3
| 2024-7-16
0  |  阅读时长 0 分钟
Accuracy
64.6%
Batch size
256
Loss
1.75
Max seq length
256
Training steps
120k
Loading...
目录