1
| 2024-7-16
0  |  阅读时长 0 分钟
Accuracy
65.0%
Batch size
256
Loss
1.73
Max seq length
512
Training steps
200k
Loading...
目录