5
| 2024-7-16
0  |  阅读时长 0 分钟
Accuracy
67.4%
Batch size
256
Loss
1.58
Max seq length
1536
Training steps
10k
Loading...
目录