Command Palette
Search for a command to run...
Logical Reasoning On Lingoly
评估指标
Delta_NoContext
Exact Match Accuracy
评测结果
各个模型在此基准测试上的表现结果
0 of 11 row(s) selected.
Search for a command to run...
各个模型在此基准测试上的表现结果
Search for a command to run...
各个模型在此基准测试上的表现结果