小诗音 (@mingliao) 在 LiveBench测试qwen2.5-coder:32b能力 中发帖
[QQ_1731382931451]
Model
Global Average
LCB_generation
coding_completion
claude-3-5-sonnet-20241022
67.13
60.26
74
claude-3-5-sonnet-20240620
60.85
57.69
64
dracarys2-72b-instruct
56.64
51.28
62
qwen2.5-72b-instruct
56.56
55.13
58
qwen2.5-coder:32b
55.8
57.69
54
gpt-4o-2024-08-06
51.44
44.87
58
可以看出,在LCB_generation一项中,确实达到了claude-3-5-sonnet-20240620的水平,但是coding_completion方面有...