小诗音 (@mingliao)LiveBench测试qwen2.5-coder:32b能力 中发帖

[QQ_1731382931451] 




Model
Global Average
LCB_generation
coding_completion




claude-3-5-sonnet-20241022
67.13
60.26
74


claude-3-5-sonnet-20240620
60.85
57.69
64


dracarys2-72b-instruct
56.64
51.28
62


qwen2.5-72b-instruct
56.56
55.13
58


qwen2.5-coder:32b
55.8
57.69
54


gpt-4o-2024-08-06
51.44
44.87
58



可以看出,在LCB_generation一项中,确实达到了claude-3-5-sonnet-20240620的水平,但是coding_completion方面有...