小诗音 (@mingliao) 在 LiveBench测试qwen2.5-coder:32b能力中发帖[QQ_1731382931451] ModelGlobal AverageLCB_generationcoding_completionclaude-3-5-sonnet-2024102267.1360.2674claude-3-5-sonnet-2024062060.8557.6964dracarys2-72b-instruct56.6451.2862qwen2.5-72b-instruct56.5655.1358qwen2.5-coder:32b55.857.6954gpt-4o-2024-08-0651.4444.8758可以看出，在LCB_generation一项中，确实达到了claude-3-5-sonnet-20240620的水平，但是coding_completion方面有...

小诗音 (@mingliao) 在 LiveBench测试qwen2.5-coder:32b能力中发帖

[QQ_1731382931451] 




Model
Global Average
LCB_generation
coding_completion




claude-3-5-sonnet-20241022
67.13
60.26
74


claude-3-5-sonnet-20240620
60.85
57.69
64


dracarys2-72b-instruct
56.64
51.28
62


qwen2.5-72b-instruct
56.56
55.13
58


qwen2.5-coder:32b
55.8
57.69
54


gpt-4o-2024-08-06
51.44
44.87
58



可以看出，在LCB_generation一项中，确实达到了claude-3-5-sonnet-20240620的水平，但是coding_completion方面有...