FretBench

A benchmark suite for evaluating guitar fretboard note-name reasoning in language models.

 

answer

Leaderboard

# Model Score Tuning Breakdown Cost Last Tested
1 Gemini 3.1 Pro Google 100.0%
Std 100% DropD 100% HSD 100% DropDb 100%
$1.296 Apr 11, 2026
2 DeepSeek V3.2 SpecialeOW DeepSeek 99.6%
Std 99% DropD 100% HSD 100% DropDb 100%
$0.423 Mar 10, 2026
3 DeepSeek V3.2 Speciale (Reasoning)OW DeepSeek 99.6%
Std 99% DropD 100% HSD 100% DropDb 100%
$0.437 Mar 10, 2026
4 Qwen 3.5 PlusOW Alibaba 99.6%
Std 99% DropD 100% HSD 100% DropDb 100%
$0.605 Mar 9, 2026
5 Kimi K2.5 (Reasoning)OW Moonshot 99.6%
Std 99% DropD 100% HSD 100% DropDb 100%
$0.625 Mar 10, 2026
View all results →