FretBench
A benchmark suite for evaluating guitar fretboard note-name reasoning in language models.
answer
A benchmark suite for evaluating guitar fretboard note-name reasoning in language models.
| # | Model | Score | Tuning Breakdown | Cost | Last Tested | |
|---|---|---|---|---|---|---|
| 1 | Gemini 3.1 Pro Google | 100.0% | Std 100% DropD 100% HSD 100% DropDb 100% | $1.296 | Apr 11, 2026 | |
| 2 | DeepSeek V3.2 SpecialeOW DeepSeek | 99.6% | Std 99% DropD 100% HSD 100% DropDb 100% | $0.423 | Mar 10, 2026 | |
| 3 | DeepSeek V3.2 Speciale (Reasoning)OW DeepSeek | 99.6% | Std 99% DropD 100% HSD 100% DropDb 100% | $0.437 | Mar 10, 2026 | |
| 4 | Qwen 3.5 PlusOW Alibaba | 99.6% | Std 99% DropD 100% HSD 100% DropDb 100% | $0.605 | Mar 9, 2026 | |
| 5 | Kimi K2.5 (Reasoning)OW Moonshot | 99.6% | Std 99% DropD 100% HSD 100% DropDb 100% | $0.625 | Mar 10, 2026 |