CentralGauge
LLM Benchmark Results for Microsoft Dynamics 365 Business Central AL Code
Benchmark Overview
58
Unique Tasks
6
Models Tested
70.4%
Overall Pass Rate
68.7%
Average Score
959,514
Total Tokens
$10.14
Total Cost
Model Performance
anthropic/claude-opus-4-5-20251101@thinking=50000
Pass Rate:81.0%
Avg Score:79.2%
First Pass:56.9%
Tokens:171,187
Cost:$3.00
anthropic/claude-opus-4-5-20251101
Pass Rate:81.0%
Avg Score:79.2%
First Pass:56.9%
Tokens:176,555
Cost:$3.13
anthropic/claude-sonnet-4-5-20250929
Pass Rate:72.4%
Avg Score:70.9%
First Pass:56.9%
Tokens:146,807
Cost:$1.46
openai/gpt-5.2-2025-12-11@thinking=high
Pass Rate:67.2%
Avg Score:65.0%
First Pass:44.8%
Tokens:150,399
Cost:$1.03
openai/gpt-5.2-2025-12-11
Pass Rate:60.3%
Avg Score:59.7%
First Pass:48.3%
Tokens:143,788
Cost:$0.98
anthropic/claude-haiku-4-5-20251001
Pass Rate:60.3%
Avg Score:58.3%
First Pass:39.7%
Tokens:170,778
Cost:$0.55
Task Results Matrix
P = Pass, F = Fail (hover for details)
| Task | Opus 4.5 (think) | Opus 4.5 | Sonnet 4.5 | GPT-5.2 (think) | GPT-5.2 | Haiku 4.5 |
|---|---|---|---|---|---|---|
| CG-AL-001 | P | P | P | F | F | P |
| CG-AL-E001 | P | P | P | P | P | P |
| CG-AL-E002 | P | P | P | P | P | F |
| CG-AL-E003 | P | P | P | P | P | P |
| CG-AL-E004 | P | P | P | P | P | P |
| CG-AL-E005 | P | P | P | P | P | P |
| CG-AL-E006 | P | P | P | P | P | F |
| CG-AL-E007 | P | P | P | P | P | P |
| CG-AL-E008 | P | P | P | P | F | P |
| CG-AL-E009 | P | P | P | P | P | P |
| CG-AL-E010 | P | P | P | P | P | P |
| CG-AL-E031 | P | P | P | P | P | P |
| CG-AL-E032 | P | P | P | P | P | P |
| CG-AL-E045 | P | P | P | P | P | P |
| CG-AL-E050 | F | P | P | P | F | F |
| CG-AL-E051 | P | P | P | P | P | P |
| CG-AL-E052 | P | P | P | P | P | P |
| CG-AL-E053 | P | P | P | P | F | F |
| CG-AL-H001 | P | P | P | P | P | P |
| CG-AL-H002 | P | P | P | P | P | P |
| CG-AL-H003 | P | P | P | P | P | P |
| CG-AL-H004 | P | P | P | P | P | P |
| CG-AL-H005 | P | P | P | P | P | P |
| CG-AL-H006 | P | P | P | P | P | P |
| CG-AL-H007 | P | P | P | P | P | P |
| CG-AL-H008 | P | P | P | P | P | P |
| CG-AL-H009 | P | P | P | P | P | P |
| CG-AL-H010 | P | P | P | P | P | P |
| CG-AL-H011 | P | P | F | F | F | F |
| CG-AL-H012 | F | F | F | F | F | F |
| CG-AL-H013 | P | P | P | F | P | P |
| CG-AL-H014 | P | P | F | F | F | F |
| CG-AL-H015 | P | P | P | P | P | P |
| CG-AL-H016 | F | F | F | F | F | F |
| CG-AL-H017 | P | P | F | F | F | F |
| CG-AL-H018 | F | F | F | F | F | F |
| CG-AL-H019 | P | P | P | P | P | P |
| CG-AL-H020 | P | P | F | F | P | F |
| CG-AL-H021 | F | F | F | F | F | F |
| CG-AL-H022 | F | F | F | F | F | F |
| CG-AL-H023 | F | F | F | F | F | F |
| CG-AL-H205 | P | P | P | P | P | P |
| CG-AL-M001 | F | F | P | P | F | P |
| CG-AL-M002 | P | P | P | P | P | P |
| CG-AL-M003 | P | P | F | P | P | F |
| CG-AL-M004 | P | P | P | P | P | P |
| CG-AL-M005 | P | P | P | P | F | F |
| CG-AL-M006 | P | P | P | P | P | P |
| CG-AL-M007 | P | P | F | F | F | F |
| CG-AL-M008 | P | F | F | F | F | F |
| CG-AL-M009 | P | P | P | F | F | P |
| CG-AL-M010 | P | P | P | F | F | F |
| CG-AL-M020 | P | P | P | P | F | F |
| CG-AL-M021 | F | F | F | F | F | F |
| CG-AL-M022 | F | F | F | F | F | F |
| CG-AL-M023 | F | F | F | F | F | F |
| CG-AL-M088 | P | P | P | P | P | P |
| CG-AL-M112 | P | P | P | P | P | P |