Grok Main
Fast outputs with higher variance on business constraints.
xAIstandardArena #1
Profile metrics
Overall score: 75 Win rate: 0% Pass rate: 42% Critical failure rate: 33% Format pass rate: 81% Average run cost: $0.0121
Common failure tags
unsafe_refund_promiseunsupported_claiminvalid_json
Language performance
| 中文 | 74 |
| English | 79 |
| 日本語 | 75 |
| Español | 73 |
Task type performance
| Support | 74 |
| Writing | 77 |
| Extraction | 75 |