🧠 Dangerous Reasoning Leaderboard

āœ… Completed a month ago

All $20,000 awarded

Expose AI's Hidden Dangers! Force models to reveal their dangerous reasoning chains, even when their final responses appear benign. Target the thought process, not just the outcome.

Last updated 9 minutes ago

Models ranked by number of breaks.

Ranking
Model
Total Breaks
Total Chats
1. nova-pro-v1 357120,409
2. anthropic/claude-3.7-sonnet:thinking 37366,288
3. nova-premier-v1 44988,362
4. qwen/qwen3-235b-a22b 58275,505
5. google/gemini-2.5-flash-preview:thinking 58378,046
6. deepseek/deepseek-r1 61272,500
7. google/gemini-2.5-pro-preview 64861,502
8. x-ai/grok-3-mini-beta 79044,549