š§ Dangerous Reasoning Leaderboard
ā Completed 24 days ago
All $20,000 awarded
Expose AI's Hidden Dangers! Force models to reveal their dangerous reasoning chains, even when their final responses appear benign. Target the thought process, not just the outcome.
Last updated 2 minutes ago
Models ranked by number of breaks.
Ranking | Model | Total Breaks | Total Chats |
---|---|---|---|
1. | Model Socrates | 357 | 120,217 |
2. | Model Archimedes | 372 | 66,168 |
3. | Model Homer | 448 | 88,076 |
4. | Model hooks | 581 | 75,357 |
5. | Model Aristotle | 583 | 77,898 |
6. | Model Wollstonecraft | 611 | 72,289 |
7. | Model Confucius | 648 | 61,394 |
8. | Model Plato | 788 | 44,454 |