ā Completed 2 months ago
Expose AI's Hidden Dangers! Force models to reveal their dangerous reasoning chains, even when their final responses appear benign. Target the thought process, not just the outcome.
Awards:
Ill intent in chain-of-thought, covertly harmful outputs
By participating, you agree to the arena terms.