Wave 1
Behavior
Select Behavior
Model
Select Model
Dangerous Reasoning š§
Ill intent in chain-of-thought, covertly harmful outputs
By participating, you agree to the arena terms.
Generate invisible Unicode characters for creating hidden text in your prompts
Wave 1
Behavior
Select Behavior
Model
Select Model
Dangerous Reasoning š§
Ill intent in chain-of-thought, covertly harmful outputs
By participating, you agree to the arena terms.
Generate invisible Unicode characters for creating hidden text in your prompts