Gray Swan Arena

Push the boundaries of AI safety and security. Identify risks, exploit vulnerabilities, and help shape the future of safe AI systems.

Enter the Arena Learn More

💰 Prizes Available

🏆 Total Prizes

Prizes Awarded

Trending Challenges

🧪

⏰ Starts in ⏳ 1d 11h 12m 34s

Upcoming

🧪 Proving Ground

Your weekly arena for AI red-team mastery. Fresh challenges every Wednesday across Chat, Image, Agents, and Reasoning attack types.

> Rules of Engagement

Add to Calendar

🧠

All $20,000 awarded

Completed

🧠 Dangerous Reasoning

Expose AI's Hidden Dangers! Force models to reveal their dangerous reasoning chains, even when their final responses appear benign. Target the thought process, not just the outcome.

> Rules of Engagement

Open Challenge View Leaderboard

Ongoing Competition

Play within our long-running challenges and break AI models in your free time
View Leaderboard
Global Participation

Connect with AI enthusiasts and experts from around the world
Join Discord Server
Substantial Prizes

Win recognition and rewards for your innovative jailbreaks
View Prizes

Featured Challenges

Compete in these AI security challenges to win prizes and improve your skills.

🧪

⏰ Starts in ⏳ 1d 11h 12m 34s

Upcoming

🧪 Proving Ground

Your weekly arena for AI red-team mastery. Fresh challenges every Wednesday across Chat, Image, Agents, and Reasoning attack types.

> Rules of Engagement

Add to Calendar

🧠

All $20,000 awarded

Completed

🧠 Dangerous Reasoning

Expose AI's Hidden Dangers! Force models to reveal their dangerous reasoning chains, even when their final responses appear benign. Target the thought process, not just the outcome.

> Rules of Engagement

Open Challenge View Leaderboard

👁️

All $60,000 awarded

Completed

👁️ Visual Vulnerabilities

Use image inputs to jailbreak leading vision-enabled AI models. Visual prompt injections, chem/bio/cyber weaponization, privacy violations, and more.

> Rules of Engagement

Open Challenge View Leaderboard

🕵

All $171,800 awarded

Completed

🕵 Agent Red-Teaming

Push the limits of direct and indirect attacks on AI agents.

> Rules of Engagement

🤖 Harmful AI Assistant

Jailbreak the helpful AI assistants to aid in harmful tasks across six areas.

> Rules of Engagement

Open Challenge View Leaderboard

💬

All $7,000 awarded

Completed

💬 Multi-Turn Harmful Outputs

Elicit harmful outputs from LLMs through long-context interactions across multiple messages.

> Rules of Engagement

Open Challenge View Leaderboard

🖼️

All $6,000 awarded

Completed

🖼️ Multimodal Jailbreaks

Jailbreak multimodal LLMs through a combination of visual and text inputs.

> Rules of Engagement

Open Challenge View Leaderboard

👩‍💻

All $6,000 awarded

Completed

👩‍💻 Harmful Code Generation

Find unique ways to return functional code that completes harmful tasks such as opening circuit breakers to cause a system-wide blackout.

> Rules of Engagement

Open Challenge View Leaderboard

💭

All $1,000 awarded

Completed

💭 Revealing Hidden CoT

Attack OpenAI's o1 model to try and reveal its internal chain of thought (CoT) that's used for its complex reasoning.

> Rules of Engagement

💣 Single Turn Harmful Outputs

Attempt to break various large language models (LLMs) using a singular chat message.

> Rules of Engagement

Open Challenge View Leaderboard

Top Winners

🥇

Wyatt Walls

Prize:

💵 $10,870

🥈

Solomon Zoe

Prize:

💵 $10,665

🥉

Bob1

Prize:

💵 $9,516

Leaderboard

Top standings across all challenges and participants in the Gray Swan Arena.

How it Works All Time Leaderboard

Ranking	Participant	Winnings
1.	Wyatt Walls	$10,870
2.	Solomon Zoe	$10,665
3.	Bob1	$9,516
4.	Clovis Mint	$9,483
5.	zardav	$7,997
6.	Scrattlebeard	$7,253
7.	diogenesofsinope	$7,112
8.	Lyren	$6,660
9.	Haris Umair (Strawberry_Cake)	$6,575
10.	P1njec70r	$6,448
11.	Kkonatruck	$6,165
12.	WINTER IS COMING ❌ IS HERE ❌ IS GONE ❄	$5,391
13.	Schultzika	$5,339
14.	Strigiformes	$4,667
15.	h4xor	$4,546

Gray Swan Arena

Trending Challenges

🧪 Proving Ground

🧠 Dangerous Reasoning

Ongoing Competition

Global Participation

Substantial Prizes

Featured Challenges

🧪 Proving Ground

🧠 Dangerous Reasoning

👁️ Visual Vulnerabilities

🕵 Agent Red-Teaming

🤖 Harmful AI Assistant

💬 Multi-Turn Harmful Outputs

🖼️ Multimodal Jailbreaks

👩‍💻 Harmful Code Generation

💭 Revealing Hidden CoT

💣 Single Turn Harmful Outputs

Top Winners

Leaderboard