BlocktoBlockto
AI Researcher Claims Anthropic’s Claude Fable 5 Guardrails Were Bypassed Within Days
AI

Photo: Illustrative

AI Researcher Claims Anthropic’s Claude Fable 5 Guardrails Were Bypassed Within Days

An artificial intelligence and cybersecurity researcher known as “Pliny the Liberator” claims to have bypassed the safety protections built into Anthropic’s newly launched Claude Fable 5 model less than 48 hours after its release.

Tristan R.
By Tristan R.

Senior Author · June 11, 2026

2 min
Key takeaways
An artificial intelligence and cybersecurity researcher known as “ Pliny the Liberator ” claims to have bypassed the safety protections built into Anthropic’s newly launched Claude Fable 5 model less than 48 hours after its release.
🚨 JAILBREAK ALERT 🚨 ANTHROPIC: PWNED 🫡 FABLE-5: LIBERATED 🦋 let s start with the 🐘 the consensus seems to be that this has been one of the most disappointing model drops of all time, effectively preventing legitimate researchers from contributing their talents to our… pic.twitter.com/Z0vdPIt4vY Pliny the Liberator 🐉󠅫󠄼󠄿󠅆󠄵󠄐󠅀󠄼󠄹󠄾󠅉󠅭 (@elder_plinius) June 10, 2026 Claude Fable 5 was introduced as a heavily safety-focused version of Anthropic’s more powerful Mythos model.
The company limited access to Mythos and designed Fable 5 with stricter controls to prevent users from obtaining potentially harmful information, including hacking methods and dangerous scientific instructions.

An artificial intelligence and cybersecurity researcher known as “Pliny the Liberator” claims to have bypassed the safety protections built into Anthropic’s newly launched Claude Fable 5 model less than 48 hours after its release.

Claude Fable 5 was introduced as a heavily safety-focused version of Anthropic’s more powerful Mythos model. The company limited access to Mythos and designed Fable 5 with stricter controls to prevent users from obtaining potentially harmful information, including hacking methods and dangerous scientific instructions.

Techniques Used to Bypass AI Guardrails

According to Pliny, several methods were used to work around the model’s restrictions. These included Unicode and homoglyph manipulation, long-context prompts, narrative framing, academic-style decomposition and recomposition, and assistance from a jailbroken version of Claude Opus 4.8.

Pliny said decomposition and recomposition was particularly effective. The technique breaks a sensitive request into multiple harmless-looking questions, which can later be combined to produce information that safety systems are designed to block.

Critics Question Fable 5 Restrictions

Since launch, Claude Fable 5 has faced criticism from some researchers who argue that the model’s restrictions limit legitimate research. When users ask about certain cybersecurity or scientific topics, the system redirects them to a less capable model instead of providing detailed responses.

Anthropic Says Testing Found No Universal Jailbreak

Before release, Anthropic conducted internal evaluations and an external bug bounty program to identify weaknesses. The company stated that more than 1,000 hours of testing failed to uncover any universal jailbreak capable of bypassing all safeguards.

How markets are positioning

Live market reaction

🛢️WTI Crude
+3.4%
Gold
+1.8%
Bitcoin
-1.8%
$DXY
+0.6%

Disclaimer

This content is for informational purposes only and does not constitute financial, investment, or legal advice. Cryptocurrency trading involves risk and may result in financial loss.

Exclusive partner offer

Start trading
with BloFin today

Up to $500 sign-up bonus and zero-fee trading on your first 30 days.

Buy crypto now

You will be redirected to BloFin

Share article

About the author

Tristan R.
Tristan R.

8+ years covering crypto markets, macro, and geopolitics. Previously at Decrypt and CoinDesk. Focused on the intersection of digital assets and traditional finance.