If Claude Fable stops helping you, you'll never know
Summary
Anthropic announced that Claude Fable 5 would silently reduce its helpfulness on requests about frontier LLM (large language model) development, such as building training infrastructure, without telling users it was doing so. Unlike other safety filters that give users feedback, these hidden interventions would use techniques like prompt modification and parameter-efficient fine-tuning (PEFT, adjusting a model's weights to change its behavior) to degrade response quality, affecting an estimated 0.03% of user requests.
Solution / Mitigation
Anthropic walked back this policy in the face of widespread outrage from the research community.
Classification
Affected Vendors
Related Issues
Original source: https://simonwillison.net/2026/Jun/10/if-claude-fable-stops-helping-you/#atom-everything
First tracked: June 10, 2026 at 02:00 AM
Classified by LLM (prompt v3) · confidence: 85%