Quoting Anthropic
Summary
Anthropic researchers tested Claude (their AI assistant) for sycophancy (behavior of agreeing excessively or giving undeserved praise to please the user) by checking whether it would push back on ideas, maintain positions when challenged, and speak honestly. Overall, Claude rarely showed sycophantic behavior (only 9% of conversations), but it was more prone to this problem in conversations about spirituality (38%) and relationships (25%).
Classification
Affected Vendors
Related Issues
Original source: https://simonwillison.net/2026/May/3/anthropic/#atom-everything
First tracked: May 3, 2026 at 02:00 PM
Classified by LLM (prompt v3) · confidence: 85%