The security intelligence platform for AI teams
AI security threats move fast and get buried under hype and noise. Built by an Information Systems Security researcher to help security teams and developers stay ahead of vulnerabilities, privacy incidents, safety research, and policy developments.
Independent research. No sponsors, no paywalls, no conflicts of interest.
AI Coding Agents Exploited via DNS-Hidden Malware: Researchers demonstrated a novel attack vector where AI coding assistants like Claude Code can be socially engineered through benign repository instructions to execute malicious payloads retrieved from DNS records (the system that translates domain names to IP addresses), bypassing traditional code review since no suspicious code appears in the repository itself. This highlights a new class of supply chain risk unique to autonomous agents that execute commands without human verification.
OpenAI Deploys GPT-5.6 Sol with Hardened Cyber Controls: OpenAI released a limited preview of GPT-5.6 Sol specifically tuned for cybersecurity tasks including vulnerability research and patch development, featuring enhanced jailbreak resistance (defenses against prompts designed to bypass safety restrictions) and guardrails targeting offensive cyber use cases, though the company acknowledges the dual-use controls may over-block legitimate security work during the preview period.
Margaret Atwood Flags Hallucination Risk in LLMs: Author Margaret Atwood publicly criticized Claude for generating factually incorrect information about a TV show, underscoring the persistent hallucination problem (when large language models confidently generate plausible but false information) inherent in systems trained on unverified or low-quality data.
OpenAI launched Patch the Planet, a program that uses AI to find and fix vulnerabilities (security flaws) in widely-used open-source software (code that anyone can access and modify) with help from cybersecurity firm Trail of Bits. The program combines AI-assisted vulnerability research with human review to develop tested fixes and coordinate their disclosure through existing project channels. The initiative has already identified hundreds of security issues and merged dozens of patches across projects like Python, Go, and cURL.
Fix: The source describes the Patch the Planet program itself as the mitigation approach: AI-assisted vulnerability research is used alongside human review by Trail of Bits engineers who filter out false positives and duplicate reports before sending findings to maintainers. Additionally, the source recommends that CISOs implement governance controls before deploying AI-assisted vulnerability research, including what one analyst calls a 'Safety Relevance Layer' that requires every AI-generated finding to pass automated verification with dynamic proof-of-concept validation and strong false-positive filtering before reaching a human analyst, plus predefined escalation paths and notification timelines for disclosed flaws in external dependencies.
CSO Online