aisecwatch.com
DashboardVulnerabilitiesNewsResearchArchiveStatsDatasetFor devs
Subscribe
aisecwatch.com

Real-time AI security monitoring. Tracking AI-related vulnerabilities, safety and security incidents, privacy risks, research developments, and policy changes.

Navigation

VulnerabilitiesNewsResearchDigest ArchiveNewsletter ArchiveSubscribeData SourcesStatisticsDatasetAPIIntegrationsWidgetRSS Feed

Maintained by

Truong (Jack) Luu

Information Systems Researcher

Industry News

New tools, products, platforms, funding rounds, and company developments in AI security.

to
Export CSV
2874 items

GPT-5.5 Instant: smarter, clearer, and more personalized

infonews
industry
May 5, 2026

OpenAI has released GPT-5.5 Instant, an updated version of ChatGPT's default model that aims to provide smarter, more accurate answers with clearer language and better personalization based on your conversation history. The new model produces 52.5% fewer hallucinated claims (false or made-up statements) compared to the previous version on high-stakes topics like medicine and law, and includes a new 'memory sources' feature that shows you what past context was used to personalize your responses, giving you control to edit or delete outdated information.

Fix: The source mentions the following controls and mitigations for personalization concerns: Users can delete chats they no longer want cited, delete or change items in saved memories through settings, or use temporary chats that don't use or update memory. When a response is personalized, users can see what context was used in 'memory sources' and delete or correct outdated information. Memory sources are not shown to others if you share a chat. The source also notes that 'memory sources are designed to make personalization easier to understand' and OpenAI plans to make this feature 'more comprehensive over time.'

OpenAI Blog

Unlocking large scale AI training networks with MRC (Multipath Reliable Connection)

infonews
industry
May 5, 2026

OpenAI and partners (AMD, Broadcom, Intel, Microsoft, NVIDIA) developed MRC (Multipath Reliable Connection), a new networking protocol that improves data transfer speed and reliability in supercomputer clusters used for AI model training. MRC addresses key challenges in large-scale AI training by reducing network congestion through adaptive packet spraying (distributing data across multiple paths), enabling redundancy to tolerate failures, and using static source routing (predetermined paths that bypass failed connections) to prevent training jobs from crashing when network failures occur.

GPT-5.5 Instant System Card

infonews
safety
May 5, 2026

GPT-5.5 Instant is OpenAI's latest fast-response AI model that uses safety methods similar to previous versions, but is the first Instant model classified as having high capability in cybersecurity and biological/chemical preparedness risks, so it has additional safeguards in place. The document clarifies naming conventions to avoid confusion: GPT-5.5 Instant (also called gpt-5.5-instant) should be compared to GPT-5.3 Instant, and the full GPT-5.5 model is referred to as GPT-5.5 Thinking.

CISOs step up to the security workforce challenge

infonews
policy
May 5, 2026

Cybersecurity leaders face a critical shortage of skilled workers, with 95% of organizations reporting at least one security skills gap and AI identified as the most pressing skill need. While some companies address this by investing in in-house training to develop employees from other technical fields into security roles (a process taking up to two years), AI both helps automate some defensive tasks and simultaneously worsens the problem by enabling attackers to operate at larger scales, increasing overall demand for skilled defenders.

Google DeepMind workers in UK vote to unionize amid deal with US military

infonews
policy
May 5, 2026

Workers at Google DeepMind's UK laboratory voted to form a union, citing concerns about a recently announced deal between Google and the US military. The workers, represented by two unions, worry that the military partnership raises ethical questions about the company's responsibility in developing AI technology.

datasette-llm 0.1a7

infonews
industry
May 4, 2026

Datasette-llm 0.1a7 is a plugin (a software add-on) that lets other plugins use AI models in a coordinated way. The release adds a feature to set default options for specific models, such as specifying which model to use for enrichment operations (adding data to existing information) and adjusting its temperature parameter (a setting that controls how creative or random the AI's responses are).

llm-echo 0.5a0

infonews
industry
May 4, 2026

llm-echo 0.5a0 is a debug plugin (a tool that helps developers test code) for LLM that provides a fake AI model called "echo" for testing purposes instead of running a real LLM. The new version adds a "-o thinking 1" option to simulate reasoning blocks (the internal steps an AI uses to work through problems) and is compatible with LLM 0.32a0 and higher.

Anthropic Mythos spurs White House to weigh pre-release reviews for high-risk AI models

infonews
policysecurity

New ways to buy ChatGPT ads

infonews
industry
May 4, 2026

OpenAI is expanding its ChatGPT advertising pilot by introducing new tools that make it easier for businesses to create and buy ads. Advertisers can now use a beta self-serve Ads Manager (a tool for setting up and managing ad campaigns) or work through partners, and can choose between cost-per-click (CPC, paying only when someone clicks an ad) or cost-per-mille (CPM, paying per 1,000 ad views) bidding options. The platform includes measurement tools that let advertisers see campaign performance without accessing user conversations, maintaining privacy.

Advancing youth safety and wellbeing in EMEA

infonews
safetypolicy

OpenAI’s president does ‘all the things,’ except answer a question

infonews
security
May 4, 2026

This article covers legal testimony from OpenAI president Greg Brockman in Elon Musk's lawsuit against OpenAI, focusing on his evasive responses and pedantic corrections during cross-examination. The piece suggests Brockman's journal entries are key evidence in the case, while highlighting his reluctance to directly answer questions.

OpenAI sales leader leaves for role at Thrive Capital

infonews
industry
May 4, 2026

James Dyett, a senior sales leader at OpenAI who managed enterprise and API (application programming interface, a set of tools that lets different software communicate) sales, is leaving the company to join venture capital firm Thrive Capital. His departure is the latest in a series of leadership changes at OpenAI, following exits by several other executives in recent months.

OpenAI and PwC collaborate to reimagine the office of the CFO

infonews
industry
May 4, 2026

OpenAI and PwC are collaborating to help finance teams use AI agents (software programs that can autonomously perform tasks) to automate workflows, reduce manual work, and improve decision-making in finance departments. The partnership is building these agents based on real-world experience from OpenAI's own finance organization, where they have already seen results like processing 5 times more contracts with the same team size.

How orphaned applications are quietly fueling your shadow IT problem

infonews
security
May 4, 2026

Orphaned applications are unused software systems that remain running in an organization's network long after their original purpose has ended, often due to workforce changes or shifting business priorities. They create significant security and compliance risks because IT teams lose track of them, meaning updates are missed, access permissions remain active, and sensitive data may continue flowing through them without proper oversight. The source explains that traditional IT asset tracking methods fail to catch these hidden systems because they only record planning decisions rather than what's actually happening on the network right now.

Anthropic teams with Goldman, Blackstone and others on $1.5 billion AI venture targeting PE-owned firms

infonews
industry
May 4, 2026

Anthropic has partnered with Goldman Sachs, Blackstone, and other investment firms to create a $1.5 billion venture that will deploy Claude, Anthropic's AI model, directly into businesses. The partnership aims to address a shortage of experts who can implement AI technology in real-world business operations by embedding engineers inside companies to redesign workflows and integrate AI into core processes, starting with companies owned by the investment firms.

AI platforms reference Nigel Farage more than other leaders when prompted on UK politics, study shows

infonews
research
May 4, 2026

A study found that AI platforms disproportionately reference Nigel Farage and Reform UK more than other UK political leaders when answering questions about British politics. Researchers suggest this indicates Reform UK has achieved unusual visibility in LLMs (large language models, AI systems trained on text data to generate responses).

Week one of the Musk v. Altman trial: What it was like in the room

infonews
policy
May 4, 2026

Elon Musk is suing OpenAI and CEO Sam Altman in federal court, claiming he invested millions expecting OpenAI to remain a nonprofit organization but alleges the company was secretly converted into a for-profit corporation, deceiving him about its original mission. The trial centers on whether Musk was actually deceived and when he discovered this alleged misconduct, with Musk seeking damages and the reversal of OpenAI's restructuring that reduced the nonprofit portion's control.

Musk texted OpenAI's Brockman about settlement two days before trial began

infonews
policy
May 4, 2026

Elon Musk, who co-founded OpenAI in 2015, is suing the company for allegedly breaking its commitment to remain a nonprofit and pursue a charitable mission, claiming they instead commercialized the AI technology. Two days before the trial started, Musk texted OpenAI's president Greg Brockman about settling the case, but when Brockman suggested both sides drop their claims, Musk responded with a threat about making him and CEO Sam Altman "the most hated men in America."

Copirate 365 at DEF CON: Plundering in the Depths of Microsoft Copilot (CVE-2026-24299)

highnews
security
May 4, 2026

This writeup describes vulnerabilities found in Microsoft Copilot products that allow attackers to steal sensitive data through multiple attack chains, including data exfiltration via HTML preview features, hijacking the AI's long-term memory through prompt injection (tricking an AI by hiding instructions in its input), and creating persistent backdoors. The vulnerabilities, assigned CVE-2026-24299, exploited what researchers call the "lethal trifecta," where an AI has access to private data, untrusted content, and external communication channels simultaneously.

Security agencies draw red lines around agentic AI deployments

infonews
securitypolicy
Previous51 / 144Next

Fix: MRC has been released through the Open Compute Project (OCP) as an open standard for the industry to use. The specification extends RDMA over Converged Ethernet (RoCE, a hardware-accelerated data transfer standard) and incorporates SRv6-based source routing to support large-scale AI networking fabrics.

OpenAI Blog
OpenAI Blog

Fix: Some CISOs address skills gaps through in-house training and development: hiring people with solid technical foundations in areas like networking, server administration, or software development, then transitioning them into security roles over approximately two years. Additionally, security leaders are encouraging their teams to leverage AI tools and examine how vendors are using AI, recognizing that AI competency will be essential in cybersecurity's future.

CSO Online
The Guardian Technology
Simon Willison's Weblog
Simon Willison's Weblog
May 4, 2026

The Trump administration is considering requiring advanced AI models to be reviewed before public release, particularly those capable of helping users find software vulnerabilities (weaknesses in code that attackers can exploit). This discussion was prompted by Anthropic's Mythos model, which can identify thousands of high-severity vulnerabilities better than most human programmers, though the company has not released it publicly and instead created Project Glasswing to give selected companies access for defensive purposes (finding and fixing vulnerabilities before attackers do).

CSO Online
OpenAI Blog
May 4, 2026

OpenAI has published a European Youth Safety Blueprint with five practical pillars to help protect young people using AI, including age-appropriate safeguards, privacy-preserving age verification, and parental controls. The company is also funding 12 organizations across Europe, the Middle East, and Africa with €500,000 in grants to conduct research and programs on youth safety, AI literacy, and mental health support in real-world settings.

OpenAI Blog
The Verge (AI)
CNBC Technology
OpenAI Blog
CSO Online
CNBC Technology
The Guardian Technology
MIT Technology Review
CNBC Technology

Fix: Microsoft patched these issues. The source states: "MSRC assigned CVE-2026-24299 and the issues are now patched." No specific patch version number or detailed mitigation steps are provided in the source text.

Embrace The Red
May 4, 2026

Security agencies including CISA have issued joint guidance on safely deploying agentic AI (autonomous AI systems that can take actions independently), warning that prompt injection (tricking an AI by hiding instructions in its input) and other attacks are common threats. The advisory recommends organizations implement strict access controls using the principle of least privilege (giving systems only the minimum permissions they need), continuous monitoring with human oversight, and careful testing before deploying AI agents to production environments.

Fix: The source text outlines recommended design and development guidelines including: strong authentication using Secure by Design principles, enforcing least-privilege principles and isolating agent capabilities, maintaining a clear inventory of agent capabilities and dependencies, implementing continuous monitoring and auditing of AI agent operations, integrating human control and oversight into workflows (including live monitoring during task execution and human approval for decision-making steps), validating how agents interpret inputs to guard against prompt injection, and regular testing of incident response plans.

CSO Online