AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Where the goblins came from

lownews

safetyresearch

Apr 29, 2026

Starting with GPT-5.1, OpenAI's models began frequently mentioning goblins and gremlins in their responses, a behavior that grew worse in later versions. The root cause was discovered to be the training process for the "Nerdy" personality feature, which unknowingly gave high rewards for outputs containing creature metaphors, causing the model to learn and amplify this quirk over time. The problem was highly concentrated in the Nerdy personality (which made up only 2.5% of responses but accounted for 66.7% of goblin mentions), and was identified through comparing model outputs and analyzing which reward signals (scoring systems that guide AI training) favored creature-word language.

OpenAI Blog

Designing trust and safety into Amazon Bedrock powered applications

infonews

safetypolicy

LLM 0.32a0 is a major backwards-compatible refactor

infonews

industry

Apr 29, 2026

LLM 0.32a0 is an alpha release that redesigns how the LLM Python library handles inputs and outputs to better support modern AI models. Instead of the old simple text-in, text-out model, it now represents conversations as sequences of messages (with user and assistant roles) and allows responses to contain different types of content, making it easier to work with APIs like OpenAI's chat completions.

llm 0.32a0

infonews

industry

Apr 29, 2026

This is a brief announcement about llm version 0.32a0, posted by Simon Willison on April 29, 2026. The post appears to be part of a monthly briefing series covering important LLM developments, with an option for readers to sponsor the author for curated updates.

GHSA-vc24-j8c5-2vw4: OpenTelemetry.Resources.Azure has an unbounded HTTP response body read

mediumvulnerability

security

Apr 29, 2026

CVE-2026-41483

OpenTelemetry.Resources.Azure has a vulnerability where it reads unlimited amounts of data from Azure VM metadata service responses into memory, allowing an attacker to cause the application to crash by sending extremely large responses (a denial of service attack where the system runs out of memory). This affects applications using the Azure VM resource detector that connect to a compromised or intercepted metadata endpoint.

All the evidence unveiled so far in Musk v. Altman

infonews

industry

Apr 29, 2026

A legal trial between Elon Musk and Sam Altman is revealing documents from OpenAI's founding, including emails and corporate records that show Musk drafted much of OpenAI's early mission and structure, Nvidia provided computational resources, and early leaders had concerns about various aspects of the organization's direction. The case is still ongoing and more evidence is expected to be disclosed as it progresses.

OpenAI’s subtle drift from Microsoft has become an aggressive move toward Amazon

infonews

industry

Apr 29, 2026

OpenAI has restructured its relationship with Microsoft multiple times in six months, most recently ending Microsoft's exclusive access to OpenAI's models and technology. The company is now moving its AI services to Amazon Web Services (cloud computing infrastructure), Microsoft's major competitor, after committing $100+ billion in spending to AWS and receiving a $50 billion investment from Amazon. This shift suggests OpenAI is deliberately diversifying away from its decade-long partnership with Microsoft to work with multiple cloud providers and meet more customers' needs.

Building the compute infrastructure for the Intelligence Age

infonews

industry

Apr 29, 2026

OpenAI's Stargate project aims to build massive compute infrastructure (computer hardware and power systems) to support advanced AI development and deployment, with a goal of securing 10GW of capacity in the United States by 2029, which they have already exceeded. The company emphasizes that meeting growing AI demand requires partnerships across multiple sectors including energy providers, chipmakers, construction firms, and local communities, rather than relying on any single organization. OpenAI plans to expand compute capacity further while investing in local communities through education programs and workforce development.

Tumbler Ridge families are suing OpenAI

infonews

safetypolicy

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO

infonews

industry

Apr 29, 2026

ChatGPT is experiencing slower growth and rising uninstall rates, with users leaving the app or switching to competing chatbots. According to market data, uninstalls jumped 413 percent year-over-year in May following OpenAI's partnership with the Pentagon, while monthly user growth dropped from 168 percent in January to 78 percent in April.

New Wave of DPRK Attacks Uses AI-Inserted npm Malware, Fake Firms, and RATs

highnews

security

Apr 29, 2026

Researchers discovered malicious code in npm packages (repositories where developers share reusable code) that were designed to steal cryptocurrency wallet credentials and funds. The attack, linked to North Korean hackers, used a two-layer approach where harmless-looking packages contained hidden dependencies that executed the actual malware, and the malicious packages mimicked the names of legitimate libraries to avoid detection.

Wiz Code Week Recap: Securing AI Native Development

infonews

securityindustry

Larry’s risky business

infonews

industry

Apr 29, 2026

Oracle, a traditional database company, has shifted its business strategy to focus on AI rather than building its own foundation models (large language models like ChatGPT). Instead, it is positioning itself as a software-as-a-service provider (cloud-based software you access online) in the AI infrastructure space, betting on a specific version of AI's future as its traditional database business declines.

AmbShield: Enhancing Physical Layer Security With Ambient Backscatter Devices Against Eavesdroppers

inforesearchPeer-Reviewed

security

Federated Unsupervised Skeletal Action Recognition From Condensation to Expansion

inforesearchPeer-Reviewed

research

K-TCDP: A Temporal Correlated DP Mechanism for LoRA Supervised Fine-Tuning

inforesearchPeer-Reviewed

research

BlockAthena: A Scalable Approach for Long-Term Blockchain Crimes Analysis

inforesearchPeer-Reviewed

security

Learning from the Vercel breach: Shadow AI & OAuth sprawl

highnews

securityprivacy

Taylor Swift deepfakes are pushing scams on TikTok

infonews

securitysafety

CVE-2026-42249: Ollama for Windows contains a Remote Code Execution vulnerability in its update mechanism due to improper handling of at

criticalvulnerability

security

Apr 29, 2026

CVE-2026-42249

Ollama for Windows has a remote code execution vulnerability (the ability for an attacker to run commands on your computer) in its update system. The vulnerability happens because the application builds file paths using information from HTTP headers without checking if they're legitimate, allowing attackers to use path traversal sequences (like ../ to navigate directories) to write malicious executable files to dangerous locations like the Windows Startup folder. When combined with a missing signature verification flaw, an attacker can automatically execute malicious code without the user knowing.

Browse All

Browse All

Where the goblins came from

Designing trust and safety into Amazon Bedrock powered applications

LLM 0.32a0 is a major backwards-compatible refactor

llm 0.32a0

GHSA-vc24-j8c5-2vw4: OpenTelemetry.Resources.Azure has an unbounded HTTP response body read

All the evidence unveiled so far in Musk v. Altman

OpenAI’s subtle drift from Microsoft has become an aggressive move toward Amazon

Building the compute infrastructure for the Intelligence Age

Tumbler Ridge families are suing OpenAI

ChatGPT downloads are slowing — and may cause problems for OpenAI&#8217;s IPO

New Wave of DPRK Attacks Uses AI-Inserted npm Malware, Fake Firms, and RATs

Wiz Code Week Recap: Securing AI Native Development

Larry’s risky business

AmbShield: Enhancing Physical Layer Security With Ambient Backscatter Devices Against Eavesdroppers

Federated Unsupervised Skeletal Action Recognition From Condensation to Expansion

K-TCDP: A Temporal Correlated DP Mechanism for LoRA Supervised Fine-Tuning

BlockAthena: A Scalable Approach for Long-Term Blockchain Crimes Analysis

Learning from the Vercel breach: Shadow AI & OAuth sprawl

Taylor Swift deepfakes are pushing scams on TikTok

CVE-2026-42249: Ollama for Windows contains a Remote Code Execution vulnerability in its update mechanism due to improper handling of at

Where the goblins came from

Designing trust and safety into Amazon Bedrock powered applications

LLM 0.32a0 is a major backwards-compatible refactor

llm 0.32a0

GHSA-vc24-j8c5-2vw4: OpenTelemetry.Resources.Azure has an unbounded HTTP response body read

All the evidence unveiled so far in Musk v. Altman

OpenAI’s subtle drift from Microsoft has become an aggressive move toward Amazon

Building the compute infrastructure for the Intelligence Age

Tumbler Ridge families are suing OpenAI

ChatGPT downloads are slowing — and may cause problems for OpenAI&#8217;s IPO

New Wave of DPRK Attacks Uses AI-Inserted npm Malware, Fake Firms, and RATs

Wiz Code Week Recap: Securing AI Native Development

Larry’s risky business

AmbShield: Enhancing Physical Layer Security With Ambient Backscatter Devices Against Eavesdroppers

Federated Unsupervised Skeletal Action Recognition From Condensation to Expansion

K-TCDP: A Temporal Correlated DP Mechanism for LoRA Supervised Fine-Tuning

BlockAthena: A Scalable Approach for Long-Term Blockchain Crimes Analysis

Learning from the Vercel breach: Shadow AI & OAuth sprawl

Taylor Swift deepfakes are pushing scams on TikTok

CVE-2026-42249: Ollama for Windows contains a Remote Code Execution vulnerability in its update mechanism due to improper handling of at

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO