AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

Prompting Frameworks for Large Language Models: A Survey

inforesearchPeer-Reviewed

research

Apr 1, 2026

This is an academic survey paper that reviews different prompting frameworks, which are structured approaches to asking large language models (AI systems trained on huge amounts of text) questions or giving them instructions to complete tasks. The paper, published in a major computer science journal, catalogues and analyzes various methods researchers have developed to improve how effectively people interact with and get useful results from LLMs.

ACM Digital Library (TOPS, DTRAP, CSUR)

Claude Code users hitting usage limits 'way faster than expected'

mediumnews

securitysafety

Mutation testing for the agentic era

infonews

securityresearch

CVE-2026-23404: In the Linux kernel, the following vulnerability has been resolved: apparmor: replace recursive profile removal with it

infovulnerability

security

Apr 1, 2026

CVE-2026-23404

A vulnerability in the Linux kernel's AppArmor security module (a tool that controls what programs can access on a system) causes the system to crash when removing many nested profiles due to stack exhaustion from recursive function calls. The fix replaces the recursive profile removal method with an iterative approach (a method that repeats steps instead of calling itself) that achieves the same result without using excessive memory.

Google Addresses Vertex Security Issues After Researchers Weaponize AI Agents

mediumnews

security

Apr 1, 2026

Palo Alto Networks revealed security problems in Google Cloud Platform's Vertex AI (Google's AI service for building and deploying machine learning models) after researchers demonstrated how to weaponize AI agents, which are autonomous programs that can perform tasks with minimal human input. Google has begun addressing these disclosed security issues.

Claude Code Source Leaked via npm Packaging Error, Anthropic Confirms

highnews

securityprivacy

I wore Meta’s smartglasses for a month – and it left me feeling like a creep

infonews

safetyprivacy

Attack Surface Management – ein Kaufratgeber

infonews

securityindustry

datasette-enrichments-llm 0.2a0

infonews

industry

Mar 31, 2026

This is a brief announcement about datasette-enrichments-llm version 0.2a0, posted by Simon Willison on April 1st, 2026. The content primarily consists of a sponsorship pitch for a monthly email digest covering important LLM (large language model) developments, rather than discussing a specific security issue or technical problem.

datasette-llm-usage 0.2a0

infonews

industry

Mar 31, 2026

datasette-llm-usage version 0.2a0 removed features for tracking allowances and pricing, which moved to a separate tool called datasette-llm-accountant, and added the ability to log complete prompts, responses, and tool calls (automated functions the AI can call) to a database table if enabled through a configuration setting. The simple prompt page was redesigned and now requires specific user permissions to access.

datasette-llm 0.1a5

infonews

industry

Mar 31, 2026

datasette-llm 0.1a5 is a release of a plugin that lets other software tools integrate with large language models. The update improves the llm_prompt_context() plugin hook (a mechanism that other plugins can connect to), so it now tracks both individual prompts and chains of prompts executed together, including tool call loops (repeated back-and-forth exchanges between the AI and external functions).

Anthropic employee error exposes Claude Code source

highnews

security

Mar 31, 2026

An Anthropic employee accidentally exposed the source code for Claude Code (an AI programming tool) by leaving a source map file (.map file, a debugging file that translates minified code back to human-readable form) in a package published on npm (a registry where developers share code). This is a security risk because hackers can use source maps to understand how the code works, find vulnerabilities, and potentially steal secrets like API keys that might be hidden in the code.

Gradient Labs gives every bank customer an AI account manager

infonews

industry

Mar 31, 2026

Gradient Labs has built an AI system that acts as a dedicated account manager for bank customers, handling complex issues like fraud and blocked payments by following strict procedures. The system uses OpenAI models (specifically GPT-5.4 mini and nano for production) and includes 15+ guardrail systems (safety checks running in parallel) to ensure conversations stay compliant and accurate, achieving 97% trajectory accuracy (following the correct procedure path from start to finish) compared to competitors at 88%.

Claude Code source code accidentally leaked in NPM package

highnews

securityprivacy

GHSA-ghq9-vc6f-8qjf: TorchGeo Remote Code Execution Vulnerability

highvulnerability

security

Mar 31, 2026

CVE-2024-49048

TorchGeo versions 0.4–0.6.0 had a critical vulnerability where the `eval` function (a Python function that executes code from text input) was used in the model weight API, allowing attackers to run arbitrary commands on systems using the library. Any platform exposing TorchGeo's get_weight() or trainers functions publicly was at risk.

CVE-2026-5281: Google Dawn Use-After-Free Vulnerability

infovulnerability

security

Mar 31, 2026

CVE-2026-5281🔥 Actively Exploited

GHSA-g86v-f9qv-rh6m: OpenClaw SSRF guard misses four IPv6 special-use ranges

lowvulnerability

security

Mar 31, 2026

OpenClaw had a vulnerability in its SSRF guard (a security check that blocks requests to internal network addresses), which incorrectly classified certain IPv6 special-use ranges (reserved address groups in the newer internet protocol) as public. This allowed attackers to potentially access internal or non-routable addresses that should have been blocked.

GHSA-m866-6qv5-p2fg: OpenClaw host-env blocklist missing `GIT_TEMPLATE_DIR` and `AWS_CONFIG_FILE` allows code execution via env override

mediumvulnerability

security

Mar 31, 2026

OpenClaw's host environment sanitization (a security check that removes dangerous settings before running code) was missing protections for two environment variables: `GIT_TEMPLATE_DIR` and `AWS_CONFIG_FILE`. An attacker could exploit this by approving a code execution request that redirects git or AWS tools to attacker-controlled files, allowing them to run untrusted code or steal credentials.

GHSA-jccr-rrw2-vc8h: OpenClaw safeBins jq `$ENV` filter bypass allows environment variable disclosure

highvulnerability

security

Mar 31, 2026

OpenClaw's jq safe-bin policy had a security flaw where it blocked direct `env` commands but still allowed access to environment variables through the `$ENV` filter, potentially letting approved commands leak sensitive environment data. This vulnerability affected versions up to 2026.3.24 in the file `src/infra/exec-safe-bin-semantics.ts` (the code that enforces safe command restrictions).

Claude Code leak exposes a Tamagotchi-style ‘pet’ and an always-on agent

highnews

securityprivacy

Browse All

Browse All

Prompting Frameworks for Large Language Models: A Survey

Claude Code users hitting usage limits 'way faster than expected'

Mutation testing for the agentic era

CVE-2026-23404: In the Linux kernel, the following vulnerability has been resolved: apparmor: replace recursive profile removal with it

Google Addresses Vertex Security Issues After Researchers Weaponize AI Agents

Claude Code Source Leaked via npm Packaging Error, Anthropic Confirms

I wore Meta’s smartglasses for a month – and it left me feeling like a creep

Attack Surface Management – ein Kaufratgeber

datasette-enrichments-llm 0.2a0

datasette-llm-usage 0.2a0

datasette-llm 0.1a5

Anthropic employee error exposes Claude Code source

Gradient Labs gives every bank customer an AI account manager

Claude Code source code accidentally leaked in NPM package

GHSA-ghq9-vc6f-8qjf: TorchGeo Remote Code Execution Vulnerability

CVE-2026-5281: Google Dawn Use-After-Free Vulnerability

GHSA-g86v-f9qv-rh6m: OpenClaw SSRF guard misses four IPv6 special-use ranges

GHSA-m866-6qv5-p2fg: OpenClaw host-env blocklist missing `GIT_TEMPLATE_DIR` and `AWS_CONFIG_FILE` allows code execution via env override

GHSA-jccr-rrw2-vc8h: OpenClaw safeBins jq `$ENV` filter bypass allows environment variable disclosure

Claude Code leak exposes a Tamagotchi-style ‘pet’ and an always-on agent

Prompting Frameworks for Large Language Models: A Survey

Claude Code users hitting usage limits 'way faster than expected'

Mutation testing for the agentic era

CVE-2026-23404: In the Linux kernel, the following vulnerability has been resolved: apparmor: replace recursive profile removal with it

Google Addresses Vertex Security Issues After Researchers Weaponize AI Agents

Claude Code Source Leaked via npm Packaging Error, Anthropic Confirms

I wore Meta’s smartglasses for a month – and it left me feeling like a creep

Attack Surface Management – ein Kaufratgeber

datasette-enrichments-llm 0.2a0

datasette-llm-usage 0.2a0

datasette-llm 0.1a5

Anthropic employee error exposes Claude Code source

Gradient Labs gives every bank customer an AI account manager

Claude Code source code accidentally leaked in NPM package

GHSA-ghq9-vc6f-8qjf: TorchGeo Remote Code Execution Vulnerability

CVE-2026-5281: Google Dawn Use-After-Free Vulnerability

GHSA-g86v-f9qv-rh6m: OpenClaw SSRF guard misses four IPv6 special-use ranges

GHSA-m866-6qv5-p2fg: OpenClaw host-env blocklist missing `GIT_TEMPLATE_DIR` and `AWS_CONFIG_FILE` allows code execution via env override

GHSA-jccr-rrw2-vc8h: OpenClaw safeBins jq `$ENV` filter bypass allows environment variable disclosure

Claude Code leak exposes a Tamagotchi-style ‘pet’ and an always-on agent