AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

CVE-2025-29770: vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. The outlines library is one of the

mediumvulnerabilityLLM-Specific

security

Source: NVD/CVE DatabaseMarch 19, 2025CVE-2025-29770

Summary

vLLM, a system for running large language models efficiently, uses the outlines library to support structured output (guidance on what format the AI's answer should follow). The outlines library stores compiled grammar rules in a cache on the hard drive, which is turned on by default. A malicious user can send many requests with different output formats, filling up this cache and causing the system to run out of disk space, making it unavailable to others (a denial of service attack). This problem affects only the V0 engine version of vLLM.