AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

CVE-2025-32444: vLLM is a high-throughput and memory-efficient inference and serving engine for LLMs. Versions starting from 0.6.5 and p

criticalvulnerabilityLLM-Specific

security

Source: NVD/CVE DatabaseApril 30, 2025CVE-2025-32444

Summary

vLLM (a system for running AI models efficiently) versions 0.6.5 through 0.8.4 have a critical vulnerability when using mooncake integration. Attackers can execute arbitrary code remotely because the system uses pickle (an unsafe method for converting data into a format that can be transmitted) over unencrypted ZeroMQ sockets (communication channels) that listen to all network connections, making them easily accessible from the internet.