AI Sec Watch: A Security Intelligence Platform for AI Systems

Luu, T.J.

CVE-2024-5206: A sensitive data leakage vulnerability was identified in scikit-learn's TfidfVectorizer, specifically in versions up to

mediumvulnerability

securityprivacy

Source: NVD/CVE DatabaseJune 6, 2024CVE-2024-5206

Summary

A vulnerability in scikit-learn's TfidfVectorizer (a tool that converts text into numerical data for machine learning) stored all words from training data in an attribute called `stop_words_`, instead of just the necessary ones, potentially leaking sensitive information like passwords or keys. The vulnerability affected versions up to 1.4.1.post1 but the risk depends on what type of data is being processed.