The Atlantic created a searchable database of the music used to train AI
Summary
A reporter at The Atlantic discovered four publicly available datasets containing millions of songs (totaling between 100,000 and 12 million tracks each) that are being used to train AI models. These datasets have been downloaded thousands of times, and companies like Google and Stability have confirmed using them in their research, raising questions about how music is used in AI training without always crediting or compensating artists.
Classification
Affected Vendors
Related Issues
Original source: https://www.theverge.com/ai-artificial-intelligence/953183/the-atlantic-searchable-database-music-ai-training-data
First tracked: June 20, 2026 at 08:00 PM
Classified by LLM (prompt v3) · confidence: 80%