Search

Home > Marketplace Tech with Molly Wood > For data-hungry tech companies, YouTube is a gold mine
Podcast: Marketplace Tech with Molly Wood
Episode:

For data-hungry tech companies, YouTube is a gold mine

Category: Technology
Duration: 00:11:41
Publish Date: 2024-07-30 10:06:48
Description:

Companies competing in the chatbot wars are using something known in the industry as “the Pile” to train their large language models. It’s a trove of open-source data made up of text scraped from all around the internet, including Wikipedia and the European Parliament. Annie Gilbertson, investigative reporter for Proof News, recently took a deep dive into the Pile and discovered something else: a dataset called “YouTube Subtitles.” Marketplace’s Lily Jamali spoke with Gilbertson about her investigation and how YouTube creators feel about their content being used without their consent.

Total Play: 0

Some more Podcasts by American Public Media

500+ Episodes
The Splendid .. 100+     10+
2K+ Episodes
Marketplace .. 20+     5
100+ Episodes
200+ Episodes
The Hilariou .. 100+     30+
2K+ Episodes
Composers Da .. 100+     10+
20+ Episodes
Mood Ring    
2K+ Episodes
100+ Episodes
The One Reci ..     10+