Search

Home > Marketplace Tech > For data-hungry tech companies, YouTube is a gold mine
Podcast: Marketplace Tech
Episode:

For data-hungry tech companies, YouTube is a gold mine

Category: Technology
Duration: 00:11:41
Publish Date: 2024-07-30 10:06:48
Description:

Companies competing in the chatbot wars are using something known in the industry as “the Pile” to train their large language models. It’s a trove of open-source data made up of text scraped from all around the internet, including Wikipedia and the European Parliament. Annie Gilbertson, investigative reporter for Proof News, recently took a deep dive into the Pile and discovered something else: a dataset called “YouTube Subtitles.” Marketplace’s Lily Jamali spoke with Gilbertson about her investigation and how YouTube creators feel about their content being used without their consent.

Total Play: 0

Some more Podcasts by American Public Media

1K+ Episodes
Marketplace     8