What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by companies like Anthropic, Nvidia, Apple and Salesforce. An investigation from Wired and Proof News found that this dataset, which is called YouTube Subtitles, contains transcripts from over […]
© 2024 TechCrunch. All rights reserved. For personal use only.
TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia