{"id":102723,"date":"2024-07-18T21:56:31","date_gmt":"2024-07-18T18:56:31","guid":{"rendered":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/"},"modified":"2024-07-18T21:56:31","modified_gmt":"2024-07-18T18:56:31","slug":"techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia","status":"publish","type":"post","link":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/","title":{"rendered":"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia"},"content":{"rendered":"<p>What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by companies like Anthropic, Nvidia, Apple and Salesforce. An investigation from Wired and Proof News found that this dataset, which is called YouTube Subtitles, contains transcripts from over [\u2026]<br \/>\n\u00a9 2024 TechCrunch. All rights reserved. For personal use only.<br \/>\n<a href=\"https:\/\/techcrunch.com\/video\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/\">TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by companies like Anthropic, Nvidia, Apple and Salesforce. An investigation from Wired and Proof News found that this dataset, which is called YouTube Subtitles, contains transcripts from over [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[5871,10741,10193,10742,10740],"tags":[1380,9897,80],"class_list":["post-102723","post","type-post","status-publish","format-standard","hentry","category-ai","category-ai-training","category-media-entertainment","category-the-techcrunch-minute","category-youtubers","tag-ai","tag-apple","tag-wall-street-journal"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.3 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia - \u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439<\/title>\n<meta name=\"description\" content=\"What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/\" \/>\n<meta property=\"og:locale\" content=\"ru_RU\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia - \u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439\" \/>\n<meta property=\"og:description\" content=\"What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by\" \/>\n<meta property=\"og:url\" content=\"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/\" \/>\n<meta property=\"og:site_name\" content=\"\u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439\" \/>\n<meta property=\"article:published_time\" content=\"2024-07-18T18:56:31+00:00\" \/>\n<meta name=\"author\" content=\"Mobile news chief editor\" \/>\n<meta name=\"twitter:label1\" content=\"\u041d\u0430\u043f\u0438\u0441\u0430\u043d\u043e \u0430\u0432\u0442\u043e\u0440\u043e\u043c\" \/>\n\t<meta name=\"twitter:data1\" content=\"Mobile news chief editor\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\\\/\"},\"author\":{\"name\":\"Mobile news chief editor\",\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/#\\\/schema\\\/person\\\/659775c2c0130cf3d639e6e8c0aede94\"},\"headline\":\"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia\",\"datePublished\":\"2024-07-18T18:56:31+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\\\/\"},\"wordCount\":93,\"keywords\":[\"AI\",\"Apple\",\"Wall Street Journal\"],\"articleSection\":[\"AI\",\"AI training\",\"Media &amp; Entertainment\",\"the techcrunch minute\",\"Youtubers\"],\"inLanguage\":\"ru-RU\"},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\\\/\",\"url\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\\\/\",\"name\":\"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia - \u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/#website\"},\"datePublished\":\"2024-07-18T18:56:31+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/#\\\/schema\\\/person\\\/659775c2c0130cf3d639e6e8c0aede94\"},\"description\":\"What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\\\/#breadcrumb\"},\"inLanguage\":\"ru-RU\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/phonezone.ru\\\/news\\\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\\\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"\u0413\u043b\u0430\u0432\u043d\u043e\u0435 \u043c\u0435\u043d\u044e\",\"item\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/#website\",\"url\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/\",\"name\":\"\u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439\",\"description\":\"\u041d\u043e\u0432\u043e\u0441\u0442\u043d\u0430\u044f \u043b\u0435\u043d\u0442\u0430: \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0435 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0438\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"ru-RU\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/#\\\/schema\\\/person\\\/659775c2c0130cf3d639e6e8c0aede94\",\"name\":\"Mobile news chief editor\",\"url\":\"https:\\\/\\\/phonezone.ru\\\/news\\\/author\\\/admin\\\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia - \u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439","description":"What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/","og_locale":"ru_RU","og_type":"article","og_title":"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia - \u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439","og_description":"What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by","og_url":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/","og_site_name":"\u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439","article_published_time":"2024-07-18T18:56:31+00:00","author":"Mobile news chief editor","twitter_misc":{"\u041d\u0430\u043f\u0438\u0441\u0430\u043d\u043e \u0430\u0432\u0442\u043e\u0440\u043e\u043c":"Mobile news chief editor"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/#article","isPartOf":{"@id":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/"},"author":{"name":"Mobile news chief editor","@id":"https:\/\/phonezone.ru\/news\/#\/schema\/person\/659775c2c0130cf3d639e6e8c0aede94"},"headline":"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia","datePublished":"2024-07-18T18:56:31+00:00","mainEntityOfPage":{"@id":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/"},"wordCount":93,"keywords":["AI","Apple","Wall Street Journal"],"articleSection":["AI","AI training","Media &amp; Entertainment","the techcrunch minute","Youtubers"],"inLanguage":"ru-RU"},{"@type":"WebPage","@id":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/","url":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/","name":"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia - \u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439","isPartOf":{"@id":"https:\/\/phonezone.ru\/news\/#website"},"datePublished":"2024-07-18T18:56:31+00:00","author":{"@id":"https:\/\/phonezone.ru\/news\/#\/schema\/person\/659775c2c0130cf3d639e6e8c0aede94"},"description":"What do MrBeast, John Oliver and the Wall Street Journal have in common? The transcripts of their YouTube videos have been scraped to train the AI used by","breadcrumb":{"@id":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/#breadcrumb"},"inLanguage":"ru-RU","potentialAction":[{"@type":"ReadAction","target":["https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/phonezone.ru\/news\/techcrunch-minute-over-100k-youtube-videos-have-been-scraped-to-train-ai-for-apple-nvidia\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"\u0413\u043b\u0430\u0432\u043d\u043e\u0435 \u043c\u0435\u043d\u044e","item":"https:\/\/phonezone.ru\/news\/"},{"@type":"ListItem","position":2,"name":"TechCrunch Minute: Over 100k YouTube videos have been scraped to train AI for Apple, Nvidia"}]},{"@type":"WebSite","@id":"https:\/\/phonezone.ru\/news\/#website","url":"https:\/\/phonezone.ru\/news\/","name":"\u041d\u043e\u0432\u043e\u0441\u0442\u0438 \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0445 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0439","description":"\u041d\u043e\u0432\u043e\u0441\u0442\u043d\u0430\u044f \u043b\u0435\u043d\u0442\u0430: \u043c\u043e\u0431\u0438\u043b\u044c\u043d\u044b\u0435 \u0442\u0435\u0445\u043d\u043e\u043b\u043e\u0433\u0438\u0438","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/phonezone.ru\/news\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"ru-RU"},{"@type":"Person","@id":"https:\/\/phonezone.ru\/news\/#\/schema\/person\/659775c2c0130cf3d639e6e8c0aede94","name":"Mobile news chief editor","url":"https:\/\/phonezone.ru\/news\/author\/admin\/"}]}},"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/phonezone.ru\/news\/wp-json\/wp\/v2\/posts\/102723","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/phonezone.ru\/news\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/phonezone.ru\/news\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/phonezone.ru\/news\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/phonezone.ru\/news\/wp-json\/wp\/v2\/comments?post=102723"}],"version-history":[{"count":0,"href":"https:\/\/phonezone.ru\/news\/wp-json\/wp\/v2\/posts\/102723\/revisions"}],"wp:attachment":[{"href":"https:\/\/phonezone.ru\/news\/wp-json\/wp\/v2\/media?parent=102723"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/phonezone.ru\/news\/wp-json\/wp\/v2\/categories?post=102723"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/phonezone.ru\/news\/wp-json\/wp\/v2\/tags?post=102723"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}