{"id":51578,"date":"2024-04-07T11:55:06","date_gmt":"2024-04-07T15:55:06","guid":{"rendered":"https:\/\/romanticany.com\/?p=51578"},"modified":"2024-04-07T11:55:06","modified_gmt":"2024-04-07T15:55:06","slug":"openai-transcribio-un-millon-de-horas-de-videos-de-youtube-para-entrenar-gpt-4-dice-new-york-times","status":"publish","type":"post","link":"https:\/\/romanticany.com\/?p=51578","title":{"rendered":"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"><em>Nueva York, 6 abr (EFE).- OpenAI cre\u00f3 un programa para transcribir m\u00e1s de un mill\u00f3n de horas de videos de Youtube con el objetivo de entrenar el modelo de generaci\u00f3n de texto GPT-4, su modelo m\u00e1s avanzado abierto al p\u00fablico, seg\u00fan una exclusiva de The New York Times (NYT) publicada este s\u00e1bado.<\/em><\/p>\n\n\n\n<p class=\"wp-block-paragraph\">El diario asegura que OpenAI, una empresa sin \u00e1nimo de lucro, desarroll\u00f3 un programa bautizado como &#8216;Whisper&#8217; que extrajo texto de m\u00e1s de un mill\u00f3n de horas en videos para obtener datos de entrenamiento de modelos de generaci\u00f3n de lenguaje, conocidos como LLM.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Las fuentes consultados por el NYT aseguran que el equipo encargado de Whisper inclu\u00eda a Greg Brockman, presidente de OpenAI.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">La empresa mantuvo un debate interno sobre si la extracci\u00f3n de texto de los v\u00eddeos alojados en la plataforma propiedad de Google supon\u00edan una violaci\u00f3n de t\u00e9rminos de uso.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Seg\u00fan el art\u00edculo, OpenAI consider\u00f3 que necesitaba m\u00e1s datos de entrenamiento en 2021 y discuti\u00f3 si obteneros de Youtube, podcast o audiolibros.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">En una reciente entrevista el consejero ejecutivo de YouTube, Neal Mohan, asegur\u00f3 que si OpenAI ha usado v\u00eddeos de la plataforma para entrenar &#8216;Sora&#8217;, su modelo de generaci\u00f3n de v\u00eddeos realistas, estar\u00eda violando sus t\u00e9rminos de servicio.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Gana 3,5 dolares por respuesta<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u00abLos creadores de contenido que vienen a Youtube tienen ciertas expectativas, entre ellas que los t\u00e9rminos de servicio se cumplen. Nuestros t\u00e9rminos permiten extraer cierto contenido como el t\u00edtulo, el nombre de canal o el nombre del creador para facilitar la web abierta\u00bb, explic\u00f3 Mohan.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">\u00abNo est\u00e1 permitido descargar las transcripciones o partes de los videos. Eso es una violaci\u00f3n clara de nuestro t\u00e9rminos de contenido\u00bb, a\u00f1adi\u00f3 el directivo.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">La portavoz de OpenAI Lindsay Held indic\u00f3 en una respuesta a la exclusiva obtenida por The Verge que la compa\u00f1\u00eda crea bases de datos \u00ab\u00fanicas\u00bb y utiliza \u00abnumerosas fuentes disponibles p\u00fablicamente y realiza acuerdos para obtener dato que no es p\u00fablico\u00bb.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Google transcribe los videos de Youtube para obtener texto para alimentar a sus modelos de generaci\u00f3n de texto, algo que violar\u00eda los derechos de los creadores que suben sus videos a la plataforma, seg\u00fan fuentes consultadas por el diario.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Hombre adopta a ni\u00f1a sin hogar &#8211; 28 a\u00f1os despu\u00e9s, descubre por qu\u00e9 nadie m\u00e1s quer\u00eda adoptarla.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Los derechos sobre el contenido usado para entrenar modelos de Inteligencia Artificial a\u00fan no est\u00e1n bien definidos y la competitividad para conseguir los mejores modelos de generaci\u00f3n de contenido realista est\u00e1 llevando a empujar las fronteras de la legalidad en derechos de autor.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">El gigante tecnol\u00f3gico Meta, creador de Facebook, debati\u00f3 el a\u00f1o pasado si comprar la editorial Simon &amp; Schuster para obtener acceso a su material de largo formato, seg\u00fan el contenido de reuniones entre gerentes, abogados e ingenieros de la compa\u00f1\u00eda a los que tuvo acceso el NYT.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">(c) Agencia EFE<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Nueva York, 6 abr (EFE).- OpenAI cre\u00f3 un programa para transcribir m\u00e1s de un mill\u00f3n de horas de videos de Youtube con el objetivo de entrenar el modelo de generaci\u00f3n de texto GPT-4, su modelo m\u00e1s avanzado abierto al p\u00fablico, seg\u00fan una exclusiva de The New York Times (NYT) publicada este s\u00e1bado. El diario asegura [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[16],"tags":[],"class_list":["post-51578","post","type-post","status-publish","format-standard","hentry","category-tecnologia"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.5 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times - Romantica NY<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/romanticany.com\/?p=51578\" \/>\n<meta property=\"og:locale\" content=\"es_ES\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times - Romantica NY\" \/>\n<meta property=\"og:description\" content=\"Nueva York, 6 abr (EFE).- OpenAI cre\u00f3 un programa para transcribir m\u00e1s de un mill\u00f3n de horas de videos de Youtube con el objetivo de entrenar el modelo de generaci\u00f3n de texto GPT-4, su modelo m\u00e1s avanzado abierto al p\u00fablico, seg\u00fan una exclusiva de The New York Times (NYT) publicada este s\u00e1bado. El diario asegura [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/romanticany.com\/?p=51578\" \/>\n<meta property=\"og:site_name\" content=\"Romantica NY\" \/>\n<meta property=\"article:published_time\" content=\"2024-04-07T15:55:06+00:00\" \/>\n<meta name=\"author\" content=\"Romantica NY\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Escrito por\" \/>\n\t<meta name=\"twitter:data1\" content=\"Romantica NY\" \/>\n\t<meta name=\"twitter:label2\" content=\"Tiempo de lectura\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 minutos\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/romanticany.com\/?p=51578#article\",\"isPartOf\":{\"@id\":\"https:\/\/romanticany.com\/?p=51578\"},\"author\":{\"name\":\"Romantica NY\",\"@id\":\"https:\/\/romanticany.com\/#\/schema\/person\/be3a3ed4b2ab0af84e4dbe5c3215474c\"},\"headline\":\"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times\",\"datePublished\":\"2024-04-07T15:55:06+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/romanticany.com\/?p=51578\"},\"wordCount\":530,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/romanticany.com\/#organization\"},\"articleSection\":[\"Tecnolog\u00eda\"],\"inLanguage\":\"es\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/romanticany.com\/?p=51578#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/romanticany.com\/?p=51578\",\"url\":\"https:\/\/romanticany.com\/?p=51578\",\"name\":\"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times - Romantica NY\",\"isPartOf\":{\"@id\":\"https:\/\/romanticany.com\/#website\"},\"datePublished\":\"2024-04-07T15:55:06+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/romanticany.com\/?p=51578#breadcrumb\"},\"inLanguage\":\"es\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/romanticany.com\/?p=51578\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/romanticany.com\/?p=51578#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Portada\",\"item\":\"https:\/\/romanticany.com\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/romanticany.com\/#website\",\"url\":\"https:\/\/romanticany.com\/\",\"name\":\"Romantica NY\",\"description\":\"AHORA CON SU PERIODICO DIGITAL Y LA MUSICA MAS ROMANTICA EN LA GRAN MANZANA, USA\",\"publisher\":{\"@id\":\"https:\/\/romanticany.com\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/romanticany.com\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"es\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/romanticany.com\/#organization\",\"name\":\"Romantica NY\",\"url\":\"https:\/\/romanticany.com\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\/\/romanticany.com\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/romanticany.com\/wp-content\/uploads\/2025\/01\/Logo-Romantica-3.png\",\"contentUrl\":\"https:\/\/romanticany.com\/wp-content\/uploads\/2025\/01\/Logo-Romantica-3.png\",\"width\":1024,\"height\":1024,\"caption\":\"Romantica NY\"},\"image\":{\"@id\":\"https:\/\/romanticany.com\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/romanticany.com\/#\/schema\/person\/be3a3ed4b2ab0af84e4dbe5c3215474c\",\"name\":\"Romantica NY\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"es\",\"@id\":\"https:\/\/romanticany.com\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/romanticany.com\/wp-content\/uploads\/2025\/01\/cropped-Logo-Romantica-3-96x96.png\",\"contentUrl\":\"https:\/\/romanticany.com\/wp-content\/uploads\/2025\/01\/cropped-Logo-Romantica-3-96x96.png\",\"caption\":\"Romantica NY\"},\"sameAs\":[\"https:\/\/romanticany.com\"],\"url\":\"https:\/\/romanticany.com\/?author=1\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times - Romantica NY","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/romanticany.com\/?p=51578","og_locale":"es_ES","og_type":"article","og_title":"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times - Romantica NY","og_description":"Nueva York, 6 abr (EFE).- OpenAI cre\u00f3 un programa para transcribir m\u00e1s de un mill\u00f3n de horas de videos de Youtube con el objetivo de entrenar el modelo de generaci\u00f3n de texto GPT-4, su modelo m\u00e1s avanzado abierto al p\u00fablico, seg\u00fan una exclusiva de The New York Times (NYT) publicada este s\u00e1bado. El diario asegura [&hellip;]","og_url":"https:\/\/romanticany.com\/?p=51578","og_site_name":"Romantica NY","article_published_time":"2024-04-07T15:55:06+00:00","author":"Romantica NY","twitter_card":"summary_large_image","twitter_misc":{"Escrito por":"Romantica NY","Tiempo de lectura":"3 minutos"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/romanticany.com\/?p=51578#article","isPartOf":{"@id":"https:\/\/romanticany.com\/?p=51578"},"author":{"name":"Romantica NY","@id":"https:\/\/romanticany.com\/#\/schema\/person\/be3a3ed4b2ab0af84e4dbe5c3215474c"},"headline":"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times","datePublished":"2024-04-07T15:55:06+00:00","mainEntityOfPage":{"@id":"https:\/\/romanticany.com\/?p=51578"},"wordCount":530,"commentCount":0,"publisher":{"@id":"https:\/\/romanticany.com\/#organization"},"articleSection":["Tecnolog\u00eda"],"inLanguage":"es","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/romanticany.com\/?p=51578#respond"]}]},{"@type":"WebPage","@id":"https:\/\/romanticany.com\/?p=51578","url":"https:\/\/romanticany.com\/?p=51578","name":"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times - Romantica NY","isPartOf":{"@id":"https:\/\/romanticany.com\/#website"},"datePublished":"2024-04-07T15:55:06+00:00","breadcrumb":{"@id":"https:\/\/romanticany.com\/?p=51578#breadcrumb"},"inLanguage":"es","potentialAction":[{"@type":"ReadAction","target":["https:\/\/romanticany.com\/?p=51578"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/romanticany.com\/?p=51578#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Portada","item":"https:\/\/romanticany.com\/"},{"@type":"ListItem","position":2,"name":"OpenAI transcribi\u00f3 un mill\u00f3n de horas de videos de Youtube para entrenar GPT-4, dice New York Times"}]},{"@type":"WebSite","@id":"https:\/\/romanticany.com\/#website","url":"https:\/\/romanticany.com\/","name":"Romantica NY","description":"AHORA CON SU PERIODICO DIGITAL Y LA MUSICA MAS ROMANTICA EN LA GRAN MANZANA, USA","publisher":{"@id":"https:\/\/romanticany.com\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/romanticany.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"es"},{"@type":"Organization","@id":"https:\/\/romanticany.com\/#organization","name":"Romantica NY","url":"https:\/\/romanticany.com\/","logo":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/romanticany.com\/#\/schema\/logo\/image\/","url":"https:\/\/romanticany.com\/wp-content\/uploads\/2025\/01\/Logo-Romantica-3.png","contentUrl":"https:\/\/romanticany.com\/wp-content\/uploads\/2025\/01\/Logo-Romantica-3.png","width":1024,"height":1024,"caption":"Romantica NY"},"image":{"@id":"https:\/\/romanticany.com\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/romanticany.com\/#\/schema\/person\/be3a3ed4b2ab0af84e4dbe5c3215474c","name":"Romantica NY","image":{"@type":"ImageObject","inLanguage":"es","@id":"https:\/\/romanticany.com\/#\/schema\/person\/image\/","url":"https:\/\/romanticany.com\/wp-content\/uploads\/2025\/01\/cropped-Logo-Romantica-3-96x96.png","contentUrl":"https:\/\/romanticany.com\/wp-content\/uploads\/2025\/01\/cropped-Logo-Romantica-3-96x96.png","caption":"Romantica NY"},"sameAs":["https:\/\/romanticany.com"],"url":"https:\/\/romanticany.com\/?author=1"}]}},"rttpg_featured_image_url":null,"rttpg_author":{"display_name":"Romantica NY","author_link":"https:\/\/romanticany.com\/?author=1"},"rttpg_comment":0,"rttpg_category":"<a href=\"https:\/\/romanticany.com\/?cat=16\" rel=\"category\">Tecnolog\u00eda<\/a>","rttpg_excerpt":"Nueva York, 6 abr (EFE).- OpenAI cre\u00f3 un programa para transcribir m\u00e1s de un mill\u00f3n de horas de videos de Youtube con el objetivo de entrenar el modelo de generaci\u00f3n de texto GPT-4, su modelo m\u00e1s avanzado abierto al p\u00fablico, seg\u00fan una exclusiva de The New York Times (NYT) publicada este s\u00e1bado. El diario asegura&hellip;","_links":{"self":[{"href":"https:\/\/romanticany.com\/index.php?rest_route=\/wp\/v2\/posts\/51578","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/romanticany.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/romanticany.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/romanticany.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/romanticany.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=51578"}],"version-history":[{"count":0,"href":"https:\/\/romanticany.com\/index.php?rest_route=\/wp\/v2\/posts\/51578\/revisions"}],"wp:attachment":[{"href":"https:\/\/romanticany.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=51578"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/romanticany.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=51578"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/romanticany.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=51578"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}