{"id":146002,"date":"2024-10-09T17:07:37","date_gmt":"2024-10-09T15:07:37","guid":{"rendered":"https:\/\/sftarticles.wpenginepowered.com\/es\/?p=341898"},"modified":"2025-06-12T10:34:06","modified_gmt":"2025-06-12T09:34:06","slug":"tiktok-entraine-son-ia-avec-les-donnees-de-tout-internet-plus-rapidement-quopenai-elle-meme","status":"publish","type":"post","link":"https:\/\/cms-articles.softonic.io\/fr\/tiktok-entraine-son-ia-avec-les-donnees-de-tout-internet-plus-rapidement-quopenai-elle-meme\/","title":{"rendered":"TikTok entra\u00eene son IA avec les donn\u00e9es de tout Internet, plus rapidement qu&#8217;OpenAI elle-m\u00eame"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"><strong>ByteDance<\/strong>, la soci\u00e9t\u00e9 chinoise derri\u00e8re <strong>TikTok<\/strong>, semble acc\u00e9l\u00e9rer sa collecte de donn\u00e9es sur Internet pour entra\u00eener ses mod\u00e8les d&#8217;intelligence artificielle g\u00e9n\u00e9rative. Depuis avril, l&#8217;entreprise a d\u00e9ploy\u00e9 un bot de scraping web appel\u00e9 <strong>Bytespider<\/strong>, selon une \u00e9tude de <strong>Kasada<\/strong>, une soci\u00e9t\u00e9 sp\u00e9cialis\u00e9e dans la gestion des bots, \u00e0 laquelle <strong><a href=\"https:\/\/fortune.com\/2024\/10\/03\/bytedance-tiktok-bytespider-scraper-bot\/\" target=\"_blank\" rel=\"noopener nofollow\" title=\"\">Fortune<\/a><\/strong> a eu acc\u00e8s. Ce bot est l&#8217;un des plus agressifs sur Internet, <strong>d\u00e9passant de loin le rythme de scraping des autres grandes entreprises technologiques telles que Google, Meta, Amazon, OpenAI et Anthropic<\/strong>.<\/p>\n\n\n<div class=\"sc-card-starred-link\">\r\n  <div class=\"sc-card-starred-link__body\">\r\n    <div class=\"sc-card-starred-link__row clearfix\">\r\n      <div class=\"sc-card-starred-link__col-logo\">\r\n        <img decoding=\"async\" class=\"sc-card-starred-link__img\" src=\"https:\/\/articles-img.sftcdn.net\/sft\/articles\/auto-mapping-folder\/sites\/3\/2024\/09\/newsletter.png\" width=\"100px\" height=\"100px\">\r\n      <\/div>\r\n      <div class=\"sc-card-starred-link__col-title\">\r\n        <p class=\"sc-card-starred-link__title\">Abonnez-vous \u00e0 la newsletter de Softonic et recevez les derni\u00e8res nouveaut\u00e9s en technologie, jeux vid\u00e9o et offres directement dans votre bo\u00eete<\/p>\r\n        <a class=\"sc-card-starred-link__button\" href=\"https:\/\/softonic-fr.beehiiv.com\/subscribe\" target=\"_blank\" rel=\"noopener noreferrer sponsored\">Abonnez-vous (c'est GRATUIT) \u25ba<\/a>\r\n      <\/div>\r\n    <\/div>\r\n    <a class=\"sc-card-starred-link__link\" href=\"https:\/\/softonic-fr.beehiiv.com\/subscribe\" target=\"_blank\" rel=\"noopener noreferrer sponsored\"><\/a>\r\n  <\/div>\r\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\">D&#8217;apr\u00e8s <strong>Sam Crowther<\/strong>, PDG de Kasada, Bytespider extrait des donn\u00e9es \u00e0 <strong>un rythme 25 fois plus rapide que GPTbot, le bot scraper d&#8217;OpenAI<\/strong>. De plus, il d\u00e9passe de 3 000 fois la vitesse de <strong>ClaudeBot<\/strong>, utilis\u00e9 par Anthropic. Au cours des six derni\u00e8res semaines, <strong>l&#8217;activit\u00e9 de scraping de Bytespider a atteint des pics significatifs<\/strong>, ce qui montre que ByteDance intensifie ses efforts pour rattraper son retard dans la course \u00e0 l&#8217;IA g\u00e9n\u00e9rative.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">L&#8217;\u00e9tude de Kasada a r\u00e9v\u00e9l\u00e9 que <strong>Bytespider ne respecte pas robots.txt<\/strong>, un standard d&#8217;exclusion qui indique aux bots de ne pas extraire de donn\u00e9es de certaines pages web. Ce scraping agressif survient dans un contexte d\u00e9licat pour ByteDance, <strong>alors que TikTok pourrait \u00eatre interdit aux \u00c9tats-Unis<\/strong>. En avril, le pr\u00e9sident am\u00e9ricain <strong>Joe Biden<\/strong> a sign\u00e9 une loi <strong>obligeant la soci\u00e9t\u00e9 \u00e0 vendre l&#8217;application pour des raisons de s\u00e9curit\u00e9 nationale ou \u00e0 la fermer<\/strong>.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" src=\"https:\/\/articles-img.sftcdn.net\/auto-mapping-folder\/sites\/2\/2024\/08\/notailawsuit.jpg\" alt=\"\" class=\"wp-image-338503\" \/><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">La collecte de donn\u00e9es sur Internet n&#8217;est pas nouvelle, mais la mont\u00e9e en puissance de l&#8217;IA g\u00e9n\u00e9rative a raviv\u00e9 la controverse, notamment en ce qui concerne <strong>la violation des droits d&#8217;auteur<\/strong>. Les entreprises technologiques utilisent des bots pour copier des donn\u00e9es et entra\u00eener leurs mod\u00e8les, ce qui inqui\u00e8te et irrite les artistes et cr\u00e9ateurs de contenu \u00e0 travers le monde, qui <strong>voient leurs \u0153uvres utilis\u00e9es sans permission, sans scrupule et sans compensation<\/strong>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Des rumeurs circulent selon lesquelles ByteDance d\u00e9veloppe un nouveau mod\u00e8le d&#8217;IA, qui pourrait \u00eatre int\u00e9gr\u00e9 \u00e0 la fonction de recherche de TikTok. Cet outil a \u00e9t\u00e9 mis \u00e0 jour ces derniers mois pour permettre aux utilisateurs de rechercher en temps r\u00e9el les mots-cl\u00e9s les plus populaires, ce qui pourrait aider les annonceurs \u00e0 am\u00e9liorer la visibilit\u00e9 de leurs publicit\u00e9s.<\/p>\n\n\n<div class=\"sc-card-program\">\r\n  <div class=\"sc-card-program__body\">\r\n    <div class=\"sc-card-program__row clearfix\">\r\n      <div class=\"sc-card-program__col-logo\">\r\n        <img decoding=\"async\" class=\"sc-card-program__img\" src=\"https:\/\/images.sftcdn.net\/images\/t_app-icon-s\/p\/25d01a14-3485-42e7-a253-e5050ac51dd1\/1217029392\/tik-tok-Download-Tiktok.jpg\" alt=\"TikTok\" width=\"100px\" height=\"100px\">\r\n      <\/div>\r\n      <div class=\"sc-card-program__col-title\">\r\n        <span class=\"sc-card-program__title\">TikTok<\/span>\r\n        <a class=\"sc-card-program__button sc-card-program-internal\" href=\"https:\/\/tik-tok.fr.softonic.com\/android\" target=\"_self\" rel=\"noopener noreferrer\">T\u00c9L\u00c9CHARGER<\/a>\r\n      <\/div>\r\n      <div class=\"sc-card-program__col-rating\">\r\n        \r\n      <\/div>\r\n    <\/div>\r\n    <div class=\"sc-card-program__row\">\r\n      <span class=\"sc-card-program__description\"><\/span>\r\n    <\/div>\r\n    <div class=\"sc-card-program__row\">\r\n      <img decoding=\"async\" class=\"sc-card-program__bigpic\" src=\"\">\r\n    <\/div>\r\n    <a class=\"sc-card-program__link track-link sc-card-program-internal\" href=\"https:\/\/tik-tok.fr.softonic.com\/android\" target=\"_self\" rel=\"noopener noreferrer\"><\/a>\r\n  <\/div>\r\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>ByteDance, la soci\u00e9t\u00e9 chinoise derri\u00e8re TikTok, semble acc\u00e9l\u00e9rer sa collecte de donn\u00e9es sur Internet pour entra\u00eener ses mod\u00e8les d&#8217;intelligence artificielle g\u00e9n\u00e9rative. Depuis avril, l&#8217;entreprise a d\u00e9ploy\u00e9 un bot de crawl web appel\u00e9 Bytespider, selon une \u00e9tude de Kasada, une soci\u00e9t\u00e9 sp\u00e9cialis\u00e9e dans la gestion des bots, \u00e0 laquelle Fortune a eu acc\u00e8s. Ce bot est l&#8217;un des plus agressifs sur Internet, d\u00e9passant de loin le rythme de scraping d&#8217;autres grandes entreprises technologiques comme Google, Meta, Amazon, OpenAI et Anthropic. Selon Sam Crowther, PDG de Kasada, Bytespider racle des donn\u00e9es \u00e0 un rythme 25 fois sup\u00e9rieur \u00e0 celui de [&hellip;]<\/p>\n","protected":false},"author":9256,"featured_media":146006,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":"","wpcf-pageviews":1},"categories":[16761],"tags":[17130],"usertag":[],"vertical":[],"content-category":[17507],"class_list":["post-146002","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news","tag-app-subdomain-redirectiontiktok","content-category-ia"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/posts\/146002","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/users\/9256"}],"replies":[{"embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/comments?post=146002"}],"version-history":[{"count":1,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/posts\/146002\/revisions"}],"predecessor-version":[{"id":159879,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/posts\/146002\/revisions\/159879"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/media\/146006"}],"wp:attachment":[{"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/media?parent=146002"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/categories?post=146002"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/tags?post=146002"},{"taxonomy":"usertag","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/usertag?post=146002"},{"taxonomy":"vertical","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/vertical?post=146002"},{"taxonomy":"content-category","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/content-category?post=146002"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}