{"id":138730,"date":"2024-05-23T14:42:14","date_gmt":"2024-05-23T12:42:14","guid":{"rendered":"https:\/\/sftarticles.wpenginepowered.com\/es\/?p=332919"},"modified":"2025-06-12T11:02:59","modified_gmt":"2025-06-12T10:02:59","slug":"de-llama-a-cameleon-voici-la-nouvelle-ia-multimodale-de-meta","status":"publish","type":"post","link":"https:\/\/cms-articles.softonic.io\/fr\/de-llama-a-cameleon-voici-la-nouvelle-ia-multimodale-de-meta\/","title":{"rendered":"De \u00ab llama \u00bb \u00e0 \u00ab cam\u00e9l\u00e9on \u00bb : voici la nouvelle IA multimodale de Meta"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\"><strong>Meta<\/strong> a pr\u00e9sent\u00e9 <strong>Chameleon<\/strong>, sa nouvelle intelligence artificielle multimodale, con\u00e7ue pour faire face \u00e0 la concurrence croissante dans le domaine de l&#8217;IA g\u00e9n\u00e9rative. Chameleon se distingue par sa <strong>multimodalit\u00e9 native<\/strong>, int\u00e9grant de mani\u00e8re fluide <strong>des composants de diff\u00e9rentes modalit\u00e9s tels que des images, du texte et du code<\/strong>.<\/p>\n\n\n<div class=\"sc-card-program\">\r\n  <div class=\"sc-card-program__body\">\r\n    <div class=\"sc-card-program__row clearfix\">\r\n      <div class=\"sc-card-program__col-logo\">\r\n        <img decoding=\"async\" class=\"sc-card-program__img\" src=\"https:\/\/images.sftcdn.net\/images\/t_app-icon-m\/p\/b2253bb6-9b53-11e6-8b9d-00163ed833e7\/219705778\/facebook-icon.png\" alt=\"Facebook\" width=\"100px\" height=\"100px\">\r\n      <\/div>\r\n      <div class=\"sc-card-program__col-title\">\r\n        <span class=\"sc-card-program__title\">Facebook<\/span>\r\n        <a class=\"sc-card-program__button sc-card-program-internal\" href=\"https:\/\/facebook.fr.softonic.com\/android\" target=\"_self\" rel=\"noopener noreferrer\">T\u00c9L\u00c9CHARGER<\/a>\r\n      <\/div>\r\n      <div class=\"sc-card-program__col-rating\">\r\n        \r\n      <\/div>\r\n    <\/div>\r\n    <div class=\"sc-card-program__row\">\r\n      <span class=\"sc-card-program__description\"><\/span>\r\n    <\/div>\r\n    <div class=\"sc-card-program__row\">\r\n      <img decoding=\"async\" class=\"sc-card-program__bigpic\" src=\"\">\r\n    <\/div>\r\n    <a class=\"sc-card-program__link track-link sc-card-program-internal\" href=\"https:\/\/facebook.fr.softonic.com\/android\" target=\"_self\" rel=\"noopener noreferrer\"><\/a>\r\n  <\/div>\r\n<\/div>\n\n\n\n<p class=\"wp-block-paragraph\">Selon le <a href=\"https:\/\/arxiv.org\/abs\/2405.09818v1\" target=\"_blank\" rel=\"noopener nofollow\" title=\"\">document<\/a> publi\u00e9 par l&#8217;\u00e9quipe de recherche, l&#8217;architecture de Chameleon permet <strong>d&#8217;excellentes performances dans des t\u00e2ches n\u00e9cessitant une compr\u00e9hension approfondie<\/strong> \u00e0 la fois de l&#8217;information visuelle et textuelle. Parmi les capacit\u00e9s remarquables de Chameleon, on trouve le sous-titrage d&#8217;images et la r\u00e9ponse \u00e0 des questions visuelles (VQA), ainsi que sa <strong>comp\u00e9titivit\u00e9 dans des t\u00e2ches exclusivement textuelles<\/strong>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Traditionnellement, les mod\u00e8les multimodaux sont cr\u00e9\u00e9s \u00e0 l&#8217;aide d&#8217;un processus appel\u00e9 <strong>\u00ab<\/strong> <strong>fusion tardive \u00bb<\/strong>, o\u00f9 le syst\u00e8me d&#8217;IA traite les diff\u00e9rentes modalit\u00e9s s\u00e9par\u00e9ment, puis fusionne les encodages pour l&#8217;inf\u00e9rence. Cependant, <strong>cette approche limite la capacit\u00e9 des mod\u00e8les \u00e0 int\u00e9grer de mani\u00e8re fluide des informations entre diff\u00e9rentes modalit\u00e9s<\/strong>.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Chameleon a adopt\u00e9 une architecture de <strong>\u00ab<\/strong> <strong>fusion pr\u00e9coce bas\u00e9e sur des jetons mixtes \u00bb<\/strong>, ce qui signifie qu&#8217;il a \u00e9t\u00e9 con\u00e7u d\u00e8s le d\u00e9part pour <strong>apprendre \u00e0 partir d&#8217;un m\u00e9lange entrelac\u00e9 d&#8217;images, de texte et d&#8217;autres modalit\u00e9s<\/strong>. Cette m\u00e9thodologie transforme les images en jetons discrets, de mani\u00e8re similaire \u00e0 la fa\u00e7on dont les mod\u00e8les linguistiques g\u00e8rent les mots, et utilise un vocabulaire unifi\u00e9 de jetons de texte, de code et d&#8217;image.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">En comparaison avec des mod\u00e8les similaires tels que <strong>Google Gemini<\/strong>, Chameleon offre <strong>une int\u00e9gration plus coh\u00e9sive des modalit\u00e9s lors de la g\u00e9n\u00e9ration de contenu<\/strong>, car il ne n\u00e9cessite pas de composants sp\u00e9cifiques pour chaque modalit\u00e9.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" src=\"https:\/\/articles-img.sftcdn.net\/sft\/articles\/auto-mapping-folder\/sites\/2\/2023\/02\/Meta-nueva-IA-1024x576.jpg\" alt=\"\" class=\"wp-image-273855\" \/><\/figure>\n<\/div>\n\n\n<p class=\"wp-block-paragraph\">L&#8217;entra\u00eenement de Chameleon a \u00e9t\u00e9 r\u00e9alis\u00e9 en deux \u00e9tapes, en utilisant un vaste ensemble de donn\u00e9es comprenant <strong>4,4 billions de jetons de texte<\/strong>, des paires image-texte et des s\u00e9quences de texte et d&#8217;images entrelac\u00e9es. Les mod\u00e8les de Chameleon, avec <strong>7 000 et 34 000 milliards de param\u00e8tres<\/strong>, ont \u00e9t\u00e9 entra\u00een\u00e9s pendant plus de 5 millions d&#8217;heures sur des GPU Nvidia A100 de 80 Go.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Les exp\u00e9riences ont d\u00e9montr\u00e9 que Chameleon peut effectuer une large gamme de t\u00e2ches de texte et multimodales avec une performance leader sur le march\u00e9. Dans les tests de VQA et de sous-titrage d&#8217;images, <strong>Chameleon-34B a surpass\u00e9 des mod\u00e8les tels que Flamingo, IDEFICS et Llava-1.5<\/strong>. De plus, <strong>il a \u00e9gal\u00e9 les performances d&#8217;autres mod\u00e8les avec moins d&#8217;exemples d&#8217;entra\u00eenement<\/strong> en contexte et avec des mod\u00e8les de taille plus petite.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Malgr\u00e9 la complexit\u00e9 de la multimodalit\u00e9, Chameleon reste comp\u00e9titif dans les t\u00e2ches de texte seul, <strong>comparable \u00e0 des mod\u00e8les tels que Mixtral 8x7B et Gemini-Pro dans les tests de raisonnement logique et de compr\u00e9hension de lecture<\/strong>. Les chercheurs soulignent que Chameleon d\u00e9bloque de nouvelles capacit\u00e9s de raisonnement et de g\u00e9n\u00e9ration multimodales, offrant des r\u00e9sultats pr\u00e9f\u00e9r\u00e9s par les utilisateurs dans les documents combinant du texte et des images de mani\u00e8re entrelac\u00e9e.<\/p>\n\n\n<div class=\"sc-card-program\">\r\n  <div class=\"sc-card-program__body\">\r\n    <div class=\"sc-card-program__row clearfix\">\r\n      <div class=\"sc-card-program__col-logo\">\r\n        <img decoding=\"async\" class=\"sc-card-program__img\" src=\"https:\/\/images.sftcdn.net\/images\/t_app-icon-m\/p\/b2253bb6-9b53-11e6-8b9d-00163ed833e7\/219705778\/facebook-icon.png\" alt=\"Facebook\" width=\"100px\" height=\"100px\">\r\n      <\/div>\r\n      <div class=\"sc-card-program__col-title\">\r\n        <span class=\"sc-card-program__title\">Facebook<\/span>\r\n        <a class=\"sc-card-program__button sc-card-program-internal\" href=\"https:\/\/facebook.fr.softonic.com\/android\" target=\"_self\" rel=\"noopener noreferrer\">T\u00c9L\u00c9CHARGER<\/a>\r\n      <\/div>\r\n      <div class=\"sc-card-program__col-rating\">\r\n        \r\n      <\/div>\r\n    <\/div>\r\n    <div class=\"sc-card-program__row\">\r\n      <span class=\"sc-card-program__description\"><\/span>\r\n    <\/div>\r\n    <div class=\"sc-card-program__row\">\r\n      <img decoding=\"async\" class=\"sc-card-program__bigpic\" src=\"\">\r\n    <\/div>\r\n    <a class=\"sc-card-program__link track-link sc-card-program-internal\" href=\"https:\/\/facebook.fr.softonic.com\/android\" target=\"_self\" rel=\"noopener noreferrer\"><\/a>\r\n  <\/div>\r\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Meta a pr\u00e9sent\u00e9 Chameleon, sa nouvelle intelligence artificielle multimodale, con\u00e7ue pour faire face \u00e0 la concurrence croissante dans le domaine de l&#8217;IA g\u00e9n\u00e9rative. Chameleon se distingue par sa multimodalit\u00e9 native, int\u00e9grant de mani\u00e8re fluide des composants de diff\u00e9rentes modalit\u00e9s tels que des images, du texte et du code. Selon le document publi\u00e9 par l&#8217;\u00e9quipe de &hellip; <a href=\"https:\/\/cms-articles.softonic.io\/fr\/de-llama-a-cameleon-voici-la-nouvelle-ia-multimodale-de-meta\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;De \u00ab llama \u00bb \u00e0 \u00ab cam\u00e9l\u00e9on \u00bb : voici la nouvelle IA multimodale de Meta&#8221;<\/span><\/a><\/p>\n","protected":false},"author":9256,"featured_media":138731,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":"","wpcf-pageviews":1},"categories":[16761],"tags":[16783],"usertag":[],"vertical":[],"content-category":[],"class_list":["post-138730","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news","tag-app-subdomain-redirectionfacebook"],"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/posts\/138730","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/users\/9256"}],"replies":[{"embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/comments?post=138730"}],"version-history":[{"count":1,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/posts\/138730\/revisions"}],"predecessor-version":[{"id":161450,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/posts\/138730\/revisions\/161450"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/media\/138731"}],"wp:attachment":[{"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/media?parent=138730"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/categories?post=138730"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/tags?post=138730"},{"taxonomy":"usertag","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/usertag?post=138730"},{"taxonomy":"vertical","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/vertical?post=138730"},{"taxonomy":"content-category","embeddable":true,"href":"https:\/\/cms-articles.softonic.io\/fr\/wp-json\/wp\/v2\/content-category?post=138730"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}