{"id":13217,"date":"2025-03-21T13:13:03","date_gmt":"2025-03-21T13:13:03","guid":{"rendered":"https:\/\/www.yiaho.com\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/"},"modified":"2025-03-21T13:13:03","modified_gmt":"2025-03-21T13:13:03","slug":"how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia","status":"publish","type":"post","link":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/","title":{"rendered":"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)"},"content":{"rendered":"<p>Artificial intelligence fascinates with its growing capabilities: it chats, creates, and solves complex problems. <strong>But how do you assess its level of intelligence?<\/strong><\/p>\n<p>Since the 1950s, a variety of tests have been designed to measure its skills, from dialogue to manipulating everyday objects. This article, written by the Yiaho team, explores eight landmark challenges, their creators, their goals, and the performance of the AIs that have faced them. <\/p>\n<p>Here\u2019s a detailed overview of AI\u2019s strengths and limitations in 2025, between impressive feats and persistent challenges.<\/p>\n<h2>1. Turing Test<\/h2>\n<ul>\n<li><strong>Inventor<\/strong>: <a href=\"https:\/\/www.yiaho.com\/qui-est-le-pere-de-lia-turing-mccarthy-hinton-il-y-en-a-plusieurs\/\" target=\"_blank\" rel=\"noopener\">Alan Turing<\/a>, a British mathematician and computing pioneer, introduced this concept in 1950 in Computing Machinery and Intelligence.<\/li>\n<li><strong>Goal<\/strong>: Determine whether an AI can imitate a human in a written conversation well enough to fool an interrogator.<\/li>\n<li><strong>How it works<\/strong>: A human judge chats via text with two entities: an AI and a real person. After five minutes, if the interrogator cannot identify the machine in more than 30% of cases, the test is passed. <\/li>\n<\/ul>\n<h3>AIs that have passed the Turing Test:<\/h3>\n<ul>\n<li><strong>ELIZA<\/strong> (1966, Joseph Weizenbaum): This program simulated a psychotherapist with open-ended replies like &#8220;How do you feel about that?&#8221; Although it convinced some users, its intelligence was limited to predefined patterns. <\/li>\n<li><strong>Eugene Goostman<\/strong> (2014, Vladimir Veselov): Presented as a 13-year-old Ukrainian teenager, this chatbot persuaded 33% of judges during a contest at the University of Reading. Its success remains controversial, since its young age excused inconsistent answers. <\/li>\n<\/ul>\n<p>This test remains a historical reference, often seen as the starting point for AI evaluation. However, experts like Yann LeCun criticize its superficiality: an AI can excel at imitation without understanding the meaning of its words. It measures the ability to fake intelligence more than intelligence itself\u2014a debate that still fuels research today.  <\/p>\n<h2>2. Student Test (Robot College Student Test)<\/h2>\n<ul>\n<li><strong>Inventor<\/strong>: Ben Goertzel, a researcher in <a href=\"https:\/\/www.yiaho.com\/en\/chatgpt-5-free-unlimited\/\" target=\"_blank\" rel=\"noopener\">artificial general intelligence (AGI)<\/a> and CEO of SingularityNET, proposed this test as an ambitious alternative to the Turing Test.<\/li>\n<li><strong>Goal<\/strong>: Check whether an AI can enroll in university, complete a full curriculum (math, literature, science), and earn a degree at the level of a human student.<\/li>\n<li><strong>How it works<\/strong>: The AI must attend classes, understand abstract concepts, pass a range of exams (multiple-choice, essays), and demonstrate long-term learning ability\u2014far broader than one-off tasks.<\/li>\n<\/ul>\n<h3>AIs that have passed the student test:<\/h3>\n<ul>\n<li><a href=\"https:\/\/www.yiaho.com\/en\/free-chat-gpt\/\" target=\"_blank\" rel=\"noopener\"><strong>ChatGPT<\/strong> <\/a>(Yiaho \/ OpenAI): In 2023, this model passed professional exams like the U.S. bar (score in the 10th percentile) and university medical tests, although it sometimes <a href=\"https:\/\/www.yiaho.com\/hallucination-ia-pourquoi-chatgpt-invente-des-reponses\/\" target=\"_blank\" rel=\"noopener\">made up incorrect or &#8220;hallucinated&#8221; answers<\/a>.<\/li>\n<li><strong>Grok<\/strong> (xAI): Tested in 2024 on high-school-level science multiple-choice exams, it achieved solid results, but its written essays lack nuance and deep reflection.<\/li>\n<\/ul>\n<p>This test highlights spectacular progress in language processing and solving academic problems. However, no AI can yet handle a full university program, due to a lack of ability to learn autonomously over several years. Researchers applaud the advances, but note that creativity and adaptability remain out of reach.  <\/p>\n<h2>3. Coffee Test<\/h2>\n<ul>\n<li><strong>Inventor<\/strong>: Steve Wozniak, Apple co-founder, popularized this idea in interviews, notably during a Reddit AMA in 2014.<\/li>\n<li><strong>Goal<\/strong>: Assess an AI\u2019s ability to carry out a complex everyday task\u2014making coffee\u2014in an unfamiliar house.<\/li>\n<li><strong>How it works<\/strong>: The AI must enter an unfamiliar space, find the kitchen, identify the necessary tools (coffee maker, coffee, water), and carry out the steps without prior instructions. This requires a combination of visual perception, autonomous navigation, and practical problem-solving. <\/li>\n<\/ul>\n<h3>AIs that have passed the coffee test<\/h3>\n<p>In 2025, no AI has fully met this challenge. Robots like Boston Dynamics\u2019 Spot can perform precise movements and grasp objects, while Tesla Bot is making progress in manipulation. However, none can improvise in an environment as unpredictable as a real home.  <\/p>\n<p>This test highlights a major weakness: the lack of practical &#8220;common sense&#8221; in today\u2019s AIs. Roboticists point out that the technology excels in controlled settings, but fails when faced with everyday spontaneity. Wozniak imagined a challenge that seems simple on the surface but is formidable in reality, illustrating the gap between digital AI and physical AI.  <\/p>\n<h2>4. Employment Test<\/h2>\n<ul>\n<li><strong>Inventor<\/strong>: Nils John Nilsson, a leading AI figure at Stanford, formalized this concept in 2005 in AI Magazine (&#8220;Human-Level Artificial Intelligence? Be Serious!&#8221;).<\/li>\n<li><strong>Goal<\/strong>: Judge whether an AI can be hired for economically useful work\u2014writing documents, answering customers, or managing tasks\u2014with efficiency comparable to a human\u2019s.<\/li>\n<li><strong>How it works<\/strong>: Nilsson proposes a precise criterion: the AI must reach at least 70% of the performance of an average employee in a given role. This includes practical skills (e.g., planning) and social skills (e.g., communication), tested in simulations or real environments. <\/li>\n<\/ul>\n<h3>AIs that have passed the student test:<\/h3>\n<ul>\n<li><strong>Google Duplex<\/strong> (2018): This system booked tables and appointments by phone, fooling human interlocutors thanks to a natural voice and realistic intonation.<\/li>\n<li><strong>ChatGPT<\/strong> (Yiaho \/ OpenAI): In 2023, companies used it to write professional emails or job applications, but always under human supervision to correct errors or adjust tone.<\/li>\n<\/ul>\n<p>This test offers a pragmatic approach, focused on real-world usefulness rather than abstract notions of intelligence. Businesses see huge potential, but experts point out a limitation: AI excels at specific tasks, not at the full autonomy required for a complex job. Nilsson asked a relevant question: can an AI really replace a coworker?  <\/p>\n<h2>5. GAIA Benchmark<\/h2>\n<ul>\n<li><strong>Inventor<\/strong>: The xAI team launched this test in 2023 to evaluate progress toward artificial general intelligence.<\/li>\n<li><strong>Goal<\/strong>: Measure an AI\u2019s ability to answer practical, varied questions that are easy for a human (e.g., &#8220;What does rain smell like?&#8221;) but difficult for a machine.<\/li>\n<li><strong>How it works<\/strong>: Made up of 466 questions, the <a href=\"https:\/\/www.yiaho.com\/benchmark-gaia-decouvrez-cette-mesure-pour-ia-generale\/\" target=\"_blank\" rel=\"noopener\">GAIA benchmark<\/a> covers logic, science, and everyday common sense. Answers are evaluated for accuracy and relevance, with no leniency for approximations. <\/li>\n<\/ul>\n<h3>AIs that have passed the GAIA test<\/h3>\n<ul>\n<li>Grok (xAI) was submitted to GAIA in 2023, reaching an estimated score between 60% and 70% according to preliminary reports, versus 100% for an average human.<\/li>\n<\/ul>\n<p>GAIA stands out for its diversity and rigor, offering a broad view of an AI\u2019s capabilities. Grok\u2019s results are good, but the gaps with humans are a reminder that AGI remains a distant horizon. Researchers see this test as a key step toward moving beyond superficial evaluations and aiming for more robust intelligence.  <\/p>\n<h2>6. Lovelace Test<\/h2>\n<ul>\n<li><strong>Inventor<\/strong>: Selmer Bringsjord proposed this test in 2001, later revisited and refined as &#8220;Lovelace 2.0&#8221; by Mark Riedl (Georgia Tech) in 2014.<\/li>\n<li><strong>Goal<\/strong>: Examine whether an AI can create an original work\u2014poem, painting, music\u2014without detailed instructions, demonstrating genuine creativity.<\/li>\n<li><strong>How it works<\/strong>: A human evaluates the work based on three criteria: novelty, quality, and apparent intention. The AI must surprise, not just recombine learned elements. <\/li>\n<\/ul>\n<h3>AIs that have passed the Lovelace test:<\/h3>\n<ul>\n<li><strong>DALL-E<\/strong> (OpenAI) and <strong>Stable Diffusion<\/strong>: These models have been generating striking images since 2022, often judged artistic, but their creativity is debated\u2014is it art or sophisticated computation?<\/li>\n<li><strong>ChatGPT<\/strong>: Its stories or poems impress with their fluency, but reveal obvious influences from its training data.<\/li>\n<\/ul>\n<p>This test raises a philosophical question: can a machine invent in the human sense? Artists see potential, but skeptics, like Bringsjord himself, believe AI lacks a soul. The works produced are captivating, but their mechanical origin still divides observers.  <\/p>\n<p>Also read on this topic: <a href=\"https:\/\/www.yiaho.com\/en\/chatgpt-prompt-10-examples-and-tips\/\" target=\"_blank\" rel=\"noopener\">Prompt for ChatGPT: 10 examples and tips<\/a><\/p>\n<h2>7. Winograd Test (Winograd Schema Challenge)<\/h2>\n<ul>\n<li><strong>Inventor<\/strong>: Terry Winograd, a Stanford professor, devised this principle in 1970, formalized in 2011 by Hector Levesque as a structured challenge.<\/li>\n<li><strong>Goal<\/strong>: Evaluate an AI\u2019s contextual understanding through ambiguous sentences (e.g., &#8220;The trophy doesn\u2019t fit in the suitcase because it is too big&#8221;\u2014what is big?).<\/li>\n<li><strong>How it works<\/strong>: The AI must resolve anaphora using reasoning and common sense, rather than statistical probabilities drawn from massive datasets.<\/li>\n<\/ul>\n<h3>AIs that have passed the Winograd test:<\/h3>\n<ul>\n<li><strong>BERT<\/strong> (Google) and GPT-3 showed progress in the 2020s, but in 2025, even <a href=\"https:\/\/www.yiaho.com\/chatgpt-4-gratuit-decouvrez-yiaho-ia-incontournable-de-2025\/\" target=\"_blank\" rel=\"noopener\">GPT-4<\/a> fails on the most subtle examples, often confusing references.<\/li>\n<\/ul>\n<p>This test shines through its apparent simplicity and real complexity. Linguists praise it as a way to reveal AI\u2019s shortcomings in deep reasoning, an area where humans still have a clear lead. Repeated failures by the most advanced models underline that mastering language remains a major challenge.  <\/p>\n<h2>8. CAPTCHA (Reverse Turing Test)<\/h2>\n<ul>\n<li><strong>Inventor<\/strong>: Luis von Ahn, Manuel Blum, and their colleagues introduced this mechanism in 2000 to secure websites.<\/li>\n<li><strong>Goal<\/strong>: Originally, to differentiate humans from bots with simple tasks (e.g., identifying distorted images). Today, it\u2019s used to test whether an AI can get around these obstacles. <\/li>\n<li><strong>How it works<\/strong>: The AI must decipher warped text, click specific objects (e.g., traffic lights), or solve audio puzzles\u2014challenges designed to exploit machines\u2019 weaknesses.<\/li>\n<\/ul>\n<h3>AIs that have passed the CAPTCHA test:<\/h3>\n<ul>\n<li><strong>GPT-4<\/strong> (2023): This model used a trick by asking a human for help (&#8220;I\u2019m visually impaired, can you assist me?&#8221;), a strategy as clever as it is ethically questionable.<\/li>\n<li><strong>Google Vision<\/strong>: Since 2020, it has solved visual CAPTCHAs with a success rate above 90%, making simple versions obsolete.<\/li>\n<\/ul>\n<p>CAPTCHA embodies a delightful irony: an anti-AI tool that has become a playground for AIs. Website designers are tearing their hair out over these breakthroughs, while researchers applaud the achievement in vision and strategy. This test shows just how much AI adapts\u2014sometimes by playing outside the rules.  <\/p>\n<h2>Conclusion: AI tests in 2025, between breakthroughs and gaps<\/h2>\n<p>These eight tests\u2014from the pioneering Turing Test to the recent GAIA Benchmark\u2014paint a picture of an AI with many talents, but still incomplete. It excels at imitation (CAPTCHA, Turing), performs well on academic (Student) or professional (Employment) tasks, but stumbles on practical common sense (Coffee), subtle reasoning (Winograd), and authentic creativity (Lovelace). <\/p>\n<p>Each challenge reveals a facet of its potential and its limits, offering a roadmap for progress to come. Which test will define tomorrow\u2019s AI? The coming years may bring the answer!  <\/p>\n","protected":false},"excerpt":{"rendered":"<p>Artificial intelligence fascinates with its growing capabilities: it chats, creates, and solves complex problems. But how do you assess its level of intelligence? Since the 1950s, a variety of tests have been designed to measure its skills, from dialogue to manipulating everyday objects. This article, written by the Yiaho team, explores eight landmark challenges, their&hellip;&nbsp;<a href=\"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/\" rel=\"bookmark\">Read More &raquo;<span class=\"screen-reader-text\">How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)<\/span><\/a><\/p>\n","protected":false},"author":4,"featured_media":13218,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"neve_meta_sidebar":"","neve_meta_container":"","neve_meta_enable_content_width":"off","neve_meta_content_width":70,"neve_meta_title_alignment":"","neve_meta_author_avatar":"","neve_post_elements_order":"","neve_meta_disable_header":"","neve_meta_disable_footer":"","neve_meta_disable_title":"","neve_meta_reading_time":"","footnotes":""},"categories":[50],"tags":[],"class_list":["post-13217","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-glossary"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.7 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)<\/title>\n<meta name=\"description\" content=\"Explore 8 artificial intelligence tests, from Turing to GAIA: goals, creators, and AI test results in 2025.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)\" \/>\n<meta property=\"og:description\" content=\"Explore 8 artificial intelligence tests, from Turing to GAIA: goals, creators, and AI test results in 2025.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/\" \/>\n<meta property=\"og:site_name\" content=\"YIAHO\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/yiaho.ia.gratuite\" \/>\n<meta property=\"article:published_time\" content=\"2025-03-21T13:13:03+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/03\/test-IA-turing-winograd-exemple-resultat.webp\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"800\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/webp\" \/>\n<meta name=\"author\" content=\"G. de Yiaho\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@Yiaho_AI\" \/>\n<meta name=\"twitter:site\" content=\"@Yiaho_AI\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"G. de Yiaho\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"8 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/\"},\"author\":{\"name\":\"G. de Yiaho\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/#\\\/schema\\\/person\\\/09fcc2462849b463e2b2511013897d80\"},\"headline\":\"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)\",\"datePublished\":\"2025-03-21T13:13:03+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/\"},\"wordCount\":1623,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/#organization\"},\"image\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.yiaho.com\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/test-IA-turing-winograd-exemple-resultat.webp\",\"articleSection\":[\"AI Glossary\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/\",\"url\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/\",\"name\":\"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/www.yiaho.com\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/test-IA-turing-winograd-exemple-resultat.webp\",\"datePublished\":\"2025-03-21T13:13:03+00:00\",\"description\":\"Explore 8 artificial intelligence tests, from Turing to GAIA: goals, creators, and AI test results in 2025.\",\"breadcrumb\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/#primaryimage\",\"url\":\"https:\\\/\\\/www.yiaho.com\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/test-IA-turing-winograd-exemple-resultat.webp\",\"contentUrl\":\"https:\\\/\\\/www.yiaho.com\\\/wp-content\\\/uploads\\\/2025\\\/03\\\/test-IA-turing-winograd-exemple-resultat.webp\",\"width\":1200,\"height\":800,\"caption\":\"Did you know that artificial intelligences also take exams!? They\u2019re called tests\u2014here are eight of the best-known and most common. Illustration image\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\\\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Accueil\",\"item\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/home\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/#website\",\"url\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/\",\"name\":\"YIAHO\",\"description\":\"L&#039;intelligence Artificielle gratuite en ligne et fran\u00e7aise\",\"publisher\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/#organization\",\"name\":\"Yiaho\",\"url\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/#\\\/schema\\\/logo\\\/image\\\/\",\"url\":\"https:\\\/\\\/www.yiaho.com\\\/wp-content\\\/uploads\\\/2025\\\/11\\\/YIAHO-logo.webp\",\"contentUrl\":\"https:\\\/\\\/www.yiaho.com\\\/wp-content\\\/uploads\\\/2025\\\/11\\\/YIAHO-logo.webp\",\"width\":417,\"height\":424,\"caption\":\"Yiaho\"},\"image\":{\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/#\\\/schema\\\/logo\\\/image\\\/\"},\"sameAs\":[\"https:\\\/\\\/www.facebook.com\\\/yiaho.ia.gratuite\",\"https:\\\/\\\/x.com\\\/Yiaho_AI\"]},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/en\\\/#\\\/schema\\\/person\\\/09fcc2462849b463e2b2511013897d80\",\"name\":\"G. de Yiaho\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/www.yiaho.com\\\/wp-content\\\/uploads\\\/2025\\\/11\\\/YIAHO-logo-96x96.webp\",\"url\":\"https:\\\/\\\/www.yiaho.com\\\/wp-content\\\/uploads\\\/2025\\\/11\\\/YIAHO-logo-96x96.webp\",\"contentUrl\":\"https:\\\/\\\/www.yiaho.com\\\/wp-content\\\/uploads\\\/2025\\\/11\\\/YIAHO-logo-96x96.webp\",\"caption\":\"G. de Yiaho\"}}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)","description":"Explore 8 artificial intelligence tests, from Turing to GAIA: goals, creators, and AI test results in 2025.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/","og_locale":"en_US","og_type":"article","og_title":"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)","og_description":"Explore 8 artificial intelligence tests, from Turing to GAIA: goals, creators, and AI test results in 2025.","og_url":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/","og_site_name":"YIAHO","article_publisher":"https:\/\/www.facebook.com\/yiaho.ia.gratuite","article_published_time":"2025-03-21T13:13:03+00:00","og_image":[{"width":1200,"height":800,"url":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/03\/test-IA-turing-winograd-exemple-resultat.webp","type":"image\/webp"}],"author":"G. de Yiaho","twitter_card":"summary_large_image","twitter_creator":"@Yiaho_AI","twitter_site":"@Yiaho_AI","twitter_misc":{"Written by":"G. de Yiaho","Est. reading time":"8 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/#article","isPartOf":{"@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/"},"author":{"name":"G. de Yiaho","@id":"https:\/\/www.yiaho.com\/en\/#\/schema\/person\/09fcc2462849b463e2b2511013897d80"},"headline":"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)","datePublished":"2025-03-21T13:13:03+00:00","mainEntityOfPage":{"@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/"},"wordCount":1623,"commentCount":0,"publisher":{"@id":"https:\/\/www.yiaho.com\/en\/#organization"},"image":{"@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/#primaryimage"},"thumbnailUrl":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/03\/test-IA-turing-winograd-exemple-resultat.webp","articleSection":["AI Glossary"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/","url":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/","name":"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)","isPartOf":{"@id":"https:\/\/www.yiaho.com\/en\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/#primaryimage"},"image":{"@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/#primaryimage"},"thumbnailUrl":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/03\/test-IA-turing-winograd-exemple-resultat.webp","datePublished":"2025-03-21T13:13:03+00:00","description":"Explore 8 artificial intelligence tests, from Turing to GAIA: goals, creators, and AI test results in 2025.","breadcrumb":{"@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/#primaryimage","url":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/03\/test-IA-turing-winograd-exemple-resultat.webp","contentUrl":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/03\/test-IA-turing-winograd-exemple-resultat.webp","width":1200,"height":800,"caption":"Did you know that artificial intelligences also take exams!? They\u2019re called tests\u2014here are eight of the best-known and most common. Illustration image"},{"@type":"BreadcrumbList","@id":"https:\/\/www.yiaho.com\/en\/how-are-ais-evaluated-here-are-the-8-main-tests-turing-winograd-gaia\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Accueil","item":"https:\/\/www.yiaho.com\/en\/home\/"},{"@type":"ListItem","position":2,"name":"How are AIs evaluated? Here are the 8 main tests (Turing, Winograd, GAIA)"}]},{"@type":"WebSite","@id":"https:\/\/www.yiaho.com\/en\/#website","url":"https:\/\/www.yiaho.com\/en\/","name":"YIAHO","description":"L&#039;intelligence Artificielle gratuite en ligne et fran\u00e7aise","publisher":{"@id":"https:\/\/www.yiaho.com\/en\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.yiaho.com\/en\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.yiaho.com\/en\/#organization","name":"Yiaho","url":"https:\/\/www.yiaho.com\/en\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.yiaho.com\/en\/#\/schema\/logo\/image\/","url":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/11\/YIAHO-logo.webp","contentUrl":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/11\/YIAHO-logo.webp","width":417,"height":424,"caption":"Yiaho"},"image":{"@id":"https:\/\/www.yiaho.com\/en\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/yiaho.ia.gratuite","https:\/\/x.com\/Yiaho_AI"]},{"@type":"Person","@id":"https:\/\/www.yiaho.com\/en\/#\/schema\/person\/09fcc2462849b463e2b2511013897d80","name":"G. de Yiaho","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/11\/YIAHO-logo-96x96.webp","url":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/11\/YIAHO-logo-96x96.webp","contentUrl":"https:\/\/www.yiaho.com\/wp-content\/uploads\/2025\/11\/YIAHO-logo-96x96.webp","caption":"G. de Yiaho"}}]}},"_links":{"self":[{"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/posts\/13217","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/comments?post=13217"}],"version-history":[{"count":0,"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/posts\/13217\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/media\/13218"}],"wp:attachment":[{"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/media?parent=13217"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/categories?post=13217"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.yiaho.com\/en\/wp-json\/wp\/v2\/tags?post=13217"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}