The limits of language: AI models still struggle to match humans’ understanding of simple text

Publié le 18 February 2025 à 01h07
modifié le 18 February 2025 à 01h08

AI models struggle to match human understanding of simple texts. *Recent studies reveal notable gaps* in their effectiveness at interpreting the underlying meaning of sentences. *Although these systems are designed to process information*, their ability to capture nuances remains limited. Results indicate that human linguistic comprehension significantly surpasses that of algorithms. *This gap highlights fundamental issues* regarding the integration of AI in contexts beyond simple queries.

Results of the international study

A study conducted by a team of researchers from Rovira i Virgili University (URV) recently shed light on the performances of seven artificial intelligence (AI) models in terms of linguistic comprehension. Although these models have seen success in specific tasks, their effectiveness in understanding simple texts remains insufficient compared to human performance.

Measurement of linguistic comprehension

In this research, scientists subjected forty questions using basic grammatical structures and commonly used verbs to seven AI models. These models include Bard, ChatGPT-3.5, ChatGPT-4, Falcon, Gemini, Llama2, and Mixtral. Simultaneously, a group of four hundred native English speakers responded to the same questions, allowing for a comprehensive comparison of the results.

Disparities in accuracy between AI and humans

The analysis revealed a significant difference in the accuracy of responses. Humans achieved an average accuracy of 89%, vastly outpacing the best AI model, ChatGPT-4, which obtained 83%. Far behind, the other models did not exceed 70% success. These results show that the ability of models to handle complex tasks does not guarantee mastery of simpler tasks.

Nature of large language models

Large language models (LLMs) are neural networks that produce texts based on user queries. Their strength lies in tasks such as generating responses or translation, but a fundamental weakness plagues them: their approach relies on the exploitation of statistical patterns, rather than a true understanding of language. This observation was made by Vittoria Dentella, a researcher at URV: “LLMs do not actually understand language; they simply exploit statistical patterns in their training data.”

Consequences of the lack of understanding

Language models struggle noticeably to provide coherent responses, particularly when faced with repeated questions. During the study, human response consistency was at 87%, while for AI models, it ranged between 66% and 83%. This inability to maintain consistency during inquiries underscores the current fundamental limitations of text comprehension technologies.

Lack of contextual understanding

LLMs fail to interpret meaning in the same way a human does. Human understanding revolves around semantic, grammatical, pragmatic, and contextual factors. The models operate by identifying similarities with previously analyzed examples without truly grasping their implicit meaning. Thus, their apparent humanity is nothing more than an illusion based on predictive algorithms.

Problematic applications of LLMs

This research raises questions about the reliability of AI for critical applications. Dentella’s findings warn that the ability to perform complex tasks does not equate to proficiency in simple interactions, which often require a real understanding of language. These limitations compromise the use of AI in fields where accuracy and understanding are paramount.

Conclusion of the study

The need to improve models in terms of linguistic comprehension is evident. Researchers stress the importance of continuing to advance in this field to enhance the effectiveness and reliability of the underlying artificial intelligences in various applications. Awareness of the limitations of these technologies is the first step toward their future improvement.

Frequently asked questions about the limits of language in artificial intelligence

What are the main challenges facing AI models in understanding human language?
AI models, despite their advances, struggle to comprehend the complexity of linguistic nuances, cultural contexts, and semantic subtleties, which prevents them from competing with human understanding in reading simple texts.
Why don’t language models like ChatGPT understand the meaning of words like a human does?
These models only recognize statistical patterns in training data rather than interpreting the meaning behind these words. They lack the consciousness or experience that would enable them to grasp language contextually as a human would.
How do the performance of AI models compare to humans in simple text comprehension tests?
Studies show that humans achieve an average accuracy of 89%, while even the top-performing AI models usually do not exceed 83% accuracy in similar tests.
Can language models be used for critical applications despite their limitations?
No, their inability to understand the meaning and context of language raises concerns about their reliability for applications where true comprehension is crucial.
What types of tasks do AI models perform better than humans, despite their lack of understanding?
AI models excel in tasks based on fixed rules, such as text generation, machine translation, or simple problem-solving, where creativity or interpretation is not required.
What does “inconsistency of responses” mean in the context of AI models?
It refers to the variation in the accuracy of AI responses when subjected to repeated questions. Humans maintain their responses more consistently compared to AI models.
Are AI models able to process texts containing irony or metaphors?
No, language models still struggle to understand complex linguistic structures like irony or metaphors, limiting their ability to grasp implicit meanings.
What recent research exists on the limits of AI’s understanding of language?
Recent research conducted by international teams, including those led by Rovira i Virgili University, analyzes these limitations and highlights that AI does not reach the level of linguistic understanding of humans.
In what ways are real humans more effective than AI models in understanding simple texts?
Humans use a combination of semantic, grammatical, and contextual knowledge, allowing them to interpret and respond to texts in a more intuitive and appropriate manner.
What efforts are underway to improve the linguistic comprehension of AI models?
Research continues to explore approaches such as teaching contextual understanding or integrating new neural network architectures to enhance their capacity to grasp meaning.

actu.iaNon classéThe limits of language: AI models still struggle to match humans' understanding...

Jim Cramer believes that DeepSeek is no longer a threat to the tech giants, while AI stocks continue to...

découvrez l'analyse de jim cramer sur deepseek et son impact sur les géants de la technologie. alors que les actions d'intelligence artificielle continuent de croître, explorez les implications de cette évolution sur le marché.

Comet: the innovative browser from Perplexity that is revolutionizing AI, emails, and online commerce

découvrez comet, le navigateur innovant de perplexity qui transforme votre expérience en ligne grâce à l'intelligence artificielle. optimisez vos emails, améliorez vos interactions et révolutionnez votre commerce en ligne avec des fonctionnalités avancées.

Transforming digital into an asset of agility and efficiency for engineering and construction companies

découvrez comment tirer parti du digital pour optimiser l'agilité et l'efficacité au sein des sociétés d'ingénierie et de construction. transformez les défis en opportunités grâce à des solutions innovantes et adaptées.

Midjourney unveils its AI-powered video generator, now accessible to everyone

découvrez le nouvel outil révolutionnaire de midjourney : un générateur de vidéos par intelligence artificielle accessible à tous. transformez vos idées en contenus visuels époustouflants en quelques clics grâce à cette innovation qui redéfinit la création vidéo. explorez maintenant les possibilités infinies offertes par la technologie !
découvrez l'évolution fascinante de l'intelligence artificielle, de la simple calculatrice à chatgpt, à travers une odyssée chaotique où les frontières de la technologie deviennent de plus en plus floues. plongez dans les enjeux, les défis et les innovations qui transforment notre compréhension et notre utilisation de l'ia.

Liam Booth-Smith, a pair curator, joins an AI company he met during his time at Downing Street

découvrez comment liam booth-smith, un pair conservateur, a intégré une entreprise d'intelligence artificielle après avoir été en contact avec elle lors de son séjour à downing street. plongez dans les implications de cette collaboration entre politique et innovation technologique.