ChatGPT: a language generator that prioritizes examples and memories like humans do

Publié le 24 June 2025 à 06h07
modifié le 24 June 2025 à 06h08

ChatGPT reinvents language generation. This technological advancement reveals a functioning similar to that of the human mind. Research shows that artificial intelligence is not limited to grammar rules; it relies on accumulated examples and memories, materializing a new analogical approach.

Grounded in in-depth analyses, this discovery challenges traditional perceptions of language model learning. The thesis that data strictly derives from learned rules collapses in the face of the evidence of analogical reasoning, inseparable from human experience. In this light, it is essential to understand the profound implications of this dynamic on the development of artificial intelligence.

Central Thesis of the Study

A recent study led by scientists from the University of Oxford and the Allen Institute for AI revealed that large language models, such as ChatGPT, generalize linguistic patterns analogously to humans, relying on examples rather than strict grammar rules. This study questions the prevailing idea that these models primarily learn by inferring rules from their training data.

Experimenting with Innovative Adjectives

The researchers examined human judgments compared to the predictions of GPT-J, an open-source language model. They employed a common English word formation scheme, converting adjectives into nouns via the suffixes “-ness” and “-ity”. For example, “happy” becomes “happiness”. This experiment included the creation of 200 fictitious adjectives, such as “cormasive”.

Analogy and Memorization

Results showed that GPT-J uses analogical reasoning, relying on similarities with real words encountered in its training data. Instead of applying rules, it generates responses based on analogies. For example, “friquish” is converted into “friquishness” because this suffix resembles words like “selfish”, while for “cormasive”, the influences come from pairs of known words.

Influence of Occurrences in Training Data

The study also highlighted the impact of word form occurrences in the training data. The responses of the LLM were examined across nearly 50,000 English adjectives. The model’s predictions coincided with the statistical patterns of the training data, showing impressive accuracy. The LLM appeared to have formed a memory of each encountered word example during training.

Differences Between Humans and the Language Model

Humans possessed a rich mental dictionary, integrating all meaningful word forms without being limited to their frequency. They recognize that “friquish” and “cormasive” are not English words. To handle these potential neologisms, they operate with analogical generalizations based on their known word bank.

Characteristics of LLMs

LLMs, in contrast, generate their responses by directly drawing on specific instances of words from their training sets without creating unified entries in a mental dictionary. The approach of these models is more rigid, focusing on the repetition of examples rather than on abstraction.

Implications for the Future of AI

Senior author, Janet Pierrehumbert, stated that while LLMs can respond with virtuosity, they lack human abstraction. This limitation likely explains the need for a massive amount of data for language learning, far more than what a human requires.

Collaboration Between Disciplines

Co-author, Dr. Valentin Hofman, emphasized the importance of synergy between linguistics and AI. The results provide an in-depth view of linguistic generation by LLMs and will facilitate advances towards a robust, efficient, and explainable AI.

This project also involved researchers from prestigious institutions such as LMU Munich and Carnegie Mellon University.

For recent developments on technological advancements in AI, check related articles on Google Gemini, NVIDIA, LLM training, artificial intelligence, and AI tools.

Frequently Asked Questions About ChatGPT: A Language Generator That Prioritizes Examples and Memories Like Humans

How does ChatGPT generate sentences similar to those of a human?
ChatGPT uses analogies based on memorized examples, rather than strictly following grammar rules. This allows it to produce sentences by relying on similarities with words it has encountered in its training data.

What methods does ChatGPT use to understand unknown words?
When ChatGPT encounters unknown words, it relies on its knowledge base by comparing these words to similar examples it has memorized, which helps it determine their correct form in a sentence.

Why are examples more important than rules for ChatGPT?
Examples allow ChatGPT to learn in a more intuitive and adaptive manner, like a human does. This enables it to better generate words and phrases, while being limited by its need for accessing varied and numerous data.

How do word frequencies affect ChatGPT’s responses?
The words and expressions that ChatGPT encounters most frequently in its training data will have a stronger influence on its responses. This means it is more likely to create sentences with those words than those it has seen less often.

What is the difference between how humans and ChatGPT form analogies?
Humans create analogies from a mental base of meaningful words, while ChatGPT generates analogies based directly on specific examples from its training dataset, without forming a unified mental dictionary.

Can ChatGPT answer questions on subjects it has never encountered?
While ChatGPT can address new topics, its ability to provide relevant answers depends heavily on its understanding based on the examples that have been provided during its training.

Can ChatGPT’s language generation performance be improved?
Yes, ChatGPT’s performance can be enhanced by incorporating additional and diverse training data, which would allow it to better analyze and generate responses using analogical techniques.

actu.iaNon classéChatGPT: a language generator that prioritizes examples and memories like humans do

The last bastion of erotic deepfakes is extinguished, just a week after the adoption of a law supported by...

découvrez comment le dernier bastion des deepfakes érotiques a été affecté par la récente législation soutenue par melania trump, entraînant sa disparition rapide. explorez les implications de cette loi sur le monde numérique et la lutte contre l'exploitation en ligne.

Revolutionizing healthcare through data-driven innovation: an essential guide

découvrez comment l'innovation guidée par les données transforme les soins de santé. ce guide essentiel vous offre des insights pratiques sur l'intégration de la technologie et des données pour améliorer la qualité des soins et optimiser les processus médicaux.

Trump declares that Catholics ‘worshipped’ a misleading image of him as pope

dans une déclaration controversée, trump affirme que les catholiques ont 'adoré' une représentation trompeuse de lui en tant que pape, suscitant des réactions vives. découvrez les implications de ses propos sur la perception des croyances religieuses et l'image publique.

Google AMIE: a virtual doctor learns to interpret medical images

découvrez google amie, un médecin virtuel révolutionnaire qui utilise l'intelligence artificielle pour analyser et interpréter les images médicales. apprenez comment cette technologie innovante aide les professionnels de santé à poser des diagnostics plus précis et efficaces, tout en améliorant la prise en charge des patients.

OpenAI maintains its commitment to non-profit status in response to criticism and to Elon Musk

découvrez comment openai renforce son engagement à but non lucratif face aux critiques, y compris celles d'elon musk, et explorez l'impact de cette décision sur l'innovation et la recherche en intelligence artificielle.

Sam Altman: OpenAI aims to preserve its nonprofit spirit during its restructuring

découvrez comment sam altman et openai s'engagent à maintenir l'esprit nonprofit de l'organisation malgré sa restructuration. une analyse des enjeux éthiques et des objectifs futurs d'openai pour continuer à innover tout en restant fidèle à sa mission originale.