Day 12 of OpenAI’s revelations: launch of o3, an innovation in AI and a step towards AGI

Publié le 20 February 2025 à 16h17
modifié le 20 February 2025 à 16h17

The shockwave generated by the presentation of o3, the new model from OpenAI, transcends mere technological advancement. This innovation in artificial intelligence embodies not just progress, but a significant step towards AGI. The record-breaking performances achieved by o3 are coupled with revolutionary elements that redefine industry standards.
*The Chileans demonstrate an overwhelming superiority over known benchmarks,* while offering unprecedented flexibility in its use. *The integration of advanced algorithms* propels this tool to the forefront of contemporary requirements, thereby establishing new prospects for the future.
The growing anticipation surrounding this announcement marks a turning point in the evaluation of intelligent systems’ capabilities.

Launch of the o3 model

OpenAI recently announced, during its Shipmas event, the launch of o3, successor to the reasoning model o1. This model, as a “frontier model,” aims to establish new standards for innovation in artificial intelligence. The performances of this new model are particularly remarkable, achieving a score of 87.5% on the ARC AGI benchmark, thus surpassing the average human score of 85%.

Performance and features of o3

The advancements made by o3 are impressive. This model recorded a score of 71.7% on verified Sweetbench, an increase of 20% compared to its predecessor, o1. On the other hand, o3 also excelled in the complex challenge of Frontier Math from Epic AI, with over 25% success, marking a significant turning point in solving advanced mathematical problems.

Optimization with o3 Mini

OpenAI also introduced o3 Mini, an optimized version that offers performance comparable to o1, while costing less and reducing latency. o3 Mini includes three levels of thinking time: low, medium, and high, allowing users to tailor the artificial intelligence to their specific needs.

Safety testing program

Currently, the new models are not accessible to the general public. OpenAI has implemented a safety testing program for researchers, open until January 10. The goal is to ensure that these new innovations meet the necessary safety criteria before deployment.

Improvements in safety

OpenAI has introduced a new safety technique called “deliberative alignment”. This method leverages the reasoning capabilities of the models to more effectively identify potentially problematic requests, thereby promoting responsible use of AI.

Context of OpenAI’s announcements

From December 5 to 20, OpenAI has rolled out a series of announcements and demonstrations, presenting new innovations. This included not only the launch of o3 but also significant improvements made to various applications and services of OpenAI, thus impacting various aspects of artificial intelligence. These revelations reinforce OpenAI’s manifest desire to achieve the ambitious goal of AGI (Artificial General Intelligence).

Frequently asked questions

What is the o3 model announced by OpenAI?
The o3 model is the latest reasoning model developed by OpenAI, which sets new performance records on several artificial intelligence benchmarks, marking progress towards artificial general intelligence (AGI).
What are the main features of the o3 model?
The o3 model enhances reasoning ability, achieving 87.5% on the ARC AGI benchmark and 71.7% on Sweetbench. Additionally, it includes o3 Mini, an optimized version that delivers similar performance to o3 at a reduced cost and latency.
When will the o3 model be available to the public?
Although it is not yet publicly available, OpenAI has announced a safety testing program open to researchers until January 10, 2024, with a launch planned for o3 Mini at the end of January 2024 and o3 shortly thereafter.
What security advantages does o3 offer?
OpenAI introduces a new safety technique called “deliberative alignment,” which utilizes the model’s reasoning capabilities to better detect problematic prompts and enhance overall user safety.
How does o3 differ from OpenAI’s previous models?
O3 sets records on key benchmarks and offers improved reasoning features, surpassing the results of the previous model o1, and represents a significant step towards AGI.
What types of tests have been conducted with the o3 model?
The o3 model has undergone evaluations on benchmarks such as ARC AGI, Sweetbench, and Frontier Math from Epic AI, demonstrating superior performances compared to previous reference standards.
How do o3 and o3 Mini support the needs of developers and researchers?
Both models offer adjustable thinking time levels, allowing customization based on requirements, with o3 Mini being specifically designed to reduce costs and latency while maintaining high performance levels.

actu.iaNon classéDay 12 of OpenAI's revelations: launch of o3, an innovation in AI...

Shocked passersby by an AI advertising panel that is a bit too sincere

des passants ont été surpris en découvrant un panneau publicitaire généré par l’ia, dont le message étonnamment honnête a suscité de nombreuses réactions. découvrez les détails de cette campagne originale qui n’a laissé personne indifférent.

Apple begins shipping a flagship product made in Texas

apple débute l’expédition de son produit phare fabriqué au texas, renforçant sa présence industrielle américaine. découvrez comment cette initiative soutient l’innovation locale et la production nationale.
plongez dans les coulisses du fameux vol au louvre grâce au témoignage captivant du photographe derrière le cliché viral. entre analyse à la sherlock holmes et usage de l'intelligence artificielle, découvrez les secrets de cette image qui a fait le tour du web.

An innovative company in search of employees with clear and transparent values

rejoignez une entreprise innovante qui recherche des employés partageant des valeurs claires et transparentes. participez à une équipe engagée où intégrité, authenticité et esprit d'innovation sont au cœur de chaque projet !

Microsoft Edge: the browser transformed by Copilot Mode, an AI at your service for navigation!

découvrez comment le mode copilot de microsoft edge révolutionne votre expérience de navigation grâce à l’intelligence artificielle : conseils personnalisés, assistance instantanée et navigation optimisée au quotidien !

The European Union: A cautious regulation in the face of American Big Tech giants

découvrez comment l'union européenne impose une régulation stricte et réfléchie aux grandes entreprises technologiques américaines, afin de protéger les consommateurs et d’assurer une concurrence équitable sur le marché numérique.