AWS strengthens its generative AI tools at re:Invent 2024: from inference to training

Publié le 21 February 2025 à 08h09
modifié le 21 February 2025 à 08h09

AWS positions itself as a market leader by enhancing its generative AI capabilities at re:Invent 2024. *Evolving from inference to training* offers unlimited potential for businesses. *Advanced tools and strategic partnerships* are redefining cost optimization and operational efficiency. *The new era of innovation* begins here, propelling companies’ capabilities to new heights of artificial intelligence. The tools deployed by AWS are revolutionizing the interaction between the cloud and artificial intelligence.

New Features for Generative AI with AWS

On the inaugural day of the re:Invent 2024 conference, Swami Sivasubramanian, Vice President of AI and Data at AWS, unveiled innovative features aimed at enriching Amazon’s generative AI tools. These new options not only aim to simplify the model training process but also to optimize inference while reducing associated operational costs.

Enhancements on Amazon Bedrock

Amazon Bedrock, the generative AI inference platform, receives increased support with the introduction of several new features. Firstly, cache prompting allows users to store previous responses related to specific prompts. This system, strongly inspired by competitive methods, ensures a substantial reduction in costs and increased responsiveness.

Secondly, AWS is making public a smart routing feature, inspired by Amazon Q. For each request, the system routes the demand to the most appropriate model based on its complexity. This not only saves costs but also provides more relevant responses by prioritizing models according to the requirements of each request.

Technical Innovations on Bedrock

Other technical innovations are also enriching Bedrock. Support for structured data facilitates natural language queries directly on databases. By integrating GraphRAG, AWS also strengthens the establishment of connections between information, thereby reducing hallucinations generated by AI.

Moreover, Bedrock Data Automation provides automation for transforming unstructured data into usable formats. This feature aims to unlock the potential of the 80% of corporate data that is often difficult to access, generating actionable insights for artificial intelligence.

Expanded Partnerships with Startups

AWS is not limited to its own developments. It is also embarking on partnerships with promising startups. The platform now includes solutions from Poolside, which specializes in code generation, as well as the innovative technology Stable Diffusion 3.5 from Stability AI for image creation. The collaboration with Luma enriches the landscape by adding advanced video generation capabilities.

Significantly, the long-anticipated announcement of the Bedrock Marketplace comes with access to over a hundred specialized models, including those developed by IBM, further expanding the suite of tools available to businesses.

SageMaker Transforms into an Analytics Platform

AWS’s SageMaker platform, known for its heritage in machine learning, is also gaining new tools to enhance its analytics capabilities. The flexible training plans simplify the management of computing resources, allowing for automated allocation based on the specific needs of each training project. This advancement could reduce planning time by up to 40%.

The task governance feature optimizes the use of graphical clusters by reallocating their use between inference and training based on traffic flow. Finally, the fine-tuning recipes allow for customization of models using concrete examples of tailored responses, enabling users to refine these tools according to their specific needs.

Amazon Q Integrated into SageMaker Canvas and QuickSight

Amazon Q, AWS’s AI assistant, is integrated into SageMaker Canvas and QuickSight, thus optimizing the interaction processes with users. In SageMaker Canvas, Q allows for the creation of machine learning models through natural dialogue, simplifying the breakdown of projects into defined analysis steps.

In QuickSight, this assistant acts as an analyst, facilitating the understanding of complex scenarios for users. Queries formulated in natural language enable Q to identify relevant data and analyze this information methodically.

These developments position AWS as a serious competitor in the generative AI sector. The platform emerges as a robust solution capable of competing with market leaders thanks to its optimized tools and strategic innovations.

Frequently Asked Questions about AWS Generative AI Tool Enhancements at re:Invent 2024

What are the new features of AWS for generative AI presented at re:Invent 2024?
AWS introduced several new features, including support for cache prompting, intelligent routing, as well as tools for querying databases in natural language and a system for automating unstructured data.
How does cache prompting help reduce costs?
The cache prompting allows AI to store previous responses associated with their prompts. Thus, if a similar query is made, the AI can return the stored response directly instead of recalculating, which speeds up response times and reduces inference costs.
What is intelligent routing and how does it work?
Intelligent routing directs a request to the most suitable language model based on its complexity. Simple requests are handled by smaller models, while complex requests use more advanced models, thus allowing for better accuracy and cost reduction.
How does AWS facilitate access to unstructured data with the new features of Bedrock?
AWS offers Bedrock Data Automation, which automatically transforms unstructured data (such as documents, images, and videos) into formats usable by AI applications, thus simplifying access to a large portion of business data.
What impact can the new features of Amazon SageMaker have on machine learning?
The new features of AWS SageMaker allow for better management of computing resources with flexible training plans, optimize GPU usage for training tasks during off-peak hours through task governance, and provide fine-tuning recipes to customize models according to specific needs.
How does Amazon Q enhance the user experience across AWS services?
Amazon Q facilitates the creation of machine learning models in SageMaker Canvas through natural dialogue, and in QuickSight, it acts as a virtual analyst capable of executing complex analyses by interpreting user requests in plain language.
Which companies have formed partnerships with AWS as part of these new features?
AWS has integrated solutions from Poolside, a French startup specializing in code generation, the image model from Stability AI (Stable Diffusion 3.5), and Luma’s technology for video generation, in order to enrich its offering on the Bedrock platform.
What is AWS’s main goal with these generative AI innovations?
AWS aims to provide businesses with a comprehensive and efficient suite of tools to develop generative artificial intelligence capabilities while reducing costs and optimizing processes, in order to effectively compete with market leaders.

actu.iaNon classéAWS strengthens its generative AI tools at re:Invent 2024: from inference to...

AI agents: Promises of science fiction still to be refined before shining on the stage

découvrez comment les agents d'ia, longtemps fantasmés par la science-fiction, doivent encore évoluer et surmonter des défis pour révéler tout leur potentiel et s’imposer comme des acteurs majeurs dans notre quotidien.
taco bell a temporairement suspendu le déploiement de son intelligence artificielle après que le système ait été perturbé par un canular impliquant la commande de 18 000 gobelets d'eau, soulignant les défis liés à l'intégration de l'ia dans la restauration rapide.

Conversational artificial intelligence: a crucial strategic asset for modern businesses

découvrez comment l'intelligence artificielle conversationnelle transforme la relation client et optimise les performances des entreprises modernes, en offrant une communication fluide et des solutions innovantes adaptées à chaque besoin.

Strategies to protect your data from unauthorized access by Claude

découvrez des stratégies efficaces pour protéger vos données contre les accès non autorisés, renforcer la sécurité de vos informations et préserver la confidentialité face aux risques actuels.
découvrez l'histoire tragique d'un drame familial aux états-unis : des parents poursuivent openai en justice, accusant chatgpt d'avoir incité leur fils au suicide. un dossier bouleversant qui soulève des questions sur l'intelligence artificielle et la responsabilité.

Doctors are developing a smart stethoscope capable of detecting major heart conditions in just 15 seconds

découvrez comment des médecins ont développé un stéthoscope intelligent capable de détecter rapidement les principales maladies cardiaques en seulement 15 secondes, révolutionnant ainsi le diagnostic médical.