Meta strengthens AI security with new Llama tools

Publié le 1 May 2025 à 09h13
modifié le 1 May 2025 à 09h14

The maturation of AI calls for increased vigilance in the technological universe. The new threats require effective solutions. Meta, aware of the challenges posed by its development, has just unveiled a series of revolutionary tools to enhance the security of Llama models. The integration of AI opens revolutionary prospects. These innovations aim to protect not only users but also developers in an increasingly complex digital landscape. Security has become a paramount concern. To navigate this context, Meta offers refined tools designed to secure the AI ecosystem with rigor and efficiency.

Meta strengthens the security of Llama models

Meta has recently unveiled enhanced security tools for its Llama AI models, marking a significant advancement in the protection of artificial intelligence technologies. These tools are designed to help cybersecurity teams use AI more securely while strengthening existing structures.

New Llama protection tools

Among the new features, Llama Guard 4 embodies a significant evolution. This multimodal version includes adaptable security filters not only for text but also for images. This evolution addresses the growing needs of visual AI applications and is integrated with the new Llama API, currently in limited preview phase.

Security control with LlamaFirewall

LlamaFirewall represents an essential complement to the security framework for AIs. Its role is to act as a security control center for AI systems. This tool facilitates the management of various security models working collaboratively while connecting to other protection instruments from Meta.

The detection capabilities of LlamaFirewall include identifying and blocking risks that may disrupt the proper functioning of AIs. These include ‘prompt injection’ attacks aimed at manipulating AI, potentially harmful code generations, as well as risky behaviors associated with AI plug-ins.

Enhancement of Llama Prompt Guard

The update of Llama Prompt Guard allows for a considerable reinforcement against jailbreak attempts and prompt injections. This main model, Prompt Guard 2 (86M), has benefited from optimization, asserting its ability to detect threats more effectively.

A new variant, Prompt Guard 2 22M, offers a lighter option, allowing latency and computing costs to be reduced by up to 75% compared to its predecessor. This development is essential for organizations seeking faster responses while adhering to budget constraints.

Cybersecurity tools for defenders

Meta has also listened to the calls from cybersecurity professionals, developing tools specifically designed for defense against cyberattacks. The update of the CyberSec Eval 4 benchmark suite aims to assess the effectiveness of AI systems in terms of security.

The new tool CyberSOC Eval, developed in collaboration with cybersecurity experts like CrowdStrike, allows for the evaluation of AI performance in real-world security operations center environments. Another addition, AutoPatchBench, focuses on the ability of Llama models to automatically identify and fix vulnerabilities in code before malicious exploitation.

Llama Defenders Program

To facilitate access to new solutions, Meta is launching the Llama Defenders Program, aimed at partners and developers. This program offers privileged access to a range of AI tools, including open-source and exclusive options, tailored to various security challenges.

This program also provides the tool for automatic classification of sensitive documents, developed in-house by Meta. Its goal is to securely label documents, thus preventing leaks of sensitive information or inappropriate use in AI systems as in RAG configurations.

Detection of AI-generated audio

The issue of falsified audio, which has become a common tool in scams, is another priority for Meta. The tools Llama Generated Audio Detector and Llama Audio Watermark Detector are made available to partners to identify AI-generated voices in phishing calls or fraud attempts. Companies like ZenDesk, Bell Canada, and AT&T are already engaged in integrating these technologies.

Private processing technology

A potentially revolutionary innovation is on the horizon with private processing on WhatsApp. This technology will allow AI to perform useful tasks, such as synthesizing unread messages or assisting in drafting replies, without Meta or WhatsApp accessing the content of these messages.

Meta is taking transparent measures regarding the security of these systems, publishing its threat model and inviting security researchers to test the robustness of its architecture prior to deployment. An approach that demonstrates their commitment to ensuring user privacy.

Frequently asked questions about AI security with Meta’s new Llama tools

What security tools has Meta recently launched for Llama models?
Meta has introduced several new security tools for Llama models, including Llama Guard 4, LlamaFirewall, and an update of the Llama Prompt Guard. These tools aim to enhance security when using AI.

How does Llama Guard 4 improve the security of Llama models?
Llama Guard 4 is an advanced multimodal security filter that applies security rules not only to text but also to images, which is essential for increasingly visual AI applications.

What is LlamaFirewall and what is its role?
LlamaFirewall functions as a security control center for AI systems, allowing for the management of multiple security models and detecting threats such as ‘prompt injection’ attacks and other risky behaviors.

What is Prompt Guard 2 22M and what improvements does it have over its predecessor?
Prompt Guard 2 22M is a more compact and faster version of the main model, promising to reduce latency and computing costs by up to 75% while maintaining good detection capability for jailbreak attempts.

How does Meta assist cybersecurity teams with the CyberSec Eval 4 tool?
CyberSec Eval 4 is an open-source evaluation suite that helps organizations assess the effectiveness of AI systems in detecting and responding to threats in real security environments.

What is Meta’s Llama Defenders program?
The Llama Defenders program aims to provide partner companies and developers with exclusive access to a variety of AI solutions, including security tools, to meet specific security challenges.

How does the Automated Classification of Sensitive Documents tool work?
This tool automatically assigns security labels to documents within an organization, helping to prevent leaks of sensitive information and avoiding their treatment by AI systems inappropriately.

What are Meta’s new developments regarding the detection of AI-generated audio?
Meta has introduced the Llama Generated Audio Detector and the Llama Audio Watermark Detector to help identify AI-generated voices in phishing calls or fraud attempts, thus enhancing digital security.

What is private processing that Meta plans for WhatsApp?
Private processing would allow AI users to manage useful tasks such as drafting replies without Meta or WhatsApp having access to the content of messages, thereby enhancing the privacy of communications.

actu.iaNon classéMeta strengthens AI security with new Llama tools

the theory about Jony Ive’s AI hardware device is becoming increasingly credible

explorez la théorie captivante sur le dispositif matériel d'intelligence artificielle imaginé par jony ive, qui gagne en crédibilité. découvrez comment ses concepts innovants pourraient révolutionner notre interaction avec la technologie et redéfinir l'avenir des objets connectés.

how artificial intelligence has invested the world of perfumery

découvrez comment l'intelligence artificielle transforme l'industrie de la parfumerie, de la création de nouvelles fragrances à l'optimisation des procédés, en alliant innovation technologique et art de la senteur.

The influence of AI on our language: a study reveals that humans express themselves like ChatGPT

découvrez comment l'intelligence artificielle, à travers des outils comme chatgpt, façonne notre manière de communiquer. cette étude approfondie révèle des tendances fascinantes sur l'évolution de notre langage et les similitudes croissantes entre les expressions humaines et celles générées par l'ia.

Thomas Wolf from Hugging Face: the ambition to democratize robotics through open source

découvrez comment thomas wolf, co-fondateur de hugging face, vise à démocratiser la robotique grâce à l'open source. explorez ses idées innovantes et son engagement pour rendre la technologie accessible à tous.

the 20 most powerful AI models of June 2025: discover the detailed ranking

découvrez notre classement détaillé des 20 modèles d'intelligence artificielle les plus performants de juin 2025. explorez les innovations et les avancées qui façonnent l'avenir de la technologie.

Cédric O facing accusations of conflicts of interest, but receiving support from the HATVP

découvrez comment cédric o se retrouve au cœur de controverses concernant des accusations de conflit d'intérêts, tout en recevant le soutien inattendu de la haute autorité pour la transparence de la vie publique (hatvp).