Huawei supernode 384 shakes up Nvidia’s dominance in the AI market

Publié le 23 June 2025 à 23h13
modifié le 23 June 2025 à 23h14

Huawei triggers a revolution in the field of processors with the Supernode 384, challenging Nvidia‘s supremacy in the AI market. This technological advancement emerges in a tense context of US-Chinese rivalry, where innovation becomes the key to competitiveness. By reinventing data processing architecture, Huawei presents a system capable of competing with industry leaders despite severe trade restrictions. The new possibilities offered by this device significantly increase computing potential, paving the way for more sophisticated AI models.

A major technological advancement in the AI sector

Huawei recently unveiled its new Supernode 384 architecture, an innovation that challenges Nvidia’s supremacy in the processor market. At the Kunpeng Ascend Developer conference held in Shenzhen, company officials demonstrated how this advancement directly competes with Nvidia’s historical positioning amidst rising technological tensions between the United States and China.

A revolutionary architecture

The Supernode 384 architecture represents a turning point in the computing landscape. According to Zhang Dixuan, president of Huawei, this development arose from a necessity: “As the scale of parallel processing increases, the bandwidth between machines in traditional server architectures has become a bottleneck.”

This new architecture abandons the principles of Von Neumann computing to adopt a peer-to-peer model, optimized for modern artificial intelligence workloads. This shift proves particularly effective for Mixture-of-Experts models, which leverage specialized subnetworks to solve complex computational challenges.

Impressive performance

The implementation of CloudMatrix 384 boasts stunning technical specifications. Composed of 384 Ascend processors spread across twelve computing cabinets and four bus cabinets, it generates a computing power of 300 petaflops, coupled with 48 TB of high-bandwidth memory. This integrated infrastructure marks a significant advancement in the field of AI computing.

Real-world performance tests demonstrate the competitive position of this architecture. For instance, dense AI models such as LLaMA 3 from Meta have achieved 132 tokens per second on the Supernode 384, which is a performance 2.5 times higher than that of traditional cluster architectures.

Optimization of communication applications

Applications requiring high communication intensity show an even more marked improvement. Models from Alibaba’s Qwen and DeepSeek families have achieved between 600 and 750 tokens per second, highlighting the optimization of the architecture for next-generation AI workloads. These performance gains result from fundamental redesigns of the infrastructure. Huawei has replaced conventional Ethernet interconnections with high-speed bus connections, improving communication bandwidth by 15 times.

A response to geopolitical pressure

The development of the Supernode 384 is inseparable from the broader technological competition between the United States and China. American sanctions have restricted Huawei’s access to advanced semiconductor technologies, forcing the company to maximize performance within existing constraints. An industry analysis by SemiAnalysis suggests that the CloudMatrix 384 utilizes the AI processor Ascend 910C, which, although lagging in generations, offers undeniable architectural advantages.

Implications for the market

Huawei has already implemented CloudMatrix 384 systems in several data centers in China, notably in the provinces of Anhui, Inner Mongolia, and Guizhou. Such practical deployments validate the viability of this architecture and establish an infrastructure framework for broader market adoption.

The scalability potential of the system, capable of supporting tens of thousands of interconnected processors, makes it a compelling platform for training increasingly sophisticated artificial intelligence models. This development meets the growing needs for large-scale AI implementations across various sectors.

Disruption and future considerations

Huawei’s architectural advancement opens both opportunities and complications for the global AI ecosystem. By providing viable alternatives to Nvidia’s dominant market solutions, Huawei also accelerates the fragmentation of international technological infrastructure along geopolitical lines.

The success of Huawei’s AI computing initiatives will depend on adoption by the developer ecosystem as well as ongoing validation of performance. The company acknowledges, through its active awareness strategy at conferences, that technical innovation alone does not guarantee market acceptance.

FAQ about the Huawei Supernode 384 and its impact on the AI market

What is the Huawei Supernode 384 and what makes it innovative?
The Huawei Supernode 384 is a computing architecture designed for artificial intelligence workloads, abandoning von Neumann computing principles in favor of a peer-to-peer architecture. This innovation allows for overcoming bandwidth bottlenecks present in traditional server architectures.

How does the Supernode 384 compare to Nvidia solutions?
The Supernode 384 offers superior performance, achieving 132 tokens per second per card for dense AI models, 2.5 times more than conventional Nvidia cluster architectures. This demonstrates optimization for next-generation AI workloads.

What types of applications benefit the most from the Supernode 384?
Applications requiring intensive communication, such as those using Alibaba’s Qwen or DeepSeek models, particularly benefit from the Supernode 384, reaching up to 750 tokens per second per card thanks to an optimized architecture.

What is the importance of bandwidth in the Supernode 384 architecture?
Bandwidth is essential for parallel processing. The Supernode 384 replaced traditional Ethernet interconnections with high-speed buses, improving communication bandwidth by 15 times and reducing latency from 2 microseconds to 200 nanoseconds.

How does the Supernode 384 respond to American geopolitical restrictions?
In light of sanctions limiting Huawei’s access to semiconductor technologies, the Supernode 384 represents an innovation that fully utilizes available resources, maximizing performance while circumventing these constraints.

Where is the Supernode 384 already deployed in data centers?
The system is already operational in several data centers in China, notably in the provinces of Anhui, Inner Mongolia, and Guizhou, thus validating its infrastructure framework for broader market adoption.

What are the implications for companies considering investing in AI infrastructure?
The Supernode 384 offers companies a competitive alternative to Nvidia solutions, promoting independence from supply chains controlled by the United States. However, its long-term viability depends on continuous innovation cycles and improved geopolitical stability.

What are the potential challenges associated with adopting the Supernode 384?
The main challenges include acceptance within the developer ecosystem and validation of sustained performance. Achieving significant market presence will depend on Huawei’s ability to overcome these obstacles.

actu.iaNon classéHuawei supernode 384 shakes up Nvidia's dominance in the AI market

ChatGPT innovates with Codex, the assistant that simplifies almost all tasks for developers

découvrez chatgpt et son innovation codex, l'assistant révolutionnaire qui facilite la vie des développeurs en automatisant presque toutes leurs tâches. optimisez votre workflow et gagnez du temps grâce à cette technologie avancée.

Artificial intelligence and intellectual property: a new balance between creation and protection?

découvrez comment l'intelligence artificielle redéfinit les enjeux de la propriété intellectuelle, et explorez les défis et opportunités pour établir un nouvel équilibre entre création et protection des œuvres. un article essentiel pour comprendre l'impact de l'ia sur la créativité et les droits d'auteur.

The real stakes of a potential American embargo on AI chips worldwide

découvrez les véritables enjeux d'un potentiel embargo américain sur les puces d'intelligence artificielle à l'échelle mondiale. analyse des impacts économiques, technologiques et géopolitiques dans un monde de plus en plus dépendant de l'ia.

Spy devices in Chinese inverters and a data leak at Dior: the five cybersecurity news items of the week

découvrez les cinq actualités cybernétiques de la semaine, incluant des dispositifs espions cachés dans des onduleurs chinois et une fuite d'informations chez dior, révélant les enjeux de la sécurité numérique.

The impact of Google AI Overviews on SEO one year after its introduction

découvrez comment l'introduction de google ai overviews a transformé le paysage du seo un an après son lancement. analyse des changements d'algorithmes, des nouvelles pratiques de référencement et des stratégies gagnantes pour s'adapter à cette évolution technologique.

GitHub Copilot or Gemini Code Assist: which of the two AI code assistants stands out the most?

découvrez notre analyse comparative entre github copilot et gemini code assist. explorez les fonctionnalités, les performances et les avantages de ces deux assistants de code alimentés par l'intelligence artificielle pour déterminer lequel se démarque le plus dans l'aide à la programmation.