Upcoming DeepSeek AI model failed to train using Huawei’s chips - AI

7 x 24 Track global technological trends

Hot Topic

Day

News Topic

Upcoming DeepSeek AI model failed to train using Huawei’s chips

2025-08-14 / Read about 10 minute

Source：ArsTechnica

Startup encountered persistent technical issues using Huawei Ascend chips.

Credit: NurPhoto / Contributor | NurPhoto

Chinese artificial intelligence company DeepSeek delayed the release of its new model after failing to train it using Huawei’s chips, highlighting the limits of Beijing’s push to replace US technology.

DeepSeek was encouraged by authorities to adopt Huawei’s Ascend processor rather than use Nvidia’s systems after releasing its R1 model in January, according to three people familiar with the matter.

But the Chinese startup encountered persistent technical issues during its R2 training process using Ascend chips, prompting it to use Nvidia chips for training and Huawei’s for inference, said the people.

The issues were the main reason the model’s launch was delayed from May, said a person with knowledge of the situation, causing it to lose ground to rivals.

Training involves the model learning from a large dataset, while inference refers to the step of using a trained model to make predictions or generate a response, such as a chatbot query.

DeepSeek’s difficulties show how Chinese chips still lag behind their US rivals for critical tasks, highlighting the challenges facing China’s drive to be technologically self-sufficient.

The Financial Times this week reported that Beijing has demanded that Chinese tech companies justify their orders of Nvidia’s H20, in a move to encourage them to promote alternatives made by Huawei and Cambricon.

Industry insiders have said the Chinese chips suffer from stability issues, slower inter-chip connectivity, and inferior software compared with Nvidia’s products.

Huawei sent a team of engineers to DeepSeek’s office to help the company use its AI chip to develop the R2 model, according to two people. Yet despite having the team on site, DeepSeek could not conduct a successful training run on the Ascend chip, said the people.

DeepSeek is still working with Huawei to make the model compatible with Ascend for inference, the people said.

Founder Liang Wenfeng has said internally he is dissatisfied with R2’s progress and has been pushing to spend more time to build an advanced model that can sustain the company’s lead in the AI field, they said.

The R2 launch was also delayed because of longer-than-expected data labeling for its updated model, another person added. Chinese media reports have suggested that the model may be released as soon as in the coming weeks.

“Models are commodities that can be easily swapped out,” said Ritwik Gupta, an AI researcher at the University of California, Berkeley. “A lot of developers are using Alibaba’s Qwen3, which is powerful and flexible.”

Gupta noted that Qwen3 adopted DeepSeek’s core concepts, such as its training algorithm that makes the model capable of reasoning, but made them more efficient to use.

Gupta, who tracks Huawei’s AI ecosystem, said the company is facing “growing pains” in using Ascend for training, though he expects the Chinese national champion to adapt eventually.

“Just because we’re not seeing leading models trained on Huawei today doesn’t mean it won’t happen in the future. It’s a matter of time,” he said.

Nvidia, a chipmaker at the center of a geopolitical battle between Beijing and Washington, recently agreed to give the US government a cut of its revenues in China in order to resume sales of its H20 chips to the country.

“Developers will play a crucial role in building the winning AI ecosystem,” said Nvidia about Chinese companies using its chips. “Surrendering entire markets and developers would only hurt American economic and national security.”

DeepSeek and Huawei did not respond to a request for comment.

Previous page：Co-founder of Elon Musk’s xAI departs the company

Next page：Lessons from Building a Real-Time AI System That M...

Return to List

Hot Reading

2 day ago

Samsung HW-Q990F review: Surround sound doesn't get better than this

2 day ago

Pat Gelsinger explains how his initials ended up etched into every i386 processor ever made

2 day ago

DJI Osmo Action 6 review: Is DJI’s most advanced action cam worth it?

2 day ago