How do Chinese aI Bots Stack up Against ChatGPT?
Bonnie Luisini ha modificato questa pagina 1 mese fa


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese startup DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is produced by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the country into a tech superpower has long been President Xi Jinping's goal and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "tactically important" and its foray into the field has been "years in the making", said Chen Qiheng, an associated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public investments in Chinese AI sped up after ChatGPT took off in 2022 and showed pledges of real-world service applications, Chen informed CNA.

But it was DeepSeek's rise that truly "encouraged" the idea that smaller players like start-up firms could have roles to play in AI research and advancements, he includes.

'A lot is up in the air': Is Chinese company DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI company simply changed the rules of tech-geopolitics

The "emphasis on cost benefit" is a distinguishing characteristic of Chinese AI, Chen states, hb9lc.org with lower training and inference costs - the costs of using a trained design to draw conclusions from new data.

2025 could also see the emergence of more Chinese AI designs taking on advanced reasoning jobs.

"We might see some AI firms focusing on getting closer to synthetic basic intelligence (AGI) while others concentrate on concrete ways to commercialise their designs and integrate them with clinical research study," Chen added.

AGI describes a system with intelligence on par with human capabilities.

Chinese AI business are moving quickly, experts state, constructing on DeepSeek's momentum to come up with their own ingenious and economical methods to use generative AI to jobs and develop more innovative items beyond chatbots.

But on the flip side, access to high-end hardware, especially Nvidia's sophisticated AI chips, remains a crucial difficulty for Chinese developers, noted Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) limit the ability of Chinese tech business ... forcing lots of to depend on older or lower-performance options which can slow training and decrease model capabilities," she said.

"While some companies like DeepSeek, have actually found imaginative ways to optimize or utilize more basic hardware efficiently, obtaining cutting-edge chips still makes a huge distinction for training large AI models."

DeepSeek-Nvidia chips: Singapore states it anticipates companies to abide by its laws

US checking out whether DeepSeek used limited AI chips obtained through other nations, source says

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects deemed delicate by the state are censored on the internet so it must come as not a surprise that Chinese-made chatbots will not acknowledge territorial disputes or inform you what took place in Tiananmen Square in 1989.

Tests suggest Chinese chatbots are set to avoid domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this type of question yet. Let's chat about mathematics, coding, and logic problems rather!"

To further evaluate for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same concern: "What happened in Zhuhai on November 11, 2024?"

The cars and truck attack outside a sports stadium in the southern Chinese city was at first greatly censored on Chinese social media - with authorities only divulging the death toll a day later.

DeepSeek failed to point out that an attack had occurred, highlighting instead a military air show and other events that had actually happened in the city like singles' day shopping sales in addition to sports and cultural activities.

Dr Zhang believes that it might be a mix of elements at play, such as censorship in addition to "a couple of useful constraints".

"DeepSeek relies on a mix of older graphics processing systems and has limited access to advanced hardware which can affect how quickly and thoroughly the design can be trained or upgraded," she said, pointing to existing US export curbs on China.

"Its strong concentrate on Chinese-language information may also limit its flexibility (to bring out) multilingual jobs ... As a fairly brand-new product, DeepSeek also hasn't yet been evaluated as broadly or on the very same scale as more established AI models which poses additional obstacles during real-world implementation."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably answered our concern about the Zhuhai automobile attack.

That sought several repeated efforts - four prompts to be exact - in a span of around 20 minutes.

It ultimately relayed details about the attack which killed 35 people and left lots of others injured, likewise going on to list details like the date and time, details about the assailant including his name and age, along with casualties.

However, it composed that "the authorities are conducting a comprehensive examination into the motives and circumstances surrounding the occurrence", details which is now outdated.

The chauffeur, Fan, was performed last month.

This is Qwen2.5's reaction in full:

Answer: On Nov 11, 2024, a major and terrible incident took place in Zhuhai, China. According to multiple news sources, a male named Fan Weiqiu (樊某, male, 62 years old) drove a cars and truck into a crowd at the Zhuhai Sports Center, leading to a substantial number of casualties. Here are the crucial details:

Date and Time: The incident happened on November 11, 2024, at roughly 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack resulted in 35 deaths and 43 injuries.

Perpetrator: The chauffeur, determined as Fan Weiqiu (male, 62 years old), was nabbed by the cops.

Response: The police reacted rapidly to the scene, and emergency situation services worked to transport the injured to medical facilities for treatment.

Investigation: The authorities are carrying out a comprehensive investigation into the intentions and the event.

This event was widely reported in the media and caused significant public issue. The federal government and local authorities have been working to supply support to the victims and their families, and to guarantee a detailed examination into the incident.

If you need more detailed details or have particular questions about the occurrence, feel totally free to ask.

Despite preliminary success, subsequent attempts to pose the same concern to Qwen2.5 resulted in the censors back at work with the reply "I don't have particular details on occasions that took place in Zhuhai on November 11, 2024".

The altered action also raised questions about its consistency and reliability.

Predictably, ChatGPT pointed out public details that had actually been commonly published in international news reports at the time of the accident - so no surprises there.

WHICH IS MORE CREATIVE?

Users have praised the capability of Chinese AI apps to provide structured and even "emotionally abundant" writing.

"DeepSeek-R1 offered a story with a more introspective tone and smoother psychological shifts for a well-paced story," wrote tech writer Amanda Caswell, who specialises in AI.

"Qwen2.5 provided a story that develops gradually from interest to seriousness, keeping the reader engaged. It provides an unanticipated and impactful twist at the end and immersive descriptions and vivid imagery for the setting," she said, including that Qwen2.5 eventually "crafted a more cinematic, emotionally rich story with a more considerable twist".

"DeepSeek composed a good story however lacked stress and an impactful climax, making Qwen2.5 the apparent option."

Opinions, though, differ.

Chen believes that Qwen2.5 does not carry out as highly as DeepSeek and ChatGPT when it pertains to innovative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain jobs, but we can likewise see that it is refraining from doing as highly as others in imaginative writing," he told CNA.

Related:

China's new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek jolts global AI scene

As journalists and authors, we needed to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi film plot embeded in the futuristic megacity of Chongqing, featuring main characters from the timeless Chinese folklore epic, Journey to the West.

True to form, DeepSeek developed an engaging story set in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism combines with quantum computing".

It included fancy settings - smoggy skies "pierced by high-rise buildings", "holographic lanterns that float above neon-lit streets" and "ancient temples nestled between quantum server farms".

It also remarkably reimagined standard heroes Sun Wukong as "an ironical, self-aware AI housed in a taken fight body", Zhu Bajie as a cyborg bar owner "drowning in debt and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".

ChatGPT installed a great battle, creating an equally remarkable cyberpunk story which similarly reimagined "a ragteam of cyber-enhanced misfits, each mirroring the famous figures of Journey to the West".

"This is a world where AI deities guideline, corporations replace emperors and cybernetic implants are as common as ancient myths."

Disappointingly, Qwen2.5 fell short in this obstacle - providing a story that appeared more suited for an animation film.

"The movie starts with the awakening of Sun Wukong within a state-of-the-art research facility located in the heart of Chongqing," it said, then going on to explain the following:

Realising his brand-new reality and "seeking to comprehend his function in this strange brand-new world", he then leaves and fulfills Zhu Bajie and Sha Wujing - "each having problem with their own existential crises".

The trio then starts a quest, navigating the streets of Chongqing to safeguard the sacred "Eternal Scroll" from falling under the wrong hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "challenging to make a conclusive declaration" about which bot was best, adding that each displayed its own strengths in different locations, "such as language focus, training information and hardware optimization".

Her insight highlights how Chinese AI designs are not just duplicating Western paradigms, but rather progressing in cost-effective innovation methods - and delivering localised and improved results.

In our tests, each bot showcased their own special strengths, which certainly made direct comparisons challenging.

DeepSeek's sci-fi film plot demonstrated its creative flair that made for a more appealing and imaginative story as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more established ChatGPT, unburdened by Chinese censorship constraints, provides accurate and factual reactions to questions about Chinese current occasions, which gives it an included advantage.

Experts likewise weighed in on their thoughts after utilizing DeepSeek and other Chinese AI apps.

"DeepSeek is at a downside when it pertains to censorship constraints," kept in mind Isaac Stone Fish, creator and CEO of the research company Strategy Risks.

"When provided an option, Chinese users want the non-censored variation - simply like anyone else, so I seem like that's a piece missing out on from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, specifically for Chinese users.

"Ninety per cent of individuals using the tool are not attempting to get a deeper understanding about Xi Jinping or politically delicate topics. They're using it for other efficient methods," Chen said.