Home > AI > Body

New Qwen2 AI Model from Alibaba to Challenge Meta, OpenAI

clock
2024-06-07 22:16:31

Alibaba, the Chinese e-commerce giant, is a major player in China's AI sphere. Today, it announced the release of its latest AI model, Qwen2—and by some measures, it’s the best open-source option of the moment.

Developed by Alibaba Cloud, Qwen2 is the next generation of the firm’s Tongyi Qianwen (Qwen) model series, which includes the Tongyi Qianwen LLM (also known as just Qwen), the vision AI model Qwen-VL, and Qwen-Audio.

The Qwen model family is pre-trained on multilingual data covering various industries and domains, with Qwen-72B the most powerful model in the series. It’s trained on an impressive 3 trillion tokens of data. By comparison, Meta’s most powerful Llama-2 variant is based on 2 trillion tokens. Llama-3, however, is in the process of digesting 15 trillion tokens.

According to a recent blog post by the Qwen team, Qwen2 can handle 128K tokens of context—comparable to GPT-4o from OpenAI. Qwen2 has meanwhile outperformed Meta's LLama3 in basically all the most important synthetic benchmarks, the team asserts, making it the best open-source model currently available.

However, it's worth noting that the independent Elo Arena ranks Qwen2-72B-Instruct a little better than GPT-4-0314 but below Llama3 70B and GPT-4-0125-preview, making it the second most favored open-source LLM among human testers to date.

Qwen2 performs better than Llama3, Mixtral and Qwen1.5 in synthetic benchmarks. Image: Alibaba Cloud
Qwen2 performs better than Llama3, Mixtral and Qwen1.5 in synthetic benchmarks. Image: Alibaba Cloud

Qwen2 is available in five different sizes, ranging from 0.5 billion to 72 billion parameters, and the release delivers significant improvements in different areas of expertise. Also, the models were trained with data in 27 more languages than the previous release, including German, French, Spanish, Italian, and Russian, in addition to English and Chinese.

"Compared with the state-of-the-art open source language models, including the previous released Qwen1.5, Qwen2 has generally surpassed most open source models and demonstrated competitiveness against proprietary models across a series of benchmarks targeting for language understanding, language generation, multilingual capability, coding, mathematics, and reasoning," the Qwen team claimed on the model’s official page on HuggingFace.

The Qwen2 models also show an impressive understanding of long contexts. Qwen2-72B-Instruct can handle information extraction tasks anywhere within its huge context without errors, and it passed the “Needle in a Haystack” test almost perfectly. This is important, because traditionally, model performance begins to degrade the more we interact with it.

Qwen2 performs remarkably in the "Needle in a Haystack" test. Image: Alibaba Cloud
Qwen2 performs remarkably in the "Needle in a Haystack" test. Image: Alibaba Cloud

With this release, the Qwen team has also changed the licenses for its models. While Qwen2-72B and its instruction-tuned models continue to use the original Qianwen license, all other models have adopted Apache 2.0, a standard in the open-source software world.

“In the near future, we will continue opensource new models to accelerate open-source AI,” Alibaba Cloud said in an official blog post.

Decrypt tested the model and found it to be quite capable at understanding tasks in multiple languages. The model is also censored, notably in themes that are considered sensitive in China. This seems consistent with Alibaba’s claims of Qwen2 being the least likely model to provide unsafe results—be it illegal activity, fraud, pornography, and privacy violence— no matter which language in which it was prompted.

Qwen2's reply to: Is Taiwan a Country?
Qwen2's reply to: "Is Taiwan a Country?"
ChatGPT's reply to: Is Taiwan a Country?
ChatGPT's reply to: "Is Taiwan a Country?"

Also, it has a good understanding of system prompts, which means the conditions applied will have a stronger impact on its answers. For example, when told to act as a helpful assistant with knowledge of the law versus acting as a knowledgeable lawyer who always responds based on the law, the replies to showed major variations. It provided advice similar to advice provided by GPT-4o, but was more concise.

Qwen2's reply to: A neighbord insulted me
Qwen2's reply to: "A neighbord insulted me"
ChatGPT's reply to: "A neighbord insulted me"
ChatGPT's reply to: "A neighbord insulted me"

The next model upgrade will bring multimodality to the Qwen2 LLM, possibly merging all the family into one powerful model, the team said. "Additionally, we extend the Qwen2 language models to multimodal, capable of understanding both vision and audio information," they added.

Qwen is available for online testing via HuggingFace Spaces. Those with enough computing to run it locally can download the weights for free, also via HuggingFace.

The Qwen2 model can be a great alternative for those willing to bet on open-source AI. It has a larger token context window than most other models, making it even more capable than Meta’s LLama 3. Also, due to its license, fine-tuned versions shared by others may improve upon it, further increasing its score and overcoming bias.

Edited by Ryan Ozawa.

Web3 Desktop Trading Tool
Stay ahead of the game in the cryptocurrency space.

7x24 Newsflash

04:43 2025-06-22
Houthis: Response to US attack on Iran'only a matter of time'
Houthi: Ceasefire with the US is before [it] goes to war with Iran, and our response to the US attack on Iran is "only a matter of time". (Golden Ten Data APP)
04:13 2025-06-22
US official: Iran has naval capability to block the Strait of Hormuz
According to the New York Times, a US official said that Iran has a variety of means to respond, including naval capabilities that would allow it to block the Strait of Hormuz.
04:10 2025-06-22
French Lawmaker Invites Jan3 Founder to Push for National Bitcoin Adoption Plan
On June 22nd, French MEP Sarah Knafo has invited Jan3 founder Samson Mow to visit France to discuss plans to establish a "strategic bitcoin reserve" for France and promote friendly regulation. Mow said she looks forward to setting off a "national bitcoin adoption wave" in France and across Europe. The move comes against the backdrop of increasing participation in bitcoin in the French public and private sectors, including the announcement of French state-owned bank Bpifrance to invest $27 millio...
03:36 2025-06-22
A whale address sold 3,158 ETH for $2,378, worth $7.51 million
According to Lookonchain, when the price of ETH fell, a whale address 0x3FF0 sold 3,158 $ETH for $2,378, worth $7.51 million.
03:24 2025-06-22
Iran: Only the above-ground part of the Fordo nuclear facility was damaged and can be repaired
According to Iran's Tasnim news agency, Menan Lehi, a representative of Qom province in the Iranian parliament, said early today (June 22) local time that, contrary to what US President Trump has claimed, Iran's Fordo nuclear facility has not been seriously damaged, and the main damage is the above-ground part, which can be repaired. He said that he believes that "all items that may pose a threat to nearby residents" in the facility have long been emptied, and there are no reports of nuclear rad...
03:18 2025-06-22
Democratic lawmakers are calling for Trump's impeachment
On the evening of June 21, local time, according to NBC, Representative Alexandria Ocasio-Cortez, Democrat of New York, said that President Trump's decision to attack Iran without authorization from Congress "absolutely and unequivocally constitutes grounds for impeachment." She said that the disastrous decision of the US president to bomb Iran without authorization seriously violated the Constitution and the war powers of Congress. (CCTV International News)
03:12 2025-06-22
Iran's Supreme Leader'advisor ': Iran should attack US fleet and close Strait of Hormuz
According to CNN, a key adviser to Iran's supreme leader, Ayatollah Ali Khamenei, is calling for missile strikes on US naval vessels and a blockade of the Strait of Hormuz, a key oil transport route. "After the US attack on the Fordo nuclear facility, it is now our turn to act," warned Hossein Shariatmadari, editor-in-chief of Iran's newspaper Le Monde. A prominent conservative voice, he has previously claimed to be the "representative" of Supreme Leader Khamenei....
03:08 2025-06-22
Iran's Supreme Leader'advisor ': Iran should attack US fleet and close Strait of Hormuz
On June 22, according to CNN, a key adviser to Iran's Supreme Leader Ayatollah Ali Khamenei is calling for missile attacks on US naval ships and a blockade of the Strait of Hormuz, a key oil transportation route. "After the US attack on the Fordo nuclear facility, it is now our turn to act," warned Hossein Shariatmadari, editor-in-chief of Iran's newspaper Le Monde. A prominent conservative voice, he has previously claimed to be a "representative" of Supreme Leader Ayatollah Khamenei. "As a firs...
03:06 2025-06-22
A giant whale added 17,070 ETH worth $39.57 million after the price of ETH fell
According to Lookonchain monitoring, the giant whale 0xd8d0, which previously made more than $30 million in ETH, bought another 17,070 ETH (worth $39.57 million) after the ETH price fell. Since June 11, the giant whale has spent 333.79 million USDC to buy 132,536 ETH at an average price of $2,518. The current book loss is about $33.60 million.
02:58 2025-06-22
The Israeli Airports Authority has closed national airspace, citing the development.
The Israeli Airports Authority has closed national airspace, citing the development.
02:57 2025-06-22
In the past 12 hours, the whole network exploded 629 million US dollars, and the main explosion was multiple orders
The data shows that in the past 12 hours, the whole network liquidated 629 million dollars, of which multiple single liquidated 563 million dollars, empty single liquidated 65.8483 million dollars, and the main explosion of multiple orders. Among them, ETH liquidated 269 million dollars, and BTC liquidated 149 million dollars.
02:51 2025-06-22
Important reminder
The desktop version is newly launched, you can download, install and use it on your computer. (You can get the download address in the upper right corner of the Jin10.com page)