Meta releases' Strongest Open Source Model ', opening a new page in the battle between open source and closed source. The big model may face a reshuffle
邹高清
发表于 2024-7-28 10:40:19
1280
0
0
On July 23rd local time, Meta officially released the latest version of its language model Llama3.1. This release is seen by the AI community as a powerful counterattack against the "open source backwardness theory", and Meta founder and CEO Mark Zuckerberg also stated during the release that "open source AI is the path to the future".
OpenAI has always been criticized by the outside world for the closed nature of ChatGPT, claiming that although it is called "Open", it actually does "Close" things. However, the strength of closed source big models represented by ChatGPT-4o often discourages the industry, as if the concept that "closed source big models must have better performance than open source big models" has become the default.
But the release of Llama3.1 this time seems to rewrite this pattern. Meta has released three versions of Llama3.1, namely 8B, 70B, and 405B, with 405B being the "top of the line" version. Meta claims that its performance is comparable to the best closed source models.
The Strongest Open Source Model
Why can Llama3.1 405B compete with the best closed source models? Along with the release of Llama3.1, Meta also published a paper titled 'The Llama 3 Herd of Models', which detailed the development details of the Llama 3 model.
Firstly, in terms of usage, Llama3.1 supports 8 languages and the context windows of all three versions have been extended to 128K, which is the same as GPT-4 Turbo; Meanwhile, Llama3.1 405B has 405 billion model parameters, with a training scale 50 times larger than Llama2, and adopts a dense Transformer architecture to maintain more stable performance. In this way, Llama can process up to 96000 words of text at once, and can handle both long and short texts with ease.
In the paper, Meta also published performance comparison data between Llama3.1 405B and closed source models such as ChatGPT-4o and Claude 3.5 Sonnet. The test results show that Llama3.1 405B leads in multiple aspects such as general performance, long text processing, and multilingual processing. For example, in the ZeroSCROLLS project testing, Llama3.1 405B scored 95.2, while the latter two were both 90.5.
The outstanding performance and large training base of Llama3.1 have earned it the title of "the strongest open-source big model". However, the current Llama3.1 is still a large model mainly focused on language processing and does not support processing images, videos, or speech. This means that ChatGPT still has outstanding capabilities in multimodal task processing.
Open source AI is the path of the future
Perhaps the actual user experience of Llama has not yet reached a perfect level, but the release of Llama 3.1 405B is of great significance to AI workers around the world, as it opens a new page in the open source and closed source struggle for large models.
On the Meta official website, Zuckerberg released an open letter firmly proclaiming that "open-source AI is the path to the future". In the letter, he stated that although multiple companies are developing leading closed source models, open source is rapidly narrowing the gap. Taking Llama as an example, last year Llama 2 could only compete with older versions of the general large model, but this year Llama 3 has achieved competition with the most advanced large models and is leading in some fields.
Therefore, Zuckerberg hopes to turn Llama into the Linux of the big model era and become the industry standard for open source AI. In the early days of high-performance computing, major technology companies invested heavily in developing their own closed source versions of Unix... Today, open-source Linux has become the industry standard foundation for cloud computing and operating systems that run most mobile devices, and I believe artificial intelligence will develop in a similar way
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Tesla's Shanghai Gigafactory delivered 74117 Model 3 vehicles in July, setting a historical high
- Tesla recalls over 1.68 million models across the entire lineup in China
- Apple reportedly will produce high-end iPhone Pro models in India for the first time this year
- OpenAI's commercial subscription users have exceeded 1 million, and there are rumors of astonishing price increases for new models
- Rolling crazy! The big model price war continues! Alibaba announces: 85% price reduction!
- European privacy regulators investigate Google's use of data for artificial intelligence models
- Hong Kong stock market's apple industry chain surges! What will be the future sales of the iPhone 16 Pro model as demand remains strong?
- Elon Musk's Cybercab is about to be released! Many domestic giants are competing to explore the Robotaxi China model
- Will DeepMind's open-source biomolecule prediction model win the Nobel Prize and ignite a wave of AI pharmaceuticals?
- Baidu Robin Lee: In the past 24 months, the biggest change in the AI industry is that the big model has basically eliminated illusion
-
知名做空机构香橼研究(Citron Research)周四(11月21日)在社交媒体平台X上发布消息称,该公司已决定做空“比特币大户”微策略(Microstrategy)这家公司,并认为该公司已经将自己变身成为一家比特币投资基金 ...
- caffycat
- 12 小时前
- 支持
- 反对
- 回复
- 收藏
-
每经AI快讯,11月20日,文远知行宣布旗下自动驾驶环卫车S6与无人扫路机S1分别在新加坡滨海湾海岸大道与滨海艺术中心正式投入运营。据介绍,这是新加坡首个商业化运营的自动驾驶环卫项目。 ...
- star8699
- 前天 19:48
- 支持
- 反对
- 回复
- 收藏
-
上证报中国证券网讯(记者王子霖)11月20日,斗鱼发布2024年第三季度未经审计的财务报告。本季度斗鱼依托丰富的游戏内容生态,充分发挥主播资源和新业务潜力,持续为用户提供高质量的直播内容及游戏服务,进一步 ...
- goodfriendboy
- 前天 20:09
- 支持
- 反对
- 回复
- 收藏
-
人民网北京11月22日电 (记者栗翘楚、任妍)2024广州车展,在新能源汽车占据“半壁江山”的同时,正加速向智能网联新能源汽车全面过渡,随着“端到端”成为新宠,智能驾驶解决方案成为本届广州车展各大车企竞 ...
- 3233340
- 6 小时前
- 支持
- 反对
- 回复
- 收藏