Meta releases the strongest open-source model Llama 3.1, Zuckerberg: it will become a turning point in the industry
胡胡胡美丽_ss
发表于 2024-7-24 09:38:16
220
0
0
On the evening of July 23rd Beijing time, Meta officially released the latest open-source model Llama 3.1 series, further narrowing the gap between open-source models and closed source models. Llama 3.1 includes three parameter scales of 8B, 70B, and 450B, with the 450B parameter model surpassing OpenAI's GPT-4o in multiple benchmark tests and comparable to leading closed source models such as Claude 3.5 Sonnet.
Meta founder and CEO Mark Zuckerberg also posted a blog on the official website at the same time to promote the release. He stated that Llama 3.1 version will be a turning point in the industry, and most developers will begin to primarily use open source. Open source AI is the future direction of development.
NVIDIA Senior Research Scientist Jim Fan congratulated the Meta team on X, stating, "The power of GPT-4 is in our hands, and this is a truly historic moment
In terms of specific details, the context windows of the three versions of Llama 3.1 have increased from 8K to 128K, a 16 fold expansion, and support 8 languages simultaneously. The Llama 3.1-405B model was trained using over 15 trillion tokens, and in order to achieve this training scale, the team used 16000 H100 GPUs. Officially, the 405B model is the first Llama model trained at this scale.
Open source large-scale language models often lag behind closed source models in terms of functionality and performance, but now we are ushering in a new era led by open source
In the official blog, Meta evaluated the performance of over 150 benchmark datasets and compared the performance of Llama 3.1 with other models. The flagship model Llama 3.1-405B is comparable to GPT-4, GPT-4o, and Claude 3.5 Sonnet in a range of tasks such as common sense, operability, and mathematics. In addition, the 8B and 70B small models are competitive with closed source and open source models with similar numbers of parameters.
In real-world scenarios, Llama 3.1 405B performed better overall than GPT-4o and Claude 3.5 Sonnet compared to manual evaluations.
Meta has also updated its open source license this time, allowing developers to use the output of the Llama model (including 405B) for the first time to improve other models. Compared to GPT-4o, the official statement states that they will also use a combination approach to integrate image, video, and voice functions into Llama 3, enabling the model to recognize images and videos and support interaction through voice. However, this feature is still under development and is not yet ready for release.
In the official blog, Meta stated that the total download volume of all Llama versions has exceeded 300 million times so far.
In addition to this model release, Zuckerberg also posted a long article on the official website titled "Open Source AI Is the Path Forward", which mentioned the importance of open source. He believes that open source is good for all developers, Meta, and the world.
Zuckerberg used the example of open-source system Linux defeating closed source system Unix, believing that artificial intelligence will develop in a similar way. Several technology companies are developing leading closed models, but open source is quickly narrowing the gap. He mentioned that last year, Llama 2 could only be compared to the old generation models. And this year, Llama 3 has competitiveness in some fields, even leading the most advanced models in some aspects.
Zuckerberg believes that open source can promote innovation, reduce costs, and improve security. For developers, using open source can train, fine tune, and distill their own models. Each organization has different needs, and it is best to use models of different sizes to meet these needs, which are trained or fine tuned with specific data.
Meanwhile, developers can avoid being locked into closed vendors to protect data security. Open source software is often more secure because its development is more transparent and can be widely reviewed, "said Zuckerberg.
Zuckerberg also mentioned that open-source models have lower costs and higher efficiency, allowing developers to run inference on Llama 3.1 405B on their own infrastructure at a cost of approximately 50% of using closed models like GPT-4o, suitable for user interface and offline inference tasks.
Open source artificial intelligence represents the world's best opportunity. In Zuckerberg's view, utilizing this technology can create the greatest economic opportunity and security.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Is the turning point of Robotaxi's full commercialization approaching when Baidu Zhixing obtains Shanghai demonstration application license?
- Meta releases "industry-leading" open-source artificial intelligence (AI) model Llama 3.1
- Meta releases open-source big model Llama 3.1 with strong support from Nvidia
- Meta's most powerful model surpasses GPT-4o, Zuckerberg once again stirs up the debate over open and closed sources
- Huang Renxun's conversation with Zuckerberg: New chip samples sent this week, AI industry still has 5 years of product innovation period
- Huang Renxun, Zuckerberg supports AI big model open source, two people exchange jackets to express brotherly love
- Company Review | BeiGene has suffered continuous losses in the past 7 years, and its stock price is under pressure. Can the new CFO bring a turning point?
- Robin Lee's internal speech exposes that the open source model is not efficient enough to solve the problem of computing power
- Zuckerberg 'Explodes' AI Wearable Devices
- Multiple teams at Meta have reported layoffs, and Zuckerberg's' efficiency year 'is still ongoing
-
随着“银十”结束,各家造车新势力都交出了一份亮眼的成绩单。 理想领跑10月新势力交付榜,鸿蒙智行重回4万辆,零跑、深蓝、极氪、小鹏等单月交付量均创新高,岚图、阿维塔、智己等实现破万,但哪吒却消失在 ...
- fanadam
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
为期超七周的大罢工终于落下帷幕。 当地时间11月4日,波音美国西海岸工厂工人们就改进后的合同提案投票。 随后,代表着波音超过33000名西雅图地区机械师的IAM工会经表决,以59%的同意票决定接纳波音提 ...
- cristianna
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
证券时报网讯,热门中概股美股盘前普涨,小鹏汽车、哔哩哔哩涨近5%,蔚来涨超4%,阿里巴巴、拼多多涨超2%。
- p609520
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
何思文表示,“在进博会这个平台上,我们开启的是倾听模式,通过进博会展出各类产品,收集消费者的需求和反馈,进而帮助决定未来进口到中国的产品。过去,汽车行业的许多创新源于美国加州或欧洲。我相信,中国正 ...
- MaxLucky
- 2 小时前
- 支持
- 反对
- 回复
- 收藏