首页 News 正文

Meta releases' Strongest Open Source Model ', opening a new page in the battle between open source and closed source. The big model may face a reshuffle

邹高清
1280 0 0

On July 23rd local time, Meta officially released the latest version of its language model Llama3.1. This release is seen by the AI community as a powerful counterattack against the "open source backwardness theory", and Meta founder and CEO Mark Zuckerberg also stated during the release that "open source AI is the path to the future".
OpenAI has always been criticized by the outside world for the closed nature of ChatGPT, claiming that although it is called "Open", it actually does "Close" things. However, the strength of closed source big models represented by ChatGPT-4o often discourages the industry, as if the concept that "closed source big models must have better performance than open source big models" has become the default.
But the release of Llama3.1 this time seems to rewrite this pattern. Meta has released three versions of Llama3.1, namely 8B, 70B, and 405B, with 405B being the "top of the line" version. Meta claims that its performance is comparable to the best closed source models.
The Strongest Open Source Model
Why can Llama3.1 405B compete with the best closed source models? Along with the release of Llama3.1, Meta also published a paper titled 'The Llama 3 Herd of Models', which detailed the development details of the Llama 3 model.
Firstly, in terms of usage, Llama3.1 supports 8 languages and the context windows of all three versions have been extended to 128K, which is the same as GPT-4 Turbo; Meanwhile, Llama3.1 405B has 405 billion model parameters, with a training scale 50 times larger than Llama2, and adopts a dense Transformer architecture to maintain more stable performance. In this way, Llama can process up to 96000 words of text at once, and can handle both long and short texts with ease.
In the paper, Meta also published performance comparison data between Llama3.1 405B and closed source models such as ChatGPT-4o and Claude 3.5 Sonnet. The test results show that Llama3.1 405B leads in multiple aspects such as general performance, long text processing, and multilingual processing. For example, in the ZeroSCROLLS project testing, Llama3.1 405B scored 95.2, while the latter two were both 90.5.
The outstanding performance and large training base of Llama3.1 have earned it the title of "the strongest open-source big model". However, the current Llama3.1 is still a large model mainly focused on language processing and does not support processing images, videos, or speech. This means that ChatGPT still has outstanding capabilities in multimodal task processing.
Open source AI is the path of the future
Perhaps the actual user experience of Llama has not yet reached a perfect level, but the release of Llama 3.1 405B is of great significance to AI workers around the world, as it opens a new page in the open source and closed source struggle for large models.
On the Meta official website, Zuckerberg released an open letter firmly proclaiming that "open-source AI is the path to the future". In the letter, he stated that although multiple companies are developing leading closed source models, open source is rapidly narrowing the gap. Taking Llama as an example, last year Llama 2 could only compete with older versions of the general large model, but this year Llama 3 has achieved competition with the most advanced large models and is leading in some fields.
Therefore, Zuckerberg hopes to turn Llama into the Linux of the big model era and become the industry standard for open source AI. In the early days of high-performance computing, major technology companies invested heavily in developing their own closed source versions of Unix... Today, open-source Linux has become the industry standard foundation for cloud computing and operating systems that run most mobile devices, and I believe artificial intelligence will develop in a similar way
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   知名做空机构香橼研究(Citron Research)周四(11月21日)在社交媒体平台X上发布消息称,该公司已决定做空“比特币大户”微策略(Microstrategy)这家公司,并认为该公司已经将自己变身成为一家比特币投资基金 ...
    caffycat
    12 小时前
    支持
    反对
    回复
    收藏
  •   每经AI快讯,11月20日,文远知行宣布旗下自动驾驶环卫车S6与无人扫路机S1分别在新加坡滨海湾海岸大道与滨海艺术中心正式投入运营。据介绍,这是新加坡首个商业化运营的自动驾驶环卫项目。 ...
    star8699
    前天 19:48
    支持
    反对
    回复
    收藏
  •   上证报中国证券网讯(记者王子霖)11月20日,斗鱼发布2024年第三季度未经审计的财务报告。本季度斗鱼依托丰富的游戏内容生态,充分发挥主播资源和新业务潜力,持续为用户提供高质量的直播内容及游戏服务,进一步 ...
    goodfriendboy
    前天 20:09
    支持
    反对
    回复
    收藏
  •   人民网北京11月22日电 (记者栗翘楚、任妍)2024广州车展,在新能源汽车占据“半壁江山”的同时,正加速向智能网联新能源汽车全面过渡,随着“端到端”成为新宠,智能驾驶解决方案成为本届广州车展各大车企竞 ...
    3233340
    6 小时前
    支持
    反对
    回复
    收藏
邹高清 新手上路
  • 粉丝

    0

  • 关注

    0

  • 主题

    0