Crush all opponents? Google releases a lightweight open source model that can run on laptops
老蟹2017
发表于 2024-2-22 13:16:00
1210
0
0
The open source big model track welcomes a heavyweight new product.
On February 21st local time, Google announced the official launch of a new open source big language model (LLM) called Gemma, aimed at helping developers and researchers responsibly build artificial intelligence.
It is reported that the Gemma big model shares technology and infrastructure with Google's largest and most powerful artificial intelligence model, Gemini. "Inspired by Gemini, Google DeepMind collaborated with other Google teams to develop Gemma, which is named after Gemma, meaning 'gem' in Latin."
However, compared to Gemini, Gemma is more lightweight. Meanwhile, Gemma remains free to use, its model weights are also open-source, and commercial use is allowed.
Google has released two models with different weight scales, Gemma 2B (2 billion parameters) and Gemma 7B (7 billion parameters). Each scale has pre trained and instruction fine-tuning versions, allowing all organizations (regardless of size) to responsibly conduct commercial and distribution.
On the same day that Google released Gemma, the popular chip manufacturer Nvidia also announced a partnership with Google to ensure the smooth operation of the Gemma model on its chips. Nvidia also stated that its chatbot software Chat With RTX will soon support Gemma.
It is worth noting that Google also emphasizes that Gemma can surpass larger models on key benchmarks. What's even more impressive is that Google Gemma can run on laptops.
Google has stated that Gemini is the largest and most powerful AI model widely used today. Compared to other open models, Gemma 2B and 7B can achieve the best performance in their class within their scope. The Gemma model can run directly on developers' laptops or desktops, "It's worth noting that Gemma surpasses larger models on key benchmarks while adhering to our strict standards of safe and responsible output."
Along with the open source model, Google also released a technical report on Gemma's performance, dataset composition, and modeling methods in detail. Researchers have found in a technical report that Gemma supports a vocabulary size of 256K, which means it can provide better and faster support for languages other than English.
Comparison of Llama 2 parameters released by Gemma and Meta, from Google's official website
Gemma was also launched as soon as possible on the well-known open-source model libraries HuggingFace and HuggingChat. Shortly after its launch, both Gemma 2B and 7B models have reached the top of HuggingFace's "Big Language Model List".
AI industry expert and author of the deep learning framework Keras, Franois Chollet, further stated that the position of the strongest open source big model has now changed ownership.
Gemma's competitor Llama 3 is also about to be released. On January 19th, Meta co-founder and CEO Zuckerberg announced that Meta is training Llama 3 and will continue to open source in a responsible manner.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Robin Lee pours cold water on the big open source model, but they have different views | Decode AI
- Open source community watershed: Meta model Llama 3 with the highest release parameters or up to 400 billion
- Nvidia Open Source 340 Billion Parameter Model Nemotron-4 340B
- Nvidia suddenly opens up!
- Nvidia Open Source 340 Billion Parameter Model Nemotron-4 340B
- Meta releases the strongest open-source model Llama 3.1, Zuckerberg: it will become a turning point in the industry
- Meta releases "industry-leading" open-source artificial intelligence (AI) model Llama 3.1
- Meta releases open-source big model Llama 3.1 with strong support from Nvidia
- Huang Renxun, Zuckerberg supports AI big model open source, two people exchange jackets to express brotherly love
- Robin Lee's internal speech exposes that the open source model is not efficient enough to solve the problem of computing power
-
随着“银十”结束,各家造车新势力都交出了一份亮眼的成绩单。 理想领跑10月新势力交付榜,鸿蒙智行重回4万辆,零跑、深蓝、极氪、小鹏等单月交付量均创新高,岚图、阿维塔、智己等实现破万,但哪吒却消失在 ...
- fanadam
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
何思文表示,“在进博会这个平台上,我们开启的是倾听模式,通过进博会展出各类产品,收集消费者的需求和反馈,进而帮助决定未来进口到中国的产品。过去,汽车行业的许多创新源于美国加州或欧洲。我相信,中国正 ...
- MaxLucky
- 6 小时前
- 支持
- 反对
- 回复
- 收藏
-
11月5日至10日,第七届中国国际进口博览会(下称“进博会”)在国家会展中心(上海)举办。在进博会期间,平安健康医疗科技有限公司(下称“平安健康”)与美敦力(上海)管理有限公司(下称“美敦力”)达成战 ...
- Hidden2
- 前天 17:06
- 支持
- 反对
- 回复
- 收藏
-
今年上半年业绩表现低迷的百胜中国在第三季度打了个“翻身仗”。11月4日,百胜中国发布三季报显示,今年第三季度,其净利润同比增长22%。而在第一季度、第二季度,该公司净利润增长率分别为-1%、8%,第三季度净 ...
- nihaosifa
- 前天 15:35
- 支持
- 反对
- 回复
- 收藏