首页 News 正文

Google releases the strongest AI model Gemini, with top securities firms quickly commenting: Continuously optimistic about the prospects of the AI industry

因醉鞭名马幌
255 0 0

On December 7th, Caixin News Agency reported that US technology giant Google recently announced the launch of its largest and most powerful AI intelligent model, the Gemini.
The Gemini model released by Google this time can achieve multimodality and significantly improve performance. Gemini is a multimodal model built on Transformer decoder, which can process information in different forms of content such as video, audio, and text. The latest Gemini model is able to perform more complex reasoning and understand finer information compared to previous technologies. It can extract key points from hundreds of thousands of documents by reading, filtering, and understanding information, which will help achieve new breakthroughs in many fields from science to finance.
The Gemini model can be divided into three versions based on its size: Gemini Ultra, Gemini Pro, and Gemini Nano, all of which support contextual 32K understanding. Among them:
1) The Ultra version is the most powerful version and can demonstrate the highest efficiency in the corresponding TPU infrastructure. In multiple tests, the performance of the Ultra version exceeds GPT4V;
2) The Pro version is a cost-effective optimized version with strong capabilities in reasoning, multimodality, and other aspects. It has good scalability and can complete pre training within a few weeks. In multiple tests, it is second only to GPT4V and stronger than mainstream large models such as PaLM2, Claude2, LLaMA2, and GPT3.5;
3) Nano: It is a 4-bit model distilled from other models, with two versions: 1.8B and 3.25B, targeting low memory and high memory devices respectively, and supporting local deployment
The Gemini model, as the first multimodal model released by Google and globally, supports cloud and edge testing. According to relevant test data, Gemini Ultra outperforms human expert models in MMLU (Massive Multi tasking Language Understanding), with performance surpassing GPT-4 in multiple tasks when compared horizontally.
Minsheng Securities stated that by evaluating the Gemini model family in over 50 benchmark tests, as the model size increases, the Gemini model family continues to improve its quality in reasoning, mathematics/science, and long texts. Among all six abilities, Gemini Ultra is the best model. As the second largest model in the Gemini model family, Gemini Pro is also highly competitive in performance and more efficient in providing services.
Minsheng Securities pointed out that the Gemini training process can also innovate infrastructure, algorithms, and datasets;
In terms of infrastructure: Gemini is trained by Google TPUV5e and TPUV4, and has demonstrated engineering innovation during the training process. For example, by connecting 4096 TPUV4 chips to a dedicated optical switch, the 4x4x4 chip cube can be dynamically reconfigured as a super node of any 3D ring topology structure in about 10 seconds, and targeted deployment of Gemini Ultra and thermal maintenance functions. In response to the high inter chip interconnection speed required for the Ultra version, Google has applied multiple patented technologies such as OCS optical switching, but the final speed is not yet provided in the article.
In terms of algorithms, techniques such as single control algorithms and XLA compilers are used to optimize the training process, and stable training is achieved by preventing SDC and other issues.
In terms of dataset, Gemini training and inference speed are improved through word segmentation technology, and a series of filtering methods are used to ensure the high quality of the data used for training
The latest version of Google's computing chip TPU v5p has been released simultaneously. TPU v5p is an improvement of the previous TPU v4 version. Compared with TPU v4, TPU v5p has twice the floating-point performance and trains large language models 2.8 times faster. CITIC Securities believes that the official release of the multimodal Gemini model can expand the application scenarios and bring about continuous upgrades in computing power demand. Minsheng Securities continues to be optimistic about the future prospects of the AI industry and believes that the release of models such as GPT-5 will also bring more catalysis.
CITIC Securities stated that in the current search scenario, Gemini can reduce latency by approximately 40%. For the entire industry, the promotion of Google's productization and commercialization will also bring about overall changes. At the same time, with the launch of models such as GPT-5, it is expected to see: 1) the increase in computing power demand brought by multimodal models; 2) More and more AI scenarios and products are emerging.
The release of Gemini will further bring more expectations for multimodal models, which will drive an increase in computing power demand for the industry; In the medium to long term, it is expected that the upgrade of multimodal models will enrich the usage scenarios of related products, coupled with cost optimization brought about by hardware upgrades and algorithm optimization. The progress of 2C products is worth looking forward to.
CITIC Securities stated that it remains optimistic about the long-term impact and changes of this round of generative AI on the technology industry, and continues to focus on leading manufacturers in areas such as computing power, algorithms, data, and applications.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   知名做空机构香橼研究(Citron Research)周四(11月21日)在社交媒体平台X上发布消息称,该公司已决定做空“比特币大户”微策略(Microstrategy)这家公司,并认为该公司已经将自己变身成为一家比特币投资基金 ...
    caffycat
    昨天 11:18
    支持
    反对
    回复
    收藏
  •   每经AI快讯,11月20日,文远知行宣布旗下自动驾驶环卫车S6与无人扫路机S1分别在新加坡滨海湾海岸大道与滨海艺术中心正式投入运营。据介绍,这是新加坡首个商业化运营的自动驾驶环卫项目。 ...
    star8699
    3 天前
    支持
    反对
    回复
    收藏
  •   上证报中国证券网讯(记者王子霖)11月20日,斗鱼发布2024年第三季度未经审计的财务报告。本季度斗鱼依托丰富的游戏内容生态,充分发挥主播资源和新业务潜力,持续为用户提供高质量的直播内容及游戏服务,进一步 ...
    goodfriendboy
    3 天前
    支持
    反对
    回复
    收藏
  •   人民网北京11月22日电 (记者栗翘楚、任妍)2024广州车展,在新能源汽车占据“半壁江山”的同时,正加速向智能网联新能源汽车全面过渡,随着“端到端”成为新宠,智能驾驶解决方案成为本届广州车展各大车企竞 ...
    3233340
    昨天 17:06
    支持
    反对
    回复
    收藏
因醉鞭名马幌 注册会员
  • 粉丝

    0

  • 关注

    0

  • 主题

    43