首页 News 正文

The next generation of AI "super chips" are about to emerge

六月清晨搅
196 0 0

The highly anticipated GTC developer conference of AI chip giant Nvidia is about to be held, and the global trend of AI computing power is receiving attention.
As UK chip architecture company Arm continues to focus on the server market and recently updated its product roadmap for the Arm Neoverse series of server processors, two new Arm Neoverse computing subsystems (CSS) based on the all-new third-generation Neoverse IP have been launched. The outside world will also have a glimpse of the next generation of AI "super chips" that integrate CPUs and GPUs, and whether Nvidia will follow suit will also be closely monitored.
Neoverse is a server processor brand launched by Arm in 2018 for the data center market. Under Arm's planning, Neoverse's N series, V series, and E series each have their own positioning. For example, the V series emphasizes performance first and is used in the high-end server market. The previous generation Neoverse V2 was used in Nvidia's AI chip design.
Last March, Nvidia launched its first "Grace Hopper" GH200 superchip that combines CPU and GPU packaging. "Grace" refers to Nvidia's data center Arm CPU series released in April 2021, while "Hopper" refers to Nvidia's latest architecture GPU production model H100.
A chip industry investor told Interface News that Nvidia's Grace Hopper chip combines CPUs with top AI training products (GPUs) to create a "super chip" and jointly build a complete AI solution.
GH200 can be used for AI training and inference, and Nvidia significantly improves data transmission efficiency between CPUs and GPUs by packaging one CPU and one H100 GPU into a single chip. In November of the same year, Nvidia upgraded the GH200 again, upgrading the 96GB capacity HBM3 memory equipped on the GPU in the GH200 to 144GB HBM3e, significantly improving data transmission efficiency once again.
In the process of Nvidia seizing the AI wave with its GPU products, Arm also benefits from Nvidia's strong position in AI computing, which means that the data center market may adopt more processors based on Arm technology.
Mohammed Awad, General Manager of Arm's Infrastructure Business Unit, explained to Interface News that Nvidia's previously launched Grace Hopper Superchip has redesigned the system architecture. In the past, data centers used a single CPU to manage multiple GPUs, while Grace Hopper chips have been transformed to correspond to only one GPU per CPU. "More CPUs mean memory consistency, which ultimately greatly improves GPU utilization."
Arm stated that as the industry's demand for AI computing power gradually shifts from training to inference, CPU inference will be a key component of generative AI computing applications.
But not all AI processing will be performed on the CPU. Dermot O'Driscoll, Vice President of Product Solutions for Arm Infrastructure Business Unit, cited Grace Hopper as an example, stating that Nvidia's important innovation in this chip lies in memory capacity and shared memory mode. This tightly coupled CPU design, coupled with the configuration of AI accelerators, is very beneficial for the current popular large parameter language models and other AI applications.
In order to make custom chips faster and reduce design difficulty, Arm launched Arm Neoverse CSS last year. In Neoverse CSS, Arm configures, optimizes, and verifies the complete computing subsystem, and configures it for various computing cases. Partners focus on software tuning, customization acceleration, and other work, which can also accelerate product launch time and reduce engineering costs.
Dermot O'Driscoll pointed out that Neoverse CSS is a product launched specifically to help customers quickly build general-purpose computing chips on the Arm CPU platform. It can provide all the interfaces that customers need to choose the accelerator that couples itself. This method can provide both CPU and AI accelerators when needed, achieving the best of both worlds.
Nvidia has always played down its competitive edge with Intel and AMD for its self-developed Arm architecture Grace CPU.
Huang Renxun once told Interface News reporters in 2021 that the vast majority of data centers will continue to use existing x86 CPUs, while Grace will mainly be used in large data intensive sub markets in the computing field and will not have a "game changing" impact on existing CPU manufacturers.
However, the market landscape has changed. In the data center market, Arm is gradually gaining a foothold and posing a challenge to the giants Intel and AMD.
According to a report by market research firm Counterpoint, Arm architecture servers earned over $1 billion in revenue in the data center market for the first time in 2022, with AWS self-developed chips accounting for 3.16% of the market share and Ampere accounting for 1.52%. With the deployment of Microsoft's self-developed Arm chip in 2023 and the shipment of Grace Hopper, it is expected that Arm's market share in the server market will continue to rise.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   每经AI快讯,据亿航智能官微消息,公司EH216-S无人驾驶电动垂直起降航空器(eVTOL)获得巴西国家民航局颁发的试验飞行许可证书,并计划在巴西进行测试和试飞。关于EH216-S无人驾驶eVTOL在巴西的认证,中国民航局 ...
    潇湘才子
    昨天 08:41
    支持
    反对
    回复
    收藏
  •   今年7月,美国三大海外“债主”所持美国国债齐刷刷缩水,其中日本美债持仓已降至去年10月以来最低。   根据美国财政部当地时间9月18日公布的国际资本流动报告(TIC),2024年7月,美国前三大海外“债主”日本 ...
    520hacker
    3 天前
    支持
    反对
    回复
    收藏
  •   上证报中国证券网讯(记者俞立严)9月19日,蔚来全新品牌乐道的首款车型——乐道L60正式上市。新车定位家庭智能电动SUV,在采用BaaS电池租用服务后,L60的售价可低至14.99万元,电池租用月费最低为599元。乐道L6 ...
    anhao007
    前天 11:03
    支持
    反对
    回复
    收藏
  •   每经记者袁园   日前,国务院印发的《关于加强监管防范风险推动保险业高质量发展的若干意见》提出,以新能源汽车商业保险为重点,深化车险综合改革。   “车险综改”从2015年就已经开始逐步推进了,经过 ...
    moshulong
    前天 21:50
    支持
    反对
    回复
    收藏
六月清晨搅 注册会员
  • 粉丝

    0

  • 关注

    0

  • 主题

    30