Nvidia launches "heavyweight" late at night, expected to launch Blackwell Ultra AI chip in 2025 | Big Model World
大和797
发表于 2024-6-3 13:08:19
1242
0
0
On the evening of June 2nd, Huang Renxun, founder and CEO of NVIDIA, gave a speech on stage and revealed many key information. According to him, developers using NVIDIA NIM to deploy AI models on clouds, data centers, or workstations can shorten model deployment time from weeks to minutes. Customers such as Heshuo, Lloyd's, Siemens are all using it.
In addition, the new generation of AI chips and supercomputing platform Blackwell chips, which Nvidia has high hopes for, have begun production and are expected to launch Blackwell Ultra AI chips in 2025.
NVIDIANIM can shorten model deployment time from weeks to minutes
On the evening of June 2nd, Huang Renxun, the founder of NVIDIA dressed in leather, once again played with his own products on stage and introduced NVIDIANIM, a reasoning microservice that can provide models in optimized container form, aiming to assist enterprises of all sizes in deploying AI services.
However, strictly speaking, NVIDIANIM is not a new product and first appeared in March of this year. Nvidia announced on the evening of June 2nd that 28 million developers worldwide can download NVIDIANIM, deploy AI models on clouds, data centers, or workstations, and build generative AI applications such as Copilot (an AI assistant) and ChatGPT chatbots. Starting next month, members of the NVIDIA Developer Program can use NIM for free to conduct research, development, and testing on the infrastructure they choose.
According to Nvidia, new generative AI applications are becoming increasingly complex, often requiring the use of multiple models with different functionalities to generate text, such as images, videos, speech, etc. NVIDIANIM provides a simple and standardized way to add generative AI to applications, which can shorten model deployment time from weeks to minutes.
Huang Renxun also revealed that nearly 200 technology partners, including Cadence, Cloudera, Coheity, DataStax, Network App, Scale AI, and Xinsi Technology, are integrating NIM into their platforms to accelerate the deployment of generative AI. "Every enterprise hopes to integrate generative AI into its operations, but not every enterprise has a dedicated AI research team. NVIDIA NIM can be integrated into any platform, accessible to developers from anywhere, and can run in any environment." Huang Renxun said.
The Daily Economic News reporter learned that NIM is pre built and currently has nearly 40 models available as endpoints for developers to experience; Developers can access NVIDIA NIM microservices suitable for the Meta Llama 3 model from the open-source community platform Hugging Face, and use Hugging Face inference endpoints to access and run the Llama 3 NIM.
It is worth noting that Nvidia has also revealed the usage of a group of major customers, such as electronic manufacturer Foxconn, which is using NIM to develop Large Language Models (LLMs) for specific fields, such as intelligent manufacturing, smart cities, and smart electric vehicles; Heshuo is using NIM for a local mixed expert (MoE) model; Lloyd's is using NVIDIA NIM inference microservices to enhance the experience of employees and customers; Siemens is integrating its operational technology with NIM microservices for workshop AI workloads; Dozens of healthcare companies are also deploying NIM to support generative AI reasoning in application areas including surgical planning, digital assistants, drug discovery, and clinical trial optimization.
Blackwell chips begin production
In addition to the aforementioned products, Huang Renxun also revealed in his speech that Nvidia Blackwell chips have started production and will launch Blackwell Ultra AI chips in 2025.
In May of this year, Huang Renxun stated during a earnings conference call that it is expected that Blackwell architecture chips will bring a significant amount of revenue to the company this year. Nvidia's high expectations for Blackwell chips are still related to strong market demand. According to the latest disclosed financial report data, Nvidia achieved a revenue of $26 billion in the first quarter of the fiscal year 2025, an increase of 262% compared to the same period last year. Among them, the revenue of the data center business was 22.6 billion US dollars, an increase of 427% compared to the same period last year, making it the "leader" in performance revenue.
According to Colette Kress, Chief Financial Officer of Nvidia, the growth of data center business is due to the increase in shipments of Hopper architecture GPUs (such as H100); One of the important highlights of the quarter was Meta's announcement of the launch of the Lama 3 open-source large model, which used nearly 24000 H100 GPUs.
In addition to disclosing the progress of chip mass production, Nvidia has also launched a series of systems using the NVIDIA Blackwell architecture.
It is reported that these systems are equipped with GraceCPU and NVIDIA network and infrastructure to assist enterprises in establishing AI factories and data centers. Among them, the NVIDIA MGX modular reference design platform has added support for NVIDIA Blackwell products, including the NVIDIA GB200 NVL2 platform designed to provide excellent performance for mainstream large language model inference, retrieval enhancement generation, and data processing.
Nvidia emphasizes that the GB200 NVL2 is suitable for emerging fields such as data analysis. With the bandwidth memory performance brought by NVLink C2C interconnect technology and the specialized decompression engine in Blackwell architecture, the data processing speed can be up to 18 times faster than when using X86 CPU, and energy efficiency can be improved by 8 times. "A new round of industrial revolution has begun, and many enterprises and regions are collaborating with NVIDIA to promote the transformation of traditional data centers worth trillions of dollars towards accelerated computing, and building a new type of data center AI factory to produce new products, artificial intelligence," said Huang Renxun.
Nvidia stated that more than 90 released or under development systems from over 25 partners have used the MGX reference architecture, reducing development costs by up to three-quarters compared to before and shortening development time to six months, a two-thirds decrease compared to before. In addition, Nvidia also revealed that more than ten global robotics companies, including BYD Electronics, Siemens, Tereida, and Intrinsic, a subsidiary of Alphabet, are integrating NVIDIAIsaac acceleration libraries, physics based simulations, and AI models into their software frameworks and robotics models to improve the efficiency of factories, warehouses, and distribution centers.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
猜你喜欢
- Nvidia's first two new AI hardware models make their debut before the release of its third quarter report
- Over 10000 Nvidia Blackwell chips have been delivered to Huang Renxun in response to tariff issues
- Financial analysis: Nvidia's Q4 performance guidance falls short of the highest expectations, and the stock price fell more than 5% after the market closed
- Global Finance: Market pays attention to Nvidia's performance. The three major stock indexes of the New York Stock Exchange fluctuated on the 20th
- Nvidia's Q4 performance guidance fell short of the highest expectations, and its stock price fell more than 5% after hours
- Nvidia's third quarter revenue reached $35.082 billion
- NVIDIA's performance growth slows down, Huang Renxun steps in to 'appease' the market! Analyst: Investors Underestimate Demand for Blackwell Chips
- Nvidia's Q4 performance guidance falls short of the highest expected stock price, with a drop of over 5% after the market closed
- The stock price has skyrocketed by 33%! Snowflakes overshadow Nvidia analysts: AI software outperforms semiconductors or trends
- The three major US stock indices collectively closed higher, while the Dow Jones Industrial Average rose more than 1%. Nvidia's stock price hit a new intraday high
-
知名做空机构香橼研究(Citron Research)周四(11月21日)在社交媒体平台X上发布消息称,该公司已决定做空“比特币大户”微策略(Microstrategy)这家公司,并认为该公司已经将自己变身成为一家比特币投资基金 ...
- caffycat
- 昨天 11:18
- 支持
- 反对
- 回复
- 收藏
-
每经AI快讯,11月20日,文远知行宣布旗下自动驾驶环卫车S6与无人扫路机S1分别在新加坡滨海湾海岸大道与滨海艺术中心正式投入运营。据介绍,这是新加坡首个商业化运营的自动驾驶环卫项目。 ...
- star8699
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
上证报中国证券网讯(记者王子霖)11月20日,斗鱼发布2024年第三季度未经审计的财务报告。本季度斗鱼依托丰富的游戏内容生态,充分发挥主播资源和新业务潜力,持续为用户提供高质量的直播内容及游戏服务,进一步 ...
- goodfriendboy
- 3 天前
- 支持
- 反对
- 回复
- 收藏
-
人民网北京11月22日电 (记者栗翘楚、任妍)2024广州车展,在新能源汽车占据“半壁江山”的同时,正加速向智能网联新能源汽车全面过渡,随着“端到端”成为新宠,智能驾驶解决方案成为本届广州车展各大车企竞 ...
- 3233340
- 昨天 17:06
- 支持
- 反对
- 回复
- 收藏