首页 News 正文

JD technical leader: Large models will become smaller and even finer down to the scene

四夜父脚群
1238 0 0

General big models rely on computing power to build, while enterprise big models rely on business to run out
On July 30th, at the JD Cloud Summit held in Shanghai, Cao Peng, Chairman of the Technical Committee of JD Group and President of JD Cloud Business Unit, expressed the above views. According to his understanding, for large models, data is nourishment and scenarios are training grounds.
Over the past year, there has been a sustained craze for big models, and the industry has experienced a 'thousand model war'. According to statistics from the China Academy of Information and Communications Technology, there are currently over 1000 basic large-scale models worldwide, with China accounting for 35% of the global total.
Although the performance of basic models is constantly improving, in the personal user end, large models have not yet achieved true super applications. Instead, in many enterprise scenarios, they have gradually been deployed based on applications.
At the summit, JD Cloud showcased the latest practices of JD Yanxi's big model landing industry and released eight products including JD Cloud Enterprise Big Model Service, Yanxi Intelligent Agent Platform, Intelligent Programming Assistant JoyCoder, and Yanxi Digital Person 3.0.
According to data provided by JD.com, as of now, JD's big model has been implemented in over a hundred scenarios, covering different industries such as healthcare, e-commerce live streaming, logistics, and finance. Many of JD's own delivery personnel, merchants, doctors, procurement and sales operations, and R&D personnel have received support from the big model application.
For example, the "Jingyi Qianxun" service that serves medical scenarios, according to the head of JD Health Intelligent Algorithm Department, currently has four different sized models internally. One is a small model of about 2b, which provides a single service in a narrow domain. The team envisions that it can even be used on mobile phones in the future; The second is a medium-sized model with 14b and 22B as the core, which completes some medical consulting and service support work; Finally, there is a large model centered around 80s that specializes in serving complex medical decision-making and reasoning abilities.
The above model supports private deployment, even integrated deployment, which is related to industry characteristics. "It is difficult for the medical industry to accept a completely cloud based model, and few hospitals can accept this breakthrough," said the person in charge.
According to its introduction, in actual hospital implementation scenarios, Beijing Medical Qianxun will pay more attention to independently completing patient services in compliance, including triage, pre consultation, registration, appointment, accompanying consultations during consultations, and post consultation health management.
On the first day of GPT's release, everyone thought about the natural conversational ability and so-called anthropomorphic ability of this generation. From this perspective, whether it can better become a doctor's assistant is more valuable than becoming a diagnostic tool for doctors, "the person in charge emphasized.
In the beauty scene, unlike pure live streaming in the past, JD.com is currently attempting to combine digital person makeup testing with digital person anchors internally; In terms of footwear and clothing scenes, there will be a scene where digital people live stream in the front and hosts change their outfits in the back. The live streaming style based on specific category attributes will be transferred to digital people.
When it comes to the development trend of large models, several technical leaders from JD.com have stated that large models will become smaller and smaller. Vertical large models are a relatively certain direction, and can even be further refined to scene large models. The inherent logic is that large models need to adapt to scenarios and industries, so they cannot be too large.
He Xiaodong, Dean of JD Exploration Research Institute and Head of JD Technology's Artificial Intelligence Business, believes that due to limitations in data and computing power, simply increasing the scale of the model may quickly reach the development ceiling, resulting in the economic benefits generated by the large model being insufficient to support its own costs, making it difficult to sustain.
The large-scale models are growing at a rate of 10 times per year, with parameters ranging from billions to trillions. However, commercialization is currently lagging behind and will eventually become a problem in the medium to long term. He also pointed out that the illusion rate of many models is still high, which cannot provide solid guarantees for future industrial applications.
According to He Xiaodong, JD.com starts from the initial strategy model in terms of model self evolution. Firstly, it constructs an initial preference dataset, and then uses a pre trained reward model to score each answer. Based on the high or low score, it constructs new preference data, which will greatly promote model iteration and updates.
In terms of model inference, the cost of big language model inference is currently skyrocketing. Therefore, JD.com has improved model construction efficiency through end-to-end, low bit, high-precision quantization technology, reducing model size and enhancing inference performance without affecting model output accuracy and parameter quantity. He Xiaodong said that his current technical solution has saved 70% of the model's video memory.
When it comes to the large-scale model of enterprise implementation, Cao Peng believes that there are three key points. Firstly, simplicity is crucial. The diversity and fragmentation of scenarios cannot sustain high development costs, and it is necessary to minimize the threshold for using large models in order to cover more applications. Next is openness, based on an open Agent ecosystem, large model ecosystem, and cloud native ecosystem, giving customers the right to choose. The third is security, providing data security and privacy protection, AIGC content compliance, corpus data security management, making enterprise big model services trustworthy and reliable.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   知名做空机构香橼研究(Citron Research)周四(11月21日)在社交媒体平台X上发布消息称,该公司已决定做空“比特币大户”微策略(Microstrategy)这家公司,并认为该公司已经将自己变身成为一家比特币投资基金 ...
    caffycat
    昨天 11:18
    支持
    反对
    回复
    收藏
  •   每经AI快讯,11月20日,文远知行宣布旗下自动驾驶环卫车S6与无人扫路机S1分别在新加坡滨海湾海岸大道与滨海艺术中心正式投入运营。据介绍,这是新加坡首个商业化运营的自动驾驶环卫项目。 ...
    star8699
    3 天前
    支持
    反对
    回复
    收藏
  •   上证报中国证券网讯(记者王子霖)11月20日,斗鱼发布2024年第三季度未经审计的财务报告。本季度斗鱼依托丰富的游戏内容生态,充分发挥主播资源和新业务潜力,持续为用户提供高质量的直播内容及游戏服务,进一步 ...
    goodfriendboy
    3 天前
    支持
    反对
    回复
    收藏
  •   人民网北京11月22日电 (记者栗翘楚、任妍)2024广州车展,在新能源汽车占据“半壁江山”的同时,正加速向智能网联新能源汽车全面过渡,随着“端到端”成为新宠,智能驾驶解决方案成为本届广州车展各大车企竞 ...
    3233340
    昨天 17:06
    支持
    反对
    回复
    收藏
四夜父脚群 新手上路
  • 粉丝

    0

  • 关注

    0

  • 主题

    0