首页 News 正文

Massive financial models entering a price war? Alibaba Cloud announces a 97% price reduction for the Tongyi Qianwen GPT-4 main model

芊芊551
163 0 0

Poster News Reporter Sun Jie reports
On May 21st, Alibaba Cloud released a heavyweight news: the Qwen Long, the main model of the Tongyi Qianwen GPT-4, saw a 97% drop in API input prices from 0.02 yuan/thousand tokens to 0.0005 yuan/thousand tokens. This means that one yuan can buy 2 million tokens, which is equivalent to the amount of text in 5 New China Dictionary books. This model supports up to 10 million tokens of long text input, and after a price reduction, it is about 1/400 of the GPT-4 price, breaking through the global bottom price.
Qwen Long is a long text enhanced version of the Tongyi Qianwen model, with performance benchmarking against GPT-4 and a maximum context length of 10 million. In addition to the input price dropping to 0.0005 yuan/thousand tokens, the output price of Qwen Long has also dropped by 90% to 0.002 yuan/thousand tokens. In contrast, domestic and foreign manufacturers GPT-4, Gemini1.5 Pro, Claude 3 Sonnet, and Ernie 4.0 have input prices of 0.22 yuan, 0.025 yuan, 0.022 yuan, and 0.12 yuan per thousand tokens, respectively, which are much higher than Qwen long.
The price reduction of Tongyi Qianwen this time covers a total of 9 commercial and open source series models. The recently released flagship model Qwen Max from Tongyi Qianwen has reduced its API input price to 0.04 yuan/thousand tokens, a decrease of 67%. Qwen Max is currently the best performing Chinese large model in the industry, with performance comparable to GPT-4 Turbo on the authoritative benchmark OpenCompass, and ranking in the top 15 globally in the big model arena Chatbot Arena.
Not long ago, Sam Altman from OpenAI forwarded the Chatbot Arena ranking to confirm the GPT-4o's capabilities. Among the top 20 global models, only three Chinese models were produced by Tongyi Qianwen.
The industry generally believes that as the performance of large models gradually improves, AI application innovation is entering a period of intensive exploration. However, high inference costs remain a key factor restricting the large-scale application of large models.
At the Wuhan AI Leaders Summit, Liu Weiguang, Senior Vice President of Alibaba Cloud Intelligent Group and President of Public Cloud Business Unit, said, "As China's largest cloud computing company, Alibaba Cloud has significantly reduced the price of large model inference this time in order to accelerate the explosion of AI applications. We expect the number of calls to large model APIs to increase by thousands of times in the future."
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   2024世界人工智能大会,“首次亮相”、“新品发布”成为很多厂商的重要布局。   7月4日,网易多款AI新品首次亮相世界人工智能大会,并发布首个机器人品牌“灵动”。网易灵动是网易伏羲基于自研工业大模型和A ...
    cristianna
    昨天 15:04
    支持
    反对
    回复
    收藏
  •   百度董事长兼CEO李彦宏还记得自己第一次来参加世界人工智能大会(WAIC)是在2022年,那一次大会的主题和元宇宙相关,主办方传话给他,希望他讲一讲元宇宙。他回:“我说我还是讲AI吧,我讲不了元宇宙”。当时, ...
    niemiao
    前天 16:46
    支持
    反对
    回复
    收藏
  •   北京少有的一个阴雨绵绵的早晨,灰色天空给理想纯电车生产基地蒙上一层不同寻常的寂静。这座维持了两个多月喧嚣的新汽车工厂放缓了生产节奏;工人们开始每周只上一天班;正在产线上试制下一款纯电车的理想研发员 ...
    cvpanjun
    前天 15:46
    支持
    反对
    回复
    收藏
  •   据报道,英伟达首席执行官黄仁勋(Jensen Huang)6月份减持了价值近1.69亿美元的该公司股票,这也是他单月减持最多的一次。市场对用于驱动人工智能(AI)的芯片的巨大需求推动英伟达股价再创新高。   美国证 ...
    moshulong
    前天 10:52
    支持
    反对
    回复
    收藏
芊芊551 注册会员
  • 粉丝

    0

  • 关注

    0

  • 主题

    44