首页 News 正文

What is the mysterious new product of Google OpenAI? Latest Speculation: Multimodal AI Assistant

男人的余味偷
1227 0 0

OpenAI is determined to launch a live broadcast and launch new products the day before Google I/O Conference, demonstrating the magical updates of ChatGPT and GPT-4.
What is this mysterious new product? The speculation about GPT-5 and search engines has been personally overturned by OpenAI CEO Altman.
According to the latest reports, the AI assistant built into the phone may be a product that OpenAI is about to release.
Technology media The Information cited insiders as saying that OpenAI plans to launch a multimodal AI model that has visual and auditory functions, can communicate with you, recognize objects, and has better logical reasoning ability than current chatbots. OpenAI has already demonstrated this model to some clients.
OpenAI has developed models that can transcribe audio and text to speech. The report states that the new model is equivalent to a combination of these models, but is more accurate and responsive faster. The new model can help AI assistants distinguish mood, better understand semantics, and theoretically, it can help students learn mathematics or translate real-world gestures.
However, although the new model can surpass the GPT-4 Turbo in answering certain types of questions, there are still hallucinations.
According to developer Anany Arora, OpenAI may launch a service with built-in ChatGPT function on mobile phones for making phone calls. Arora posted screenshots of the above call related code on social media and also found evidence that OpenAI has been configured as a server for real-time audio and video communication.
Using artificial intelligence to make phone calls can save users waiting time, and this service can be seen as one of the functions of AI assistants.
The AI assistant is also a feature that Google has been developing. It is reported that Google Pixel 9 series phones will have a brand new exclusive AI assistant "Pixie" built-in, which can view items through the device's camera and perform operations such as indicating the place of purchase or providing instructions for using the items.
Altman previously revealed in an interview with Salesforce CEO Marc Benioff that his favorite AI movie is "She," a story about a man falling in love with his AI virtual assistant. "The idea of a dialogue language interface has incredible foresight."
The Information reported that Altman hopes to ultimately develop a virtual assistant that can respond quickly, similar to the AI assistant in the movie, and support existing voice assistants such as Apple Siri with this technology.
It is worth noting that according to insiders, Apple is about to reach an agreement with OpenAI to introduce the latter's technology on the new generation iOS operating system. Both parties have been finalizing the terms of an agreement to use the ChatGPT feature in Apple's next-generation iPhone operating system iOS 18.
The new model relies on the cloud for operation and is expected to be included in the free version of ChatGPT in the future
OpenAI believes that AI assistants with visual and auditory capabilities may bring about changes like smartphones. It can observe the environmental information of users, provide suggestions, and potential use cases such as acting as a tutor, translating traffic signs, repairing cars, and so on.
But similar technologies currently require too high hardware barriers to run on personal devices. Media analysis points out that the new model depends on the cloud to run and requires Internet connection to work. It may take several months or even years to make complex artificial intelligence conversations with visual and auditory functions small enough to run on personal devices such as smartphones.
It is currently unclear when OpenAI will provide these new features to paying customers, but according to people who have tried the voice assistant, OpenAI's ultimate plan is to include these features in the free version of ChatGPT, with the goal of lower operating costs than its state-of-the-art model GPT-4 Turbo.
OpenAI did not respond to the above speculation.
What will OpenAI ultimately launch? The answer will be revealed next week, and OpenAI has announced that it will live stream on its official website at 10am Pacific Time on May 13th (1am Beijing Time on May 14th), showcasing some updates to ChatGPT and GPT-4.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   知名做空机构香橼研究(Citron Research)周四(11月21日)在社交媒体平台X上发布消息称,该公司已决定做空“比特币大户”微策略(Microstrategy)这家公司,并认为该公司已经将自己变身成为一家比特币投资基金 ...
    caffycat
    昨天 11:18
    支持
    反对
    回复
    收藏
  •   每经AI快讯,11月20日,文远知行宣布旗下自动驾驶环卫车S6与无人扫路机S1分别在新加坡滨海湾海岸大道与滨海艺术中心正式投入运营。据介绍,这是新加坡首个商业化运营的自动驾驶环卫项目。 ...
    star8699
    3 天前
    支持
    反对
    回复
    收藏
  •   上证报中国证券网讯(记者王子霖)11月20日,斗鱼发布2024年第三季度未经审计的财务报告。本季度斗鱼依托丰富的游戏内容生态,充分发挥主播资源和新业务潜力,持续为用户提供高质量的直播内容及游戏服务,进一步 ...
    goodfriendboy
    3 天前
    支持
    反对
    回复
    收藏
  •   人民网北京11月22日电 (记者栗翘楚、任妍)2024广州车展,在新能源汽车占据“半壁江山”的同时,正加速向智能网联新能源汽车全面过渡,随着“端到端”成为新宠,智能驾驶解决方案成为本届广州车展各大车企竞 ...
    3233340
    昨天 17:06
    支持
    反对
    回复
    收藏
男人的余味偷 新手上路
  • 粉丝

    0

  • 关注

    0

  • 主题

    2