首页 News 正文

What is the mysterious new product of Google OpenAI? Latest Speculation: Multimodal AI Assistant

男人的余味偷
1216 0 0

OpenAI is determined to launch a live broadcast and launch new products the day before Google I/O Conference, demonstrating the magical updates of ChatGPT and GPT-4.
What is this mysterious new product? The speculation about GPT-5 and search engines has been personally overturned by OpenAI CEO Altman.
According to the latest reports, the AI assistant built into the phone may be a product that OpenAI is about to release.
Technology media The Information cited insiders as saying that OpenAI plans to launch a multimodal AI model that has visual and auditory functions, can communicate with you, recognize objects, and has better logical reasoning ability than current chatbots. OpenAI has already demonstrated this model to some clients.
OpenAI has developed models that can transcribe audio and text to speech. The report states that the new model is equivalent to a combination of these models, but is more accurate and responsive faster. The new model can help AI assistants distinguish mood, better understand semantics, and theoretically, it can help students learn mathematics or translate real-world gestures.
However, although the new model can surpass the GPT-4 Turbo in answering certain types of questions, there are still hallucinations.
According to developer Anany Arora, OpenAI may launch a service with built-in ChatGPT function on mobile phones for making phone calls. Arora posted screenshots of the above call related code on social media and also found evidence that OpenAI has been configured as a server for real-time audio and video communication.
Using artificial intelligence to make phone calls can save users waiting time, and this service can be seen as one of the functions of AI assistants.
The AI assistant is also a feature that Google has been developing. It is reported that Google Pixel 9 series phones will have a brand new exclusive AI assistant "Pixie" built-in, which can view items through the device's camera and perform operations such as indicating the place of purchase or providing instructions for using the items.
Altman previously revealed in an interview with Salesforce CEO Marc Benioff that his favorite AI movie is "She," a story about a man falling in love with his AI virtual assistant. "The idea of a dialogue language interface has incredible foresight."
The Information reported that Altman hopes to ultimately develop a virtual assistant that can respond quickly, similar to the AI assistant in the movie, and support existing voice assistants such as Apple Siri with this technology.
It is worth noting that according to insiders, Apple is about to reach an agreement with OpenAI to introduce the latter's technology on the new generation iOS operating system. Both parties have been finalizing the terms of an agreement to use the ChatGPT feature in Apple's next-generation iPhone operating system iOS 18.
The new model relies on the cloud for operation and is expected to be included in the free version of ChatGPT in the future
OpenAI believes that AI assistants with visual and auditory capabilities may bring about changes like smartphones. It can observe the environmental information of users, provide suggestions, and potential use cases such as acting as a tutor, translating traffic signs, repairing cars, and so on.
But similar technologies currently require too high hardware barriers to run on personal devices. Media analysis points out that the new model depends on the cloud to run and requires Internet connection to work. It may take several months or even years to make complex artificial intelligence conversations with visual and auditory functions small enough to run on personal devices such as smartphones.
It is currently unclear when OpenAI will provide these new features to paying customers, but according to people who have tried the voice assistant, OpenAI's ultimate plan is to include these features in the free version of ChatGPT, with the goal of lower operating costs than its state-of-the-art model GPT-4 Turbo.
OpenAI did not respond to the above speculation.
What will OpenAI ultimately launch? The answer will be revealed next week, and OpenAI has announced that it will live stream on its official website at 10am Pacific Time on May 13th (1am Beijing Time on May 14th), showcasing some updates to ChatGPT and GPT-4.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   苹果知名分析师郭明錤周四(10月31日)在社交媒体上发文表示,苹果明年可能会减少对芯片制造商博通Wi-Fi芯片的依赖,并推出自己的处理器。   郭明錤在社交媒体平台X上写道,“在2025年下半年的新产品(例如iPh ...
    uturn
    昨天 14:42
    支持
    反对
    回复
    收藏
  •   10月30日,小鹏汽车生态企业小鹏汇天宣布,旗下分体式飞行汽车“陆地航母”即将亮相2024中国航展,11月12日将在中国航展第二展区(斗门莲洲)进行全球首次公开飞行,同时“陆地航母”也将在珠海国际航展中心8号 ...
    yxtianyouyou
    前天 11:43
    支持
    反对
    回复
    收藏
  •   交易所监管文件显示,当地时间11月1日,亚马逊创始人杰夫·贝索斯拟出售约1635万股亚马逊股票,预计套现约30.5亿美元。今年7月,贝索斯已申请额外出售约2500万股亚马逊股票,按当时股价计算可套现约50亿美元。 ...
    blueskybb
    13 小时前
    支持
    反对
    回复
    收藏
  •   近日,凯撒海湾目的地(山东)运营管理有限责任公司(简称“凯撒海湾”)与携程旅悦集团签署战略合作协议,双方将围绕“海上目的地运营”、“旅游产品与服务创新”、“研学旅行”、“日韩及海外旅游市场开拓”等 ...
    llyyy2008
    11 小时前
    支持
    反对
    回复
    收藏
男人的余味偷 新手上路
  • 粉丝

    0

  • 关注

    0

  • 主题

    2