首页 News 正文

What is the mysterious new product of Google OpenAI? Latest Speculation: Multimodal AI Assistant

男人的余味偷
1204 0 0

OpenAI is determined to launch a live broadcast and launch new products the day before Google I/O Conference, demonstrating the magical updates of ChatGPT and GPT-4.
What is this mysterious new product? The speculation about GPT-5 and search engines has been personally overturned by OpenAI CEO Altman.
According to the latest reports, the AI assistant built into the phone may be a product that OpenAI is about to release.
Technology media The Information cited insiders as saying that OpenAI plans to launch a multimodal AI model that has visual and auditory functions, can communicate with you, recognize objects, and has better logical reasoning ability than current chatbots. OpenAI has already demonstrated this model to some clients.
OpenAI has developed models that can transcribe audio and text to speech. The report states that the new model is equivalent to a combination of these models, but is more accurate and responsive faster. The new model can help AI assistants distinguish mood, better understand semantics, and theoretically, it can help students learn mathematics or translate real-world gestures.
However, although the new model can surpass the GPT-4 Turbo in answering certain types of questions, there are still hallucinations.
According to developer Anany Arora, OpenAI may launch a service with built-in ChatGPT function on mobile phones for making phone calls. Arora posted screenshots of the above call related code on social media and also found evidence that OpenAI has been configured as a server for real-time audio and video communication.
Using artificial intelligence to make phone calls can save users waiting time, and this service can be seen as one of the functions of AI assistants.
The AI assistant is also a feature that Google has been developing. It is reported that Google Pixel 9 series phones will have a brand new exclusive AI assistant "Pixie" built-in, which can view items through the device's camera and perform operations such as indicating the place of purchase or providing instructions for using the items.
Altman previously revealed in an interview with Salesforce CEO Marc Benioff that his favorite AI movie is "She," a story about a man falling in love with his AI virtual assistant. "The idea of a dialogue language interface has incredible foresight."
The Information reported that Altman hopes to ultimately develop a virtual assistant that can respond quickly, similar to the AI assistant in the movie, and support existing voice assistants such as Apple Siri with this technology.
It is worth noting that according to insiders, Apple is about to reach an agreement with OpenAI to introduce the latter's technology on the new generation iOS operating system. Both parties have been finalizing the terms of an agreement to use the ChatGPT feature in Apple's next-generation iPhone operating system iOS 18.
The new model relies on the cloud for operation and is expected to be included in the free version of ChatGPT in the future
OpenAI believes that AI assistants with visual and auditory capabilities may bring about changes like smartphones. It can observe the environmental information of users, provide suggestions, and potential use cases such as acting as a tutor, translating traffic signs, repairing cars, and so on.
But similar technologies currently require too high hardware barriers to run on personal devices. Media analysis points out that the new model depends on the cloud to run and requires Internet connection to work. It may take several months or even years to make complex artificial intelligence conversations with visual and auditory functions small enough to run on personal devices such as smartphones.
It is currently unclear when OpenAI will provide these new features to paying customers, but according to people who have tried the voice assistant, OpenAI's ultimate plan is to include these features in the free version of ChatGPT, with the goal of lower operating costs than its state-of-the-art model GPT-4 Turbo.
OpenAI did not respond to the above speculation.
What will OpenAI ultimately launch? The answer will be revealed next week, and OpenAI has announced that it will live stream on its official website at 10am Pacific Time on May 13th (1am Beijing Time on May 14th), showcasing some updates to ChatGPT and GPT-4.
CandyLake.com 系信息发布平台,仅提供信息存储空间服务。
声明:该文观点仅代表作者本人,本文不代表CandyLake.com立场,且不构成建议,请谨慎对待。
您需要登录后才可以回帖 登录 | 立即注册

本版积分规则

  •   每经AI快讯,据亿航智能官微消息,公司EH216-S无人驾驶电动垂直起降航空器(eVTOL)获得巴西国家民航局颁发的试验飞行许可证书,并计划在巴西进行测试和试飞。关于EH216-S无人驾驶eVTOL在巴西的认证,中国民航局 ...
    潇湘才子
    昨天 08:41
    支持
    反对
    回复
    收藏
  •   今年7月,美国三大海外“债主”所持美国国债齐刷刷缩水,其中日本美债持仓已降至去年10月以来最低。   根据美国财政部当地时间9月18日公布的国际资本流动报告(TIC),2024年7月,美国前三大海外“债主”日本 ...
    520hacker
    3 天前
    支持
    反对
    回复
    收藏
  •   上证报中国证券网讯(记者俞立严)9月19日,蔚来全新品牌乐道的首款车型——乐道L60正式上市。新车定位家庭智能电动SUV,在采用BaaS电池租用服务后,L60的售价可低至14.99万元,电池租用月费最低为599元。乐道L6 ...
    anhao007
    前天 11:03
    支持
    反对
    回复
    收藏
  •   每经记者袁园   日前,国务院印发的《关于加强监管防范风险推动保险业高质量发展的若干意见》提出,以新能源汽车商业保险为重点,深化车险综合改革。   “车险综改”从2015年就已经开始逐步推进了,经过 ...
    moshulong
    前天 21:50
    支持
    反对
    回复
    收藏
男人的余味偷 新手上路
  • 粉丝

    0

  • 关注

    0

  • 主题

    2