OpenAI Might Release a Digital Assistant That Recognizes Images, Audio Faster

OpenAI has been a huge player in the AI scene with several of its products having impressive capabilities. On top of the existing AI services that the AI startup already offers, there are rumors saying that the company is set to announce the arrival of a multimodal AI digital assistant.

OpenAI
Dilara Irem Sancar/Anadolu via Getty Images

OpenAI's Digital Assistant

There are a few sources who claim to have already seen the new AI tool in action, wherein it can recognize objects and talk to people, even managing to detect voices that indicate a person is being sarcastic. Of course, these are all rumors for now.

The new model is said to be faster and can interpret images and audio more accurately compared to OpenAI's text-to-speech models. When it comes to answering certain types of questions, it can outperform GPT-4 Turbo.

Despite the improved capabilities, it can still potentially get answers wrong, as mentioned by The Verge. If the product is a real thing, then the company will likely introduce it during its event set to take place on Monday.

The rumored model pairs well with another expected function, where ChatGPT will have a built-in ability to make phone calls. The potential feature was spotted by developer Ananay Arora, who also saw evidence of the possibility of real-time audio and video communication with the chatbot.

While rumors of new AI models are circulating, OpenAI CEO Sam Altman already denied that there will be an unveiling of new models that would be better than GPT-4. Still, the event would likely come with a big announcement from the company, so there's a real possibility there.

Either way, OpenAI has already released several AI models and tools that exceed that of its competitors. Its video generator Sora, for instance, impressed a lot of people with its capability to generate realistic videos.

A Potential Partnership with Apple

Whatever OpenAI intends to unveil in the event, which will take place on Monday at 10 a.m. PT, it might be help in closing a deal with Apple. The iPhone maker is already in talks with both Google and OpenAI for the AI features it intends to include in the iOS 18 update.

9To5Google reported that Apple is already closing in on a deal with OpenAI, but Google is still in the mix with its Gemini AI model. If the AI startup wins the deal, then iPhones might soon have ChatGPT integrated into its system.

Apple is yet to reveal what the partnership will lead to specifically. Some of the possible uses would be AI enhancing the assistance features on the smartphone, or functioning as a voice assistant to execute simple tasks like setting alarms to searching the web.

No matter what functions it may be, it would be a huge step forward for OpenAI to partner with a tech giant like Apple, especially since the company is already behind in the AI race. OpenAI's model could be used for Apple's future AI capabilities as well, which will be beneficial for both parties.

© 2024 iTech Post All rights reserved. Do not reproduce without permission.

More from iTechPost

Real Time Analytics