OpenAI introduced an update to its GPT-4 language model, called GPT-4o. This iteration promises increased processing speed and broader functionalities encompassing text, vision and audio processing. The update will be integrated into ChatGPT, OpenAI's flagship chatbot product.
OpenAI stated that GPT-4o will be rolled out incrementally, with text and image processing functionalities becoming available in ChatGPT first. Notably, all users will have access to the base functionalities of GPT-4o, while paid tiers will offer increased capacity limits, reaching up to five times that of the free tier.
Sam Altman, CEO, OpenAI expressed his astonishment at GPT-4o's capabilities, comparing it to "AI from the movies." In a blog post, he wrote, "The new voice (and video) mode is the most impressive computing interface I've ever encountered. It feels like movie-style AI; I'm still amazed that it's real. Achieving human-like response times and expressiveness is a significant change."
He further noted that OpenAI's original goal was to create direct benefits with AI, but it has evolved into a platform for user-driven innovation for the collective good. "We are a business and will find numerous ways to monetise, which will enable us to offer free, exceptional AI services to (hopefully) billions of people," Altman added.
Additionally, OpenAI has also revealed an update to ChatGPT's voice mode. The update leverages GPT-4o's capabilities, transforming the application into a real-time voice assistant capable of responding to user prompts and adapting to surrounding environments. This functionality is a significant improvement over the current voice mode, which is limited to single prompts and relies solely on audio input.
In addition, the company originally aimed to directly create benefits for society through its advanced AI models. However, the update suggests a strategic change, focusing on making these models accessible to developers through paid APIs.
This shift positions OpenAI as a provider of foundational technology, empowering third parties to leverage its capabilities and drive further innovation.
Further, this revelation comes amidst speculation regarding OpenAI's upcoming developments. Prior reports suggested the unveiling of an AI search engine, a GPT-4-powered voice assistant or even a completely new GPT-5 model. Interestingly, the timing coincides with Google I/O, where Google's AI division, Gemini, is expected to showcase its latest advancements. This strategic timing suggests a growing rivalry within the field of artificial intelligence.