What to know about the launch of GPT-4o

Lauren Sforza

May 13, 2024 at 7:23 PM·2 min read

Oops!
Something went wrong.
Please try again later.

OpenAI on Monday launched its latest artificial intelligence (AI) model, GPT-4o, which promises improvements in its text, vision and audio capabilities.

OpenAI unveiled the model during a live demonstration Monday, with Chief Technology Officer Mira Murati saying it is a “huge step forward with the ease of use” of the system. OpenAI’s newest model launched just one day before Google’s annual developer conference scheduled for Tuesday.

Here’s what to know about the launch of GPT-4o.

Improved voice instruction

Users can now show GPT-4o multiple photos and chat with the model about the uploaded image, according to OpenAI.

This can help students work their way through math problems step by step. One of the demonstrations shown during the launch on Monday walks the users through a simple math problem without giving away any answers.

A separate video posted by online instruction company Khan Academy demonstrates how the new model can help teach students in real time. The student shared his screen with him working through the problem in real time as the model guided him through it.

A faster model with improved capabilities

Murati said Monday that GPT-4o provides “GPT-4 level intelligence” that is faster and improves the system’s capabilities across text, vision and audio.

“This is really shifting the paradigm into the future of collaboration, where this interaction becomes much more natural and far, far easier,” she said.

OpenAI said its new model can “respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds.” It noted that this is about the same amount of time it takes for humans to respond in a conversation.

The new model launched Monday

GPT-4o is available starting Monday to all users of OpenAI’s ChatGPT AI chatbot, including those who are using the free version.

“GPT-4o’s text and image capabilities are starting to roll out today in ChatGPT. We are making GPT-4o available in the free tier, and to Plus users with up to 5x higher message limits,” OpenAI wrote in its update Monday.

The new voice mode will come out in the following weeks for ChatGPT Plus users, OpenAI CEO Sam Altman wrote on the social platform X.

The model is ‘natively multimodal’

Altman also posted on X that the model is “natively multimodal,” which means that the model can generate content and understand commands through voice, text or images.

In a separate blog post, he said the new voice and video mode “is the best computer interface” he has ever used.

“It feels like AI from the movies; and it’s still a bit surprising to me that it’s real. Getting to human-level response times and expressiveness turns out to be a big change,” he wrote in Monday’s post.

For the latest news, weather, sports, and streaming video, head to The Hill.

TechCrunch
OpenAI debuts GPT-4o 'omni' model now powering ChatGPT
OpenAI announced a new flagship generative AI model on Monday that they call GPT-4o — the "o" stands for "omni," referring to the model's ability to handle text, speech, and video. OpenAI CTO Mira Murati said that GPT-4o provides "GPT-4-level" intelligence but improves on GPT-4's capabilities across multiple modalities and media. "GPT-4o reasons across voice, text and vision," Murati said during a streamed presentation at OpenAI's offices in San Francisco on Monday.
TechCrunch
ChatGPT's mobile app revenue saw its biggest spike yet following GPT-4o launch
Consumer demand for the latest AI technology is heating up. The launch of OpenAI's latest flagship model, GPT-4o, has now driven the company's biggest-ever spike in revenue on mobile, despite the model being freely available on the web. This technical innovation is also pushing more users to upgrade to OpenAI's paid subscription, according to new data from app intelligence firm Appfigures.
Yahoo Finance
Microsoft unveils GPT-4o for Azure, new AI apps in fight against Google, Amazon
Microsoft debuted a litany of new AI offerings as part of its Build developer conference as its fight with Google and Amazon continues to heat up.
Engadget
OpenAI claims that its free GPT-4o model can talk, laugh, sing and see like a human
The new model accepts any combination of text, audio and images as input and can generate an output in all three formats.
TechCrunch
OpenAI and Google lay out their competing AI visions
This week had two major events from OpenAI and Google. Hot off OpenAI’s tail, Google’s I/O conference featured a smattering of announcements and integrations for its flagship model, Gemini. This week also saw some major shake-ups at AWS and OpenAI.
Yahoo Finance
Microsoft debuts new Copilot+ PCs using OpenAI's GPT-4o while taking shots at Apple
The spring wave of AI product announcements continued Monday with Microsoft rolling out the latest version of Copilot with new AI features.
TechCrunch
OpenAI's ChatGPT announcement: Watch the GPT-4o reveal and demo here
OpenAI's livestreamed GPT announcement event happened at 10 a.m. PT Monday, but you can still catch up on the reveals. The company described the event as “a chance to demo some ChatGPT and GPT-4 updates.” As it turned out, the announcement was a new model called GPT-4o -- the "o" stands for "omni"-- which offers greater responsiveness to voice prompts, as well as better vision capabilities.
TechCrunch
Microsoft upgrades its AI app-building platforms
Microsoft's big focus at this year's Build conference is generative AI. Azure AI Studio is a toolset within Microsoft's Azure OpenAI Service that lets customers combine an AI model like OpenAI’s recently announced GPT-4o with their own data and build a chat assistant or another type of app that "reasons over" that data. Copilot Studio, meanwhile, provides tools to connect Copilot for Microsoft 365 -- the AI-powered "copilot" in apps like Excel, Word and PowerPoint as well as Microsoft’s Edge browser and Windows -- to third-party data.
Engadget
What to expect from Microsoft Build 2024: The Surface event, Windows 11 and AI
Microsoft has a Surface showcase and its Build developer conference planned for early next week. Here's what we expect.
Engadget
OpenAI co-founder and Chief Scientist Ilya Sutskever is leaving the company
Last year, Ilya Sutskever admitted that he regretted the role he played in the sudden dismissal of OpenAI CEO Sam Altman and President Greg Brockman.
TechCrunch
Stack Overflow signs deal with OpenAI to supply data to its models
OpenAI is collaborating with Stack Overflow, the Q&A forum for software developers, to improve its generative AI models' performance on programming-related tasks. As a result of the partnership, announced Monday, OpenAI's models, including models served through its ChatGPT chatbot platform, should get better over time at answering programming-related questions, the two companies say. At the same time, Stack Overflow will benefit from OpenAI's expertise in developing new generative AI integrations on the Stack Overflow platform.
Engadget
OpenAI’s new safety team is led by board members, including CEO Sam Altman
OpenAI has created a new Safety and Security Committee less than two weeks after the company dissolved the team tasked with protecting humanity from AI’s existential threats. This latest iteration will include two board members and CEO Sam Altman.
Engadget
Opera is adding Google's Gemini AI to its browser
Opera ha teamed up with Google to integrate its Gemini AI models into its Aria AI browser assistant.
Engadget
VR classics Job Simulator and Vacation Simulator come to Apple Vision Pro
Job Simulator and Vacation Simulator have been released for the Apple Vision Pro. This is a version developed specifically for the platform with optimized hand-and-eye tracking.
Autoblog
BYD unveils new plug-in hybrid tech — with a claimed 1,200-mile range
BYD's latest plug-in hybrid technology, launched in sedan versions of its Qin L and Seal 06 models, is priced from 99,800 yuan ($13,775).
Autoblog
Kenworth SuperTruck 2 is a conceptual 10-wheeler of the future
Kenworth's SuperTruck 2, part of a project with the DoE, is a concept ten-wheeler with a diesel mild-hybrid engine and huge efficiency.
TechCrunch
Rock band's hidden hacking-themed website gets hacked
On Friday, Pal Kovacs was listening to the long-awaited new album from rock and metal giants Bring Me The Horizon when he noticed a strange sound at the end of the record’s last track. Being a fan of solving riddles and breaking encrypted codes, Kovacs wondered: does this sound contain a hidden message? Kovacs opened the song in the audio editing app Audacity and, as he suspected, there was indeed a spectrogram — essentially a visual representation of the audio itself — which was actually a scannable QR code.
Yahoo Sports
Saints DE Tanoh Kpassagnon out indefinitely with torn Achilles, not yet ruled out for all of 2024 season
Tanoh Kpassagnon tore his Achilles while running a drill at a team workout earlier this offseason, the Saints confirmed on Tuesday.
Autoblog
Exclusive: 2025 VW Jetta and GLI photos show first glimpse of updates
Autoblog's exclusive photos of the 2025 VW Jetta and GLI show updates outside and a new interior design.
Yahoo Life Shopping
Drew Barrymore says this cleanser is 'by far the best,' and it's only $12 on Amazon
More than 42,000 five-star fans agree: It's the easiest way to remove makeup, no harsh scrubbing required.

News

Life

Entertainment

Finance

Sports

New on Yahoo

What to know about the launch of GPT-4o

Improved voice instruction

A faster model with improved capabilities

The new model launched Monday

The model is ‘natively multimodal’

Recommended Stories

OpenAI debuts GPT-4o 'omni' model now powering ChatGPT

ChatGPT's mobile app revenue saw its biggest spike yet following GPT-4o launch

Microsoft unveils GPT-4o for Azure, new AI apps in fight against Google, Amazon

OpenAI claims that its free GPT-4o model can talk, laugh, sing and see like a human

OpenAI and Google lay out their competing AI visions

Microsoft debuts new Copilot+ PCs using OpenAI's GPT-4o while taking shots at Apple

OpenAI's ChatGPT announcement: Watch the GPT-4o reveal and demo here

Microsoft upgrades its AI app-building platforms

What to expect from Microsoft Build 2024: The Surface event, Windows 11 and AI

OpenAI co-founder and Chief Scientist Ilya Sutskever is leaving the company

Stack Overflow signs deal with OpenAI to supply data to its models

OpenAI’s new safety team is led by board members, including CEO Sam Altman

Opera is adding Google's Gemini AI to its browser

VR classics Job Simulator and Vacation Simulator come to Apple Vision Pro

BYD unveils new plug-in hybrid tech — with a claimed 1,200-mile range

Kenworth SuperTruck 2 is a conceptual 10-wheeler of the future

Rock band's hidden hacking-themed website gets hacked

Saints DE Tanoh Kpassagnon out indefinitely with torn Achilles, not yet ruled out for all of 2024 season

Exclusive: 2025 VW Jetta and GLI photos show first glimpse of updates

Drew Barrymore says this cleanser is 'by far the best,' and it's only $12 on Amazon