OpenAI is rumored to be dropping GPT-5 soon — here's what we know about the next-gen model

Ryan Morrison

April 22, 2024 at 8:47 AM·6 min read

OpenAI logo on a phone screen.

Social media is buzzing with rumors of a big OpenAI announcement. This has been sparked by the success of Meta’s Llama 3 (with a bigger model coming in July) as well as a cryptic series of images shared by the AI lab showing the number 22.

As April 22 is OpenAI CEO Sam Altman’s birthday — he’s 39 — the rumor mill is postulating that the company will drop something big such as Sora or even the much anticipated GPT-5.

If it is the latter and we get a major new AI model it will be a significant moment in artificial intelligence as Altman has previously declared it will be “significantly better” than its predecessor and will take people by surprise.

I personally think it will more likely be something like GPT-4.5 or even a new update to DALL-E, OpenAI’s image generation model but here is everything we know about GPT-5 just in case.

What do we know about GPT-5?

A post shared by OpenAI (@openai)
A photo posted by on

We know very little about GPT-5 as OpenAI has remained largely tight lipped on the performance and functionality of its next generation model. We know it will be “materially better” as Altman made that declaration more than once during interviews.

It is very likely going to be multimodal, meaning it can take input from more than just text but to what extent is unclear.

Each new large language model from OpenAI is a significant improvement on the previous generation across reasoning, coding, knowledge and conversation. GPT-5 will be no different.

It has been in training since late last year and will either have significantly more than the 1.5 trillion parameters in GPT-4, or a similar number but stronger underlying architecture allowing for a major performance improvement without increasing the overall model size.

This is something we’ve seen from others such as Meta with Llama 3 70B, a model much smaller than the likes of GPT-3.5 but performing at a similar level in benchmarks.

Chat GPT-5 is very likely going to be multimodal, meaning it can take input from more than just text but to what extent is unclear. Google’s Gemini 1.5 models can understand text, image, video, speech, code, spatial information and even music. GPT-5 is likely to have similar capabilities.

What will GPT-5 be able to do?

Sam Altman is leaving OpenAI for Microsoft

One of the biggest changes we might see with GPT-5 over previous versions is a shift in focus from chatbot to agent. This would allow the AI model to assign tasks to sub-models or connect to different services and perform real-world actions on its own.

This is an area the whole industry is exploring and part of the magic behind the Rabbit r1 AI device. It allows a user to do more than just ask the AI a question, rather you’d could ask the AI to handle calls, book flights or create a spreadsheet from data it gathered elsewhere.

One potential use for agents is in managing everyday life tasks. You could give ChatGPT with GPT-5 your dietary requirements, access to your smart fridge camera and your grocery store account and it could automatically order refills without you having to be involved.

I think this is unlikely to happen this year but agents is certainly the direction of travel for the AI industry, especially as more smart devices and systems become connected.

How different will GPT-5 be?

Image of smartphone with OpenAI ChatGPT loaded ready to use

One thing we might see with GPT-5, particularly in ChatGPT, is OpenAI following Google with Gemini and giving it internet access by default. This would remove the problem of data cutoff where it only has knowledge as up to date as its training ending date.

Expanded multimodality will also likely mean interacting with GPT-5 by voice, video or speech becomes default rather than an extra option. This would make it easier for OpenAI to turn ChatGPT into a smart assistant like Siri or Google Gemini.

Finally, I think the context window will be much larger than is currently the case. It is currently about 128,000 tokens — which is how much of the conversation it can store in its memory before it forgets what you said at the start of a chat.

We’re already seeing some models such as Gemini Pro 1.5 with a million plus context window and these larger context windows are essential for video analysis due to the increased data points from a video compared to simple text or a still image.

Bring out the robots

OpenAI Figure 01 robot handling an apple

One of the biggest trends in generative AI this past year has been in providing a brain for humanoid robots, allowing them to perform tasks on their own without a developer having to programme every action and command before the robot can carry it out.

OpenAI has invested heavily in robotics startup Figure, using GPT-4 to power the Figure 01 and GPT-5 will likely have some spatial awareness data as part of its training to make this even more reliable and capable — understanding how humans interact with the world.

Nvidia is also working on AI models in this space that will be widely available, and AI startup AI21’s founder Professor Amnon Shashua has launched Mentee Robotics to create GenAI powered robots that could find their way into homes and workplaces as early as next year.

Google is also building generative AI powered robots that could use future versions of the Gemini models, especially with massive context windows and Meta is training Llama to understand spatial information for more competent AI-based AR devices like the smart glasses.

What this all means

Essentially we’re starting to get to a point — as Meta’s chief AI scientist Yann LeCun predicts — where our entire digital lives go through an AI filter. Agents and multimodality in GPT-5 mean these AI models can perform tasks on our behalf, and robots put AI in the real world.

OpenAI is facing increasing competition from open source models from companies like Mistral and Meta, as well as direct competitors like Anthropic with Claude and Google with Gemini. You then have Microsoft shifting away from its reliance on OpenAI — although I still think OpenAI will feature at Build 2024 in May.

Before we see GPT-5 I think OpenAI will release an intermediate version such as GPT-4.5 with more up to date training data, a larger context window and improved performance. GPT-3.5 was a significant step up from the base GPT-3 model and kickstarted ChatGPT.

Altman says they have a number of exciting models and products to release this year including Sora, possibly the AI voice product Voice Engine and some form of next-gen AI language model.

More from Tom's Guide

Engadget
OpenAI will train its AI models on the Financial Times' journalism
Generative AI is only as good as the training data used to train the models that power it, so AI companies have increasingly been striking deals with news publishers.
Engadget
OpenAI's Sam Altman and other tech leaders join the federal AI safety board
Sam Altman, OpenAI's CEO, Microsoft chief Satya Nadella, Alphabet CEO Sundar Pichai are joining the government's Artificial Intelligence Safety and Security Board, according to The Wall Street Journal.
TechCrunch
Creators of Sora-powered short explain AI-generated video's strengths and limitations
OpenAI's video generation tool Sora took the AI community by surprise in February with fluid, realistic video that seems miles ahead of competitors. Shy Kids is a digital production team based in Toronto that was picked by OpenAI as one of a few to produce short films essentially for OpenAI promotional purposes, though they were given considerable creative freedom in creating "air head." In an interview with visual effects news outlet fxguide, post-production artist Patrick Cederberg described "actually using Sora" as part of his work. Perhaps the most important takeaway for most is simply this: While OpenAI's post highlighting the shorts lets the reader assume they more or less emerged fully formed from Sora, the reality is that these were professional productions, complete with robust storyboarding, editing, color correction, and post work like rotoscoping and VFX.
TechCrunch
TechCrunch Minute: Meta's new Llama 3 models give open source AI a boost
New AI models from Meta are making waves in technology circles. The two new models, part of the Facebook parent company's Llama line of artificial intelligence tools, are both open source, helping them stand apart from competing offerings from OpenAI and other well-known names. Meta's new Llama models have differently sized underlying datasets, with the Llama 3 8B model featuring eight billion parameters, and the Llama 3 70B model some 70 billion parameters.
TechCrunch
This Week in AI: When 'open source' isn't so open
This week, Meta released the latest in its Llama series of generative AI models: Llama 3 8B and Llama 3 70B. Capable of analyzing and writing text, the models are "open sourced," Meta said -- intended to be a "foundational piece" of systems that developers design with their unique goals in mind. "We believe these are the best open source models of their class, period," Meta wrote in a blog post.
TechCrunch
Meta releases Llama 3, claims it's among the best open models available
Meta has released the latest entry in its Llama series of open generative AI models: Llama 3. Or, more accurately, the company has debuted two models in its new Llama 3 family, with the rest to come at an unspecified future date. Meta describes the new models -- Llama 3 8B, which contains 8 billion parameters, and Llama 3 70B, which contains 70 billion parameters -- as a "major leap" compared to the previous-gen Llama models, Llama 2 8B and Llama 2 70B, performance-wise.
TechCrunch
Meta confirms that its Llama 3 open source LLM is coming in the next month
At an event in London on Tuesday, Meta confirmed that it plans an initial release of Llama 3 — the next generation of its large language model used to power generative AI assistants — within the next month. This confirms a report published on Monday by The Information that Meta was getting close to launch. "Within the next month, actually less, hopefully in a very short period of time, we hope to start rolling out our new suite of next-generation foundation models, Llama 3," said Nick Clegg, Meta's president of global affairs.
TechCrunch
Cloud revenue accelerates 21% to $76 billion for the latest earnings cycle
If you were concerned about slowing cloud infrastructure growth for a time in 2023, you can finally relax: The cloud was back with a vengeance this quarter. The market as a whole was up a healthy $13.5 billion to $76 billion, up 21% over the first quarter in 2023, per Synergy Research. If you’re wondering what’s driving the growth, you probably guessed that it's related to generative AI and the copious amount of data required to build the underlying models.
Engadget
Rabbit R1 review: A $199 AI toy that fails at almost everything
The Rabbit R1 is a cute AI gadget, but at launch it’s riddled with issues and terrible battery life. When phones can handle similar AI tasks, the R1 doesn’t do enough to justify its existence.
Autoblog
Study: These are the most expensive vehicles to drive per mile
iSeeCars' data showed that electric vehicles dominated the list of the most expensive vehicles to drive. Here's why.
Yahoo TV
Banned 'Bluey' episode makes its YouTube debut. How the lovable children's show continues to break TV rules.
The episode originally aired in Australia four years ago but has never been available on Disney+, where “Bluey” airs in the U.S. Now it has quietly appeared on YouTube, surprising fans.
Yahoo Sports
The Lakers firing Darvin Ham was a predictable move. So ... now what?
Perhaps it was the right time to let go of Ham. But who looks at this Lakers roster and sees a championship team? Or a championship contender? Not in this NBA.
Yahoo Sports
Clippers reportedly pursuing contract extension with coach Tyronn Lue
The Los Angeles Clippers are reportedly pursuing a contract extension with head coach Tyronn Lue, who is expected to be targeted by other teams, including the Los Angeles Lakers.
Yahoo Tech
'I'm never concerned it'll fall': This top-selling phone mount is just $20 at Amazon
More than 95,000 people use this to keep their eyes on the road, instead of on their phones.
TechCrunch
Allozymes puts its accelerated enzymatics to work on a data and AI play, raising $15M
Allozymes' ingenious method of quickly testing millions of bio-based chemical reactions is proving to be not just a useful service, but the basis of a unique and valuable dataset. The company just raised a $15 million Series A to grow its business from a helpful service to a world-class resource. The company has grown to 32 people in the U.S., Europe and Singapore, and has 15 times the lab space, which it has used to accelerate its already exponentially faster enzyme-screening technique.
Yahoo Movies
Intimate 'Challengers' scenes set the internet ablaze. So, what makes for a great movie kiss?
"Challengers" might be a movie about tennis, but people can't stop talking about its kissing scenes.
Yahoo Life
Busy Philipps opens up about her mental health journey and ADHD diagnosis: 'Why is this so hard for me?'
"I allowed it to make me feel bad about myself and tell myself stories about my intelligence and my capabilities," says the actress, who was diagnosed with ADHD at 39.
Yahoo Life
Feeling anxious? Here's how to calm down quickly.
Experts share techniques to pull yourself out of a moment of anxiety or panic.
Yahoo Personal Finance
What to do after a car accident: Your step-by-step guide
See what to do after a car crash and how to get started documenting the accident and filing an insurance claim.
Yahoo Personal Finance
How does a fixed-rate mortgage work?
A fixed-rate mortgage locks in your interest rate for the entire loan term. Learn how a fixed-rate mortgage works and whether it’s the right fit for you.

News

Life

Entertainment

Finance

Sports

New on Yahoo

OpenAI is rumored to be dropping GPT-5 soon — here's what we know about the next-gen model

What do we know about GPT-5?

What will GPT-5 be able to do?

How different will GPT-5 be?

Bring out the robots

What this all means

More from Tom's Guide

Recommended Stories

OpenAI will train its AI models on the Financial Times' journalism

OpenAI's Sam Altman and other tech leaders join the federal AI safety board

Creators of Sora-powered short explain AI-generated video's strengths and limitations

TechCrunch Minute: Meta's new Llama 3 models give open source AI a boost

This Week in AI: When 'open source' isn't so open

Meta releases Llama 3, claims it's among the best open models available

Meta confirms that its Llama 3 open source LLM is coming in the next month

Cloud revenue accelerates 21% to $76 billion for the latest earnings cycle

Rabbit R1 review: A $199 AI toy that fails at almost everything

Study: These are the most expensive vehicles to drive per mile

Banned 'Bluey' episode makes its YouTube debut. How the lovable children's show continues to break TV rules.

The Lakers firing Darvin Ham was a predictable move. So ... now what?

Clippers reportedly pursuing contract extension with coach Tyronn Lue

'I'm never concerned it'll fall': This top-selling phone mount is just $20 at Amazon

Allozymes puts its accelerated enzymatics to work on a data and AI play, raising $15M

Intimate 'Challengers' scenes set the internet ablaze. So, what makes for a great movie kiss?

Busy Philipps opens up about her mental health journey and ADHD diagnosis: 'Why is this so hard for me?'

Feeling anxious? Here's how to calm down quickly.

What to do after a car accident: Your step-by-step guide

How does a fixed-rate mortgage work?