Google flashes everyone — new Gemini Flash 1.5 takes on GPT-4o

Ryan Morrison

May 14, 2024 at 3:58 PM·3 min read

Google has launched a new member of the Gemini family of artificial intelligence models. Sitting between the on-device Nano and cloud-based Pro, Gemini Flash is designed for chat, complex tasks that require a fast response and handling images, video and speech.

Unveiled at the annual Google I/O developer event, Gemini Flash 1.5 is a native multimodal model similar to OpenAI’s recently unveiled GPT-4o and was built for speed, making it useful for real-time conversations.

The new model is currently available globally for developers to use in their own applications, so we could see a number of third-party live chat apps built using Gemini Flash 1.5 soon.

We also saw an upgrade to Gemini Pro 1.5, the model first released earlier this year and the news it will now power the Gemini Advanced premium chatbot.

What makes Gemini Flash 1.5 different?

Gemini Flash 1.5 sits just above Nano and just below Pro in the size hierarchy and what makes it different, not just to its siblings but other AI models, is the combination of speed and agility.

In addition to being fast and impressive in its ability to understand text, images, video and speech, Flash 1.5 is cheap — at least compared to Pro which is 20 times more expensive.

“We know from user feedback that some applications need lower latency and a lower cost to serve,” said Google DeepMind CEO Demis Hassabis. “This inspired us to keep innovating,” he added, unveiling Flash as a “model that’s lighter-weight than 1.5 Pro, and designed to be fast and efficient to serve at scale.”

A good comparison, at least in terms of speed, is with OpenAI’s recently announced GPT-4o model. It is very fast, natively multimodal and designed for real-time interaction. That said, Gemini Flash 1.5 seems to be a less capable model in terms of reasoning.

What about the massive context window?

Like other Gemini family models, Flash 1.5 comes with a massive one million token context window and the promise of actually being able to utilize it in full. In comparison, GPT-4o has a 128,000 token content window and Claude 3 is at 200,000 tokens.

What makes a large context window so important is the ability to hold a massive amount of information in its memory within a single conversation. This is vital when it comes to analyzing non-text content as an image is worth 1,000 words and a video even more.

It was also trained by its big brother, Gemini Pro 1.5. Hassabis said this was done “through a process called ‘distillation,’ where the most essential knowledge and skills from a larger model are transferred to a smaller, more efficient model.”

“1.5 Flash excels at summarization, chat applications, image and video captioning, data extraction from long documents and tables, and more,” as a result of this process, he said.

As these models, including the faster but smaller ones like Flash, gain the ability to understand more than just text that increased context window becomes even more important.

More from Tom's Guide

Engadget
Opera is adding Google's Gemini AI to its browser
Opera ha teamed up with Google to integrate its Gemini AI models into its Aria AI browser assistant.
TechCrunch
Scarlett Johansson brought receipts to the OpenAI controversy
OpenAI announced this week that it’s removing Sky, one of the voices used by its new GPT-4o model, after users found it sounded eerily similar to Scarlett Johansson’s AI character in “Her.” While the company claims the voice was not based on Johansson’s, the actress said OpenAI had previously approached her about using her voice for the model. The U.S. Department of Justice and 30 state attorneys general filed a lawsuit against Live Nation Entertainment, Ticketmaster’s parent company, for alleged monopolistic practices.
Yahoo Finance
Microsoft unveils GPT-4o for Azure, new AI apps in fight against Google, Amazon
Microsoft debuted a litany of new AI offerings as part of its Build developer conference as its fight with Google and Amazon continues to heat up.
TechCrunch
OpenAI and Google lay out their competing AI visions
This week had two major events from OpenAI and Google. Hot off OpenAI’s tail, Google’s I/O conference featured a smattering of announcements and integrations for its flagship model, Gemini. This week also saw some major shake-ups at AWS and OpenAI.
Yahoo Finance
Microsoft debuts new Copilot+ PCs using OpenAI's GPT-4o while taking shots at Apple
The spring wave of AI product announcements continued Monday with Microsoft rolling out the latest version of Copilot with new AI features.
TechCrunch
OpenAI debuts GPT-4o 'omni' model now powering ChatGPT
OpenAI announced a new flagship generative AI model on Monday that they call GPT-4o — the "o" stands for "omni," referring to the model's ability to handle text, speech, and video. OpenAI CTO Mira Murati said that GPT-4o provides "GPT-4-level" intelligence but improves on GPT-4's capabilities across multiple modalities and media. "GPT-4o reasons across voice, text and vision," Murati said during a streamed presentation at OpenAI's offices in San Francisco on Monday.
TechCrunch
OpenAI's ChatGPT announcement: Watch the GPT-4o reveal and demo here
OpenAI's livestreamed GPT announcement event happened at 10 a.m. PT Monday, but you can still catch up on the reveals. The company described the event as “a chance to demo some ChatGPT and GPT-4 updates.” As it turned out, the announcement was a new model called GPT-4o -- the "o" stands for "omni"-- which offers greater responsiveness to voice prompts, as well as better vision capabilities.
TechCrunch
Microsoft upgrades its AI app-building platforms
Microsoft's big focus at this year's Build conference is generative AI. Azure AI Studio is a toolset within Microsoft's Azure OpenAI Service that lets customers combine an AI model like OpenAI’s recently announced GPT-4o with their own data and build a chat assistant or another type of app that "reasons over" that data. Copilot Studio, meanwhile, provides tools to connect Copilot for Microsoft 365 -- the AI-powered "copilot" in apps like Excel, Word and PowerPoint as well as Microsoft’s Edge browser and Windows -- to third-party data.
Engadget
The Morning After: The biggest news from Google's I/O keynote
The biggest news stories this morning: With Gemini Live, Google wants you to relax and have a natural chat with AI, Google Pixel 8a review.
Engadget
What to expect from Microsoft Build 2024: The Surface event, Windows 11 and AI
Microsoft has a Surface showcase and its Build developer conference planned for early next week. Here's what we expect.
Engadget
RIP ChatGPT's knockoff Scarlett Johansson voice [2023 — 2024]
OpenAI is removing an AI voice from ChatGPT that many believe sounds like Scarlett Johansson. The company didn't give a clear explanation for the decision.
TechCrunch
Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more
Google’s Gemini on Android, its AI replacement for Google Assistant, will soon be taking advantage of its ability to deeply integrate with Android’s mobile operating system and Google’s apps. At the Google I/O 2024 developer conference on Tuesday, the company announced that users will be able to pull up the Gemini overlay on top of the app they’re using in more ways. It’s also updating Android’s built-in AI model, Gemini Nano.
TechCrunch
Google's Gemini updates: How Project Astra is powering some of I/O's big reveals
Google is improving its AI-powered chatbot Gemini so that it can better understand the world around it — and the people conversing with it. At the Google I/O 2024 developer conference on Tuesday, the company previewed a new experience in Gemini called Gemini Live, which lets users have “in-depth” voice chats with Gemini on their smartphones. “With Live, Gemini can better understand you,” Sissie Hsiao, GM for Gemini experiences at Google, said during a press briefing.
Engadget
With Gemini Live, Google wants you to relax and have a natural chat with AI
At Google I/O today, the company showed off Gemini Live, a new mobile experience for natural conversations with its AI.
Engadget
Google I/O 2024 live updates: The latest on Gemini AI, Android 15 and more
Engadget's liveblog of Google I/O 2024, covering the announcements about Android 15, Gemini AI and more.
Engadget
Google Gemini can power a virtual AI teammate with its own Workspace account
At its I/O keynote, Google demoed a virtual AI teammate that can provide insights on a team's work using its own Workspace account.
TechCrunch
Gemini comes to Gmail to summarize, draft emails, and more
Gmail is getting an AI-powered upgrade. At its Google I/O 2024 conference on Tuesday, Google announced that Gmail users will be able to search, summarize, and draft their emails using its Gemini AI technology. It will also be able to take action on emails for more complex tasks, like helping you process an e-commerce return by searching your inbox, finding the receipt, and filling out an online form.
Engadget
OpenAI's board allegedly learned about ChatGPT launch on Twitter
“[The] board was not informed in advance of that,” Toner said on Tuesday on a podcast called The Ted AI Show. “We learned about ChatGPT on Twitter.”
Yahoo Finance
A 'robust' earnings outlook is driving further optimism for stocks in 2024
Earnings estimates for the S&P 500 holding up better than normal is one of the key reasons Wall Street strategists are optimistic about the path for stocks in the second half of 2024.
Yahoo Sports
NBA playoffs: Karl-Anthony Towns, Timberwolves finally grab win over Mavericks to avoid sweep
The Timberwolves, after falling down 0-3 in the series, have forced a Game 5 in the Western Conference finals.

News

Life

Entertainment

Finance

Sports

New on Yahoo

Google flashes everyone — new Gemini Flash 1.5 takes on GPT-4o

What makes Gemini Flash 1.5 different?

What about the massive context window?

More from Tom's Guide

Recommended Stories

Opera is adding Google's Gemini AI to its browser

Scarlett Johansson brought receipts to the OpenAI controversy

Microsoft unveils GPT-4o for Azure, new AI apps in fight against Google, Amazon

OpenAI and Google lay out their competing AI visions

Microsoft debuts new Copilot+ PCs using OpenAI's GPT-4o while taking shots at Apple

OpenAI debuts GPT-4o 'omni' model now powering ChatGPT

OpenAI's ChatGPT announcement: Watch the GPT-4o reveal and demo here

Microsoft upgrades its AI app-building platforms

The Morning After: The biggest news from Google's I/O keynote

What to expect from Microsoft Build 2024: The Surface event, Windows 11 and AI

RIP ChatGPT's knockoff Scarlett Johansson voice [2023 — 2024]

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more

Google's Gemini updates: How Project Astra is powering some of I/O's big reveals

With Gemini Live, Google wants you to relax and have a natural chat with AI

Google I/O 2024 live updates: The latest on Gemini AI, Android 15 and more

Google Gemini can power a virtual AI teammate with its own Workspace account

Gemini comes to Gmail to summarize, draft emails, and more

OpenAI's board allegedly learned about ChatGPT launch on Twitter

A 'robust' earnings outlook is driving further optimism for stocks in 2024

NBA playoffs: Karl-Anthony Towns, Timberwolves finally grab win over Mavericks to avoid sweep