Google's AI can ingest multiple books. What will it do with all that data?

Alistair Barr

March 25, 2024 at 5:00 AM·4 min read

Google Gemini 1.5 Pro has a context window of 1 million tokens.
Users will be able to submit giant prompts that are equivalent to multiple books.
What will Google do with all the data that users share with the company?

Two decades ago, Google cofounder Larry Page had a dream to digitally scan millions of books. It turned into a long and bitter legal battle that the company eventually won.

Today, the emergence of huge AI models is turning this book-scanning debate on its head.

Google will soon release a powerful new model called Gemini 1.5 Pro that has a context window of 1 million tokens. That's about 750,000 words, or the equivalent of 3 to 7 books depending on the length. It can also suck in 1 hour of video, 11 hours of audio, and more than 30,000 lines of code via user prompts.

"Entirely new capabilities"

Until recently, AI models could only handle a few thousand tokens. This meant that users were limited in their interactions with these systems. It was a bit like having a conversation with a forgetful friend who would have to restart the chat from scratch every so often.

Gemini 1.5 Pro is being previewed to a few lucky early testers. When it rolls out fully, users will be able to dump in whole book series, codebases, entire legal case histories, or really anything they want. This Google model can ingest all this information quickly and then answer questions about the data.

"Longer context windows show us the promise of what is possible," Google CEO Sundar Pichai said when unveiling Gemini 1.5 in February. "They will enable entirely new capabilities."

A giant digital vacuum

What is Google going to do with the data people share through Gemini 1.5?

After trying so hard for so many years to scan millions of books itself, Google will now have users willingly dumping whole volumes into its AI model, along with mountains of other text, code, images, and video.

It's highly likely that this information will be used as training data to help Google build other models. The emergence of generative AI has sparked a global race for high-quality data, so a huge context window can work as a giant digital vacuum.

Google says data shared with Gemini "helps improve and develop Google products, services, and machine-learning technologies."

Machine learning is a type of AI. So it's safe to interpret this comment as a yes: Google will use this data to train future AI models.

Developers versus corporate customers

The internet giant treats information shared with its AI models and services differently, depending on the offering.

Google AI Studio is a new developer tool for Gemini. For this service, the company says content submitted "may be used to improve our services, including our machine-learning technologies."

Vertex AI is an enterprise platform for larger corporate customers. Google told BI that in this case the company "doesn't use customer data to train Google models without that customer's permission."

Gemini 1.5 Pro, the fanciest Google AI model with the largest context window, is not fully available yet, so the terms of service aren't out. A Google spokesperson declined to comment on which data-use approach will apply to this top model. "We'll prioritize transparency, choice, and control," they added.

A brave new AI world

Either way, this is a brave new AI world of information sharing. It's probably why some big companies have sent around warnings again recently prohibiting employees from sharing sensitive data with AI models.

Google also warns users about sharing certain data with its models.

"Do not submit sensitive, confidential, or personal information to the Services," the company says in bold writing in one of its current Gemini terms of service.

Prompt data controls

Here are some other important tips for controlling how Google uses any prompts you submit to its AI models. These are from a company spokesman and Google terms of service.

You can turn Gemini Apps Activity off through this dashboard. This prevents your future conversations from being used to improve Google's generative AI models.
If this setting is off, your conversations will still be saved for up to 72 hours to help Google provide the Gemini AI service and process any feedback you want to share with the company.
In those 72 hours, unless you give feedback, your conversations also won't be used to improve Google products, including its AI models.
If you're 18 or older, Google stores your Gemini Apps activity in your Google Account for up to 18 months by default. You can cut that down to 3 or 36 months in your Gemini Apps Activity settings.
You can also review or delete your activity in that same dashboard at any time.

Read the original article on Business Insider

TechCrunch
Google's Gemini Pro 1.5 enters public preview on Vertex AI
Gemini 1.5 Pro, Google's most capable generative AI model, is now available in public preview on Vertex AI, Google's enterprise-focused AI development platform. Undoubtedly its headlining feature is the amount of context that it can process: between 128,000 tokens to up to 1 million tokens, where "tokens" refers to subdivided bits of raw data (like the syllables "fan," "tas" and "tic" in the word "fantastic").
18d ago
TechCrunch
Google goes all in on generative AI at Google Cloud Next
This week in Las Vegas, 30,000 folks came together to hear the latest and greatest from Google Cloud. What they heard was all generative AI, all the time. Google Cloud is first and foremost a cloud infrastructure and platform vendor.
14d ago
TechCrunch
Google's Gemini comes to databases
Google wants Gemini, its family of generative AI models, to power your app's databases — in a sense. At its annual Cloud Next conference in Las Vegas, Google announced the public preview of Gemini in Databases, a collection of features underpinned by Gemini to — as the company pitched it — "simplify all aspects of the database journey." In less jargony language, Gemini in Databases is a bundle of AI-powered, developer-focused tools for Google Cloud customers who are creating, monitoring and migrating app databases.
18d ago
TechCrunch
Google Cloud Next 2024: Everything announced so far
Google’s Cloud Next 2024 event takes place in Las Vegas through Thursday, and that means lots of new cloud-focused news on everything from Gemini, Google’s AI-powered chatbot, to AI to devops and security. Last year's event was the first in-person Cloud Next since 2019, and Google took to the stage to show off its ongoing dedication to AI with its Duet AI for Gmail and many other debuts, including expansion of generative AI to its security product line and other enterprise-focused updates and debuts. Don’t have time to watch the full archive of Google's keynote event?
17d ago
TechCrunch
Google rolls out Gemini in Android Studio for coding assistance
Google continues rolling out Gemini to different products as today the company announced that Android Studio's bot is getting upgraded with Gemini Pro. In May 2023, during the Google I/O developer event, the company introduced Studio Bot powered by the PaLM-2 foundation model. The company is rolling out Gemini in Android Studio in more than 180 countries for the Android Studio Jellyfish version.
19d ago
Engadget
Google's new AI video generator is more HR than Hollywood
Google Vids is not a replacement for AI-powered video generation tools like OpenAI's Sora. Instead, it uses stock footage and personal documents and media to spit out videos that office workers would be proud of.
18d ago
TechCrunch
Google releases Imagen 2, a video clip generator
Google doesn't have the best track record when it comes to image-generating AI. In February, the image generator built into Gemini, Google's AI-powered chatbot, was found to be randomly injecting gender and racial diversity into prompts about people, resulting in images of racially diverse Nazis, among other offensive inaccuracies. Google pulled the generator, vowing to improve it and eventually re-release it.
18d ago
Engadget
Google reverses course and brings its Gemini AI to the regular Pixel 8
Google is releasing the Nano version of its Gemini AI for the Pixel 8 smartphone. This follows a roll out last year to the Pixel 8 Pro.
a month ago
Engadget
Apple has reportedly resumed talks with OpenAI to build a chatbot for the iPhone
Apple has resumed talks with OpenAI, the maker of ChatGPT, to build an AI-powered chatbot into the iPhone, according to a new report.
15h ago
Yahoo Finance
Tesla and Microsoft presented 2 distinct versions of AI. Investors liked both.
Earnings presentations from four of the biggest tech companies in the world showed two very different visions of AI: the moonshot and the incremental gains.
3h ago
Yahoo Sports
Ezekiel Elliott, Dallas Cowboys moving closer to a reunion
Elliott played last season with the Patriots after seven years in Dallas.
2h ago
Yahoo Life Shopping
'Loose fitting without being frumpy' These comfy capri pants are down to $22
Over 5,000 shoppers give these comfy pants a perfect five-star rating — you can get them over 60% off.
2h ago
Engadget
I played Fire Emblem Engage on easy mode, and it got me back into gaming
"Sure, winning battles and matches in more difficult modes will feel more rewarding, but not every gaming experience has to be a challenge."
3h ago
Yahoo Life Shopping
The spring top shoppers say can 'help hide any figure flaws' is on sale for just $27 — that's nearly 50% off
Drapey and flattering, more than 3,000 reviewers rave about this blouse.
3h ago
Yahoo Entertainment
I went on the 'Sex and the City' tour. For the show's fans, Carrie Bradshaw's New York City is still alive.
New fans and critics of "Sex and the City" are born every day, so I spent some time with longtime viewers to learn why the show still resonates decades later.
5h ago
Yahoo Sports
2024 NFL Draft: Fantasy football winners and losers after Rounds 2 and 3
Scott Pianowski examines the potential fantasy impact of intriguing receivers and running backs taken on Day 2 of the NFL Draft.
10h ago
Yahoo TV
'The Simpsons' has been on the air for 34 years. Why a character's shocking death is rare for the series.
He was only the ninth recurring character to ever be killed off in the show's 34-year history.
14h ago
TechCrunch
It's a sunny day for Google Cloud
Google Cloud, Google's cloud computing division, had a blockbuster fiscal quarter, blowing past analysts' expectations and sending Google parent company Alphabet's stock soaring 13%+ in after-hours trading. Google Cloud revenue jumped 28% to $9.57 billion in Q1 2024, bolstered by the demand for generative AI tools that rely on cloud infrastructure, services and apps. Google Cloud's operating income grew nearly 5x to $900 million, up from $191 million.
2d ago
Yahoo Sports
NBA Playoffs: Kawhi Leonard looks far from 100% in Clippers' chippy loss to Mavericks
It was ugly all over for the Clippers in Game 3.
12h ago
Yahoo Tech
'Nothing short of exceptional': Apple's 'killer' AirPods Pro are down to $200
But are they worth the spend? No question, fans say: 'These aren't just a little bit better than the competition; they are a lot better.'
18h ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

Google's AI can ingest multiple books. What will it do with all that data?

"Entirely new capabilities"

A giant digital vacuum

Developers versus corporate customers

A brave new AI world

Prompt data controls

Recommended Stories

Google's Gemini Pro 1.5 enters public preview on Vertex AI

Google goes all in on generative AI at Google Cloud Next

Google's Gemini comes to databases

Google Cloud Next 2024: Everything announced so far

Google rolls out Gemini in Android Studio for coding assistance

Google's new AI video generator is more HR than Hollywood

Google releases Imagen 2, a video clip generator

Google reverses course and brings its Gemini AI to the regular Pixel 8

Apple has reportedly resumed talks with OpenAI to build a chatbot for the iPhone

Tesla and Microsoft presented 2 distinct versions of AI. Investors liked both.

Ezekiel Elliott, Dallas Cowboys moving closer to a reunion

'Loose fitting without being frumpy' These comfy capri pants are down to $22

I played Fire Emblem Engage on easy mode, and it got me back into gaming

The spring top shoppers say can 'help hide any figure flaws' is on sale for just $27 — that's nearly 50% off

I went on the 'Sex and the City' tour. For the show's fans, Carrie Bradshaw's New York City is still alive.

2024 NFL Draft: Fantasy football winners and losers after Rounds 2 and 3

'The Simpsons' has been on the air for 34 years. Why a character's shocking death is rare for the series.

It's a sunny day for Google Cloud

NBA Playoffs: Kawhi Leonard looks far from 100% in Clippers' chippy loss to Mavericks

'Nothing short of exceptional': Apple's 'killer' AirPods Pro are down to $200