Microsoft's new Phi-3 is one of the smallest AI models available — and it performs better than its larger rivals

Ryan Morrison

April 23, 2024 at 7:58 AM·3 min read

Microsoft Copilot app running on a phone with Microsoft logo in background.

Microsoft has revealed its impressive new Phi-3 artificial intelligence model. This is a tiny model in comparison to the likes of GPT-4, Gemini or Llama 3 but it packs a punch for its size.

Known as small language models, these lightweight AIs make it cheaper and easier to run certain tasks without having to use the heavy computing power of the bigger models.

Despite its tiny size Phi-3 mini has already performed as well as Llama 2 on some benchmarks with Microsoft saying it is as responsive as a model 10 times its size.

What isn't clear is whether this might form part of a future update to Copilot as Microsoft looks to bring more of its functionality on-device, or whether this will remain as a standalone project.

What is Microsoft Phi-3?

Microsoft Phi-3 was trained with a "curriculum", according to VP of Azure Eric Boyd. Speaking to The Verge, Boyd said they took a list of 3000 words and asked a larger language model to make children's books to teach Phi. It was then trained on these new books.

We introduce phi-3-mini ... whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5 despite being small enough to be deployed on a phone.

Phi-3 comes in three sizes; mini which is just 3.8 billion parameters, a 7 billion parameter small and the 14 billion parameter medium model. In comparison GPT-4 has more than a trillion parameters and the smallest Llama 3 model has 8 billion.

Phi-3 is built on top of the learning from Phi-1, which focused on coding, and Phi-2 which was taught to reason. This improved reasoning is how it can match GPT-3.5 in response quality despite training on a much smaller dataset.

This shift to smaller models performing as well as, or even outperforming big players is a growing trend. Meta's Llama 3 70B has almost reached GPT-4 levels in some benchmarks and smaller models seem to be finding specific niches.

Phi-3’s performance “rivals that of models such as Mixtral 8x7B and GPT-3.5,” the researchers from Microsoft explained in a paper on the new model. This happened “despite being small enough to be deployed on a phone.”

The innovation was entirely down to the dataset, according to the team. This was made up of heavily filtered web data and the synthetic data from the children’s books made by other AI.

Why do we need Phi-3?

Microsoft Phi-3 is designed to run on a wider range of devices and much faster than is possible with larger models. It sits in a family of models like StabilityAI Zephyr, Google Gemini Nano and Anthropic’s Claude 3 Haiku that could run on a laptop or phone — no internet required.

In future these models could be bundled with a smartphone, embedded on a chip that sits inside a smart speaker or even built into your fridge to give you advice on your dietary habits.

While cloud-based models like Google Gemini Ultra, Claude 3 Opus and GPT-4-Turbo will always outperform the smaller models across all areas, they have several drawbacks including cost, speed and that they require an internet connection to be usable.

These tiny models will allow you to chat to your virtual assistant even if you have no internet connection, have the AI summarize content without sending data offline or even work in an internet of things device without you knowing AI is at play.

Apple is said to be relying almost entirely on these local models to power the next generation of generative AI features in iOS 18 and Google has deployed Gemini Nano to more Android handset.

More from Tom's Guide

Engadget
Microsoft's lightweight Phi-3 Mini model can run on smartphones
Microsoft has unveiled its latest light AI model called the Phi-3 Mini designed to run smartphones and other devices.
TechCrunch
Google Gemini: Everything you need to know about the new generative AI platform
Google's trying to make waves with Gemini, its flagship suite of generative AI models, apps and services. To make it easier to keep up with the latest Gemini developments, we've put together this handy guide, which we'll keep updated as new Gemini models, features and news about Google’s plans for Gemini are released. Gemini is Google's long-promised, next-gen GenAI model family, developed by Google's AI research labs DeepMind and Google Research.
TechCrunch
Too many models
Other large language models like LLaMa or OLMo -- though they technically share a basic architecture -- don't actually fill the same role. There's some deliberate confusion about these two things, because the models' developers want to borrow a little of the fanfare associated with major AI platform releases, like your GPT-4V or Gemini Ultra.
TechCrunch
Meta releases Llama 3, claims it's among the best open models available
Meta has released the latest entry in its Llama series of open generative AI models: Llama 3. Or, more accurately, the company has debuted two models in its new Llama 3 family, with the rest to come at an unspecified future date. Meta describes the new models -- Llama 3 8B, which contains 8 billion parameters, and Llama 3 70B, which contains 70 billion parameters -- as a "major leap" compared to the previous-gen Llama models, Llama 2 8B and Llama 2 70B, performance-wise.
TechCrunch
Meta adds its AI chatbot, powered by Llama 3, to the search bar across its apps
Meta's making several big moves today to promote its AI services across its platform. The company has upgraded its AI chatbot with its newest large language model, Llama 3, and it is now running it in the search bar of its four major apps (Facebook, Messenger, Instagram and WhatsApp) across multiple countries. This confirms and extends a test that TechCrunch reported on last week, when we spotted that the company had started testing Meta AI on Instagram's search bar.
TechCrunch
OpenAI opens Tokyo hub, adds GPT-4 model optimized for Japanese
OpenAI is expanding to Japan, with the opening of a new Tokyo office and plans for a GPT-4 model optimized specifically for the Japanese language. Japan is the current G7 chair and President of the G7's Hiroshima AI Process, an initiative to promote AI safety, including stronger AI governance.
Yahoo Finance
The Oracle of Omaha warns about AI: What Buffett said at Berkshire
Warren Buffett told Berkshire Hathaway shareholders that he has mixed feelings about artificial intelligence — in a tone he's expressed before.
Yahoo Sports
Kyle Larson beats Chris Buescher at Kansas in closest finish in NASCAR history
Larson won by 0.001 seconds.
Yahoo Sports
Shohei Ohtani punctuates Dodgers sweep of Braves with 2 home runs to tie MLB lead
Ohtani tagged Braves ace Max Fried for a two-run shot in the first inning, then hit a solo shot in the eighth as the Dodgers prevailed in a battle of NL favorites.
Yahoo Sports
Twins' 12-game win streak ends with 9–2 loss to Red Sox
The Minnesota Twins' 12-game winning streak ended on Sunday with a 9–2 loss to the Boston Red Sox at Target Field.
TechCrunch
Women in AI: Catherine Breslin helps companies develop AI strategies
To give AI-focused women academics and others their well-deserved — and overdue — time in the spotlight, TechCrunch has been publishing a series of interviews focused on remarkable women who’ve contributed to the AI revolution. Catherine Breslin is the founder and director of Kingfisher Labs, where she helps companies develop AI strategies.
Yahoo Sports
The NBA Loser Lineup: Paolo Banchero, Magic on the rise in fantasy and reality despite playoff exit
Basketball analyst Dan Titus breaks down what the teams and stars who were booted from the NBA Playoffs must do to remain in good fantasy standing next season.
TechCrunch
Hyundai antes up $1B for AV startup Motional and Elon unplugs the Tesla Supercharger team
TechCrunch Mobility is moving to Thursdays! Sign up here for free — just click TechCrunch Mobility! EV startup Fisker laid off more employees to "preserve cash" as bankruptcy inches ever closer; ride-hailing company Ola cut about 180 jobs and ousted its chief executive, Hemant Bakshi, merely four months after appointing him to the post; and lidar company Luminar slashed its 700-person workforce by 20% as part of a restructuring to adopt an "asset light" business model.
TechCrunch
Jack Dorsey departs Bluesky board
Bluesky’s most prominent backer has left its board. On Saturday, Jack Dorsey posted on X about grants for open protocols from his philanthropic Start Small initiative. This prompted someone to ask Dorsey if he was still on the Bluesky board, and he responded with a terse “no.” Dorsey did not answer any of the follow-up posts asking him to explain his departure.
Autoblog
Lamborghini, Aston Martin, Mercedes-AMG, Porsche and Koenigsegg Lego sets coming this summer
Lego announces new sets including Lambo V12 Vision GT, Aston Martin AMR23, Mercedes-AMG G63 & SL63, Porsche GT4 e-Performance, and Koenigsegg Jesko.
Yahoo Sports
Russell Westbrook says report about him wanting to leave Clippers 'has likely been fabricated'
Russell Westbrook is trying to clear up reports about him wanting to leave the Clippers.
Yahoo Life Shopping
I shop sales for a living, and these are the Wayfair Way Day home deals I'm eyeing — save up to 80%
I'm eyeing a set of cooling queen-sized sheets for just $18, a Staub iron skillet that's 60% off and an outdoor patio set for $150, to name a few.
TechCrunch
Why NASA is betting on a 36-pixel camera
NASA's James Webb Space Telescope is making strides in astronomy with its 122-megapixel primarily infrared photos taken 1.5 million kilometers away from Earth. The space agency's newest sky-peeper takes a different approach, however, performing groundbreaking space science with 36 pixels. The X-ray Imaging and Spectroscopy Mission (XRISM), pronounced “crism,” is a collaboration between NASA and the Japan Aerospace Exploration Agency (JAXA).
Yahoo Life Shopping
I'm an interior designer, and these are my favorite deals from Wayfair's Way Day rug sale— save up to 80% off
Score deeply discounted floor coverings from some of my favorite brands — Kelly Clarkson, Magnolia Home by Joanna Gaines and Loloi.
Yahoo Life Shopping
Mother's Day gold: 'Festive' solar torch lights to make her garden glow — save 40%
Get eight of these flickering-flame beauties for $30 and 'make spring/summer a little more fun.'

News

Life

Entertainment

Finance

Sports

New on Yahoo

Microsoft's new Phi-3 is one of the smallest AI models available — and it performs better than its larger rivals

What is Microsoft Phi-3?

Why do we need Phi-3?

More from Tom's Guide

Recommended Stories

Microsoft's lightweight Phi-3 Mini model can run on smartphones

Google Gemini: Everything you need to know about the new generative AI platform

Too many models

Meta releases Llama 3, claims it's among the best open models available

Meta adds its AI chatbot, powered by Llama 3, to the search bar across its apps

OpenAI opens Tokyo hub, adds GPT-4 model optimized for Japanese

The Oracle of Omaha warns about AI: What Buffett said at Berkshire

Kyle Larson beats Chris Buescher at Kansas in closest finish in NASCAR history

Shohei Ohtani punctuates Dodgers sweep of Braves with 2 home runs to tie MLB lead

Twins' 12-game win streak ends with 9–2 loss to Red Sox

Women in AI: Catherine Breslin helps companies develop AI strategies

The NBA Loser Lineup: Paolo Banchero, Magic on the rise in fantasy and reality despite playoff exit

Hyundai antes up $1B for AV startup Motional and Elon unplugs the Tesla Supercharger team

Jack Dorsey departs Bluesky board

Lamborghini, Aston Martin, Mercedes-AMG, Porsche and Koenigsegg Lego sets coming this summer

Russell Westbrook says report about him wanting to leave Clippers 'has likely been fabricated'

I shop sales for a living, and these are the Wayfair Way Day home deals I'm eyeing — save up to 80%

Why NASA is betting on a 36-pixel camera

I'm an interior designer, and these are my favorite deals from Wayfair's Way Day rug sale— save up to 80% off

Mother's Day gold: 'Festive' solar torch lights to make her garden glow — save 40%