Are LLMs About To Hit A Wall? | Commentary

Alex Kantrowitz

April 26, 2024 at 5:30 PM·4 min read

Oops!
Something went wrong.
Please try again later.

Each new generation of large language model (LLM) consumes a staggering amount of resources.

Meta, for instance, trained its new Llama 3 models with about 10 times more data and 100 times more compute than Llama 2. Amid a chip shortage, it used two 24,000 GPU clusters, with each chip running around the price of a luxury car. It employed so much data in its AI work, it considered buying the publishing house Simon & Schuster to find more.

Afterward, even its executives wondered aloud if the pace was sustainable.

“It is unclear whether we need to continue scaling or whether we need more innovation on post-training,” Ahmad Al-Dahle, Meta’s VP of GenAI, told me in an interview last week. “Is the infrastructure investment unsustainable over the long run? I don’t think we know.”

For Meta — and its counterparts running large language models — the question of whether throwing more data, compute, and energy at the problem will lead to further scale looms large. Since LLMs entered the popular imagination, the best path to exponential improvement seemed to be combining these ingredients and allowing the magic to happen. But with the top bound of all three potentially in sight, the industry will need newer techniques, more efficient training, and custom built hardware to progress. Without advances in these areas, LLMs may indeed hit a wall.

The path of continued scale probably starts with better methods to train and run LLMs, some of which is already in motion. “We are starting to see new kinds of architectures that are going to change how these models scale in the future,” Swami Sivasubramanian, VP of AI and Data at Amazon Web Services, told me in an interview Thursday night. Sivasubramanian said researchers within Stanford and elsewhere are getting models to learn faster, with the same amount of data, and 10 times cheaper inference. “I’m actually very optimistic about the future when it comes to novel model architectures, which has the potential to disrupt the space,” he said.

Already, new methods of training these models seem to be paying off. “The smallest Llama 3 is basically as powerful as the the biggest Llama 2,” Mark Zuckerberg said on the Dwarkesh Patel podcast last week.

To fuel these models — and get around potential bottlenecks in exhausting real world data — synthetic data created by AI is playing a key role. Though not fully proven yet, this data already made its way into model training. “Our coding abilities on Llama 3 is exceptionally high,” Meta’s Al-Dahle said. “Part of that was really being innovative and pushing on our ability to leverage models to generate synthetic data.”

Along with finding better models, LLM progress likely depends on building better chips that can train and run these models faster and more efficiently than traditional chips. While NVIDIA GPUs are exceptionally useful for large language models, they aren’t purpose-built for them. Now some chips built specifically for generative AI are showing promise. Researchers like Andrew Ng have praised Groq, one buzzy name, as the type of chip that works fast enough to take generative AI to the next level, especially as the field pushes toward agents.

Meanwhile, companies like Amazon, Intel, Google and others are building “accelerators,” or custom chips that can run AI processes fast. At Amazon, Sivasubramanian said, the company’s purpose built Trainium chips are “designed with the sole purpose of being able to train these large language models” and already four times faster than the first generation.

Given the need and the opportunity ahead, it’s no wonder OpenAI CEO Sam Altman is reportedly raising a lot of money to build chips powerful enough to achieve his aims.

The one LLM constraint that’s been little discussed is energy, and it may be the most important. “There’s a capital question of — at what point does it stop being worth it to put the capital in? — but I actually think before we hit that, you’re going to run into energy constraints,” Zuckerberg told Patel. He floated the idea of building a 1 gigawatt datacenter to advance AI, or something approximating a meaningful nuclear power plant. But given regulatory approvals and the build outs complexity, it could take years to produce. “I think it will happen,” he said. “This is only a matter of time.”

Until we get to such massive energy allocation, it may be difficult to say how much room LLMs have left to improve. But it seems like sooner or later, we will find out. “I am not thinking about it myself,” Sivasubramanian said with a laugh, of a nuclear-level plant to run AI models, “but I can’t speak to my infra team.”

The post Are LLMs About To Hit A Wall? | Commentary appeared first on TheWrap.

Engadget
Meta rolls out an updated AI assistant, built with the long-awaited Llama 3
Meta has begun rolling out its new AI assistant, which was built using the long-awaited Llama 3 LLM. You can use Meta AI on Facebook, Instagram, WhatsApp and Messenger, with support for the Quest platform coming soon.
TechCrunch
Meta AI tested: Doesn't quite justify its own existence, but free is free
Meta's new large language model, Llama 3, powers the imaginatively named "Meta AI," a newish chatbot that the social media and advertising company has installed in as many of its apps and interfaces as possible. It tends to regurgitate a lot of web search results, and it doesn't excel at anything, but hey — the price is right. You can currently access Meta AI for free on the web at Meta.ai, on Instagram, Facebook, WhatsApp and probably a few other places if those aren't enough.
TechCrunch
This Week in AI: When 'open source' isn't so open
This week, Meta released the latest in its Llama series of generative AI models: Llama 3 8B and Llama 3 70B. Capable of analyzing and writing text, the models are "open sourced," Meta said -- intended to be a "foundational piece" of systems that developers design with their unique goals in mind. "We believe these are the best open source models of their class, period," Meta wrote in a blog post.
Engadget
Meta is expanding its paid verification service for businesses
Meta is expanding its paid verification service for businesses, adding three new tiers to the program that offers extra perks to companies willing to pay a monthly subscription.
Engadget
Apple's 2023 iMac drops to a record-low price
Apple's 2023 iMac has dropped to its lowest price to date, but you'll need to be comfortable having just 8GB of RAM.
Engadget
iPad Pro 2024 vs. 2022: What’s changed
Apple updated its top-of-the-line tablets at its Let Loose event today. Our comparison lets you quickly glance at all the changes to the new model — big and small — to help you decide whether it’s worth the upgrade.
Engadget
OpenAI partners with People publisher Dotdash Meredith
OpenAI is partnering with another publisher as it moves towards a licensed approach to training materials. Dotdash Meredith, the owner of brands like People and Better Homes & Gardens, will license its content for OpenAI to train ChatGPT.
TechCrunch
Bedrock Studio is Amazon's attempt to simplify generative AI app development
Amazon is launching a new tool, Bedrock Studio, designed to let organizations experiment with generative AI models, collaborate on those models, and ultimately build generative AI-powered apps. Available in public preview starting today, the web-based Bedrock Studio -- a part of Bedrock, Amazon's generative AI tooling and hosting platform -- provides what Amazon describes in a blog post as a "rapid prototyping environment" for generative AI. Bedrock Studio guides developers through the steps to evaluate, analyze, fine-tune and share generative AI models from Anthropic, Cohere, Mistral, Meta and other Bedrock partners, as well as test different model settings and guardrails and integrate outside data sources and APIs.
Yahoo Sports
What's wrong with the Denver Nuggets? | Devine Intervention
Dan Devine and Adam Mares discuss the Denver Nuggets and Minnesota Timberwolves after Monday night’s game 2.
Yahoo Life
‘Get the best sleep ever’: This nighttime remedy has over 6,000 reviewers resting easier — and it's on sale
Shoppers swear this vitamin-packed bedtime drink is the one thing that helps them get in their winks.
TechCrunch
Brandywine Realty Trust says data stolen in ransomware attack
U.S. realty trust giant Brandywine Realty Trust has confirmed a cyberattack that resulted in the theft of data from its network. In a filing with regulators on Tuesday, the Philadelphia-based Brandywine described the cybersecurity incident as unauthorized access and the "deployment of encryption" on its internal corporate IT systems, consistent with a ransomware attack. Brandywine said the cyberattack caused disruption to the company's business applications that support its operations and corporate functions, including its financial reporting systems.
Autoblog
These states have the highest rates of road rage gun violence
ConsumerAffairs recently studied road rage in the United States, finding that an alarming number of drivers in some places resort to firearms for problem resolution.
Yahoo Life Shopping
Giada De Laurentiis' blade of choice is a santoku — get a bestseller for over 50% off in time for Mother's Day
Just how sharp is this $32 knife? 'It feels like I am slicing through the seams of reality,' says one shopper.
Engadget
The Beats Fit Pro wireless earbuds are on sale for $160 right now
The Beats Fit Pro, our top pick for workout and running headphones, is 20 percent off at the minute.
Yahoo Personal Finance
Here’s how inflation erodes your savings and what you can do to stop it
Inflation wears away your purchasing power, meaning the dollars you save today will be worth less in the future. Here’s what you can do to stop the negative impact of inflation on your savings.
Autoblog
Leasing vs. Buying a Car — Which is better?
Here's a guide on how to decide whether to lease or buy your next car.
Yahoo Sports
Walker Buehler delivers increased velocity with a bit of rust in return for the Dodgers
The Dodgers righty threw four innings and surrendered three runs to the Marlins in his first start in nearly two years.
Yahoo Life Shopping
'No more guesswork': Your foodie mom will appreciate this easy-to-read meat thermometer, down to $13
This digital doodad has over 61,000 sizzling five-star reviews: 'This is the secret to not overcooking!'
Yahoo Sports
WNBA to start full-time charter flights for all teams this season
The league intends to launch the program “as soon as we can get planes in places.”
Yahoo Finance
Disney stock falls as company attempts to make streaming business profitable
Disney reported its fiscal second quarter earnings before the bell on Tuesday. Here's what to know.

News

Life

Entertainment

Finance

Sports

New on Yahoo

Are LLMs About To Hit A Wall? | Commentary

Recommended Stories

Meta rolls out an updated AI assistant, built with the long-awaited Llama 3

Meta AI tested: Doesn't quite justify its own existence, but free is free

This Week in AI: When 'open source' isn't so open

Meta is expanding its paid verification service for businesses

Apple's 2023 iMac drops to a record-low price

iPad Pro 2024 vs. 2022: What’s changed

OpenAI partners with People publisher Dotdash Meredith

Bedrock Studio is Amazon's attempt to simplify generative AI app development

What's wrong with the Denver Nuggets? | Devine Intervention

‘Get the best sleep ever’: This nighttime remedy has over 6,000 reviewers resting easier — and it's on sale

Brandywine Realty Trust says data stolen in ransomware attack

These states have the highest rates of road rage gun violence

Giada De Laurentiis' blade of choice is a santoku — get a bestseller for over 50% off in time for Mother's Day

The Beats Fit Pro wireless earbuds are on sale for $160 right now

Here’s how inflation erodes your savings and what you can do to stop it

Leasing vs. Buying a Car — Which is better?

Walker Buehler delivers increased velocity with a bit of rust in return for the Dodgers

'No more guesswork': Your foodie mom will appreciate this easy-to-read meat thermometer, down to $13

WNBA to start full-time charter flights for all teams this season

Disney stock falls as company attempts to make streaming business profitable