We’re rapidly moving where any image of a person will be easily remixed to make a compelling new video

Jon Devo

April 8, 2024 at 6:32 AM·3 min read

Still from ample AI video generated by Open AI's Sora app.

If you witnessed the first attempts at Generative AI video a year ago, including ‘Will Smith Eating Spaghetti’, I share your horror. If you haven’t, trust me, don’t Google it. As with all things AI, the technology has experienced a generational leap in fidelity in the year since those early demonstrations.

People who are part of the Hugging Face AI community will have had a heads-up about what was coming, but many members of the general public lost their minds when OpenAI took to X (formerly Twitter) in February to share SORA, its text-to-video Generative AI Model.

The set of videos, created by text prompts, showed a selection of scenes, actions, and art styles between nine seconds and a minute long. Of course, these demonstrations were cherry-picked, but they were simply astonishing.

Using the prompt: ‘A movie trailer featuring the adventures of a 30-year-old spaceman wearing a red wool knitted motorcycle helmet, blue sky, salt desert, cinematic style, shot on 35mm film, vivid colors’, SORA was able to generate a video (above) with multiple scenes, as well as realistic cinematic lighting and camera movements, including human-like characters moving within each scene.

In another video (below), SORA created a night scene in Tokyo, with a lady in a red dress, leather jacket and sunglasses walking down the street as the camera tracks backwards capturing her movements. The wet floor reflects the neon street signs and shadows of passers-by. The main character’s movement is a little off, but it’s an impressive demonstration of something generated from text without any editing or corrections. It’s difficult to communicate exactly how compelling each of these videos are, so check them out on the links to the original X (Twitter) posts.

Since the worldwide reveal of SORA, other leaders and innovators in the space of generative AI have begun sharing demonstrations of Generative video that go further than OpenAI’s impressive model. The EMO: Emote Portrait Alive audio-to-video diffusion model, created by the Institute for Intelligent Computing, Alibaba Group is one of the most mind-blowing models I’ve seen.

EMO can take a still image and animate it based on an audio source. This would be impressive enough if the model simply moved the subject’s mouth to portray a realistic lip sync, but EMO goes further. Through a two-stage process that trains the model to faithfully map motion to the face of the subject, EMO understands the context of the subject’s facial expression, maintaining the emotion and contextual tone while animating the face to deliver the source audio.

We’re rapidly moving into a world where any image of a person, whether still or in motion, will be easily remixed to produce new, compelling content. The optimist in me wonders what opportunities we may have to revisit old content and animate them with audio-to-video and text-to-video AI models. There will, of course, be concerns around authenticity and deception, as content produced using these models and their offspring will soon become indistinguishable from reality.

Are you inspired or terrified? I’m a little bit of both.

TechCrunch
Google Gemini: Everything you need to know about the new generative AI platform
Google's trying to make waves with Gemini, its flagship suite of generative AI models, apps and services. To make it easier to keep up with the latest Gemini developments, we've put together this handy guide, which we'll keep updated as new Gemini models, features and news about Google’s plans for Gemini are released. Gemini is Google's long-promised, next-gen GenAI model family, developed by Google's AI research labs DeepMind and Google Research.
2d ago
Engadget
OpenAI will train its AI models on the Financial Times' journalism
Generative AI is only as good as the training data used to train the models that power it, so AI companies have increasingly been striking deals with news publishers.
2d ago
TechCrunch
Creators of Sora-powered short explain AI-generated video's strengths and limitations
OpenAI's video generation tool Sora took the AI community by surprise in February with fluid, realistic video that seems miles ahead of competitors. Shy Kids is a digital production team based in Toronto that was picked by OpenAI as one of a few to produce short films essentially for OpenAI promotional purposes, though they were given considerable creative freedom in creating "air head." In an interview with visual effects news outlet fxguide, post-production artist Patrick Cederberg described "actually using Sora" as part of his work. Perhaps the most important takeaway for most is simply this: While OpenAI's post highlighting the shorts lets the reader assume they more or less emerged fully formed from Sora, the reality is that these were professional productions, complete with robust storyboarding, editing, color correction, and post work like rotoscoping and VFX.
4d ago
TechCrunch
Watch it and weep (or smile): Synthesia's AI video avatars now feature emotions
Generative AI has captured the public imagination with a leap into creating elaborate, plausibly real text and imagery out of verbal prompts. Now, Synthesia — one of the ambitious AI startups working in video, specifically custom avatars designed for business users to create promotional, training and other enterprise video content — is releasing an update that it hopes will help it leapfrog over some of the challenges in its particular field. Unlike other generative AI players like OpenAI, which has built a two-pronged strategy — raising huge public awareness with consumer tools like ChatGPT while also building out a B2B offering, with its APIs used by independent developers as well as giant enterprises — Synthesia is leaning into the approach that some other prominent AI startups are taking.
6d ago
TechCrunch
Hugging Face releases a benchmark for testing generative AI on health tasks
Generative AI models are increasingly being brought to healthcare settings — in some cases prematurely, perhaps. Hugging Face, the AI startup, proposes a solution in a newly released benchmark test called Open Medical-LLM. Created in partnership with researchers at the nonprofit Open Life Science AI and the University of Edinburgh's Natural Language Processing Group, Open Medical-LLM aims to standardize evaluating the performance of generative AI models on a range of medical-related tasks.
13d ago
TechCrunch
Adobe's working on generative video, too
Adobe says it's building an AI model to generate video. Offered as an answer of sorts to OpenAI's Sora, Google's Imagen 2 and models from the growing number of startups in the nascent generative AI video space, Adobe's model -- a part of the company's expanding Firefly family of generative AI products -- will make its way into Premiere Pro, Adobe's flagship video editing suite, sometime later this year, Adobe says. Like many generative AI video tools today, Adobe's model creates footage from scratch (either a prompt or reference images) -- and it powers three new features in Premiere Pro: object addition, object removal and generative extend.
16d ago
TechCrunch
Generative AI is coming for healthcare, and not everyone's thrilled
Generative AI, which can create and analyze images, text, audio, videos and more, is increasingly making its way into healthcare, pushed by both Big Tech firms and startups alike. Google Cloud, Google's cloud services and products division, is collaborating with Highmark Health, a Pittsburgh-based nonprofit healthcare company, on generative AI tools designed to personalize the patient intake experience. Amazon's AWS division says it's working with unnamed customers on a way to use generative AI to analyze medical databases for "social determinants of health."
17d ago
TechCrunch
Vana plans to let users rent out their Reddit data to train AI
In the generative AI boom, data is the new oil. From Big Tech firms to startups, AI makers are licensing e-books, images, videos, audio and more from data brokers, all in the pursuit of training up more capable (and more legally defensible) AI-powered products. Shutterstock has deals with Meta, Google, Amazon and Apple to supply millions of images for model training, while OpenAI has signed agreements with several news organizations to train its models on news archives.
18d ago
Engadget
US bill proposes AI companies list what copyrighted materials they use
A new bill would make AI companies detail which copyrighted materials they took data from.
21d ago
TechCrunch
OpenAI expands its custom model training program
OpenAI is expanding a program, Custom Model, to help enterprise customers develop tailored generative AI models using its technology for specific use cases, domains and applications. Custom Model launched last year at OpenAI's inaugural developer conference, DevDay, offering companies an opportunity to work with a group of dedicated OpenAI researchers to train and optimize models for specific domains. "Dozens" of customers have enrolled in Custom Model since.
a month ago
TechCrunch
Snapchat's 'My AI' chatbot can now set in-app reminders and countdowns
Snapchat is launching the ability for users to set in-app reminders with the help of its My AI chatbot, the company announced on Wednesday. The social network is also rolling out editable chats, AI-powered custom Bitmoji looks, map reactions, emoji reactions, and more. With the new AI reminders feature, Snapchat is hoping users will use its app instead of their device's default clock app when setting countdowns or reminders.
4h ago
TechCrunch
Microsoft taps Sanctuary AI for general-purpose robot research
Microsoft, it seems, is hedging its bets when it comes to general-purpose robotics AI. Today, the tech giant announced a collaboration with Figure competitor Sanctuary AI, best known for its humanoid robot, Phoenix. The Sanctuary partnership really gets to the heart of Microsoft’s interest in the category: artificial general intelligence.
6h ago
Yahoo Finance
AI chip darling AMD struggles to impress Wall Street this earnings season
Investors weren't impressed with the company's growth forecast amid the AI rally.
2h ago
Yahoo Sports
How gloves transformed prize fighting from an illegal bloodsport into big business
We sometimes think of the widespread use of gloves as the innovation that civilized boxing. In reality, it’s more like the gloves are what gave it the veneer of respectability it needed in order for people to start making real money off the sport.
2h ago
Yahoo Sports
Timberwolves head coach Chris Finch plans to travel with team to Denver for Game 1 after knee surgery
Chris Finch underwent surgery to repair a ruptured patellar tendon in his right knee on Wednesday.
22m ago
Yahoo News
'It's a very uncertain time': Columbia students share their perspectives on the campus protests
Amid final exams and upcoming graduations, Columbia students are grappling with fluctuating tensions on campus and the national attention these protests have received.
40m ago
Yahoo Life Shopping
Martha Stewart's go-to Skechers sneakers are on rare sale at Amazon — save over $30
The domestic diva prioritizes comfort and support when it comes to shoes, and she says these affordable kicks deliver on both.
1h ago
Yahoo Sports
House v. NCAA settlement heats up, more drama at Colorado and the worst Kentucky Derby names
Dan Wetzel, Ross Dellenger, and SI's Pat Forde unpack the latest update on the House v. NCAA case, react to Twitter beef happening at Colorado, and the worst Kentucky Derby names ever.
1h ago
Yahoo Life Shopping
'I just want to be soaking in this stuff': Drew Barrymore's favorite face oil is on sale for just $17
The celeb's go-to vitamin E serum moisturizes and helps minimize the appearance of fine lines, scars and stretch marks.
1h ago
Yahoo Life Shopping
Reba McEntire, 69, relies on this wrinkle-smoothing 'holy grail' moisturizer for radiant skin
Her makeup artist swears by it for this clever foundation trick for dewy, glowing coverage.
47m ago

News

Life

Entertainment

Finance

Sports

New on Yahoo

We’re rapidly moving where any image of a person will be easily remixed to make a compelling new video

Recommended Stories

Google Gemini: Everything you need to know about the new generative AI platform

OpenAI will train its AI models on the Financial Times' journalism

Creators of Sora-powered short explain AI-generated video's strengths and limitations

Watch it and weep (or smile): Synthesia's AI video avatars now feature emotions

Hugging Face releases a benchmark for testing generative AI on health tasks

Adobe's working on generative video, too

Generative AI is coming for healthcare, and not everyone's thrilled

Vana plans to let users rent out their Reddit data to train AI

US bill proposes AI companies list what copyrighted materials they use

OpenAI expands its custom model training program

Snapchat's 'My AI' chatbot can now set in-app reminders and countdowns

Microsoft taps Sanctuary AI for general-purpose robot research

AI chip darling AMD struggles to impress Wall Street this earnings season

How gloves transformed prize fighting from an illegal bloodsport into big business

Timberwolves head coach Chris Finch plans to travel with team to Denver for Game 1 after knee surgery

'It's a very uncertain time': Columbia students share their perspectives on the campus protests

Martha Stewart's go-to Skechers sneakers are on rare sale at Amazon — save over $30

House v. NCAA settlement heats up, more drama at Colorado and the worst Kentucky Derby names

'I just want to be soaking in this stuff': Drew Barrymore's favorite face oil is on sale for just $17

Reba McEntire, 69, relies on this wrinkle-smoothing 'holy grail' moisturizer for radiant skin