BlogLab

Sora: OpenAI will now let you create videos from verbal cues

CNN  — 

Artificial intelligence leader OpenAI introduced a new AI model called Sora which it claims can create “realistic” and “imaginative” 60-second videos from quick text prompts.

In a blog post on Wednesday, the company said Sora is capable of generating videos up to 60 seconds in length from text instructions, with the ability to serve up scenes with multiple characters, specific types of motion, and detailed background details.

“The model understands not only what the user has asked for in the prompt, but also how those things exist in the physical world,” the blog post said.

OpenAI said it intends to train the AI models so it can “help people solve problems that require real-world interaction.”

U.S. Securities and Exchange Commission (SEC) chairman Gary Gensler attends a meeting of the Financial Stability Oversight Council at the U.S. Department of Treasury on December 14, 2023 in Washington, DC. The group has published their 2023 annual report, which takes a look at the past year in climate, banking, cybersecurity, artificial intelligence, cryptocurrency and other issues. Drew Angerer/Getty Images

This is the latest effort from the company behind the viral chatbot ChatGPT, which continues to push the generative AI movement forward. Although “multi-modal models” are not new and text-to-video models already exist, what sets this apart is the length and accuracy that OpenAI claims Sora to have, according to Reece Hayden, a senior analyst at market research firm ABI Research.

Hayden said these types of AI models could have a big impact on digital entertainment markets with new personalized content being streamed across channels.

“One obvious use case is within TV; creating short scenes to support narratives,” Hayden said. “The model is still limited though, but it shows the direction of the market.”

At the same time, OpenAI said Sora is still a work in progress with clear “weaknesses,” particularly when it comes to spatial details of a prompt – mixing up left and right – and cause and effect. It gave the example of creating a video of someone taking a bite out of a cookie but it not having a bite mark right after.

For now, OpenAI’s messaging remains focused on safety. The company said it plans to work with a team of experts to test the latest model and look closely at various areas including misinformation, hateful content and bias. The company said it is also building tools to help detect misleading information.

BEVERLY HILLS, CALIFORNIA - JANUARY 07: Taylor Swift attends the 81st Annual Golden Globe Awards at The Beverly Hilton on January 07, 2024 in Beverly Hills, California. (Photo by Axelle/Bauer-Griffin/FilmMagic) Axelle/Bauer-Griffin/FilmMagic/Getty Images/FILE

Sora will first be made available to cybersecurity professors, called “red teamers,” who can assess the product for harms or risks. It is also granting access to a number of visual artists, designers and filmmakers to collect feedback on how creative professionals could use it.

The latest update comes as OpenAI continues to advance ChatGPT.

Earlier this week, the company said it is testing a feature in which users can control ChatGPT’s memory, allowing them to ask the platform to remember chats to make future conversations more personalized or tell it to forget what was previously discussed.

ncG1vNJzZmivp6x7pLrNZ5qopV9nfXOAjmlpaGllZMGmr8dopqmdnpa2bsDEsatmrJ9iw6qwxKhkrKeilnyqusOer2egpKK5

Fernande Dalal

Update: 2024-07-14