OpenAI Launches New Tool Sora, Can Turn Text Into Videos

LetterGetter

Editor

posted on 1 year ago — updated on 1 second ago

216
views

OpenAI’s Sora is a new artificial intelligence tool that uses text prompts to create realistic 60-second videos.

OpenAI, the company behind ChatGPT, has unveiled a new type of artificial intelligence (AI) that generates realistic video based on text prompts, sparking stunned reactions online. Sora, the text-to-video model, has “a deep understanding of language” and can generate “compelling characters that express vibrant emotions,” according to a blog post published by OpenAI on February 15.

The organisation has shared a slew of posts explaining how Sora works. You could, for instance, use this prompt to convincingly depict what Tokyo is all about. And we must admit, it seems pretty convincing.

“Beautiful, snowy Tokyo city is bustling. The camera moves through the bustling city street, following several people enjoying the beautiful snowy weather and shopping at nearby stalls. Gorgeous sakura petals are flying through the wind along with snowflakes,” one of the posts read.

Watch the video (prompt) here:

Introducing Sora, our text-to-video model.Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions. https://t.co/7j2JN27M3W
Prompt: “Beautiful, snowy… pic.twitter.com/ruTEWn87vf
— OpenAI (@OpenAI) February 15, 2024

Since its release, the video has received 45.3 million views, and the numbers are only increasing. The clip has also received numerous comments from users.

According to the Microsoft-backed startup, Sora, which means “sky” in Japanese, is “able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background.” It is a significant step forward in the field of AI, following the release of tools such as ChatGPT and DALL-E.

Watch other videos of the new AI tool here:

Prompt: “Animated scene features a close-up of a short fluffy monster kneeling beside a melting red candle. the art style is 3d and realistic, with a focus on lighting and texture. the mood of the painting is one of wonder and curiosity, as the monster gazes at the flame with… pic.twitter.com/aLMgJPI0y6— OpenAI (@OpenAI) February 15, 2024

Prompt: “A stylish woman walks down a Tokyo street filled with warm glowing neon and animated city signage. she wears a black leather jacket, a long red dress, and black boots, and carries a black purse. she wears sunglasses and red lipstick. she walks confidently and casually.… pic.twitter.com/cjIdgYFaWq— OpenAI (@OpenAI) February 15, 2024

Prompt: “A gorgeously rendered papercraft world of a coral reef, rife with colorful fish and sea creatures.” pic.twitter.com/gzEE8SwP81— OpenAI (@OpenAI) February 15, 2024

Meanwhile, Sam Altman, the CEO of OpenAI, asked users to recommend prompts for Sora on X. He then shared the results, which included lifelike clips of two golden retrievers podcasting atop a mountain, a grandmother preparing gnocchi and marine life competing in an ocean-top bicycle race.

Prompt: “A movie trailer featuring the adventures of the 30 year old space man wearing a red wool knitted motorcycle helmet, blue sky, salt desert, cinematic style, shot on 35mm film, vivid colors.” pic.twitter.com/0JzpwPUGPB— OpenAI (@OpenAI) February 15, 2024

Prompt: “Several giant wooly mammoths approach treading through a snowy meadow, their long wooly fur lightly blows in the wind as they walk, snow covered trees and dramatic snow capped mountains in the distance, mid afternoon light with wispy clouds and a sun high in the distance… pic.twitter.com/Um5CWI18nS— OpenAI (@OpenAI) February 15, 2024

Sora’s popularity skyrocketed after CEO Altman invited social media users to send out written prompts. One user commented, “This one’s actually the most impressive video so far from a semantics and fidelity standpoint.” “Such a powerful tool and it has already spread the magic all over the world,” wrote another. “Wow, impressive,” shared someone. “This is insane,” noted a person.

About Sora

OpenAI describes Sora as a text-to-video model that produces one-minute videos while “maintaining the visual quality and adherence to the user’s prompt.”

According to OpenAI, Sora is capable of creating complex scenes with multiple characters, specific types of motion, and precise subject and background details. As per the company, the model understands not only what the user prompts, but also how these things will manifest in the real world. According to reports, OpenAI is also collaborating with visual artists, designers, and filmmakers to gather model feedback.