OpenAI introduces AI model that turns text into video

OpenAI introduces AI model that turns text into video
FILE PHOTO: OpenAI logo is seen in this illustration taken, on February 3, 2023. REUTERS/Dado Ruvic/Illustration/File Photo

Microsoft-backed OpenAI is working on a software that can generate minute-long videos based on text prompts, the company said Thursday.

The software, called Sora, is currently available for red teaming, which helps identify flaws in the AI system, as well as for use by visual artists, designers and filmmakers to gain feedback on the model, the company said in a statement.

“Sora is able to generate complex scenes with multiple characters, specific types of motion, and accurate details of the subject and background,” the statement said, adding that it can create multiple shots within a single video.

READ: AI agent that controls devices is OpenAI’s next project

Apart from generating videos from text prompts, Sora can animate a still image, the company said in a blogpost.

The video generation software follows OpenAI’s ChatGPT chatbot, which was released in late 2022 and created a buzz around GenAI with its ability to compose emails and write codes and poems.

Earlier this week, OpenAI was reportedly planning to turn ChatGPT into an AI agent.

Gizmodo said The Information first reported on OpenAI’s allegedly upcoming project. The news outlet said the AI firm will build agent software that could take over a device and complete tasks for people.

The AI assistant would work with multiple computer programs, letting it click, move the cursor, and type.

It is seen to transform how everyone uses their computers as it can craft slides, add images, and include captions.

Also, this AI agent will reportedly have web browsing capabilities that can perform research on behalf of an individual. Gizmodo said it is part of CEO Sam Altman’s goal of turning his chatbot into a “supermart personal assistant.” 

READ: OpenAI explains how to teach with AI

Social media giant Meta Platforms beefed up its image generation model Emu last year to add two AI-based features that can edit and generate videos from text prompts. The Facebook parent company is also looking to compete with Microsoft, Alphabet’s Google, and Amazon in the rapidly transforming generative AI universe.

Sora is a work-in-progress, with the company adding that the model may confuse the spatial details of a prompt, and have difficulty in following a specific camera trajectory.

OpenAI said it was also developing tools which can discern if a video was generated by Sora.

Read more...