News

New Google Images tease the future of AI-generated movies with the DeepMind tool that even creates soundtracks for videos

Share on facebook
Share on twitter
Share on linkedin
Share on pinterest
Share on telegram
Share on email
Share on reddit
Share on whatsapp
Share on telegram


The next generation of AI-powered videos is about to go public as Google announces a new tool that can automatically create unique soundtracks.

Several AI-generated video creators have impressed users for years, such as OpenAI’s Sora, Runway Gen-3 Alpha, and Luma AI’s Dream Machine.

3

Google announced new video-to-audio tool for its DeepMind AI generator on MondayCredit: AP
The V2A tool will produce music that works with character dialogue and other tonal elements to create the right auditory atmosphere

3

The V2A tool will produce music that works with character dialogue and other tonal elements to create the right auditory atmosphereCredit: Google
DeepMind's V2A can also generate an unlimited number of soundtrack ideas

3

DeepMind’s V2A can also generate an unlimited number of soundtrack ideasCredit: Google

But none of these magic makers have managed to generate a decent soundtrack to accompany the videos – until now.

Google announced the new video to audio tool to its DeepMind AI generator on Monday.

“Video generation models are advancing at an incredible pace, but many current systems can only generate silent results. One of the next main steps to bring the generated films to life is to create soundtracks for these silent videos,” Google wrote.

“Today, we’re sharing progress on our video to audio (V2A) technologywhich enables synchronized audiovisual generation.”

“V2A combines video pixels with natural images language text prompts to generate rich soundscapes for the on-screen action,” they explained.

The tool can be combined with video generation templates like Veo to create dramatic soundtracks that align perfectly with any scene.

The AI ​​will produce music that works with character dialogue and other tonal elements to create the right auditory atmosphere.

“It can also generate soundtracks for a variety of traditional footage, including archival material, silent films and more – opening up a wider range of creative opportunities,” said DeepMind.

Google shared impressive examples of the new technology in action, including clips of a Western-style soundtrack that accompanied a cowboy on horseback and a wild wolf howling at the moon.

COMPLETE CREATIVE CONTROL

Google’s new V2A tool will give creators the power to let AI generate a soundtrack based on the clip’s visual input and language instructions, or to create a soundtrack themselves.

‘Oh God, this shouldn’t exist,’ scream viewers as ‘insane’ AI-made video is revealed – can you see signs the man isn’t real?

Users can provide instructions and editing tips to the tool to guide its output in the desired direction.

One set of instructions read: “Audio request: cinematic, suspense, horror film, music, tension, ambiance, footsteps on concrete.”

The scene shows a man walking through a destroyed building before ending with a vision of the same man on a mysterious bridge.

The AI ​​creates a suitable soundtrack for the clip that matches the tone and pace of the narrative.

ENDLESS SOUNDTRACK OPTIONS

DeepMind’s V2A can also generate an unlimited number of soundtrack ideas.

An example prompt read: “Audio prompt: Spaceship hurtles through the vastness of space, stars passing by, high speed, science fiction.”

The video showed a spacecraft flying through the vast opening of space with the light of a star shining in the distance.

The first soundtrack generated by the V2A tool was an uplifting orchestral piece that matched the image and stimulus.

A second AI-produced soundtrack from the same prompt was darker and slower.

What is Google DeepMind?

Google’s DeepMind project was born in 2010.

“Google DeepMind brings together two of the world’s leading AI labs – Google Brain and DeepMind – into a single, focused team, led by our CEO Demis Hassabis,” according to Google.

“Over the past decade, both teams have been responsible for some of the greatest advances in AI research, many of which underpin the thriving AI industry we see today.”

The organization aims to bring to light the enormous potential of AI for everyone.

“We are a team of scientists, engineers, ethicists, and more, working to build the next generation of AI systems safely and responsibly,” they wrote.

“By solving some of the toughest scientific and engineering challenges of our time, we are working to create innovative technologies that can advance science, transform work, serve diverse communities – and improve the lives of billions of people.”

SOURCE: GOOGLE DEEPMIND

Using “Audio Prompt: Ethereal Cello Atmosphere” changed things even more.

This third soundtrack immediately established a sadder, more thoughtful tone.

JUST IMPROVING

Google said these updates were just the latest attempt to update its full suite of AI-generated content providers.

They hope to improve some issues in the next versions.

“Because the quality of the audio output depends on the quality of the video input, artifacts or distortions in the video, which are outside the model’s training distribution, can lead to a noticeable drop in audio quality,” Google said.

“We are also improving lip sync for videos that involve speech. V2A attempts to generate speech from the input transcripts and synchronize it with the characters’ lip movements.”

“But the paired video generation model may not be conditional on transcriptions. This creates a mismatch, often resulting in awkward lip syncing as the video model does not generate mouth movements that match the transcription,” they added.



This story originally appeared on The-sun.com read the full story

Support fearless, independent journalism

We are not owned by a billionaire or shareholders – our readers support us. Donate any amount over $2. BNC Global Media Group is a global news organization that delivers fearless investigative journalism to discerning readers like you! Help us to continue publishing daily.

Support us just once

We accept support of any size, at any time – you name it for $2 or more.

Related

More

1 2 3 6,013

Don't Miss

Final A-Rod vs. NBA West Cubana, a Study of Contrasting Team Sales

The Minnesota Timberwolves and Dallas Mavericks entered the NBA Western

A French priest accused of sexually assaulting children in the Canadian Arctic has died

TORONTO – Joannes Rivoire, a French priest who was accused