Jump to content
  • Sign Up
×
×
  • Create New...

Ever wanted to hear a saxophone bark? Nvidia just made the ‘world’s most flexible sound machine’ that uses AI to blend music, voices and sounds


Recommended Posts

  • Diamond Member

This is the hidden content, please

Ever wanted to hear a saxophone bark? Nvidia just made the ‘world’s most flexible sound machine’ that uses AI to blend music, voices and sounds


  • Nvidia has announced its new Fugatto generative AI audio tool
  • It can create and mix audio in all kinds of ways, but isn’t out yet
  • Fugatto promies to create unique sounds, audio mixes, speech, and more

Nvidia

This is the hidden content, please
a new generative AI audio tool called Fugatto, which it’s describing as the “world’s most flexible sound machine” – capable of producing all kinds of music, speech, and other audio, and even unique sounds that have never been heard before.

Fugatto, which is short for Foundational Generative Audio Transformer Opus 1, can work with text prompts and audio samples. You can simply describe what you want to hear, or get the AI model to modify or combine existing audio clips.

For example, you can have the sound of a train transform into a lush orchestral arrangement, or mix a banjo melody with the sounds of rainfall. You can hear the sound of a saxophone barking, or a flute meowing, just by typing in a prompt.

Fugatto can also isolate vocals from tracks, and change the vocal delivery style, as well as generate speech from scratch. Feed in an existing melody, and you can have it played on whatever instrument you like, in any kind of style.

The bad news – it’s not available yet

Audio AI Fugatto Generates Sound from Text | NVIDIA Research –
This is the hidden content, please


This is the hidden content, please

So how can you try out this impressive new AI technology? You can’t, for the time being: you’ll have to make do with Nvidia’s

This is the hidden content, please
and a
This is the hidden content, please
. There’s no word yet on when Fugatto will be available for public testing.

Some of the samples published by Nvidia include the sound of a female voice barking, a factory machine screaming, a typewriter whispering, and a cello shouting with anger. You can see the wide variety of audio effects that are possible.

Nvidia has also demonstrated how the AI engine is able to produce spoken word clips, which can then be delivered with a range of different emotions (from ****** to happy) and even with different accents applied.

Sign up to be the first to know about unmissable ****** Friday deals on top tech, plus get all your favorite TechRadar content.

“We wanted to create a model that understands and generates sound like humans do,”

This is the hidden content, please
, one of the Fugatto team. “Fugatto is our first step toward a future where unsupervised multitask learning in audio synthesis and transformation emerges from data and model scale.”

You might also like



This is the hidden content, please

#wanted #hear #saxophone #bark #Nvidia #worlds #flexible #sound #machine #blend #music #voices #sounds

This is the hidden content, please

This is the hidden content, please

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Vote for the server

    To vote for this server you must login.

    Jim Carrey Flirting GIF

  • Recently Browsing   0 members

    • No registered users viewing this page.

Important Information

Privacy Notice: We utilize cookies to optimize your browsing experience and analyze website traffic. By consenting, you acknowledge and agree to our Cookie Policy, ensuring your privacy preferences are respected.