Forum: TFSI

Ever wanted to hear a saxophone bark? Nvidia just made the 'world

From TechnologyDaily@1337:1/100 to All on Tue Nov 26 12:15:05 2024

Ever wanted to hear a saxophone bark? Nvidia just made the 'worlds most flexible sound machine that uses AI to blend music, voices and sounds

Date:
Tue, 26 Nov 2024 12:01:20 +0000

Description:
Meet Fugatto, which can produce all kinds of music and audio from your
prompts with the help of AI.

FULL STORY ======================================================================Nvidia has announced its new Fugatto generative AI audio tool It can create and mix audio in all kinds of ways, but isn't out yet Fugatto promies to create
unique sounds, audio mixes, speech, and more

Nvidia has announced a new generative AI audio tool called Fugatto, which
it's describing as the "world's most flexible sound machine" capable of producing all kinds of music, speech, and other audio, and even unique sounds that have never been heard before.

Fugatto, which is short for Foundational Generative Audio Transformer Opus 1, can work with text prompts and audio samples. You can simply describe what
you want to hear, or get the AI model to modify or combine existing audio clips.

For example, you can have the sound of a train transform into a lush orchestral arrangement, or mix a banjo melody with the sounds of rainfall.
You can hear the sound of a saxophone barking, or a flute meowing, just by typing in a prompt.

Fugatto can also isolate vocals from tracks, and change the vocal delivery style, as well as generate speech from scratch. Feed in an existing melody, and you can have it played on whatever instrument you like, in any kind of style. The bad news it's not available yet

So how can you try out this impressive new AI technology? You can't, for the time being: you'll have to make do with Nvidia's promo video and a website of samples . There's no word yet on when Fugatto will be available for public testing.

Some of the samples published by Nvidia include the sound of a female voice barking, a factory machine screaming, a typewriter whispering, and a cello shouting with anger. You can see the wide variety of audio effects that are possible.

Nvidia has also demonstrated how the AI engine is able to produce spoken word clips, which can then be delivered with a range of different emotions (from angry to happy) and even with different accents applied.

"We wanted to create a model that understands and generates sound like humans do," says Nvidia's Rafael Valle , one of the Fugatto team. "Fugatto is our first step toward a future where unsupervised multitask learning in audio synthesis and transformation emerges from data and model scale." You might also like Google's scary-good AI podcast tool just got better Nvidia says AI could be the biggest tech leap ever Suno's upgraded AI song generator is launched

======================================================================
Link to news story: https://www.techradar.com/computing/artificial-intelligence/ever-wanted-to-hea r-a-saxophone-bark-nvidia-just-made-the-worlds-most-flexible-sound-machine-tha t-uses-ai-to-blend-music-voices-and-sounds

--- Mystic BBS v1.12 A47 (Linux/64)
* Origin: tqwNet Technology News (1337:1/100)

Who's Online
Recent Visitors
- CyberNix
  Sun Jun 1 21:35:53 2025
  from London, UK via Telnet
- gretchiie
  Sat May 31 08:42:43 2025
  from austin tx via Telnet
- CyberNix
  Tue May 27 21:43:34 2025
  from London, UK via Telnet
- gretchiie
  Wed May 14 03:08:07 2025
  from austin tx via Telnet

System Info

Sysop:	CyberNix
Location:	London, UK
Users:	20
Nodes:	10 (0 / 10)
Uptime:	220:25:49
Calls:	884
Files:	4,294
Messages:	650,945

Ever wanted to hear a saxophone bark? Nvidia just made the 'world

Who's Online

Recent Visitors

System Info