Nvidia has unveiled Fugatto, an innovative AI-powered music and sound editor that pushes the boundaries of audio creation. Described as a "creative breakthrough," Fugatto uses text and audio prompts to generate sounds, music, and speech that it has not been explicitly trained on. This unique capability allows for the creation of unexpected and fascinating audio experiences, such as a trumpet that meows or a saxophone that howls and barks.
The tool enables users to craft audio compositions from even the wildest prompts. For instance, one of the example tracks generated by Fugatto is titled, "Create a saxophone howling, barking, then electronic music with dogs barking." The AI can also create intricate soundscapes, such as “deep, rumbling bass pulses paired with high-pitched digital chirps, like the sound of a massive sentient machine waking up.” This flexibility opens up entirely new possibilities for sound design, offering an expansive range of creative options.
Fugatto’s functionality goes beyond music creation, offering features that include:
To develop Fugatto, Nvidia’s team compiled a dataset of millions of audio samples. The model was built using advanced instruction-based techniques that allow it to expand its range of capabilities, learning new tasks without requiring additional data. This dataset also included contributions from various sound libraries, including the BBC, enhancing Fugatto’s versatility and breadth.
Nvidia highlights Fugatto’s potential as a game-changer for artists, filmmakers, and sound designers, giving them unprecedented creative control over audio production. While other companies, including Stability AI, OpenAI, and Google DeepMind, have also ventured into AI-driven audio tools, Nvidia argues that Fugatto stands apart by enabling the generation of entirely novel sounds. Most existing AI tools rely on pre-trained datasets, creating derivative outputs, but Fugatto offers a fresh dimension of originality, allowing users to generate audio that defies conventional expectations.
Despite the excitement surrounding AI in music creation, the rise of these tools has sparked controversy. Some startups have already faced copyright lawsuits over AI-generated music, and Nvidia itself has faced scrutiny for reportedly training models using subtitles from thousands of YouTube videos. While Nvidia has not disclosed how it will address licensing or copyright concerns, Fugatto’s ability to generate entirely unique sounds may help it navigate the legal challenges that often surround AI-generated music.
Fugatto represents a significant leap forward in the potential of AI to transform the music and sound production industry, offering users the ability to create truly novel audio experiences that were previously unimaginable.