Nonhuman Vocal Algorithms

View More

The DeepMind AI Can Say Sentences and Play Piano Without a Sound Library

As algorithmic voices become a more common part of daily life, with Siri, Alexa and Cortana feeling as familiar as friends, the DeepMind AI's WaveNet program is focusing on creating an authentic sounding voice that is completely original. In other words, while Siri, Alexa, Cortana and others rely on processing recordings of a real human's voice, DeepMind AI's WaveNet creates an original voice by learning from a host of recordings.

The difference between WaveNet and other common systems is that WaveNet doesn't just put together sounds from a database. Rather, it learns speech sounds and puts those sounds together on its own. Because of this learning process, the DeepMind AI program can be used for anything that involves sound, including piano music (a sampling of which Google included on its blog.)
Trend Themes
1. Algorithmic Voices - Opportunity for businesses to develop original and authentic algorithmic voices for virtual assistants and other applications.
2. Voice Learning - Disruptive innovation opportunity to create AI programs that can learn speech sounds and generate original voices.
3. Sound Generation AI - Emerging trend of AI programs capable of creating sounds, including piano music, without relying on pre-existing sound libraries.
Industry Implications
1. Virtual Assistant Technology - Potential for disruption in the virtual assistant industry with the development of algorithmic voices that are more realistic and personalized.
2. Speech Recognition and Synthesis - Opportunity for advancements in speech recognition and synthesis technologies as AI programs learn to generate original voices.
3. Music Composition and Production - Disruption in the music industry with the emergence of AI programs capable of generating original music compositions without the need for pre-existing sound libraries.

Related Ideas

Similar Ideas
VIEW FULL ARTICLE & IMAGES