cover of episode 648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip

648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip

2023/1/27
logo of podcast Super Data Science: ML & AI Podcast with Jon Krohn

Super Data Science: ML & AI Podcast with Jon Krohn

Frequently requested episodes will be transcribed first

Shownotes Transcript

Text-to-speech gets a groundbreaking update with Microsoft’s VALL-E. On this Five-Minute Friday, Jon Krohn investigates how the Microsoft team modeled their tool to replicate natural human speech using just three seconds of a person’s voice.

Additional materials: www.superdatascience.com/648)

Interested in sponsoring a SuperDataScience Podcast episode? Visit JonKrohn.com/podcast) for sponsorship information.