Microsoft’s VALL-E AI Is Able to Imitate a Human Voice in a Three-Second Pattern

Microsoft engineers have introduced an AI (artificial intelligence) model for text-to-speech called VALL-E. It is able to imitate a human voice, relying only on a three-second sound sample. The developers claim that VALL-E can synthesize audio, where the “learned” voice says something, while retaining even the emotional coloring. You might also be interested in our… Continue reading Microsoft’s VALL-E AI Is Able to Imitate a Human Voice in a Three-Second Pattern