The cross-lingual voice cloning technology has been unveiled by vidby, a Swiss business that offers AI-powered software for rapid and accurate video translation and is a YouTube preferred vendor. Users that need to translate and dub videos with their own voices will benefit from the new feature.
When placing an order through the site, consumers will be asked to provide a voice recording of at least 5 minutes in any language in order to access the new service. Clean audio samples from a single speaker, free of background noise, music, or other effects, are required for the training data. Vidby’s sophisticated processing develops the digital voice duplicate in minutes once the sample is submitted. Following that, the voice sample can be used for video dubbing. The platform currently offers cloned voice dubbing in 28 languages. Some of the most commonly spoken languages are English, German, Spanish, Mandarin, and Hindi.
“There are plenty of voice cloning solutions on the market for the same language, but there aren’t many for cross-language cloning because it’s a difficult task.” Because each language has its own distinct sounds that may not exist in other languages, poor interlingual cloning might result in a strong accent. This isn’t simply a machine issue; humans are affected as well. Consider when someone speaks a language that is not their native tongue. That is why mastering interlingual cloning requires not just duplicating the tone and sound of the voice, but also mastering the language’s pronunciation without that distinguishing accent,” explains Alexander Konovalov, CEO and co-founder of vidby. “You can easily dub videos in multiple languages with the appropriate voice without sacrificing quality.All you need is access to our platform and a speaker audio recording. We’ll handle the rest.”