查看原文
其他

15秒复制原音!OpenAI首次展示语音人工智能Voice Engine!

iiiMichael 外贸赶路人 2024-06-30

点击上方蓝字,关注“外贸赶路人”

文末有视频解读


What’s Voice Engine:

Today we are sharing preliminary insights and results from a small-scale preview of a model called Voice Engine, which uses text input and a single 15-second audio sample to generate natural-sounding speech that closely resembles the original speaker.
OpenAI官方


简洁概述Voice Engine:用户只需要提供15秒左右的参考声音,输入文字文本,就能生成几乎和原音一模一样的全新音频。


早期应用场景(列举三个)
帮助阅读(语言学习)
Providing reading assistance to non-readers and children through natural-sounding, emotive voices representing a wider range of speakers than what's possible with preset voices. Age of Learning, an education technology company dedicated to the academic success of children, has been using this to generate pre-scripted voice-over content. They also use Voice Engine and GPT-4 to create real-time, personalized responses to interact with students. With this technology, Age of Learning has been able to create more content for a wider audience.

多语言音频生成器或语音翻译
Translating content, like videos and podcasts, so creators and businesses can reach more people around the world, fluently and in their own voices. One early adopter of this is HeyGen, an AI visual storytelling platform that works with their enterprise customers to create custom, human-like avatars for a variety of content, from product marketing to sales demos. They use Voice Engine for video translation, so they can translate a speaker's voice into multiple languages and reach a global audience. When used for translation, Voice Engine preserves the native accent of the original speaker: for example generating English with an audio sample from a French speaker would produce speech with a French accent.

受损人声恢复器
Helping patients recover their voice, for those suffering from sudden or degenerative speech conditions. The Norman Prince Neurosciences Institute at Lifespan, a not-for-profit health system that serves as the primary teaching affiliate of Brown University's medical school, is exploring uses of AI in clinical contexts. They've been piloting a program offering Voice Engine to individuals with oncologic or neurologic etiologies for speech impairment. Since Voice Engine requires such a short audio sample, doctors Fatima Mirza, Rohaid Ali and Konstantina Svokos were able to restore the voice of a young patient who lost her fluent speech due to a vascular brain tumor, using audio from a video recorded for a school project.

具体内容参看视频讲解

推荐阅读




由于微信改了规则,不再按照时间顺序推送

如果你经常收不到文章推送

读完点一下“点赞”“在看”
这样每次新文章的推送
就会第一时间出现在你的订阅列表里啦:)


感谢转发给更多外贸朋友们~


​​​
继续滑动看下一个
向上滑动看下一个

您可能也对以下帖子感兴趣

文章有问题?点此查看未经处理的缓存