会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 7. 发明申请
    • VOICE FONT SPEAKER AND PROSODY INTERPOLATION
    • 声音扬声器和前置插值
    • WO2015130581A1
    • 2015-09-03
    • PCT/US2015/017002
    • 2015-02-23
    • MICROSOFT TECHNOLOGY LICENSING, LLC
    • LUAN, JianHE, LeiLEUNG, Max
    • G10L13/08G10L13/033
    • G10L13/0335G06F3/0482G06F3/04847G10L13/02G10L13/033G10L13/08
    • Multi-voice font interpolation is provided. A multi-voice font interpolation engine allows the production of computer generated speech with a wide variety of speaker characteristics and/or prosody by interpolating speaker characteristics and prosody from existing fonts. Using prediction models from multiple voice fonts, the multi-voice font interpolation engine predicts values for the parameters that influence speaker characteristics and/or prosody for the phoneme sequence obtained from the text to spoken. For each parameter, additional parameter values are generated by a weighted interpolation from the predicted values. Modifying an existing voice font with the interpolated parameters changes the style and/or emotion of the speech while retaining the base sound qualities of the original voice. The multi-voice font interpolation engine allows the speaker characteristics and/or prosody to be transplanted from one voice font to another or entirely new speaker characteristics and/or prosody to be generated for an existing voice font.
    • 提供多语音字体插补。 多语音字体插入引擎允许通过从现有字体插入扬声器特征和韵律来生成具有各种扬声器特征和/或韵律的计算机生成语音。 使用多个语音字体的预测模型,多语音字体插值引擎预测影响说话者特征的参数的值和/或从要发音的文本获得的音素序列的韵律。 对于每个参数,通过来自预测值的加权内插生成附加参数值。 使用内插参数修改现有的语音字体会改变语音的风格和/或情绪,同时保留原始语音的基本声音质量。 多语音字体插入引擎允许将扬声器特征和/或韵律从一种语音字体移植到另一种或全新的扬声器特征和/或为现有语音字体生成的韵律。
    • 9. 发明申请
    • TEXT-TO-SPEECH WITH EMOTIONAL CONTENT
    • 具有情感内容的文字与语音
    • WO2016040209A1
    • 2016-03-17
    • PCT/US2015/048755
    • 2015-09-07
    • MICROSOFT TECHNOLOGY LICENSING, LLC
    • LUAN, JianHE, LeiLEUNG, Max
    • G10L13/033
    • G10L13/027G10L13/033
    • Techniques for converting text to speech having emotional content. In an aspect, an emotionally neutral acoustic trajectory is predicted for a script using a neutral model, and an emotion-specific acoustic trajectory adjustment is independently predicted using an emotion-specific model. The neutral trajectory and emotion-specific adjustments are combined to generate a transformed speech output having emotional content. In another aspect, state parameters of a statistical parametric model for neutral voice are transformed by emotion-specific factors that vary across contexts and states. The emotion-dependent adjustment factors may be clustered and stored using an emotion-specific decision tree or other clustering scheme distinct from a decision tree used for the neutral voice model.
    • 用于将文本转换为具有情感内容的语言的技术。 在一方面,对于使用中性模型的脚本预测情绪中立的声轨迹,并且使用情感特定模型独立地预测情绪特异性声轨迹调整。 组合中性轨迹和情感特定调整以产生具有情感内容的变换语音输出。 在另一方面,用于中立语音的统计参数模型的状态参数由在情境和状态之间变化的情绪特异性因子来转换。 可以使用与用于中立语音模型的决策树不同的情感特定决策树或其他聚类方案来聚集和存储与情绪相关的调整因子。