会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • System and Method for Automatic Prediction of Speech Suitability for Statistical Modeling
    • 自动预测统计建模语音适用性的系统与​​方法
    • US20140074468A1
    • 2014-03-13
    • US13606618
    • 2012-09-07
    • Alexander SorinSlava ShechtmanVincent Pollet
    • Alexander SorinSlava ShechtmanVincent Pollet
    • G10L15/00
    • G10L25/48G10L13/04G10L25/18
    • An embodiment according to the invention provides a capability of automatically predicting how favorable a given speech signal is for statistical modeling, which is advantageous in a variety of different contexts. In Multi-Form Segment (MFS) synthesis, for example, an embodiment according to the invention uses prediction capability to provide an automatic acoustic driven template versus model decision maker with an output quality that is high, stable and depends gradually on the system footprint. In speaker selection for a statistical Text-to-Speech synthesis (TTS) system build, as another example context, an embodiment according to the invention enables a fast selection of the most appropriate speaker among several available ones for the full voice dataset recording and preparation, based on a small amount of recorded speech material.
    • 根据本发明的实施例提供了一种自动预测给定语音信号对于统计建模有利的能力,这在各种不同的上下文中是有利的。 在多格段(MFS)合成中,例如,根据本发明的实施例使用预测能力来提供具有高,稳定的输出质量的自动声驱动模板与模型决策者,并逐渐依赖于系统占用。 在用于统计文本到语音合成(TTS)系统构建的说话者选择中,作为另一示例性上下文,根据本发明的实施例使得能够在完整语音数据集记录和准备中的几个可用的扬声器中快速选择最合适的说话者 ,基于少量的录音材料。
    • 2. 发明授权
    • System and method for automatic prediction of speech suitability for statistical modeling
    • 自动预测语音适用性的统计建模系统和方法
    • US09484045B2
    • 2016-11-01
    • US13606618
    • 2012-09-07
    • Alexander SorinSlava ShechtmanVincent Pollet
    • Alexander SorinSlava ShechtmanVincent Pollet
    • G10L13/06G10L13/04G10L19/00G10L25/48G10L25/18
    • G10L25/48G10L13/04G10L25/18
    • An embodiment according to the invention provides a capability of automatically predicting how favorable a given speech signal is for statistical modeling, which is advantageous in a variety of different contexts. In Multi-Form Segment (MFS) synthesis, for example, an embodiment according to the invention uses prediction capability to provide an automatic acoustic driven template versus model decision maker with an output quality that is high, stable and depends gradually on the system footprint. In speaker selection for a statistical Text-to-Speech synthesis (TTS) system build, as another example context, an embodiment according to the invention enables a fast selection of the most appropriate speaker among several available ones for the full voice dataset recording and preparation, based on a small amount of recorded speech material.
    • 根据本发明的实施例提供了一种自动预测给定语音信号对于统计建模有利的能力,这在各种不同的上下文中是有利的。 在多格段(MFS)合成中,例如,根据本发明的实施例使用预测能力来提供具有高,稳定的输出质量的自动声驱动模板与模型决策者,并逐渐依赖于系统占用。 在用于统计文本到语音合成(TTS)系统构建的说话者选择中,作为另一示例性上下文,根据本发明的实施例使得能够在完整语音数据集记录和准备中的几个可用的扬声器中快速选择最合适的说话者 ,基于少量的录音材料。