会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Speech synthesis with fuzzy heteronym prediction using decision trees
    • 使用决策树进行模糊异词预测的语音合成
    • US09058811B2
    • 2015-06-16
    • US13402602
    • 2012-02-22
    • Xi WangXiaoyan LouJian Li
    • Xi WangXiaoyan LouJian Li
    • G10L13/02G10L13/08G06N7/02
    • G10L13/08
    • According to one embodiment, a method, apparatus for synthesizing speech, and a method for training acoustic model used in speech synthesis is provided. The method for synthesizing speech may include determining data generated by text analysis as fuzzy heteronym data, performing fuzzy heteronym prediction on the fuzzy heteronym data to output a plurality of candidate pronunciations of the fuzzy heteronym data and probabilities thereof, generating fuzzy context feature labels based on the plurality of candidate pronunciations and probabilities thereof, determining model parameters for the fuzzy context feature labels based on acoustic model with fuzzy decision tree, generating speech parameters from the model parameters, and synthesizing the speech parameters via synthesizer as speech.
    • 根据一个实施例,提供了一种用于合成语音的方法,装置,以及用于语音合成中使用的用于训练声学模型的方法。 用于合成语音的方法可以包括通过文本分析产生的数据作为模糊异词数据,对模糊异词数据执行模糊异词预测以输出模糊异词数据的多个候选发音及其概率,基于 多个候选发音和概率,基于具有模糊决策树的声学模型确定模糊上下文特征标签的模型参数,从模型参数生成语音参数,并通过合成器将语音参数合成为语音。
    • 2. 发明申请
    • METHOD, APPARATUS FOR SYNTHESIZING SPEECH AND ACOUSTIC MODEL TRAINING METHOD FOR SPEECH SYNTHESIS
    • 方法,用于合成语音的装置和用于语音合成的声学模型训练方法
    • US20120221339A1
    • 2012-08-30
    • US13402602
    • 2012-02-22
    • Xi WangXiaoyan LouJian Li
    • Xi WangXiaoyan LouJian Li
    • G10L13/08
    • G10L13/08
    • According to one embodiment, a method, apparatus for synthesizing speech, and a method for training acoustic model used in speech synthesis is provided. The method for synthesizing speech may include determining data generated by text analysis as fuzzy heteronym data, performing fuzzy heteronym prediction on the fuzzy heteronym data to output a plurality of candidate pronunciations of the fuzzy heteronym data and probabilities thereof, generating fuzzy context feature labels based on the plurality of candidate pronunciations and probabilities thereof, determining model parameters for the fuzzy context feature labels based on acoustic model with fuzzy decision tree, generating speech parameters from the model parameters, and synthesizing the speech parameters via synthesizer as speech.
    • 根据一个实施例,提供了一种用于合成语音的方法,装置,以及用于语音合成中使用的用于训练声学模型的方法。 用于合成语音的方法可以包括通过文本分析产生的数据作为模糊异词数据,对模糊异词数据执行模糊异词预测以输出模糊异词数据的多个候选发音及其概率,基于 多个候选发音和概率,基于具有模糊决策树的声学模型确定模糊上下文特征标签的模型参数,从模型参数生成语音参数,并通过合成器将语音参数合成为语音。