会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 38. 发明授权
    • Voice personalization of speech synthesizer
    • 语音合成器的语音个性化
    • US06970820B2
    • 2005-11-29
    • US09792928
    • 2001-02-26
    • Jean-Claude JunquaFlorent PerronninRoland KuhnPatrick Nguyen
    • Jean-Claude JunquaFlorent PerronninRoland KuhnPatrick Nguyen
    • G10L13/08G10L13/02G10L13/04G10L13/06G10L21/00G10L13/00
    • G10L13/04G10L2021/0135
    • The speech synthesizer is personalized to sound like or mimic the speech characteristics of an individual speaker. The individual speaker provides a quantity of enrollment data, which can be extracted from a short quantity of speech, and the system modifies the base synthesis parameters to more closely resemble those of the new speaker. More specifically, the synthesis parameters may be decomposed into speaker dependent parameters, such as context-independent parameters, and speaker independent parameters, such as context dependent parameters. The speaker dependent parameters are adapted using enrollment data from the new speaker. After adaptation, the speaker dependent parameters are combined with the speaker independent parameters to provide a set of personalized synthesis parameters. To adapt the parameters with a small amount of enrollment data, an eigenspace is constructed and used to constrain the position of the new speaker so that context independent parameters not provided by the new speaker may be estimated.
    • 语音合成器被个性化以发音或模仿单个扬声器的语音特征。 单个扬声器提供一定数量的登记数据,其可以从短语言中提取,并且系统将基本合成参数修改为更接近于新说话者的参考数据。 更具体地,合成参数可以被分解为与扬声器相关的参数,诸如与上下文无关的参数,以及与扬声器无关的参数,诸如与上下文相关的参数。 使用来自新扬声器的注册数据来调整与扬声器相关的参数。 在适应之后,将扬声器依赖参数与扬声器独立参数组合以提供一组个性化合成参数。 为了使参数具有少量的注册数据,构造本征空间并用于约束新的说话者的位置,以便可以估计不能由新发言者提供的上下文独立参数。
    • 39. 发明申请
    • Pattern matching for large vocabulary speech recognition with packed distribution and localized trellis access
    • 用于大量词汇语音识别的模式匹配,具有打包分发和本地化网格访问
    • US20050159952A1
    • 2005-07-21
    • US10512354
    • 2003-03-19
    • Patrick NguyenLuca Rigazio
    • Patrick NguyenLuca Rigazio
    • G10L15/08G10L15/10G10L15/28G10L15/00
    • G10L15/08G10L15/10G10L15/285G10L15/30G10L15/34
    • A method is provided for improving pattern matching in a speech recognition system having a plurality of acoustic models (20). Similarity measures for acoustic feature vectors (54) are determined in groups that are then buffered into cache memory (59). To further reduce computational processing, the acoustic data may be partitioned amongst a plurality of processing nodes (66, 67, 68). In addition, a priori knowledge of the spoken order may be used to establish the access order (124) used to copy records from the main speech parameter table (120, 200) into a sub-table (130, 204). The sub-table is processed such that the entries are in contiguous memory locations (206) and sorted according to the processing order (208). The speech processing algorithm is then directed to operate upon the sub-table (210) which causes the processor to load the sub-table into high speed cache memory (104, 212).
    • 提供了一种用于改进具有多个声学模型(20)的语音识别系统中的模式匹配的方法。 以随后缓冲到高速缓存存储器(59)中的组确定声学特征向量(54)的相似性度量。 为了进一步减少计算处理,可以在多个处理节点(66,67,68)之间划分声学数据。 此外,可以使用口语顺序的先验知识来建立用于将记录从主语音参数表(120,200)复制到子表(130,204)中的访问顺序(124)。 处理子表使得条目在连续存储器位置(206)中并根据处理顺序(208)进行排序。 语音处理算法随后被引导以对子表(210)进行操作,这使得处理器将子表加载到高速缓存存储器(104,212)中。