会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • METHOD AND APPARATUS FOR FUSING VOICED PHONEME UNITS IN TEXT-TO-SPEECH
    • 用于在语音中填充声音单元的方法和装置
    • US20110320199A1
    • 2011-12-29
    • US13183667
    • 2011-07-15
    • Jian LuanJian Li
    • Jian LuanJian Li
    • G10L15/26
    • G10L13/06
    • According to one embodiment, an apparatus for fusing voiced phoneme units in Text-To-Speech, includes a reference unit selection module configured to select a reference unit from the plurality of units based on pitch cycle information of the each unit and the number of pitch cycles of the target segment. The apparatus includes a template creation module configured to create a template based on the reference unit selected by the reference unit selection module and the number of pitch cycles of the target segment, wherein the number of pitch cycles of the template is same with that of pitch cycles of the target segment. The apparatus includes a pitch cycle alignment module configured to align pitch cycles of each unit of the plurality of units except the reference unit with pitch cycles of the template by using a dynamic programming algorithm.
    • 根据一个实施例,一种用于在文本到语音中融合浊音音素单元的装置包括:参考单元选择模块,被配置为基于每个单元的音调周期信息和音调数量从多个单元中选择参考单元 目标段的周期。 该装置包括:模板创建模块,被配置为基于由参考单元选择模块选择的参考单元和目标段的音调周期数来创建模板,其中模板的音调周期数与节距的相同 目标段的周期。 该装置包括音调周期对准模块,其被配置为通过使用动态规划算法来将参考单元之外的多个单元的每个单元的音调周期与模板的音调周期对准。
    • 3. 发明授权
    • Method and apparatus for verification of speaker authentication
    • 用于验证扬声器认证的方法和装置
    • US07809561B2
    • 2010-10-05
    • US11692470
    • 2007-03-28
    • Jian LuanJie Hao
    • Jian LuanJie Hao
    • G10L15/12
    • G10L17/08G10L17/24
    • The present invention provides a method and apparatus for verification of speaker authentication. A method for verification of speaker authentication, comprising: inputting an utterance containing a password that is spoken by a speaker; extracting an acoustic feature vector sequence from said inputted utterance; DTW-matching said extracted acoustic feature vector sequence and a speaker template enrolled by an enrolled speaker; calculating each of a plurality of local distances between said DTW-matched acoustic feature vector sequence and said speaker template; nonlinear-transforming said each local distance calculated to give more weights on small local distances; calculating a DTW-matching score based on said plurality of local distances nonlinear-transformed; and comparing said matching score with a predefined discriminating threshold to determine whether said inputted utterance is an utterance containing a password spoken by the enrolled speaker.
    • 本发明提供了用于验证扬声器认证的方法和装置。 一种用于验证扬声器认证的方法,包括:输入包含由扬声器说出的口令的话语; 从所述输入的话语中提取声学特征向量序列; DTW匹配所述提取的声学特征向量序列和由注册的说话者登记的说话者模板; 计算所述DTW匹配的声学特征向量序列和所述说话者模板之间的多个局部距离中的每一个; 非线性变换表示计算的每个局部距离,以便在较小的局部距离上给出更多的权重; 基于所述多个局部距离非线性变换来计算DTW匹配分数; 以及将所述匹配分数与预定义的识别阈值进行比较,以确定所述输入的话语是否是包含由所登记的说话者所说的口令的话语。
    • 5. 发明授权
    • Dynamic long-distance dependency with conditional random fields
    • 动态长距离依赖条件随机场
    • US09037460B2
    • 2015-05-19
    • US13433186
    • 2012-03-28
    • Jian LuanLinfang WangHairong XiaSheng ZhaoDaniela Braga
    • Jian LuanLinfang WangHairong XiaSheng ZhaoDaniela Braga
    • G10L15/26G10L15/08G10L13/08G10L15/197
    • G10L15/083G10L13/08G10L15/197
    • Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a special output label that is pre-defined according to each application scenario. Besides the number of units in the segment (from the previous special output label to the current unit), the dynamic features may also include the sum of any basic features of units in the segment. Since the added dynamic features are involved in the distance from the previous specific label, the searching lattice associated with Viterbi searching is expanded to distinguish the nodes with various distances. The dynamic features may be used in a variety of different applications, such as Natural Language Processing, Text-To-Speech and Automatic Speech Recognition. For example, the dynamic features may be used to assist in prosodic break and pause prediction.
    • CRF利用动态特征来处理输出标签的长距离依赖性。 动态特征呈现出根据每个应用场景预定义的特定输出标签的显式距离所涉及的概率分布。 除了段中的单位数(从前一个特殊输出标签到当前单位),动态特征还可以包括段中单位的任何基本特征的总和。 由于添加的动态特征涉及到与先前特定标签的距离,因此扩展了与维特比搜索相关联的搜索点,以区分不同距离的节点。 动态特征可用于各种不同的应用,如自然语言处理,文本到语音和自动语音识别。 例如,动态特征可以用于辅助韵律休息和暂停预测。
    • 6. 发明授权
    • Method and apparatus for enrollment and verification of speaker authentication
    • 扬声器认证注册和验证的方法和装置
    • US07877254B2
    • 2011-01-25
    • US11692397
    • 2007-03-28
    • Jian LuanPei DingLei HeJie Hao
    • Jian LuanPei DingLei HeJie Hao
    • G10L17/00G10L19/00G10L15/06
    • G10L17/04
    • The present invention provides a method and apparatus for enrollment and verification of speaker authentication. The method for enrollment of speaker authentication, comprising: extracting an acoustic feature vector sequence from an enrollment utterance of a speaker; and generating a speaker template using the acoustic feature vector sequence; wherein said step of extracting an acoustic feature vector sequence comprises: generating a filter-bank for the enrollment utterance of the speaker for filtering locations and energies of formants in the spectrum of the enrollment utterance based on the enrollment utterance; filtering the spectrum of the enrollment utterance by the generated filter-bank; and generating the acoustic feature vector sequence from the filtered enrollment utterance.
    • 本发明提供了一种用于注册和验证扬声器认证的方法和装置。 用于注册说话人认证的方法,包括:从扬声器的登记话音中提取声学特征向量序列; 以及使用所述声学特征向量序列生成扬声器模板; 其中所述提取声学特征向量序列的步骤包括:基于所述登记话语,生成用于所述扬声器的登记话语的滤波器组,用于过滤所述登记话音频谱中的共振峰的位置和能量; 通过生成的滤波器组过滤入场语音的频谱; 以及从所述滤波的登记话音生成所述声学特征向量序列。
    • 8. 发明授权
    • Method and apparatus for enrollment and evaluation of speaker authentification
    • 扬声器鉴定的注册和评估方法和设备
    • US07962336B2
    • 2011-06-14
    • US11859358
    • 2007-09-21
    • Jian LuanJie Hao
    • Jian LuanJie Hao
    • G10L15/00
    • G10L17/04
    • The present invention provides a method and apparatus for enrollment and evaluation of speaker authentication. The method for enrollment of speaker authentication, comprising: generating a plurality of acoustic feature vector sequences respectively based on a plurality of utterances of the same content spoken by a speaker; generating a reference template from said plurality of acoustic feature vector sequences; generating a corresponding pseudo-impostor feature vector sequence for each of said plurality of acoustic feature vector sequences based on a code book that includes a plurality of codes and their corresponding feature vectors; and selecting an optimal acoustic feature subset based on said plurality of acoustic feature vector sequences, said reference template and said plurality of pseudo-impostor feature vector sequences.
    • 本发明提供了一种用于注册和评估扬声器认证的方法和装置。 用于注册说话人认证的方法,包括:分别基于由说话者说出的相同内容的多个话语产生多个声学特征向量序列; 从所述多个声学特征向量序列生成参考模板; 基于包括多个代码及其对应的特征向量的代码簿,为所述多个声学特征向量序列中的每一个产生相应的伪冒号特征向量序列; 以及基于所述多个声学特征向量序列,所述参考模板和所述多个伪冒号特征向量序列来选择最佳声学特征子集。
    • 9. 发明申请
    • METHOD AND APPARATUS FOR ENROLLMENT AND VERIFICATION OF SPEAKER AUTHENTICATION
    • 声音认证的加密和验证方法和设备
    • US20070239451A1
    • 2007-10-11
    • US11692397
    • 2007-03-28
    • Jian LuanPei DingLei HeJie Hao
    • Jian LuanPei DingLei HeJie Hao
    • G10L17/00
    • G10L17/04
    • The present invention provides a method and apparatus for enrollment and verification of speaker authentication. The method for enrollment of speaker authentication, comprising: extracting an acoustic feature vector sequence from an enrollment utterance of a speaker; and generating a speaker template using the acoustic feature vector sequence; wherein said step of extracting an acoustic feature vector sequence comprises: generating a filter-bank for the enrollment utterance of the speaker for filtering locations and energies of formants in the spectrum of the enrollment utterance based on the enrollment utterance; filtering the spectrum of the enrollment utterance by the generated filter-bank; and generating the acoustic feature vector sequence from the filtered enrollment utterance.
    • 本发明提供了一种用于注册和验证扬声器认证的方法和装置。 用于注册说话人认证的方法,包括:从扬声器的登记话音中提取声学特征向量序列; 以及使用所述声学特征向量序列生成扬声器模板; 其中所述提取声学特征向量序列的步骤包括:基于所述登记话语,生成用于所述扬声器的登记话语的滤波器组,用于过滤所述登记话音频谱中的共振峰的位置和能量; 通过生成的滤波器组过滤入场语音的频谱; 以及从所述滤波的登记话音生成所述声学特征向量序列。
    • 10. 发明申请
    • DYNAMIC LONG-DISTANCE DEPENDENCY WITH CONDITIONAL RANDOM FIELDS
    • 动态长距离依赖于条件随机场
    • US20130262105A1
    • 2013-10-03
    • US13433186
    • 2012-03-28
    • Jian LuanLinfang WangHairong XiaSheng ZhaoDaniela Braga
    • Jian LuanLinfang WangHairong XiaSheng ZhaoDaniela Braga
    • G10L15/26
    • G10L15/083G10L13/08G10L15/197
    • Dynamic features are utilized with CRFs to handle long-distance dependencies of output labels. The dynamic features present a probability distribution involved in explicit distance from/to a special output label that is pre-defined according to each application scenario. Besides the number of units in the segment (from the previous special output label to the current unit), the dynamic features may also include the sum of any basic features of units in the segment. Since the added dynamic features are involved in the distance from the previous specific label, the searching lattice associated with Viterbi searching is expanded to distinguish the nodes with various distances. The dynamic features may be used in a variety of different applications, such as Natural Language Processing, Text-To-Speech and Automatic Speech Recognition. For example, the dynamic features may be used to assist in prosodic break and pause prediction.
    • CRF利用动态特征来处理输出标签的长距离依赖关系。 动态特征呈现出根据每个应用场景预定义的特定输出标签的显式距离所涉及的概率分布。 除了段中的单位数(从前一个特殊输出标签到当前单位),动态特征还可以包括段中单位的任何基本特征的总和。 由于添加的动态特征涉及到与先前特定标签的距离,因此扩展了与维特比搜索相关联的搜索点,以区分不同距离的节点。 动态特征可用于各种不同的应用,如自然语言处理,文本到语音和自动语音识别。 例如,动态特征可以用于辅助韵律休息和暂停预测。