会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明申请
    • Noise adaptation system of speech model, noise adaptation method, and noise adaptation program for speech recognition
    • 语音模型噪声适应系统,噪声适应方法和语音识别噪声适应程序
    • US20050080623A1
    • 2005-04-14
    • US10920461
    • 2004-08-18
    • Sadaoki FuruiZhipeng ZhangTsutomu HorikoshiToshiaki Sugimura
    • Sadaoki FuruiZhipeng ZhangTsutomu HorikoshiToshiaki Sugimura
    • G10L15/02G10L15/06G10L15/14G10L15/20G10L15/00
    • G10L15/065G10L21/0216
    • An object of the present invention is to facilitate dealing with noisy speech with varying SNR and save calculation costs by generating a speech model with a single-tree-structure and using the model for speech recognition. Every piece of noise data stored in a noise database is used under every SNR condition to calculate the distance between all noise models with the SNR conditions and the noise-added speech is clustered. Based on the result of the clustering, a single-tree-structure model space into which the noise and SNR are integrated is generated (steps S1 to S5). At a noise extraction step (step S6), inputted noisy speech to be recognized is analyzed to extract a feature parameter string and the likelihoods of HMMs are compared one another to select an optimum model from the tree-structure noisy speech model space (step S7). Linear transformation is applied to the selected noisy speech model space so that the likelihood is maximized (step S8).
    • 本发明的一个目的是便于处理具有变化的SNR的噪声语音,并且通过用单树结构生成语音模型并使用语音识别模型来节省计算成本。 在每个SNR条件下使用存储在噪声数据库中的每条噪声数据,以计算所有噪声模型与SNR条件之间的距离,并且将噪声添加语音聚类。 基于聚类的结果,生成其中积分噪声和SNR的单树结构模型空间(步骤S1至S5)。 在噪声提取步骤(步骤S6)中,分析输入的要识别的噪声语音以提取特征参数串,并将HMM的可能性彼此进行比较,以从树结构噪声语音模型空间中选择最佳模型(步骤S7 )。 将线性变换应用于所选择的噪声语音模型空间,使得似然性最大化(步骤S8)。
    • 6. 发明授权
    • Noise adaptation system of speech model, noise adaptation method, and noise adaptation program for speech recognition
    • 语音模型噪声适应系统,噪声适应方法和语音识别噪声适应程序
    • US07424426B2
    • 2008-09-09
    • US10920461
    • 2004-08-18
    • Sadaoki FuruiZhipeng ZhangTsutomu HorikoshiToshiaki Sugimura
    • Sadaoki FuruiZhipeng ZhangTsutomu HorikoshiToshiaki Sugimura
    • G10L15/00
    • G10L15/065G10L21/0216
    • An object of the present invention is to facilitate dealing with noisy speech with varying SNR and save calculation costs by generating a speech model with a single-tree-structure and using the model for speech recognition.Every piece of noise data stored in a noise database is used under every SNR condition to calculate the distance between all noise models with the SNR conditions and the noise-added speech is clustered. Based on the result of the clustering, a single-tree-structure model space into which the noise and SNR are integrated is generated (steps S1 to S5). At a noise extraction step (step S6), inputted noisy speech to be recognized is analyzed to extract a feature parameter string and the likelihoods of HMMs are compared one another to select an optimum model from the tree-structure noisy speech model space (step S7). Linear transformation is applied to the selected noisy speech model space so that the likelihood is maximized (step S8).
    • 本发明的一个目的是便于处理具有变化的SNR的噪声语音,并且通过用单树结构生成语音模型并使用语音识别模型来节省计算成本。 在每个SNR条件下使用存储在噪声数据库中的每条噪声数据,以计算所有噪声模型与SNR条件之间的距离,并且将噪声添加语音聚类。 基于聚类的结果,生成其中积分噪声和SNR的单树结构模型空间(步骤S1至S5)。 在噪声提取步骤(步骤S6),分析输入的要识别的噪声语音以提取特征参数串,并将HMM的可能性彼此进行比较,以从树结构噪声语音模型空间中选择最佳模型(步骤 S 7)。 将线性变换应用于所选择的噪声语音模型空间,使得似然性最大化(步骤S 8)。
    • 7. 发明授权
    • Voice recognition server, telephone equipment, voice recognition system, and voice recognition method
    • 语音识别服务器,电话设备,语音识别系统和语音识别方法
    • US08238525B2
    • 2012-08-07
    • US12693796
    • 2010-01-26
    • Zhipeng ZhangHirotaka Furukawa
    • Zhipeng ZhangHirotaka Furukawa
    • H04M1/64G10L15/26
    • H04M3/42153G10L15/183G10L15/30G10L2015/228H04M2201/40H04M2201/405H04M2250/66
    • A voice recognition server 200 has a voice reception unit 202 which receives a voice from a telephone equipment 100, a model storage unit 208 which stores at least one acoustic model and at least one language model used for converting the voice received by the voice reception unit 202, to character data, a number decision unit 204 which decides a current calling number and a second number of the telephone equipment 100, a model selection unit 206 which selects an acoustic model stored in the model storage unit 208, based on the current calling number and the second number, and which selects a language model stored in the model storage unit 208, based on the current calling number, and a voice recognition unit 210 which converts the voice received by the voice reception unit 202, to character data, based on the acoustic model and the language model selected by the model selection unit 206.
    • 语音识别服务器200具有从电话设备100接收语音的语音接收单元202,存储至少一个声学模型的模型存储单元208和用于转换由语音接收单元接收的语音的至少一种语言模型 202,对于字符数据,确定当前主叫号码的号码决定单元204和电话设备100的第二号码,选择存储在模型存储单元208中的声学模型的模型选择单元206,基于当前呼叫 基于当前主叫号码选择存储在模型存储单元208中的语言模型和将由语音接收单元202接收的语音转换为基于字符数据的语音识别单元210 在由模型选择单元206选择的声学模型和语言模型上。
    • 9. 发明授权
    • Noise adaptation system of speech model, noise adaptation method, and noise adaptation program for speech recognition
    • 语音模型噪声适应系统,噪声适应方法和语音识别噪声适应程序
    • US07552049B2
    • 2009-06-23
    • US10796283
    • 2004-03-10
    • Zhipeng ZhangKiyotaka OtsujiToshiaki SugimuraSadaoki Furui
    • Zhipeng ZhangKiyotaka OtsujiToshiaki SugimuraSadaoki Furui
    • G10L15/00
    • G10L15/20
    • An object of the present invention is to enable optimal clustering for many types of noise data and to improve the accuracy of estimation of a speech model sequence of input speech. Noise is added to speech in accordance with noise-to-signal ratio conditions to generate noise-added speech (step S1), the mean value of speech cepstral is subtracted from the generated, noise-added speech (step 2), a Gaussian distribution model of each piece of noise-added speech is created (step S3), the likelihoods of the pieces of noise-added speech are calculated to generate a likelihood matrix (step S4) to obtain a clustering result. An optimum model is selected (step S7) and linear transformation is performed to provide a maximized likelihood (step S8). Because noise-added speech is consistently used both in clustering and model learning, clustering for many types of noise data and an accurate estimation of a speech model sequence can be achieved.
    • 本发明的一个目的是为多种类型的噪声数据实现最佳聚类,并提高输入语音的语音模型序列的估计精度。 根据噪声信号比率条件将噪声添加到语音中以产生噪声添加语音(步骤S1),从产生的噪声添加语音(步骤2)中减去语音倒频谱的平均值,高斯分布 创建每个噪声添加语音的模型(步骤S3),计算噪声添加语音片段的可能性,以生成似然矩阵(步骤S4)以获得聚类结果。 选择最佳模型(步骤S7),并执行线性变换以提供最大似然(步骤S8)。 因为在聚类和模型学习中始终使用增加噪音的语音,所以可以实现许多类型的噪声数据的聚类和语音模型序列的精确估计。