会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Continuous speech recognition method and system using inter-word phonetic information
    • 连续语音识别方法和系统使用字间语音信息
    • US07299178B2
    • 2007-11-20
    • US10784173
    • 2004-02-24
    • Su-yeon YoonIn-jeong ChoiNam-hoon Kim
    • Su-yeon YoonIn-jeong ChoiNam-hoon Kim
    • G10L15/00
    • G10L15/187G10L15/02
    • A continuous speech recognition method and system are provided. The continuous speech recognition method includes constructing a pronunciation dictionary database including at least one pronunciation representation for each word which is influenced by applying phonological rules, wherein the pronunciation representation for the coda of a first word or the pronunciation representation for the onset of a second word following the first word is additionally indexed with an identifier if it does not match the phonetic pronunciation of its spelling, forming inter-word phonetic information in matrix form by combination of a number of all probable phonetic pairs, each of which is basically comprised of the coda of a first word and the onset of a second word following the first word, wherein the coda of the first word or the onset of the second word is indexed with an identifier if they undergo phonological changes and performing speech recognition on feature vectors extracted from an input speech signal with reference to the pronunciation dictionary database and the inter-word phonetic information.
    • 提供连续的语音识别方法和系统。 所述连续语音识别方法包括:构建发音词典数据库,其包括受应用语音规则影响的每个单词的至少一个发音表示,其中用于第一个单词的节奏的发音表示或用于开始第二个单词的发音表示 如果第一个单词与其拼写的语音发音不匹配,则另外用第一个单词进行索引,通过组合多个所有可能的语音对,以矩阵形式形成词间语音信息,其中每一个基本上由 第一个单词的开头和第一个单词之后的第二个单词的开始,其中如果第一个单词的第一个单词的起始点或者第二个单词的开始是经过语音改变并且对从 参考发音字典数据库的输入语音信号 ase和词间语音信息。
    • 2. 发明申请
    • Apparatus and method for recognizing voice
    • 用于识别语音的装置和方法
    • US20070083371A1
    • 2007-04-12
    • US11475963
    • 2006-06-28
    • Sang-bae JeongNam-hoon KimJeong-su KimIn-jeong ChoiIck-sang Han
    • Sang-bae JeongNam-hoon KimJeong-su KimIn-jeong ChoiIck-sang Han
    • G10L15/14
    • G10L15/142
    • An apparatus and method for recognizing voice. The apparatus includes a feature vector extraction unit dividing an input voice signal into predetermined unit regions, and extracting feature vectors corresponding to each of the unit regions; a predicted node extraction unit extracting a list of second nodes whose travels to a first node corresponding to the extracted feature vectors are predicted, with reference to a network of one or more nodes; a single waveform similarity calculation unit calculating degrees of single waveform similarity of the first node and the second nodes of the list by substituting the extracted feature vectors into single waveform probability distributions that constitute voice signals corresponding to the second nodes; a multiple waveform similarity calculation unit calculating degrees of multiple waveform similarity by substituting the extracted feature vectors into multiple waveform probability distributions that constitute single waveform probability distributions usable to calculate the degrees of single waveform similarity in a preset range; and an output unit outputting a function-performing signal corresponding to a multiple waveform probability distribution that enables calculation of a highest of the calculated degrees of multiple waveform similarity.
    • 用于识别语音的装置和方法。 该装置包括:特征向量提取单元,将输入的语音信号划分为预定的单位区域;提取与每个单位区域对应的特征向量; 参考一个或多个节点的网络,预测提取与对应于所提取的特征向量的对第一节点的行进的第二节点的列表的预测节点提取单元; 单个波形相似度计算单元,通过将提取的特征向量代入构成对应于第二节点的语音信号的单波形概率分布来计算第一节点和列表的第二节点的单波形相似度的度数; 多波形相似度计算单元,通过将所提取的特征向量代入构成单个波形概率分布的多个波形概率分布来计算多个波形相似度,以计算预设范围内的单一波形相似度; 以及输出单元,输出与多波形概率分布相对应的功能执行信号,能够计算所计算出的多重波形相似度的最高值。
    • 3. 发明授权
    • Apparatus and method for recognizing voice
    • 用于识别语音的装置和方法
    • US08140334B2
    • 2012-03-20
    • US11475963
    • 2006-06-28
    • Sang-bae JeongNam-hoon KimJeong-su KimIn-jeong ChoiIck-sang Han
    • Sang-bae JeongNam-hoon KimJeong-su KimIn-jeong ChoiIck-sang Han
    • G10L15/14G10L15/00
    • G10L15/142
    • An apparatus and method for recognizing voice. The apparatus includes a feature vector extraction unit dividing an input voice signal into predetermined unit regions, and extracting feature vectors corresponding to each of the unit regions; a predicted node extraction unit extracting a list of second nodes whose travels to a first node corresponding to the extracted feature vectors are predicted, with reference to a network of one or more nodes; a single waveform similarity calculation unit calculating degrees of single waveform similarity of the first node and the second nodes of the list by substituting the extracted feature vectors into single waveform probability distributions that constitute voice signals corresponding to the second nodes; a multiple waveform similarity calculation unit calculating degrees of multiple waveform similarity by substituting the extracted feature vectors into multiple waveform probability distributions that constitute single waveform probability distributions usable to calculate the degrees of single waveform similarity in a preset range; and an output unit outputting a function-performing signal corresponding to a multiple waveform probability distribution that enables calculation of a highest of the calculated degrees of multiple waveform similarity.
    • 用于识别语音的装置和方法。 该装置包括:特征向量提取单元,将输入的语音信号划分为预定的单位区域;提取与每个单位区域对应的特征向量; 参考一个或多个节点的网络,预测提取与对应于所提取的特征向量的对第一节点的行进的第二节点的列表的预测节点提取单元; 单个波形相似度计算单元,通过将提取的特征向量代入构成对应于第二节点的语音信号的单波形概率分布来计算第一节点和列表的第二节点的单波形相似度的度数; 多波形相似度计算单元,通过将所提取的特征向量代入构成单个波形概率分布的多个波形概率分布来计算多个波形相似度,以计算预设范围内的单一波形相似度; 以及输出单元,输出与多波形概率分布相对应的功能执行信号,能够计算所计算出的多重波形相似度的最高值。
    • 6. 发明申请
    • Method and apparatus for discriminative estimation of parameters in maximum a posteriori (MAP) speaker adaptation condition and voice recognition method and apparatus including these
    • 最大后验(MAP)说话者适应条件中的参数的鉴别估计方法和装置以及包括这些参数的语音识别方法和装置
    • US20050065793A1
    • 2005-03-24
    • US10898382
    • 2004-07-26
    • In-jeong ChoiSang-ryong Kim
    • In-jeong ChoiSang-ryong Kim
    • G10L15/07G10L15/12G10L19/12
    • G10L15/07
    • A method and apparatus for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, and a voice recognition apparatus having the apparatus and a voice recognition method using the method are provided. The method for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, in which at least speaker-independent model parameters and prior density parameters, which are standards in recognizing a speaker's voice, are obtained as the result of model training after fetching training sets on a plurality of speakers from a training database, has the steps of (a) classifying adaptation data among training sets for respective speakers; (b) obtaining model parameters adapted from adaptation data on each speaker by using the initial values of the parameters; (c) searching a plurality of candidate hypotheses on each uttered sentence of training sets by using the adapted model parameters, and calculating gradients of speaker-independent model parameters by measuring the degree of errors on each training sentence; and (d) when training sets of all speakers are adapted, updating parameters, which were set at the initial stage, based on the calculated gradients.
    • 提供了一种用于鉴别性估计最大后验(MAP)说话者适应条件中的参数的方法和装置,以及具有使用该方法的装置和语音识别方法的语音识别装置。 作为模型训练的结果,获得最大后验(MAP)说话者适应条件中的参数的辨别性估计的方法,其中至少与说话者独立的模型参数和作为识别说话者的声音的标准的先前密度参数被获得 在从训练数据库获取多个扬声器上的训练集之后,具有以下步骤:(a)在适用于各个扬声器的训练集之间对适配数据进行分类; (b)通过使用参数的初始值从每个说话者的适应数据中获得适应的模型参数; (c)通过使用适应的模型参数来搜索训练集的每个发音句子上的多个候选假设,以及通过测量每个训练句子的错误程度来计算与说话者无关的模型参数的梯度; 和(d)当适应所有发言者的训练集时,根据计算的梯度更新在初始阶段设定的参数。
    • 7. 发明授权
    • Apparatus, method, and medium for dialogue speech recognition using topic domain detection
    • 使用主题域检测的对话语音识别的装置,方法和介质
    • US08301450B2
    • 2012-10-30
    • US11589165
    • 2006-10-30
    • Jae-won LeeIn-jeong Choi
    • Jae-won LeeIn-jeong Choi
    • G06F17/27G10L15/00G10L15/04G10L17/00G10L15/18
    • G10L15/1822G10L15/1815
    • An apparatus, method, and medium for dialogue speech recognition using topic domain detection are disclosed. An apparatus includes a forward search module performing a forward search in order to create a word lattice similar to a feature vector, which is extracted from an input voice signal, with reference to a global language model database, a pronunciation dictionary database and an acoustic model database, which have been previously established, a topic-domain-detection module detecting a topic domain by inferring a topic based on meanings of vocabularies contained in the word lattice using information of the word lattice created as a result of the forward search, and a backward-decoding module performing a backward decoding of the detected topic domain with reference to a specific topic domain language model database, which has been previously established, thereby outputting a speech recognition result for an input voice signal in text form. Accuracy and efficiency for a dialogue sentence are improved.
    • 公开了一种使用主题域检测进行对话语音识别的装置,方法和介质。 一种装置,包括执行前向搜索的前向搜索模块,以便参考全局语言模型数据库,发音词典数据库和声学模型来创建类似于从输入语音信号提取的特征向量的单词格 数据库,其已经建立,主题域检测模块通过使用由作为前向搜索的结果创建的单词格点的信息,基于包含在单词格中的词汇的含义来推断主题来检测主题领域,以及 后向解码模块参照已经建立的特定主题域语言模型数据库执行所检测到的主题域的反向解码,从而以文本形式输出用于输入语音信号的语音识别结果。 提高对话句子的​​准确性和效率。
    • 8. 发明授权
    • Method and apparatus for discriminative estimation of parameters in maximum a posteriori (MAP) speaker adaptation condition and voice recognition method and apparatus including these
    • 最大后验(MAP)说话者适应条件中的参数的鉴别估计方法和装置以及包括这些参数的语音识别方法和装置
    • US07324941B2
    • 2008-01-29
    • US10898382
    • 2004-07-26
    • In-jeong ChoiSang-ryong Kim
    • In-jeong ChoiSang-ryong Kim
    • G10L15/28
    • G10L15/07
    • A method and apparatus for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, and a voice recognition apparatus having the apparatus and a voice recognition method using the method are provided. The method for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, in which at least speaker-independent model parameters and prior density parameters, which are standards in recognizing a speaker's voice, are obtained as the result of model training after fetching training sets on a plurality of speakers from a training database, has the steps of (a) classifying adaptation data among training sets for respective speakers; (b) obtaining model parameters adapted from adaptation data on each speaker by using the initial values of the parameters; (c) searching a plurality of candidate hypotheses on each uttered sentence of training sets by using the adapted model parameters, and calculating gradients of speaker-independent model parameters by measuring the degree of errors on each training sentence; and (d) when training sets of all speakers are adapted, updating parameters, which were set at the initial stage, based on the calculated gradients.
    • 提供了一种用于鉴别性估计最大后验(MAP)说话者适应条件中的参数的方法和装置,以及具有使用该方法的装置和语音识别方法的语音识别装置。 作为模型训练的结果,获得最大后验(MAP)说话者适应条件中的参数的辨别性估计的方法,其中至少与说话者独立的模型参数和作为识别说话者的声音的标准的先前密度参数被获得 在从训练数据库获取多个扬声器上的训练集之后,具有以下步骤:(a)在适用于各个扬声器的训练集之间对适配数据进行分类; (b)通过使用参数的初始值从每个说话者的适应数据中获得适应的模型参数; (c)通过使用适应的模型参数来搜索训练集的每个发音句子上的多个候选假设,以及通过测量每个训练句子的错误程度来计算与说话者无关的模型参数的梯度; 和(d)当适应所有发言者的训练集时,根据计算的梯度更新在初始阶段设定的参数。
    • 9. 发明申请
    • Speech recognition method, apparatus and navigation system
    • 语音识别方法,装置和导航系统
    • US20060100871A1
    • 2006-05-11
    • US11253641
    • 2005-10-20
    • In-jeong ChoiJeong-su KimKwang-il Hwang
    • In-jeong ChoiJeong-su KimKwang-il Hwang
    • G10L15/04
    • G01C21/3629G01C21/3608G01C21/3664G10L15/22
    • A speech recognition method and apparatus and a navigation system having the speech recognition apparatus are provided. The speech recognition method includes capturing speech as speech signal and extracting features from the speech signal, selecting candidates of a subword among subwords of the word based on the extracted features and displaying the candidate subwords for the subword, selecting candidates of a next subword following the subword based on the selected candidates of the subword and displaying the candidates of the next subword, and determining whether the user has selected one of the candidates of the next subword and, if not, selecting candidates of subwords following the next subword based on the series of subwords that have been previously selected by the user and displaying the selected candidates of the next subword.
    • 提供具有语音识别装置的语音识别方法和装置以及导航系统。 语音识别方法包括将语音作为语音信号进行采集,并从语音信号中提取特征,基于所提取的特征选择单词的子词中的子词的候选,并显示子词的候选词,选择下一个子词的候选 基于所选择的子词的候选者的子字,并显示下一个子词的候选,并且确定用户是否已经选择了下一个子词的候选中的一个,如果不是,则基于该系列选择下一个子词后的子词的候选 的以前由用户选择并显示下一个子词的选定候选者的子词。