专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US07299178B2 Continuous speech recognition method and system using inter-word phonetic information 有权
标题翻译：连续语音识别方法和系统使用字间语音信息
公开(公告)号：US07299178B2
公开(公告)日：2007-11-20
申请号：US10784173
申请日：2004-02-24
申请人： Su-yeon Yoon , In-jeong Choi , Nam-hoon Kim
发明人： Su-yeon Yoon , In-jeong Choi , Nam-hoon Kim
IPC分类号： G10L15/00
CPC分类号： G10L15/187 , G10L15/02
摘要： A continuous speech recognition method and system are provided. The continuous speech recognition method includes constructing a pronunciation dictionary database including at least one pronunciation representation for each word which is influenced by applying phonological rules, wherein the pronunciation representation for the coda of a first word or the pronunciation representation for the onset of a second word following the first word is additionally indexed with an identifier if it does not match the phonetic pronunciation of its spelling, forming inter-word phonetic information in matrix form by combination of a number of all probable phonetic pairs, each of which is basically comprised of the coda of a first word and the onset of a second word following the first word, wherein the coda of the first word or the onset of the second word is indexed with an identifier if they undergo phonological changes and performing speech recognition on feature vectors extracted from an input speech signal with reference to the pronunciation dictionary database and the inter-word phonetic information.
摘要翻译：提供连续的语音识别方法和系统。所述连续语音识别方法包括：构建发音词典数据库，其包括受应用语音规则影响的每个单词的至少一个发音表示，其中用于第一个单词的节奏的发音表示或用于开始第二个单词的发音表示如果第一个单词与其拼写的语音发音不匹配，则另外用第一个单词进行索引，通过组合多个所有可能的语音对，以矩阵形式形成词间语音信息，其中每一个基本上由第一个单词的开头和第一个单词之后的第二个单词的开始，其中如果第一个单词的第一个单词的起始点或者第二个单词的开始是经过语音改变并且对从参考发音字典数据库的输入语音信号 ase和词间语音信息。

2. 发明申请

US20070083371A1 Apparatus and method for recognizing voice 有权
标题翻译：用于识别语音的装置和方法
公开(公告)号：US20070083371A1
公开(公告)日：2007-04-12
申请号：US11475963
申请日：2006-06-28
申请人： Sang-bae Jeong , Nam-hoon Kim , Jeong-su Kim , In-jeong Choi , Ick-sang Han
发明人： Sang-bae Jeong , Nam-hoon Kim , Jeong-su Kim , In-jeong Choi , Ick-sang Han
IPC分类号： G10L15/14
CPC分类号： G10L15/142
摘要： An apparatus and method for recognizing voice. The apparatus includes a feature vector extraction unit dividing an input voice signal into predetermined unit regions, and extracting feature vectors corresponding to each of the unit regions; a predicted node extraction unit extracting a list of second nodes whose travels to a first node corresponding to the extracted feature vectors are predicted, with reference to a network of one or more nodes; a single waveform similarity calculation unit calculating degrees of single waveform similarity of the first node and the second nodes of the list by substituting the extracted feature vectors into single waveform probability distributions that constitute voice signals corresponding to the second nodes; a multiple waveform similarity calculation unit calculating degrees of multiple waveform similarity by substituting the extracted feature vectors into multiple waveform probability distributions that constitute single waveform probability distributions usable to calculate the degrees of single waveform similarity in a preset range; and an output unit outputting a function-performing signal corresponding to a multiple waveform probability distribution that enables calculation of a highest of the calculated degrees of multiple waveform similarity.
摘要翻译：用于识别语音的装置和方法。该装置包括：特征向量提取单元，将输入的语音信号划分为预定的单位区域;提取与每个单位区域对应的特征向量; 参考一个或多个节点的网络，预测提取与对应于所提取的特征向量的对第一节点的行进的第二节点的列表的预测节点提取单元; 单个波形相似度计算单元，通过将提取的特征向量代入构成对应于第二节点的语音信号的单波形概率分布来计算第一节点和列表的第二节点的单波形相似度的度数; 多波形相似度计算单元，通过将所提取的特征向量代入构成单个波形概率分布的多个波形概率分布来计算多个波形相似度，以计算预设范围内的单一波形相似度; 以及输出单元，输出与多波形概率分布相对应的功能执行信号，能够计算所计算出的多重波形相似度的最高值。

3. 发明授权

US08140334B2 Apparatus and method for recognizing voice 有权
标题翻译：用于识别语音的装置和方法
公开(公告)号：US08140334B2
公开(公告)日：2012-03-20
申请号：US11475963
申请日：2006-06-28
申请人： Sang-bae Jeong , Nam-hoon Kim , Jeong-su Kim , In-jeong Choi , Ick-sang Han
发明人： Sang-bae Jeong , Nam-hoon Kim , Jeong-su Kim , In-jeong Choi , Ick-sang Han
IPC分类号： G10L15/14 , G10L15/00
CPC分类号： G10L15/142
摘要： An apparatus and method for recognizing voice. The apparatus includes a feature vector extraction unit dividing an input voice signal into predetermined unit regions, and extracting feature vectors corresponding to each of the unit regions; a predicted node extraction unit extracting a list of second nodes whose travels to a first node corresponding to the extracted feature vectors are predicted, with reference to a network of one or more nodes; a single waveform similarity calculation unit calculating degrees of single waveform similarity of the first node and the second nodes of the list by substituting the extracted feature vectors into single waveform probability distributions that constitute voice signals corresponding to the second nodes; a multiple waveform similarity calculation unit calculating degrees of multiple waveform similarity by substituting the extracted feature vectors into multiple waveform probability distributions that constitute single waveform probability distributions usable to calculate the degrees of single waveform similarity in a preset range; and an output unit outputting a function-performing signal corresponding to a multiple waveform probability distribution that enables calculation of a highest of the calculated degrees of multiple waveform similarity.
摘要翻译：用于识别语音的装置和方法。该装置包括：特征向量提取单元，将输入的语音信号划分为预定的单位区域;提取与每个单位区域对应的特征向量; 参考一个或多个节点的网络，预测提取与对应于所提取的特征向量的对第一节点的行进的第二节点的列表的预测节点提取单元; 单个波形相似度计算单元，通过将提取的特征向量代入构成对应于第二节点的语音信号的单波形概率分布来计算第一节点和列表的第二节点的单波形相似度的度数; 多波形相似度计算单元，通过将所提取的特征向量代入构成单个波形概率分布的多个波形概率分布来计算多个波形相似度，以计算预设范围内的单一波形相似度; 以及输出单元，输出与多波形概率分布相对应的功能执行信号，能够计算所计算出的多重波形相似度的最高值。

4. 发明申请

US20060173673A1 Speech recognition method and apparatus using lexicon group tree 有权
标题翻译：使用词汇组树的语音识别方法和装置
公开(公告)号：US20060173673A1
公开(公告)日：2006-08-03
申请号：US11342701
申请日：2006-01-31
申请人： Sang-bae Jeong , In-jeong Choi , Ick-sang Han , Jeong-su Kim
发明人： Sang-bae Jeong , In-jeong Choi , Ick-sang Han , Jeong-su Kim
IPC分类号： G06F17/27
CPC分类号： G06F17/2765 , G10L15/197
摘要： A method and an apparatus for selecting a vocabulary closest to an input speech from among lexicons stored in memory, wherein a centroid lexicon representing lexicons belonging to a predetermined lexicon group is generated. Two lexicons, having a longest distance therebetween in the lexicon group, are selected using the centroid lexicon from the lexicon group, and a node indicating the lexicon group branches based on the two selected lexicons. A node having low group similarity is selected from among current terminal nodes, including branch nodes, and the above procedure is repeatedly performed on a lexicon group indicated by the selected node.
摘要翻译：一种用于从存储在存储器中的词典中选择最接近输入语音的词汇的方法和装置，其中生成表示属于预定词典组的词典的质心词典。在词典组中具有最长距离的两个词典使用来自词典组的质心词典进行选择，并且指示词典组的节点基于两个选定的词典进行分支。从包括分支节点的当前终端节点中选择具有低组相似性的节点，并且对由所选节点指示的词典组重复执行上述过程。

5. 发明授权

US07953594B2 Speech recognition method and apparatus using lexicon group tree 有权
标题翻译：使用词汇组树的语音识别方法和装置
公开(公告)号：US07953594B2
公开(公告)日：2011-05-31
申请号：US11342701
申请日：2006-01-31
申请人： Sang-bae Jeong , In-jeong Choi , Ick-sang Han , Jeong-su Kim
发明人： Sang-bae Jeong , In-jeong Choi , Ick-sang Han , Jeong-su Kim
IPC分类号： G10L11/06
CPC分类号： G06F17/2765 , G10L15/197
摘要： A method and an apparatus for selecting a vocabulary closest to an input speech from among lexicons stored in memory, wherein a centroid lexicon representing lexicons belonging to a predetermined lexicon group is generated. Two lexicons, having a longest distance therebetween in the lexicon group, are selected using the centroid lexicon from the lexicon group, and a node indicating the lexicon group branches based on the two selected lexicons. A node having low group similarity is selected from among current terminal nodes, including branch nodes, and the above procedure is repeatedly performed on a lexicon group indicated by the selected node.
摘要翻译：一种用于从存储在存储器中的词典中选择最接近输入语音的词汇的方法和装置，其中生成表示属于预定词典组的词典的质心词典。在词典组中具有最长距离的两个词典使用来自词典组的质心词典进行选择，并且指示词典组的节点基于两个选定的词典进行分支。从包括分支节点的当前终端节点中选择具有低组相似性的节点，并且对由所选节点指示的词典组重复执行上述过程。

6. 发明申请

US20050065793A1 Method and apparatus for discriminative estimation of parameters in maximum a posteriori (MAP) speaker adaptation condition and voice recognition method and apparatus including these 失效
标题翻译：最大后验（MAP）说话者适应条件中的参数的鉴别估计方法和装置以及包括这些参数的语音识别方法和装置
公开(公告)号：US20050065793A1
公开(公告)日：2005-03-24
申请号：US10898382
申请日：2004-07-26
申请人： In-jeong Choi , Sang-ryong Kim
发明人： In-jeong Choi , Sang-ryong Kim
IPC分类号： G10L15/07 , G10L15/12 , G10L19/12
CPC分类号： G10L15/07
摘要： A method and apparatus for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, and a voice recognition apparatus having the apparatus and a voice recognition method using the method are provided. The method for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, in which at least speaker-independent model parameters and prior density parameters, which are standards in recognizing a speaker's voice, are obtained as the result of model training after fetching training sets on a plurality of speakers from a training database, has the steps of (a) classifying adaptation data among training sets for respective speakers; (b) obtaining model parameters adapted from adaptation data on each speaker by using the initial values of the parameters; (c) searching a plurality of candidate hypotheses on each uttered sentence of training sets by using the adapted model parameters, and calculating gradients of speaker-independent model parameters by measuring the degree of errors on each training sentence; and (d) when training sets of all speakers are adapted, updating parameters, which were set at the initial stage, based on the calculated gradients.
摘要翻译：提供了一种用于鉴别性估计最大后验（MAP）说话者适应条件中的参数的方法和装置，以及具有使用该方法的装置和语音识别方法的语音识别装置。作为模型训练的结果，获得最大后验（MAP）说话者适应条件中的参数的辨别性估计的方法，其中至少与说话者独立的模型参数和作为识别说话者的声音的标准的先前密度参数被获得在从训练数据库获取多个扬声器上的训练集之后，具有以下步骤：（a）在适用于各个扬声器的训练集之间对适配数据进行分类; （b）通过使用参数的初始值从每个说话者的适应数据中获得适应的模型参数; （c）通过使用适应的模型参数来搜索训练集的每个发音句子上的多个候选假设，以及通过测量每个训练句子的错误程度来计算与说话者无关的模型参数的梯度; 和（d）当适应所有发言者的训练集时，根据计算的梯度更新在初始阶段设定的参数。

7. 发明授权

US08301450B2 Apparatus, method, and medium for dialogue speech recognition using topic domain detection 有权
标题翻译：使用主题域检测的对话语音识别的装置，方法和介质
公开(公告)号：US08301450B2
公开(公告)日：2012-10-30
申请号：US11589165
申请日：2006-10-30
申请人： Jae-won Lee , In-jeong Choi
发明人： Jae-won Lee , In-jeong Choi
IPC分类号： G06F17/27 , G10L15/00 , G10L15/04 , G10L17/00 , G10L15/18
CPC分类号： G10L15/1822 , G10L15/1815
摘要： An apparatus, method, and medium for dialogue speech recognition using topic domain detection are disclosed. An apparatus includes a forward search module performing a forward search in order to create a word lattice similar to a feature vector, which is extracted from an input voice signal, with reference to a global language model database, a pronunciation dictionary database and an acoustic model database, which have been previously established, a topic-domain-detection module detecting a topic domain by inferring a topic based on meanings of vocabularies contained in the word lattice using information of the word lattice created as a result of the forward search, and a backward-decoding module performing a backward decoding of the detected topic domain with reference to a specific topic domain language model database, which has been previously established, thereby outputting a speech recognition result for an input voice signal in text form. Accuracy and efficiency for a dialogue sentence are improved.
摘要翻译：公开了一种使用主题域检测进行对话语音识别的装置，方法和介质。一种装置，包括执行前向搜索的前向搜索模块，以便参考全局语言模型数据库，发音词典数据库和声学模型来创建类似于从输入语音信号提取的特征向量的单词格数据库，其已经建立，主题域检测模块通过使用由作为前向搜索的结果创建的单词格点的信息，基于包含在单词格中的词汇的含义来推断主题来检测主题领域，以及后向解码模块参照已经建立的特定主题域语言模型数据库执行所检测到的主题域的反向解码，从而以文本形式输出用于输入语音信号的语音识别结果。提高对话句子的准确性和效率。

8. 发明授权

US07324941B2 Method and apparatus for discriminative estimation of parameters in maximum a posteriori (MAP) speaker adaptation condition and voice recognition method and apparatus including these 失效
标题翻译：最大后验（MAP）说话者适应条件中的参数的鉴别估计方法和装置以及包括这些参数的语音识别方法和装置
公开(公告)号：US07324941B2
公开(公告)日：2008-01-29
申请号：US10898382
申请日：2004-07-26
申请人： In-jeong Choi , Sang-ryong Kim
发明人： In-jeong Choi , Sang-ryong Kim
IPC分类号： G10L15/28
CPC分类号： G10L15/07
摘要： A method and apparatus for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, and a voice recognition apparatus having the apparatus and a voice recognition method using the method are provided. The method for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, in which at least speaker-independent model parameters and prior density parameters, which are standards in recognizing a speaker's voice, are obtained as the result of model training after fetching training sets on a plurality of speakers from a training database, has the steps of (a) classifying adaptation data among training sets for respective speakers; (b) obtaining model parameters adapted from adaptation data on each speaker by using the initial values of the parameters; (c) searching a plurality of candidate hypotheses on each uttered sentence of training sets by using the adapted model parameters, and calculating gradients of speaker-independent model parameters by measuring the degree of errors on each training sentence; and (d) when training sets of all speakers are adapted, updating parameters, which were set at the initial stage, based on the calculated gradients.
摘要翻译：提供了一种用于鉴别性估计最大后验（MAP）说话者适应条件中的参数的方法和装置，以及具有使用该方法的装置和语音识别方法的语音识别装置。作为模型训练的结果，获得最大后验（MAP）说话者适应条件中的参数的辨别性估计的方法，其中至少与说话者独立的模型参数和作为识别说话者的声音的标准的先前密度参数被获得在从训练数据库获取多个扬声器上的训练集之后，具有以下步骤：（a）在适用于各个扬声器的训练集之间对适配数据进行分类; （b）通过使用参数的初始值从每个说话者的适应数据中获得适应的模型参数; （c）通过使用适应的模型参数来搜索训练集的每个发音句子上的多个候选假设，以及通过测量每个训练句子的错误程度来计算与说话者无关的模型参数的梯度; 和（d）当适应所有发言者的训练集时，根据计算的梯度更新在初始阶段设定的参数。

9. 发明申请

US20060100871A1 Speech recognition method, apparatus and navigation system 审中-公开
标题翻译：语音识别方法，装置和导航系统
公开(公告)号：US20060100871A1
公开(公告)日：2006-05-11
申请号：US11253641
申请日：2005-10-20
申请人： In-jeong Choi , Jeong-su Kim , Kwang-il Hwang
发明人： In-jeong Choi , Jeong-su Kim , Kwang-il Hwang
IPC分类号： G10L15/04
CPC分类号： G01C21/3629 , G01C21/3608 , G01C21/3664 , G10L15/22
摘要： A speech recognition method and apparatus and a navigation system having the speech recognition apparatus are provided. The speech recognition method includes capturing speech as speech signal and extracting features from the speech signal, selecting candidates of a subword among subwords of the word based on the extracted features and displaying the candidate subwords for the subword, selecting candidates of a next subword following the subword based on the selected candidates of the subword and displaying the candidates of the next subword, and determining whether the user has selected one of the candidates of the next subword and, if not, selecting candidates of subwords following the next subword based on the series of subwords that have been previously selected by the user and displaying the selected candidates of the next subword.
摘要翻译：提供具有语音识别装置的语音识别方法和装置以及导航系统。语音识别方法包括将语音作为语音信号进行采集，并从语音信号中提取特征，基于所提取的特征选择单词的子词中的子词的候选，并显示子词的候选词，选择下一个子词的候选基于所选择的子词的候选者的子字，并显示下一个子词的候选，并且确定用户是否已经选择了下一个子词的候选中的一个，如果不是，则基于该系列选择下一个子词后的子词的候选的以前由用户选择并显示下一个子词的选定候选者的子词。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式