专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20170301352A1 RE-RECOGNIZING SPEECH WITH EXTERNAL DATA SOURCES 审中-公开
公开(公告)号：US20170301352A1
公开(公告)日：2017-10-19
申请号：US15637526
申请日：2017-06-29
申请人： Google Inc.
发明人： Trevor D. Strohman , Johan Schalkwyk , Gleb Skobeltsyn
IPC分类号： G10L15/32 , G10L15/22 , G10L15/19 , G10L25/51 , G10L15/02
CPC分类号： G10L15/32 , G10L15/02 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/26 , G10L25/51 , G10L2015/025
摘要： Methods, including computer programs encoded on a computer storage medium, for improving speech recognition based on external data sources. In one aspect, a method includes obtaining an initial candidate transcription of an utterance using an automated speech recognizer and identifying, based on a language model that is not used by the automated speech recognizer in generating the initial candidate transcription, one or more terms that are phonetically similar to one or more terms that do occur in the initial candidate transcription. Additional actions include generating one or more additional candidate transcriptions based on the identified one or more terms and selecting a transcription from among the candidate transcriptions.

2. 发明授权

US08959020B1 Discovery of problematic pronunciations for automatic speech recognition systems 有权
标题翻译：发现自动语音识别系统的有问题的发音
公开(公告)号：US08959020B1
公开(公告)日：2015-02-17
申请号：US13853150
申请日：2013-03-29
申请人： Google Inc.
发明人： Brian Strope , Francoise Beaufays , Trevor D. Strohman
IPC分类号： G10L15/18 , G06F17/24
CPC分类号： G10L15/187
摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for discovery of problematic pronunciations for automatic speech recognition systems. One of the methods includes determining a frequency of occurrences of one or more n-grams in transcribed text and a frequency of occurrences of the n-grams in typed text and classifying a system pronunciation of a word included in the n-grams as correct or incorrect based on the frequencies. The n-grams may comprise one or more words and at least one of the words is classified as incorrect based on the frequencies. The frequencies of the specific n-grams may be determined across a domain using one or more n-grams that typically appear adjacent to the specific n-grams.
摘要翻译：方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于发现用于自动语音识别系统的有问题的发音。其中一种方法包括确定转录文本中一个或多个n克的出现频率和类型文本中出现的n-gram的频率，并将包含在n-gram中的单词的系统发音分类为正确或基于频率不正确。 n克可以包括一个或多个单词，并且基于频率将这些单词中的至少一个分类为不正确的。可以使用通常出现在特定n-gram附近的一个或多个n克来跨域确定特定n克的频率。

3. 发明授权

US09741339B2 Data driven word pronunciation learning and scoring with crowd sourcing based on the word's phonemes pronunciation scores 有权
公开(公告)号：US09741339B2
公开(公告)日：2017-08-22
申请号：US13930495
申请日：2013-06-28
申请人： Google Inc.
发明人： Fuchun Peng , Francoise Beaufays , Brian Strope , Xin Lei , Pedro J. Moreno Mengibar , Trevor D. Strohman
IPC分类号： G10L15/00 , G09B5/00 , G10L15/14 , G10L15/18 , G10L13/08 , G10L15/06 , G09B17/00
CPC分类号： G10L15/18 , G09B17/006 , G10L13/08 , G10L15/06
摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining pronunciations for particular terms. The methods, systems, and apparatus include actions of obtaining audio samples of speech corresponding to a particular term and obtaining candidate pronunciations for the particular term. Further actions include generating, for each candidate pronunciation for the particular term and audio sample of speech corresponding to the particular term, a score reflecting a level of similarity between of the candidate pronunciation and the audio sample, wherein the said score for the particular term is obtained by using a minimum of individual scores of phonemes comprising the term. Additional actions include aggregating the scores for each candidate pronunciation and adding one or more candidate pronunciations for the particular term to a pronunciation lexicon based on the aggregated scores for the candidate pronunciations.

4. 发明申请

US20170229124A1 RE-RECOGNIZING SPEECH WITH EXTERNAL DATA SOURCES 审中-公开
公开(公告)号：US20170229124A1
公开(公告)日：2017-08-10
申请号：US15016609
申请日：2016-02-05
申请人： Google Inc.
发明人： Trevor D. Strohman , Johan Schalkwyk , Gleb Skobeltsyn
IPC分类号： G10L15/32 , G10L15/02 , G10L25/51 , G10L15/26 , G10L15/183
CPC分类号： G10L15/32 , G10L15/02 , G10L15/183 , G10L15/19 , G10L15/22 , G10L15/26 , G10L25/51 , G10L2015/025
摘要： Methods, including computer programs encoded on a computer storage medium, for improving speech recognition based on external data sources. In one aspect, a method includes obtaining an initial candidate transcription of an utterance using an automated speech recognizer and identifying, based on a language model that is not used by the automated speech recognizer in generating the initial candidate transcription, one or more terms that are phonetically similar to one or more terms that do occur in the initial candidate transcription. Additional actions include generating one or more additional candidate transcriptions based on the identified one or more terms and selecting a transcription from among the candidate transcriptions.

5. 发明申请

US20150006178A1 DATA DRIVEN PRONUNCIATION LEARNING WITH CROWD SOURCING 有权
标题翻译：数据驱动公开学习与CROWD采购
公开(公告)号：US20150006178A1
公开(公告)日：2015-01-01
申请号：US13930495
申请日：2013-06-28
申请人： Google Inc.
发明人： Fuchun Peng , Francoise Beaufays , Brian Strope , Xin Lei , Pedro J. Moreno Mengibar , Trevor D. Strohman
IPC分类号： G10L15/18
CPC分类号： G10L15/18 , G09B17/006 , G10L13/08 , G10L15/06
摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining pronunciations for particular terms. The methods, systems, and apparatus include actions of obtaining audio samples of speech corresponding to a particular term and obtaining candidate pronunciations for the particular term. Further actions include generating, for each candidate pronunciation for the particular term and audio sample of speech corresponding to the particular term, a score reflecting a level of similarity between of the candidate pronunciation and the audio sample. Additional actions include aggregating the scores for each candidate pronunciation and adding one or more candidate pronunciations for the particular term to a pronunciation lexicon based on the aggregated scores for the candidate pronunciations.
摘要翻译：方法，系统和装置，包括在计算机存储介质上编码的计算机程序，用于确定特定术语的发音。方法，系统和装置包括获得与特定术语相对应的语音样本的动作，并获得特定术语的候选发音。进一步的动作包括针对特定术语的每个候选发音和对应于特定术语的语音样本生成反映候选发音和音频样本之间的相似程度的分数。附加动作包括聚合每个候选发音的分数，并且基于候选发音的聚合分数，将特定术语的一个或多个候选发音添加到发音词典。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式