会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Discovery of problematic pronunciations for automatic speech recognition systems
    • 发现自动语音识别系统的有问题的发音
    • US08959020B1
    • 2015-02-17
    • US13853150
    • 2013-03-29
    • Google Inc.
    • Brian StropeFrancoise BeaufaysTrevor D. Strohman
    • G10L15/18G06F17/24
    • G10L15/187
    • Methods, systems, and apparatus, including computer programs encoded on computer storage media, for discovery of problematic pronunciations for automatic speech recognition systems. One of the methods includes determining a frequency of occurrences of one or more n-grams in transcribed text and a frequency of occurrences of the n-grams in typed text and classifying a system pronunciation of a word included in the n-grams as correct or incorrect based on the frequencies. The n-grams may comprise one or more words and at least one of the words is classified as incorrect based on the frequencies. The frequencies of the specific n-grams may be determined across a domain using one or more n-grams that typically appear adjacent to the specific n-grams.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于发现用于自动语音识别系统的有问题的发音。 其中一种方法包括确定转录文本中一个或多个n克的出现频率和类型文本中出现的n-gram的频率,并将包含在n-gram中的单词的系统发音分类为正确或 基于频率不正确。 n克可以包括一个或多个单词,并且基于频率将这些单词中的至少一个分类为不正确的。 可以使用通常出现在特定n-gram附近的一个或多个n克来跨域确定特定n克的频率。
    • 5. 发明申请
    • DATA DRIVEN PRONUNCIATION LEARNING WITH CROWD SOURCING
    • 数据驱动公开学习与CROWD采购
    • US20150006178A1
    • 2015-01-01
    • US13930495
    • 2013-06-28
    • Google Inc.
    • Fuchun PengFrancoise BeaufaysBrian StropeXin LeiPedro J. Moreno MengibarTrevor D. Strohman
    • G10L15/18
    • G10L15/18G09B17/006G10L13/08G10L15/06
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining pronunciations for particular terms. The methods, systems, and apparatus include actions of obtaining audio samples of speech corresponding to a particular term and obtaining candidate pronunciations for the particular term. Further actions include generating, for each candidate pronunciation for the particular term and audio sample of speech corresponding to the particular term, a score reflecting a level of similarity between of the candidate pronunciation and the audio sample. Additional actions include aggregating the scores for each candidate pronunciation and adding one or more candidate pronunciations for the particular term to a pronunciation lexicon based on the aggregated scores for the candidate pronunciations.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于确定特定术语的发音。 方法,系统和装置包括获得与特定术语相对应的语音样本的动作,并获得特定术语的候选发音。 进一步的动作包括针对特定术语的每个候选发音和对应于特定术语的语音样本生成反映候选发音和音频样本之间的相似程度的分数。 附加动作包括聚合每个候选发音的分数,并且基于候选发音的聚合分数,将特定术语的一个或多个候选发音添加到发音词典。