会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 81. 发明申请
    • Discriminative Training of Document Transcription System
    • 文件转录系统的歧视性培训
    • US20140343939A1
    • 2014-11-20
    • US14244053
    • 2014-04-03
    • MModal IP LLC
    • Lambert MathiasGirija YegnanarayananJuergen Fritsch
    • G10L15/06G10L15/26
    • G10L15/063G06F17/271G06F17/2775G06F17/28G10L15/02G10L15/183G10L15/193G10L15/26G10L2015/0631G10L2015/0633
    • A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model using discriminative training techniques, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.
    • 提供用于训练用于语音识别的声学模型的系统。 特别地,这样的系统可以用于基于口语音频流和口头音频流的非文字转录来执行训练。 这样的系统可以识别表示具有多个口头形式的概念的非文字记录中的文本。 该系统可以尝试在音频流中识别在非文字转录中产生相应文本的音频流中的实际语音形式,从而产生更准确地表示语音音频流的经修改的脚本。 可以使用修改和更准确的抄本来使用辨别性训练技术训练声学模型,从而产生比使用直接基于原始非文字誊本进行训练的常规技术产生的更好的声学模型。
    • 82. 发明申请
    • NAME RECOGNITION SYSTEM
    • 名称识别系统
    • US20130332164A1
    • 2013-12-12
    • US13492720
    • 2012-06-08
    • Devang K. Nalk
    • Devang K. Nalk
    • G10L15/06
    • G10L15/187G10L15/30G10L2015/025G10L2015/0633
    • A speech recognition system uses, in one embodiment, an extended phonetic dictionary that is obtained by processing words in a user's set of databases, such as a user's contacts database, with a set of pronunciation guessers. The speech recognition system can use a conventional phonetic dictionary and the extended phonetic dictionary to recognize speech inputs that are user requests to use the contacts database, for example, to make a phone call, etc. The extended phonetic dictionary can be updated in response to changes in the contacts database, and the set of pronunciation guessers can include pronunciation guessers for a plurality of locales, each locale having its own pronunciation guesser.
    • 在一个实施例中,语音识别系统使用通过在用户的一组数据库(例如用户的联系人数据库)中处理单词与一组发音猜测器来获得的扩展语音字典。 语音识别系统可以使用传统的语音字典和扩展语音字典来识别作为用户请求使用联系人数据库的语音输入,例如进行电话呼叫等。扩展的语音字典可以响应于 联系人数据库中的变化和发音猜测器的集合可以包括多个语言环境的发音猜测器,每个语言环境具有其自己的发音猜测器。
    • 83. 发明申请
    • Discriminative Training of Document Transcription System
    • 文件转录系统的歧视性培训
    • US20130166297A1
    • 2013-06-27
    • US13773928
    • 2013-02-22
    • MULTIMODAL TECHNOLOGIES, LLC
    • Lambert MathiasGirija YegnanarayananJuergen Fritsch
    • G10L15/06
    • G10L15/063G06F17/271G06F17/2775G06F17/28G10L15/02G10L15/183G10L15/193G10L15/26G10L2015/0631G10L2015/0633
    • A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model using discriminative training techniques, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.
    • 提供用于训练用于语音识别的声学模型的系统。 特别地,这样的系统可以用于基于口语音频流和口头音频流的非文字转录来执行训练。 这样的系统可以识别表示具有多个口头形式的概念的非文字记录中的文本。 该系统可以尝试在音频流中识别在非文字转录中产生相应文本的音频流中的实际语音形式,从而产生更准确地表示语音音频流的经修改的脚本。 可以使用修改和更准确的抄本来使用辨别性训练技术训练声学模型,从而产生比使用直接基于原始非文字誊本进行训练的常规技术产生的更好的声学模型。
    • 88. 发明授权
    • Expanding an effective vocabulary of a speech recognition system
    • 扩展语音识别系统的有效词汇
    • US07120582B1
    • 2006-10-10
    • US09390370
    • 1999-09-07
    • Jonathan H. YoungHaakon L. ChevalierLaurence S. GillickToffee A. AlbinaMarlboro B. Moore, IIIPaul E. RensingJonathan P. Yamron
    • Jonathan H. YoungHaakon L. ChevalierLaurence S. GillickToffee A. AlbinaMarlboro B. Moore, IIIPaul E. RensingJonathan P. Yamron
    • G10L15/00G10L15/06
    • G10L15/063G10L2015/0633G10L2015/0635G10L2015/0636
    • The invention provides techniques for creating and using fragmented word models to increase the effective size of an active vocabulary of a speech recognition system. The active vocabulary represents all words and word fragments that the speech recognition system is able to recognize. Each word may be represented by a combination of acoustic models. As such, the active vocabulary represents the combinations of acoustic models that the speech recognition system may compare to a user's speech to identify acoustic models that best match the user's speech. The effective size of the active vocabulary may be increased by dividing words into constituent components or fragments (for example, prefixes, suffixes, separators, infixes, and roots) and including each component as a separate entry in the active vocabulary. Thus, for example, a list of words and their plural forms (for example, “book, books, cook, cooks, hook, hooks, look and looks”) may be represented in the active vocabulary using the words (for example, “book, cook, hook and look”) and an entry representing the suffix that makes the words plural (for example, “+s”, where the “+” preceding the “s” indicates that “+s” is a suffix). For a large list of words, and ignoring the entry associated with the suffix, this technique may reduce the number of vocabulary entries needed to represent the list of words considerably.
    • 本发明提供了用于创建和使用分割词模型以增加语音识别系统的活跃词汇表的有效大小的技术。 活动词汇表示语音识别系统能够识别的所有单词和单词片段。 每个单词可以由声学模型的组合来表示。 因此,活动词汇表示声学模型的组合,语音识别系统可以与用户的语音进行比较,以识别与用户的语音最匹配的声学模型。 活动词汇表的有效大小可以通过将单词划分成组成组件或片段(例如,前缀,后缀,分隔符,中缀和根)并将每个组件作为活动词汇表中的单独条目来增加。 因此,例如,可以在活动词汇表中使用单词(例如,“书籍,书籍,烹饪,烹饪,钩子,钩子,外观和外观”)的单词列表及其复数形式 书签,烹饪,钩子和外观“)和表示使单词复数的后缀的条目(例如,”+ s“,其中”+“之前的”+“表示”+ s“是后缀)。 对于大量单词列表,忽略与后缀相关联的条目,这种技术可能会大大减少用于表示单词列表所需的词汇表数量。