专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

31. 发明授权

US07505911B2 Combined speech recognition and sound recording 有权
标题翻译：组合语音识别和录音
公开(公告)号：US07505911B2
公开(公告)日：2009-03-17
申请号：US11005568
申请日：2004-12-05
申请人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston , Edward W. Porter
发明人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston , Edward W. Porter
IPC分类号： G01L21/06
CPC分类号： G10L15/22 , G10L15/26 , G10L2015/225
摘要： A handheld device with both large-vocabulary speech recognition and audio recoding allows users to switch between at least two of the following three modes: (1) recording audio without corresponding speech recognition; (2) recording with speech recognition; and (3) speech recognition without audio recording. A handheld device with both large-vocabulary speech recognition and audio recoding enables a user to select a portion of previously recorded sound and have speech recognition performed upon it. A system enables a user to search for a text label associated with portions of unrecognized recorded sound by uttering the label's words. A large-vocabulary system allows users to switch between playing back recorded audio and speech recognition with a single input, with successive audio playbacks automatically starting slightly before the end of prior playback. And a cell phone that allows both large-vocabulary speech recognition and audio recording and playback.
摘要翻译：具有大词汇语音识别和音频重新编码的手持设备允许用户在以下三种模式中的至少两种之间进行切换：（1）记录没有相应语音识别的音频; （2）用语音识别录音; 和（3）没有录音的语音识别。具有大词汇语音识别和音频重新编码的手持设备使得用户能够选择先前记录的声音的一部分并且对其进行语音识别。系统使用户能够通过发出标签的单词来搜索与未被识别的记录声音的部分相关联的文本标签。大词汇系统允许用户使用单个输入在回放记录的音频和语音识别之间切换，连续的音频播放在先前播放结束之前自动开始。和一个手机，允许大词汇语音识别和音频录音和播放。

32. 发明申请

US20080153465A1 VOICE SEARCH-ENABLED MOBILE DEVICE 审中-公开
标题翻译：语音搜索启用移动设备
公开(公告)号：US20080153465A1
公开(公告)日：2008-06-26
申请号：US11673341
申请日：2007-02-09
申请人： Gunnar Evermann , Daniel L. Roth , Laurence S. Gillick , James Coughlin
发明人： Gunnar Evermann , Daniel L. Roth , Laurence S. Gillick , James Coughlin
IPC分类号： G10L21/00 , H04Q7/22
CPC分类号： H04M3/4931 , G06F16/957 , G10L15/30 , H04M1/72522 , H04M1/72561 , H04M7/0036 , H04M2250/74
摘要： Methods and devices for providing a user of a mobile communications device with mobile voice-mediated search capability. The methods and devices involve receiving an utterance from a user of the mobile device, the utterance including a search request; using the speech recognition functionality to recognize that the utterance includes a search request; as a result of recognizing that the utterance includes a search request, establishing a wireless data connection to a remote server; sending a representation of the search request to the remote server over the wireless data connection; receiving search results that are responsive to the search request; and presenting the search results on the mobile device.
摘要翻译：用于向移动通信设备的用户提供具有移动语音媒介搜索能力的方法和设备。所述方法和设备涉及从所述移动设备的用户接收话语，所述话语包括搜索请求; 使用所述语音识别功能来识别所述话语包括搜索请求; 作为识别出话语包括搜索请求的结果，建立到远程服务器的无线数据连接; 通过无线数据连接向远程服务器发送搜索请求的表示; 接收响应于该搜索请求的搜索结果; 并在移动设备上呈现搜索结果。

33. 发明授权

US07313526B2 Speech recognition using selectable recognition modes 有权
标题翻译：使用可选识别模式进行语音识别
公开(公告)号：US07313526B2
公开(公告)日：2007-12-25
申请号：US10950092
申请日：2004-09-24
申请人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston , Manfred G. Grabherr
发明人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston , Manfred G. Grabherr
IPC分类号： G10L11/00 , G10L15/28 , G10L15/04 , G06F3/00
CPC分类号： G10L15/22 , G10L15/19
摘要： The present invention relates to speech recognition using selectable recognition modes. This includes innovations such as: large vocabulary speech recognition programming that supplies recognized words to external program as they are recognized, and allows a user to select between large vocabulary recognition of an utterance with and without language context from the prior utterance independently of state of the external program; allowing a user to select between continuous and discrete speech recognition that use substantially the same vocabulary; allowing a user to select between continuous and discrete large-vocabulary speech recognition modes; allowing a user to select between at least two different alphabetic entry speech recognition modes; and allowing a user to select from among four or more of the following recognitions modes when creating text: a large-vocabulary mode, an alphabetic entry mode, a number entry mode, and a punctuation entry mode.
摘要翻译：本发明涉及使用可选择识别模式的语音识别。这包括创新，例如：大量词汇语音识别程序，在识别出外部程序时，将识别的词提供给外部程序，并允许用户在与先前的语言无关的语言语境的大量词汇识别与非语言语境之间进行选择外部程序; 允许用户在使用基本相同词汇的连续和离散语音识别之间进行选择; 允许用户在连续和离散的大词汇语音识别模式之间进行选择; 允许用户在至少两个不同的字母进入语音识别模式之间进行选择; 并且允许用户在创建文本时从四种或更多种以下识别模式中进行选择：大词汇模式，字母输入模式，数字输入模式和标点输入模式。

34. 发明授权

US07225130B2 Methods, systems, and programming for performing speech recognition 有权
标题翻译：用于执行语音识别的方法，系统和编程
公开(公告)号：US07225130B2
公开(公告)日：2007-05-29
申请号：US10227653
申请日：2002-09-06
申请人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston , Manfred G. Grabherr
发明人： Daniel L. Roth , Jordan R. Cohen , David F. Johnston , Manfred G. Grabherr
IPC分类号： G10L11/00 , G10L15/28 , G10L15/04 , G06F3/00
CPC分类号： G10L15/19 , G10L15/22
摘要： The present invention relates to: speech recognition using selectable recognition modes; using choice lists in large-vocabulary speech recognition; enabling users to select word transformations; speech recognition that automatically turns recognition off in one or more specified ways; phone key control of large-vocabulary speech recognition; speech recognition using phone key alphabetic filtering and spelling: speech recognition that enables a user to perform re-utterance recognition; the combination of speech recognition and text-to-speech (TTS) generation; the combination of speech recognition with handwriting and/or character recognition; and the combination of large-vocabulary speech recognition with audio recording and playback.
摘要翻译：本发明涉及：使用可选择识别模式的语音识别; 在大词汇语音识别中使用选择列表; 使用户能够选择字变换; 以一种或多种指定方式自动转移识别的语音识别; 电话键控大词汇语音识别; 使用手机密钥字母过滤和拼写的语音识别：语音识别，使得用户能够执行重新发音识别; 语音识别和文本到语音（TTS）生成的组合; 语音识别与手写和/或字符识别的组合; 以及大词汇语音识别与音频录制和播放的组合。

35. 发明授权

US07133827B1 Training speech recognition word models from word samples synthesized by Monte Carlo techniques 有权
标题翻译：通过蒙特卡罗技术合成的单词样本训练语音识别词模型
公开(公告)号：US07133827B1
公开(公告)日：2006-11-07
申请号：US10361154
申请日：2003-02-06
申请人： Laurence S. Gillick , Donald R. McAllaster , Daniel L. Roth
发明人： Laurence S. Gillick , Donald R. McAllaster , Daniel L. Roth
IPC分类号： G10L15/06
CPC分类号： G10L15/063
摘要： A new word model is trained from synthetic word samples derived by Monte Carlo techniques from one or more prior word models. The prior word model can be a phonetic word model and the new word model can be a non-phonetic, whole-word, word model. The prior word model can be trained from data that has undergone a first channel normalization and the synthesized word samples from which the new word model is trained can undergo a different channel normalization similar to that to be used in a given speech recognition context. The prior word model can have a first model structure and the new word model can have a second, different, model structure. These differences in model structure can include, for example, differences of model topology; differences of model complexity; and differences in the type of basis function used in a description of such probability distributions.
摘要翻译：从一个或多个先前的单词模型通过蒙特卡罗技术衍生的合成词样本训练新的单词模型。先验词模型可以是语音模型，新词模型可以是非语音，全字，单词模型。可以从已经经历第一信道规范化的数据训练现有单词模型，并且从其中训练新单词模型的合成单词样本可以经历与在给定语音识别上下文中使用相似的不同信道规范化。先验词模型可以具有第一模型结构，并且新词模型可以具有第二，不同的模型结构。模型结构的这些差异可以包括例如模型拓扑的差异; 模型复杂性差异; 以及在这种概率分布的描述中使用的基函数的类型的差异。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式