会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Voice recognition system
    • 语音识别系统
    • US4829576A
    • 1989-05-09
    • US921625
    • 1986-10-21
    • Edward W. Porter
    • Edward W. Porter
    • G06F3/16G10L15/00
    • G10L15/00G06F3/167
    • A text locating system recognizes spoken utterances, uses the recognized words as a search string, and searches text for words matching that search string. The probability that a given vocabulary word is selected as a search word is altered both by limiting the recognizable vocabulary to words in the text to the searched, and by altering the probability that individual recognizable words will be selected as a function of the number of time they occur in that text. The system performs incremental searches by adding successively recognized words to the search string and searching for the next occurrence of the string in response to each such addition. The invention can be used in a text editing system which enables a user to switch between a dictation mode, which inserts recognized words into text, and a search mode, which uses them to search for new cursor locations. Broadly speaking, the invention provides a computer system which recognizes spoken words, which has a data structure representing words; which uses that data structure for a purpose other than speech recognition; and which alters the probability that a given vocabulary word will be recognized as a function of the frequency of that word in the data structure.
    • 文本定位系统识别口语,使用识别的单词作为搜索字符串,并搜索与该搜索字符串匹配的单词。 通过将文本中的可识别词汇限制到搜索词,并且通过改变将被选择个体可识别词作为时间数的函数的概率来改变给定词汇单词被选择为搜索词的概率 它们出现在该文本中。 系统通过向搜索字符串中添加连续识别的单词并响应于每个这样的添加来搜索字符串的下一个出现来执行增量搜索。 本发明可以用在文本编辑系统中,该系统使得用户能够在将识别的单词插入到文本中的听写模式和使用它们来搜索新的光标位置的搜索模式之间进行切换。 广义地说,本发明提供了一种识别具有表示单词的数据结构的口语单词的计算机系统; 其将该数据结构用于除语音识别之外的目的; 并且其改变给定词汇单词将被识别为数据结构中该单词的频率的函数的概率。
    • 3. 发明授权
    • Combined speech and handwriting recognition
    • 综合言语和手写识别
    • US07467089B2
    • 2008-12-16
    • US11005633
    • 2004-12-05
    • Daniel L. RothEdward W. Porter
    • Daniel L. RothEdward W. Porter
    • G10L21/00
    • G10L15/22G10L15/19
    • The invention relates to the combination of speech recognition with handwriting and/or character recognition. This includes the innovation of selecting one or more best-scoring recognition candidates as a function of recognition of both handwritten and spoken representations of a sequence of one or more words to be recognized. It also includes the innovation of using character or handwriting recognition of one or more letters to alphabetically filter speech recognition of one or more words. It also includes the innovations of using speech recognition of one or more letter-identifying words to alphabetically filter handwriting recognition, and of using speech recognition to correct handwriting recognition of one or more words.
    • 本发明涉及语音识别与手写和/或字符识别的组合。 这包括选择一个或多个最佳得分识别候选者作为识别要被识别的一个或多个单词的序列的手写和口头表示的功能的创新。 它还包括使用一个或多个字母的字符或手写识别来对一个或多个字的语音识别进行字母表过滤的创新。 它还包括使用一个或多个字母识别字的语音识别来按字母顺序地过滤手写识别的创新,以及使用语音识别来纠正一个或多个词的手写识别。
    • 4. 发明授权
    • Combined speech recognition and text-to-speech generation
    • 组合语音识别和文本到语音生成
    • US07577569B2
    • 2009-08-18
    • US10949991
    • 2004-09-24
    • Daniel L. RothJordan R. CohenDavid F. JohnstonManfred G. GrabherrEdward W. Porter
    • Daniel L. RothJordan R. CohenDavid F. JohnstonManfred G. GrabherrEdward W. Porter
    • G10L13/08
    • G10L13/08G10L15/187G10L15/19
    • Text-to-speech (TTS) generation is used in conjunction with large vocabulary speech recognition to say words selected by the speech recognition. The software for performing the large vocabulary speech recognition can share speech modeling data with the TTS software. TTS or recorded audio can be used to automatically say both recognized text and the names of recognized commands after their recognition. The TTS can automatically repeats text recognized by the speech recognition after each of a succession of end of utterance detections. A user can move a cursor back or forward in recognized text, and the TTS can speak one or more words at the cursor location after each such move. The speech recognition can be used to produces a choice list of possible recognition candidates and the TTS can be used to provide spoken output of one or more of the candidates on the choice list.
    • 文本到语音(TTS)生成与大词汇语音识别结合使用来说出由语音识别选择的单词。 用于执行大词汇语音识别的软件可以与TTS软件共享语音建模数据。 TTS或录制音频可以用于在识别后自动说出识别的文本和识别的命令的名称。 TTS可以在每次连续的话语检测结束后自动重复通过语音识别识别的文本。 用户可以在识别的文本中向后或向前移动光标,并且在每次这样的移动之后,TTS可以在光标位置说一个或多个单词。 语音识别可用于产生可能的识别候选者的选择列表,并且TTS可以用于在选择列表上提供一个或多个候选者的口语输出。
    • 5. 发明授权
    • Combined speech recognition and sound recording
    • 组合语音识别和录音
    • US07505911B2
    • 2009-03-17
    • US11005568
    • 2004-12-05
    • Daniel L. RothJordan R. CohenDavid F. JohnstonEdward W. Porter
    • Daniel L. RothJordan R. CohenDavid F. JohnstonEdward W. Porter
    • G01L21/06
    • G10L15/22G10L15/26G10L2015/225
    • A handheld device with both large-vocabulary speech recognition and audio recoding allows users to switch between at least two of the following three modes: (1) recording audio without corresponding speech recognition; (2) recording with speech recognition; and (3) speech recognition without audio recording. A handheld device with both large-vocabulary speech recognition and audio recoding enables a user to select a portion of previously recorded sound and have speech recognition performed upon it. A system enables a user to search for a text label associated with portions of unrecognized recorded sound by uttering the label's words. A large-vocabulary system allows users to switch between playing back recorded audio and speech recognition with a single input, with successive audio playbacks automatically starting slightly before the end of prior playback. And a cell phone that allows both large-vocabulary speech recognition and audio recording and playback.
    • 具有大词汇语音识别和音频重新编码的手持设备允许用户在以下三种模式中的至少两种之间进行切换:(1)记录没有相应语音识别的音频; (2)用语音识别录音; 和(3)没有录音的语音识别。 具有大词汇语音识别和音频重新编码的手持设备使得用户能够选择先前记录的声音的一部分并且对其进行语音识别。 系统使用户能够通过发出标签的单词来搜索与未被识别的记录声音的部分相关联的文本标签。 大词汇系统允许用户使用单个输入在回放记录的音频和语音识别之间切换,连续的音频播放在先前播放结束之前自动开始。 和一个手机,允许大词汇语音识别和音频录音和播放。
    • 7. 发明授权
    • Innovations for the display of web pages
    • 用于显示网页的创新
    • US07219309B2
    • 2007-05-15
    • US10389445
    • 2003-03-14
    • Sampo J. KaasilaEdward W. Porter
    • Sampo J. KaasilaEdward W. Porter
    • G06F3/00
    • G06F17/30905G06T3/4015
    • Web pages are displayed with a simultaneous overview and magnified view. An indicator can show the portion of the overview in the magnified view. Both views can be shown, one above the other, across the full width of the same screen. A user can select between such a split view and another view, including an overview-only view, a magnified-only view, and a view in which selected text is laid out to fit the width of the magnified view. Navigational input can directly move the layout in the magnified view or the cursor, and can scroll both the overview and magnified view. The magnified view can display text with antialiased fonts designed for its resolution. The magnified view can be made to function like a magnifying glass. The width of text in multicolumn layouts can be limited to fit the width of a view window, such as the magnified-view.
    • 网页显示同时进行概览和放大视图。 指示器可以显示放大视图中概述的部分。 两个视图都可以在同一屏幕的整个宽度上显示。 用户可以在这样的分割视图和另一视图之间进行选择,包括仅概览视图,仅放大视图,以及放置所选文本以适合放大视图宽度的视图。 导航输入可以在放大视图或光标中直接移动布局,并可滚动概览和放大视图。 放大视图可以显示专门为其分辨率设计的抗锯齿字体的文本。 放大视图可以像放大镜一样起作用。 可以将多列布局中的文本宽度限制为适合视图窗口的宽度,例如放大视图。
    • 8. 发明授权
    • Word recognition using choice lists
    • 使用选择列表的Word识别
    • US07809574B2
    • 2010-10-05
    • US10950074
    • 2004-09-24
    • Daniel L. RothJordan R. CohenDavid F. JohnstonEdward W. Porter
    • Daniel L. RothJordan R. CohenDavid F. JohnstonEdward W. Porter
    • G10L21/00G06F3/16G06F3/14
    • G10L15/14
    • One aspect of the invention involves word recognition that uses scrollable choice lists in which choices are listed in character-order. Another aspect relates to a scrollable, visually-displayed word recognition choice list, where the recognition candidates on the choice list are each associated with a choice-selecting symbol the user can use to select a desired recognition candidate by pressing an associated button, and where the same choice-selecting symbol is used for different choices displayed on the display at different times as a result of scrolling. Another aspect of the invention relates to providing a choice list of best scoring characters for a particular character position in the spelling of a filter that is used to filter word recognition. Another aspect of the invention relates to a choice list used in word recognition in which the choice list can be scrolled horizontally.
    • 本发明的一个方面涉及使用可滚动选择列表的单词识别,其中以字符顺序列出选择。 另一方面涉及可滚动的,视觉上显示的词识别选择列表,其中选择列表上的识别候选者各自与选择选择符号相关联,用户可以通过按下相关联的按钮来选择期望的识别候选,并且其中 相同的选择选择符号用于在不同时间显示在显示器上作为滚动的结果的不同选择。 本发明的另一方面涉及提供用于滤波器的拼写中用于过滤词识别的特定字符位置的最佳评分字符的选择列表。 本发明的另一方面涉及用于字识别中的选择列表,其中选择列表可以水平滚动。
    • 10. 发明授权
    • Method for interactive speech recognition and training
    • 交互式语音识别和训练方法
    • US5027406A
    • 1991-06-25
    • US280700
    • 1988-12-06
    • Jed RobertsJames K. BakerEdward W. Porter
    • Jed RobertsJames K. BakerEdward W. Porter
    • G10L15/06G10L15/22
    • G10L15/063G10L15/22
    • A method for creating word models for a large vocabulary, natural language dictation system. A user with limited typing skills can create documents with little or no advance training of word models. As the user is dictating, the user speaks a word which may or may not already be in the active vocabulary. The system displays a list of the words in the active vocabulary which best match the spoken word. By keyboard or voice command, the user may choose the correct word from the list or may choose to edit a similar word if the correct word is not on the list. Alternately, the user may type or speak the initial letters of the word. Then the recognition algorithm is called again satisfying the initial letters, and the choices displayed again. A word list is then also displayed from a large backup vocabulary. The best words to display from the backup vocabulary are chosen using a statistical language model and optionally word models derived from a phonemic dictionary. When the correct word is chosen by the user, the speech sample is used to create or update an acoustic model for the word, without further intervention by the user. As the system is used, it also constantly updates its statistical language model. The system gets more and more word models and keeps improving its performance the more it is used. The system may be used for connected speech as well as for discrete utterances.