专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US5634086A Method and apparatus for voice-interactive language instruction 失效
标题翻译：语音交互语言指令的方法和装置
公开(公告)号：US5634086A
公开(公告)日：1997-05-27
申请号：US529376
申请日：1995-09-18
申请人： Dimitry Rtischev , Jared C. Bernstein , George T. Chen , John W. Butzberger
发明人： Dimitry Rtischev , Jared C. Bernstein , George T. Chen , John W. Butzberger
IPC分类号： G09B19/04 , G09B7/04 , G09B19/06 , G10L15/14 , G10L15/183 , G10L15/193 , G10L15/22 , G10L3/00 , G10L5/06 , G10L9/00
CPC分类号： G10L15/193 , G09B19/06 , G10L15/183
摘要： Spoken-language instruction method and apparatus employ context-based speech recognition for instruction and evaluation, particularly language instruction and language fluency evaluation. A system can administer a lesson, and particularly a language lesson, and evaluate performance in a natural interactive manner while tolerating strong foreign accents, and produce as an output a reading quality score. A finite state grammar set corresponding to the range of word sequence patterns in the lesson is employed as a constraint on a hidden Markov model (HMM) search apparatus in an HMM speech recognizer which includes a set of hidden Markov models of target-language narrations produced by native speakers of the target language. The invention is preferably based on use of a linguistic context-sensitive speech recognizer. The invention includes a system with an interactive decision mechanism which employs at least three levels of error tolerance to simulate a natural level of patience in human-based interactive instruction. A system for a reading phase is implemented through a finite state machine having at least four states which recognizes reading error at any position in a script and which employs a first set of actions. A related system for an interactive question phase is implemented through a finite state machine, but which recognizes reading errors as well as incorrect answers while invoking a second set of actions. A linguistically-sensitive utterance endpoint detector is provided for judging termination of a spoken utterance to simulate human turn-taking in conversational speech.
摘要翻译：语言指导方法和装置采用基于语境的语音识别来进行指导和评估，特别是语言指导和语言流畅性评估。系统可以管理课程，特别是语言课程，并以自然的交互方式评估表现，同时容忍强大的外国口音，并产生读数质量得分。对应于课程中的单词序列模式的范围的有限状态语法集合被用作HMM语音识别器中的隐马尔可夫模型（HMM）搜索装置的约束，其包括产生的目标语言叙述的一组隐马尔可夫模型以母语为母语的目标语言。本发明优选地基于使用语言上下文敏感语音识别器。本发明包括具有交互式决策机制的系统，其采用至少三个误差容限级别来模拟基于人的交互式指令的自然级别的耐心。用于读取阶段的系统通过具有至少四个状态的有限状态机来实现，该状态识别脚本中任何位置处的读取错误并且采用第一组动作。用于交互式问题阶段的相关系统通过有限状态机实现，但是在调用第二组动作时识别读取错误以及不正确的答案。提供语言敏感的话语端点检测器，用于判断语音话语的终止以模拟会话语音中的人转向。

2. 发明授权

US5581655A Method for recognizing speech using linguistically-motivated hidden Markov models 失效
标题翻译：使用语言学动机的隐马尔可夫模型识别语音的方法
公开(公告)号：US5581655A
公开(公告)日：1996-12-03
申请号：US589432
申请日：1996-01-22
申请人： Michael H. Cohen , Mitchel Weintraub , Patti J. Price , Hy Murveit , Jared C. Bernstein
发明人： Michael H. Cohen , Mitchel Weintraub , Patti J. Price , Hy Murveit , Jared C. Bernstein
IPC分类号： G10L15/06 , G09B19/04 , G10L15/14 , G10L15/18 , G10L5/06
CPC分类号： G10L15/187 , G09B19/04 , G10L15/142
摘要： An automatic speech recognition methodology, wherein words are modeled as probabilistic networks of allophones, collects nodes in the probabilistic network into equivalence classes when those nodes have the same allophonic choices governed by the same phonological rules. The allophonic choices allow for representation of dialectic pronunciation variations between different speakers. Training data is shared among nodes in an equivalence class so that accurate pronunciation probabilities may be determined even for words for which there is only a limited amount of training data. A method is used to determine probabilities for each of a multitude of pronunciation models for each word in the vocabulary, based on automatic extraction of linguistic knowledge from sets of phonological rules, in order to robustly and accurately model dialectal variation.
摘要翻译：一种自动语音识别方法，其中单词被建模为异常的概率网络，当这些节点具有由相同语音规则控制的相同的等式选择时，将概率网络中的节点收集成等价类。不平衡选择允许在不同说话者之间表达辩证的发音变化。培训数据在等价类中的节点之间共享，使得甚至对于只有有限量的训练数据的单词也可以确定准确的发音概率。基于语言规则集合中语言知识的自动提取，为了强化和准确地模拟方言的变化，使用一种方法来确定词汇表中每个单词的多个发音模型的概率。

3. 发明授权

US6157913A Method and apparatus for estimating fitness to perform tasks based on linguistic and other aspects of spoken responses in constrained interactions 有权
标题翻译：用于估计适合度的方法和装置，用于基于在受约束的交互中的语音响应的语言和其他方面执行任务
公开(公告)号：US6157913A
公开(公告)日：2000-12-05
申请号：US184804
申请日：1998-11-02
申请人： Jared C. Bernstein
发明人： Jared C. Bernstein
IPC分类号： G09B7/02 , G10L17/00 , G10L15/22
CPC分类号： G09B7/02 , G10L17/26
摘要： Linguistic and/or extra-linguistic information is extracted from speech signals to provide measures that may then be compared to expected norms, individual baselines or other nominal or numeric criteria (according to particular psychomotor, perceptual, cognitive or emotional constructs) that are required for satisfactory performance of particular tasks, or that indicate a user's psychological or physical state. The user produces the speech signals in the context of a constrained voice-interactive dialog that utilizes prompts chosen such that the expected range of responses will exhibit low linguistic entropy. For example, the prompts may be interpreted by the user as requests for information, requests to read or repeat or paraphrase a word, sentence, or larger linguistic unit, requests to draw an inference, requests to complete, or identify elements in graphic or verbal aggregates (e.g., pictures or discourses), as examples to imitate, or any similar graphical or verbal presentation that conventionally serves as a prompt to speak. The display is presented though a device either integral or peripheral to a computer system, such as a local or remote video display terminal or telephone.
摘要翻译：从语音信号中提取语言和/或语言外信息，以提供可能与预期规范，个人基线或其他名义或数字标准（根据特定精神运动，知觉，认知或情感结构）进行比较的措施，这些标准或数学标准是特定任务的令人满意的表现，或指示用户的心理或身体状态。用户在受限语音交互对话的上下文中产生语音信号，该对话使用所选择的提示，使得响应的预期范围将呈现低语言熵。例如，提示可以由用户解释为请求信息，请求阅读或重复或改写单词，句子或更大的语言单元，绘制推论的请求，要求完成或识别图形或语言元素聚合（例如，图片或话语），作为模仿的示例，或任何类似的图形或口头表达，其通常作为提示说话。显示器通过集成或外围设备呈现给计算机系统，例如本地或远程视频显示终端或电话。

4. 发明授权

US5870709A Method and apparatus for combining information from speech signals for adaptive interaction in teaching and testing 失效
标题翻译：用于在教学和测试中组合用于自适应交互的语音信号的信息的方法和装置
公开(公告)号：US5870709A
公开(公告)日：1999-02-09
申请号：US753580
申请日：1996-11-25
申请人： Jared C. Bernstein
发明人： Jared C. Bernstein
IPC分类号： G09B5/04 , G06F1/00 , G06F3/16 , G09B7/00 , G09B7/04 , G09B19/00 , G09B19/04 , G09B19/06 , G10L15/00 , G10L15/10 , G10L15/22 , G10L15/28 , G01L5/06
CPC分类号： G09B7/04 , G09B19/04 , G10L15/22
摘要： A computer system with a speech recognition component provides a method and apparatus for instructing and evaluating the proficiency of human users in skills that can be exhibited through speaking. The computer system tracks linguistic, indexical and paralinguistic characteristics of the spoken input of users, and implements games, data access, instructional systems, and tests. The computer system combines characteristics of the spoken input automatically to select appropriate material and present it in a manner suitable for the user. In one embodiment, the computer system measures the response latency and speaking rate of the user and presents its next spoken display at an appropriate speaking rate. In other embodiments, the computer system identifies the gender and native language of the user, and combines that information with the relative accuracy of the linguistic content of the user's utterance to select and display material that may be easier or more challenging for speakers with these characteristics.
摘要翻译：具有语音识别部件的计算机系统提供了一种方法和装置，用于指导和评估人类用户在通过说话展现的技能中的能力。计算机系统跟踪用户口语输入的语言，索引和辅助特征，实现游戏，数据访问，教学系统和测试。计算机系统自动组合口头输入的特征以选择合适的材料并以适合用户的方式呈现。在一个实施例中，计算机系统测量用户的响应延迟和说话率，并以适当的讲话率显示其下一个口语显示。在其他实施例中，计算机系统识别用户的性别和母语，并且将该信息与用户话语的语言内容的相对准确度相结合，以选择和显示对于具有这些特征的扬声器可能更容易或更具挑战性的材料。

5. 发明授权

US5268990A Method for recognizing speech using linguistically-motivated hidden Markov models 失效
标题翻译：使用语言学动机的隐马尔可夫模型识别语音的方法
公开(公告)号：US5268990A
公开(公告)日：1993-12-07
申请号：US648097
申请日：1991-01-31
申请人： Michael H. Cohen , Mitchel Weintraub , Patti J. Price , Hy Murveit , Jared C. Bernstein
发明人： Michael H. Cohen , Mitchel Weintraub , Patti J. Price , Hy Murveit , Jared C. Bernstein
IPC分类号： G10L15/06 , G09B19/04 , G10L15/14 , G10L15/18 , G10L3/00 , G10L9/00
CPC分类号： G10L15/187 , G09B19/04 , G10L15/142
摘要： An automatic speech recognition methodology takes advantage of linguistic constraints wherein words are modeled as probabilistic networks of phonetic segments (herein phones), and each phone is represented as a context-independent hidden Markov phone model mixed with a number of context-dependent phone models. Recognition is based on use of methods to design phonological rule sets based on measures of coverage and overgeneration of pronunciations which achieves high coverage of pronunciations with compact representations. Further, a method estimates probabilities of the different possible pronunciations of words. A further method models cross-word coarticulatory effects. In a specific embodiment of the system, a specific method determines the single most-likely pronunciation of words. In further specific embodiments of the system, methods generate speaker-dependent pronunciation networks.
摘要翻译：自动语音识别方法利用语言约束，其中词被建模为语音段（这里是电话）的概率网络，并且每个电话被表示为与多个上下文相关的电话模型混合的与上下文无关的隐马尔可夫手机模型。识别是基于使用方法来设计语音规则集，其基于覆盖的测度和发音的过度生成，其实现具有紧凑表示的发音的高覆盖。此外，一种方法估计词的不同可能发音的概率。另一种方法建立跨词coarticulatory效应。在系统的具体实施例中，特定方法确定单词最可能的发音。在系统的进一步具体实施例中，方法产生说话者相关的发音网络。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式