会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Method and apparatus for voice-interactive language instruction
    • 语音交互语言指令的方法和装置
    • US5634086A
    • 1997-05-27
    • US529376
    • 1995-09-18
    • Dimitry RtischevJared C. BernsteinGeorge T. ChenJohn W. Butzberger
    • Dimitry RtischevJared C. BernsteinGeorge T. ChenJohn W. Butzberger
    • G09B19/04G09B7/04G09B19/06G10L15/14G10L15/183G10L15/193G10L15/22G10L3/00G10L5/06G10L9/00
    • G10L15/193G09B19/06G10L15/183
    • Spoken-language instruction method and apparatus employ context-based speech recognition for instruction and evaluation, particularly language instruction and language fluency evaluation. A system can administer a lesson, and particularly a language lesson, and evaluate performance in a natural interactive manner while tolerating strong foreign accents, and produce as an output a reading quality score. A finite state grammar set corresponding to the range of word sequence patterns in the lesson is employed as a constraint on a hidden Markov model (HMM) search apparatus in an HMM speech recognizer which includes a set of hidden Markov models of target-language narrations produced by native speakers of the target language. The invention is preferably based on use of a linguistic context-sensitive speech recognizer. The invention includes a system with an interactive decision mechanism which employs at least three levels of error tolerance to simulate a natural level of patience in human-based interactive instruction. A system for a reading phase is implemented through a finite state machine having at least four states which recognizes reading error at any position in a script and which employs a first set of actions. A related system for an interactive question phase is implemented through a finite state machine, but which recognizes reading errors as well as incorrect answers while invoking a second set of actions. A linguistically-sensitive utterance endpoint detector is provided for judging termination of a spoken utterance to simulate human turn-taking in conversational speech.
    • 语言指导方法和装置采用基于语境的语音识别来进行指导和评估,特别是语言指导和语言流畅性评估。 系统可以管理课程,特别是语言课程,并以自然的交互方式评估表现,同时容忍强大的外国口音,并产生读数质量得分。 对应于课程中的单词序列模式的范围的有限状态语法集合被用作HMM语音识别器中的隐马尔可夫模型(HMM)搜索装置的约束,其包括产生的目标语言叙述的一组隐马尔可夫模型 以母语为母语的目标语言。 本发明优选地基于使用语言上下文敏感语音识别器。 本发明包括具有交互式决策机制的系统,其采用至少三个误差容限级别来模拟基于人的交互式指令的自然级别的耐心。 用于读取阶段的系统通过具有至少四个状态的有限状态机来实现,该状态识别脚本中任何位置处的读取错误并且采用第一组动作。 用于交互式问题阶段的相关系统通过有限状态机实现,但是在调用第二组动作时识别读取错误以及不正确的答案。 提供语言敏感的话语端点检测器,用于判断语音话语的终止以模拟会话语音中的人转向。
    • 3. 发明授权
    • Method and apparatus for estimating fitness to perform tasks based on
linguistic and other aspects of spoken responses in constrained
interactions
    • 用于估计适合度的方法和装置,用于基于在受约束的交互中的语音响应的语言和其他方面执行任务
    • US6157913A
    • 2000-12-05
    • US184804
    • 1998-11-02
    • Jared C. Bernstein
    • Jared C. Bernstein
    • G09B7/02G10L17/00G10L15/22
    • G09B7/02G10L17/26
    • Linguistic and/or extra-linguistic information is extracted from speech signals to provide measures that may then be compared to expected norms, individual baselines or other nominal or numeric criteria (according to particular psychomotor, perceptual, cognitive or emotional constructs) that are required for satisfactory performance of particular tasks, or that indicate a user's psychological or physical state. The user produces the speech signals in the context of a constrained voice-interactive dialog that utilizes prompts chosen such that the expected range of responses will exhibit low linguistic entropy. For example, the prompts may be interpreted by the user as requests for information, requests to read or repeat or paraphrase a word, sentence, or larger linguistic unit, requests to draw an inference, requests to complete, or identify elements in graphic or verbal aggregates (e.g., pictures or discourses), as examples to imitate, or any similar graphical or verbal presentation that conventionally serves as a prompt to speak. The display is presented though a device either integral or peripheral to a computer system, such as a local or remote video display terminal or telephone.
    • 从语音信号中提取语言和/或语言外信息,以提供可能与预期规范,个人基线或其他名义或数字标准(根据特定精神运动,知觉,认知或情感结构)进行比较的措施,这些标准或数学标准是 特定任务的令人满意的表现,或指示用户的心理或身体状态。 用户在受限语音交互对话的上下文中产生语音信号,该对话使用所选择的提示,使得响应的预期范围将呈现低语言熵。 例如,提示可以由用户解释为请求信息,请求阅读或重复或改写单词,句子或更大的语言单元,绘制推论的请求,要求完成或识别图形或语言元素 聚合(例如,图片或话语),作为模仿的示例,或任何类似的图形或口头表达,其通常作为提示说话。 显示器通过集成或外围设备呈现给计算机系统,例如本地或远程视频显示终端或电话。
    • 4. 发明授权
    • Method and apparatus for combining information from speech signals for
adaptive interaction in teaching and testing
    • 用于在教学和测试中组合用于自适应交互的语音信号的信息的方法和装置
    • US5870709A
    • 1999-02-09
    • US753580
    • 1996-11-25
    • Jared C. Bernstein
    • Jared C. Bernstein
    • G09B5/04G06F1/00G06F3/16G09B7/00G09B7/04G09B19/00G09B19/04G09B19/06G10L15/00G10L15/10G10L15/22G10L15/28G01L5/06
    • G09B7/04G09B19/04G10L15/22
    • A computer system with a speech recognition component provides a method and apparatus for instructing and evaluating the proficiency of human users in skills that can be exhibited through speaking. The computer system tracks linguistic, indexical and paralinguistic characteristics of the spoken input of users, and implements games, data access, instructional systems, and tests. The computer system combines characteristics of the spoken input automatically to select appropriate material and present it in a manner suitable for the user. In one embodiment, the computer system measures the response latency and speaking rate of the user and presents its next spoken display at an appropriate speaking rate. In other embodiments, the computer system identifies the gender and native language of the user, and combines that information with the relative accuracy of the linguistic content of the user's utterance to select and display material that may be easier or more challenging for speakers with these characteristics.
    • 具有语音识别部件的计算机系统提供了一种方法和装置,用于指导和评估人类用户在通过说话展现的技能中的能力。 计算机系统跟踪用户口语输入的语言,索引和辅助特征,实现游戏,数据访问,教学系统和测试。 计算机系统自动组合口头输入的特征以选择合适的材料并以适合用户的方式呈现。 在一个实施例中,计算机系统测量用户的响应延迟和说话率,并以适当的讲话率显示其下一个口语显示。 在其他实施例中,计算机系统识别用户的性别和母语,并且将该信息与用户话语的语言内容的相对准确度相结合,以选择和显示对于具有这些特征的扬声器可能更容易或更具挑战性的材料 。