会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 21. 发明申请
    • Enabling Natural Language Understanding In An X+V Page Of A Multimodal Application
    • 在多模式应用程序的X + V页面中启用自然语言理解
    • US20080208586A1
    • 2008-08-28
    • US11679292
    • 2007-02-27
    • Soonthorn AtivanichayaphongCharles W. CrossGerald M. McCobb
    • Soonthorn AtivanichayaphongCharles W. CrossGerald M. McCobb
    • G10L21/00
    • G10L2015/228
    • Enabling natural language understanding using an X+V page of a multimodal application implemented with a statistical language model (‘SLM’) grammar of the multimodal application in an automatic speech recognition (‘ASR’) engine, with the multimodal application operating in a multimodal browser on a multimodal device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to the ASR engine through a VoiceXML interpreter, including: receiving, in the ASR engine from the multimodal application, a voice utterance; generating, by the ASR engine according to the SLM grammar, at least one recognition result for the voice utterance; determining, by an action classifier for the VoiceXML interpreter, an action identifier in dependence upon the recognition result, the action identifier specifying an action to be performed by the multimodal application; and interpreting, by the VoiceXML interpreter, the multimodal application in dependence upon the action identifier.
    • 通过使用自动语音识别(“ASR”)引擎中的多模式应用程序的统计语言模型(“SLM”)语法实现的多模式应用程序的X + V页面,实现自然语言理解,多模式应用程序在多模态下运行 浏览器支持包括语音模式和一个或多个非语音模式的多种交互模式的多模式设备,所述多模式应用通过VoiceXML解释器可操作地耦合到ASR引擎,包括:在多模式应用的ASR引擎中, 一个声音说话; 由ASR引擎根据SLM语法生成语音话语的至少一个识别结果; 通过所述VoiceXML解释器的动作分类器确定依赖于所述识别结果的动作标识符,所述动作标识符指定要由所述多模式应用执行的动作; 并且由VoiceXML解释器根据动作标识符解释多模式应用。
    • 23. 发明申请
    • ORAL MODIFICATION OF AN ASR LEXICON OF AN ASR ENGINE
    • ASR发动机的ASR LEXICON的ORAL修改
    • US20070288241A1
    • 2007-12-13
    • US11423711
    • 2006-06-13
    • Charles W. CrossFrank L. JaniaJames R. Lewis
    • Charles W. CrossFrank L. JaniaJames R. Lewis
    • G10L21/00
    • G10L15/22G10L15/06G10L2015/0631
    • Methods, apparatus, and computer program products are described for providing oral modification of an ASR lexicon of an ASR engine that include receiving, in the ASR engine from a user through a multimodal application, speech for recognition, where the ASR engine includes an ASR lexicon of words capable of recognition by the ASR engine, and the ASR lexicon does not contain at least one word of the speech for recognition; indicating by the ASR engine through the multimodal application to the user that the ASR lexicon does not contain the word; receiving by the ASR engine from the user through the multimodal application an oral instruction to add the word to the ASR lexicon, where the oral instruction is accompanied by an oral spelling of the word from the user; and executing the instruction by the ASR engine.
    • 描述了用于提供ASR引擎的ASR词汇的口头修改的方法,装置和计算机程序产品,其包括在用户通过多模式应用的ASR引擎中接收用于识别的语音,其中ASR引擎包括ASR词典 能够被ASR引擎识别的字,并且ASR词典不包含用于识别的言语中的至少一个单词; 由ASR引擎通过多模态应用向用户指示ASR词典不包含该词; 由ASR引擎从用户通过多模式应用程序接收口头指令,将该单词添加到ASR词典中,其中口头指令伴随着来自用户的单词的口头拼写; 并执行ASR引擎的指令。
    • 26. 发明申请
    • Indexing Digitized Speech With Words Represented In The Digitized Speech
    • 用数字化语言代表的数字化语音索引
    • US20080235021A1
    • 2008-09-25
    • US11688331
    • 2007-03-20
    • Charles W. CrossFrank L. Jania
    • Charles W. CrossFrank L. Jania
    • G10L15/00
    • G10L15/19G10L15/183G10L15/193G10L15/197G10L15/22G10L21/06G10L2015/228
    • Indexing digitized speech with words represented in the digitized speech, with a multimodal digital audio editor operating on a multimodal device supporting modes of user interaction, the modes of user interaction including a voice mode and one or more non-voice modes, the multimodal digital audio editor operatively coupled to an ASR engine, including providing by the multimodal digital audio editor to the ASR engine digitized speech for recognition; receiving in the multimodal digital audio editor from the ASR engine recognized user speech including a recognized word, also including information indicating where, in the digitized speech, representation of the recognized word begins; and inserting by the multimodal digital audio editor the recognized word, in association with the information indicating where, in the digitized speech, representation of the recognized word begins, into a speech recognition grammar, the speech recognition grammar voice enabling user interface commands of the multimodal digital audio editor.
    • 在数字化语音中表示的词索引数字化语音,多模数字音频编辑器在支持用户交互模式的多模式设备上操作,包括语音模式和一种或多种非语音模式的用户交互模式,多模式数字音频 编辑器可操作地耦合到ASR引擎,包括由多模数字音频编辑器提供给ASR引擎的数字化语音进行识别; 在多模式数字音频编辑器中从包含识别字的ASR引擎识别的用户语音接收信息,还包括指示在数字化语音中识别字词的表示何处开始的信息; 并且通过多模式数字音频编辑器将识别的词与表示数字化语音在识别字的表示开始的位置的信息相关联地插入到语音识别语法中,使语音识别语法语音启用多模态的用户界面命令 数字音频编辑器。
    • 27. 发明申请
    • Effecting Functions On A Multimodal Telephony Device
    • 多功能电话设备上的功能
    • US20080208594A1
    • 2008-08-28
    • US11679312
    • 2007-02-27
    • Charles W. CrossFrank L. JaniaDarren M. Shaw
    • Charles W. CrossFrank L. JaniaDarren M. Shaw
    • G10L11/00
    • G10L15/26
    • Methods, apparatus, and computer program products are described for effecting functions on a multimodal telephony device, implemented with the multimodal application operating on a multimodal telephony device supporting multiple modes of interaction including a voice mode and one or more non-voice modes, the multimodal application operatively coupled to an automated speech recognition engine. Embodiments include receiving the speech of a telephone call; identifying with the automated speech recognition engine action keywords in the speech of the telephone call; selecting a function of the multimodal telephony device in dependence upon the action keywords; identifying parameters for the function of the multimodal telephony device; and executing the function of the multimodal telephony device using the identified parameters.
    • 描述了用于在多模式电话设备上实现功能的方法,装置和计算机程序产品,该多模式电话设备通过在支持包括语音模式和一个或多个非语音模式的多种交互模式的多模式电话设备上操作的多模式应用程序来实现,多模态 可操作地耦合到自动语音识别引擎的应用。 实施例包括接收电话呼叫的语音; 用电话语音中的自动语音识别引擎动作关键字识别; 根据动作关键词选择多模式电话设备的功能; 识别用于多模式电话设备的功能的参数; 以及使用所识别的参数来执行所述多模式电话设备的功能。
    • 28. 发明申请
    • Altering Behavior Of A Multimodal Application Based On Location
    • 改变基于位置的多模态应用的行为
    • US20080208593A1
    • 2008-08-28
    • US11679301
    • 2007-02-27
    • Soonthorn AtivanichayaphongCharles W. CrossIgor R. JablokovGerald M. McCobb
    • Soonthorn AtivanichayaphongCharles W. CrossIgor R. JablokovGerald M. McCobb
    • G10L21/00
    • G10L15/22G10L15/24
    • Methods, apparatus, and products are disclosed for altering behavior of a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application, including a voice mode and one or more non-voice modes. The voice mode of user interaction with the multimodal application is supported by a voice interpreter. Altering behavior of a multimodal application based on location includes: receiving a location change notification in the voice interpreter from a device location manager, the device location manager operatively coupled to a position detection component of the multimodal device, the location change notification specifying a current location of the multimodal device; updating, by the voice interpreter, location-based environment parameters for the voice interpreter in dependence upon the current location of the multimodal device; and interpreting, by the voice interpreter, the multimodal application in dependence upon the location-based environment parameters.
    • 公开了基于位置改变多模式应用的行为的方法,装置和产品。 多模式应用程序在多模式设备上运行,支持与多模式应用程序的多种用户交互模式,包括语音模式和一种或多种非语音模式。 与多模式应用程序的用户交互的语音模式由语音解释器支持。 基于位置改变多模式应用的行为包括:从设备位置管理器在语音解释器中接收位置改变通知,该设备位置管理器可操作地耦合到多模态设备的位置检测组件,位置变化通知指定当前位置 的多模式设备; 语音解释器根据多模式设备的当前位置更新语音解释器的基于位置的环境参数; 并且由语音解释器根据基于位置的环境参数来解释多模式应用。