会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 6. 发明授权
    • Records disambiguation in a multimodal application operating on a multimodal device
    • 记录在多模式设备上运行的多模式应用程序中的歧义
    • US09349367B2
    • 2016-05-24
    • US12109167
    • 2008-04-24
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Pradeep P. Mansey
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Pradeep P. Mansey
    • G10L15/00G10L15/08G10L15/183G10L15/22
    • G10L15/22G10L15/00G10L15/08G10L15/183
    • Methods, apparatus, and products are disclosed for record disambiguation in a multimodal application operating on a multimodal device, the multimodal device supporting multiple modes of interaction including at least a voice mode and a visual mode, that include: prompting, by the multimodal application, a user to identify a particular record among a plurality of records; receiving, by the multimodal application in response to the prompt, a voice utterance from the user; determining, by the multimodal application, that the voice utterance ambiguously identifies more than one of the plurality of records; generating, by the multimodal application, a user interaction to disambiguate the records ambiguously identified by the voice utterance in dependence upon record attributes of the records ambiguously identified by the voice utterance; and selecting, by the multimodal application for further processing, one of the records ambiguously identified by the voice utterance in dependence upon the user interaction.
    • 公开了用于在多模式设备上操作的多模式应用中的记录消歧的方法,装置和产品,所述多模式设备支持包括至少语音模式和视觉模式的多种交互模式,其包括:由多模式应用提示, 用户识别多个记录中的特定记录; 由多模式应用程序响应于该提示,接收来自用户的语音发声; 由所述多模式应用程序确定所述语音发音含糊地识别所述多​​个记录中的多于一个的记录; 由多模式应用程序产生用户交互,以消除由声音话语模糊识别的记录,依赖于由语音话语模糊识别的记录的记录属性; 以及通过多模式应用程序进行进一步处理,根据用户交互,通过语音话语模糊识别的记录之一。
    • 7. 发明申请
    • TESTING A GRAMMAR USED IN SPEECH RECOGNITION FOR RELIABILITY IN A PLURALITY OF OPERATING ENVIRONMENTS HAVING DIFFERENT BACKGROUND NOISE
    • 测试在具有不同背景噪声的多种操作环境中可靠性的语音识别中使用的灰度
    • US20120053934A1
    • 2012-03-01
    • US13289233
    • 2011-11-04
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, JR.Michael H. Mirt
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, JR.Michael H. Mirt
    • G10L15/20
    • G10L15/01
    • Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.
    • 用于在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性的方法,系统和产品,包括:为所述多个操作环境中的每一个接收记录的背景噪声; 产生语音识别引擎使用语法进行识别的测试语音语音; 将测试语音发音与每个记录的背景噪声混合,导致多个混合测试语音话语,每个混合测试语音话语具有不同的背景噪声; 对于每个混合测试语音话语,使用语法和混合测试语音话语进行语音识别,导致每个混合测试语音话语的语音识别结果; 并且对于每个记录的背景噪声,根据具有记录的背景噪声的混合测试语音话语的语音识别结果,评估语法的语音识别可靠性。
    • 8. 发明授权
    • Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
    • 在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性
    • US08082148B2
    • 2011-12-20
    • US12109204
    • 2008-04-24
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Michael H. Mirt
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Michael H. Mirt
    • G10L15/20
    • G10L15/01
    • Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.
    • 用于在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性的方法,系统和产品,包括:为所述多个操作环境中的每一个接收记录的背景噪声; 产生语音识别引擎使用语法进行识别的测试语音语音; 将测试语音发音与每个记录的背景噪声混合,导致多个混合测试语音话语,每个混合测试语音话语具有不同的背景噪声; 对于每个混合测试语音话语,使用语法和混合测试语音话语进行语音识别,导致每个混合测试语音话语的语音识别结果; 并且对于每个记录的背景噪声,根据具有记录的背景噪声的混合测试语音话语的语音识别结果来评估语法的语音识别可靠性。
    • 9. 发明申请
    • Testing A Grammar Used In Speech Recognition For Reliability In A Plurality Of Operating Environments Having Different Background Noise
    • 在具有不同背景噪声的多种操作环境中测试用于语音识别中的可用性的语法
    • US20090271189A1
    • 2009-10-29
    • US12109204
    • 2008-04-24
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, JR.Michael H. Mirt
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, JR.Michael H. Mirt
    • G10L15/00
    • G10L15/01
    • Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.
    • 用于在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性的方法,系统和产品,包括:为所述多个操作环境中的每一个接收记录的背景噪声; 产生语音识别引擎使用语法进行识别的测试语音语音; 将测试语音发音与每个记录的背景噪声混合,导致多个混合测试语音话语,每个混合测试语音话语具有不同的背景噪声; 对于每个混合测试语音话语,使用语法和混合测试语音话语进行语音识别,导致每个混合测试语音话语的语音识别结果; 并且对于每个记录的背景噪声,根据具有记录的背景噪声的混合测试语音话语的语音识别结果,评估语法的语音识别可靠性。
    • 10. 发明授权
    • Improving speech capabilities of a multimodal application
    • 提高多模式应用程序的语音能力
    • US08380513B2
    • 2013-02-19
    • US12468166
    • 2009-05-19
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • G10L11/00
    • G10L15/22G10L15/187G10L15/19G10L2015/228
    • Improving speech capabilities of a multimodal application including receiving, by the multimodal browser, a media file having a metadata container; retrieving, by the multimodal browser, from the metadata container a speech artifact related to content stored in the media file for inclusion in the speech engine available to the multimodal browser; determining whether the speech artifact includes a grammar rule or a pronunciation rule; if the speech artifact includes a grammar rule, modifying, by the multimodal browser, the grammar of the speech engine to include the grammar rule; and if the speech artifact includes a pronunciation rule, modifying, by the multimodal browser, the lexicon of the speech engine to include the pronunciation rule.
    • 改善多模式应用的语音能力,包括由多模式浏览器接收具有元数据容器的媒体文件; 由所述多模式浏览器从所述元数据容器检索与存储在所述媒体文件中的内容相关的语音伪像,以包括在所述多模式浏览器中可用的语音引擎中; 确定语音伪影是否包括语法规则或发音规则; 如果语音工件包括语法规则,则由多模式浏览器修改语音引擎的语法以包括语法规则; 并且如果语音伪影包括发音规则,则由多模式浏览器修改语音引擎的词典以包括发音规则。