会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Improving speech capabilities of a multimodal application
    • 提高多模式应用程序的语音能力
    • US08380513B2
    • 2013-02-19
    • US12468166
    • 2009-05-19
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • G10L11/00
    • G10L15/22G10L15/187G10L15/19G10L2015/228
    • Improving speech capabilities of a multimodal application including receiving, by the multimodal browser, a media file having a metadata container; retrieving, by the multimodal browser, from the metadata container a speech artifact related to content stored in the media file for inclusion in the speech engine available to the multimodal browser; determining whether the speech artifact includes a grammar rule or a pronunciation rule; if the speech artifact includes a grammar rule, modifying, by the multimodal browser, the grammar of the speech engine to include the grammar rule; and if the speech artifact includes a pronunciation rule, modifying, by the multimodal browser, the lexicon of the speech engine to include the pronunciation rule.
    • 改善多模式应用的语音能力,包括由多模式浏览器接收具有元数据容器的媒体文件; 由所述多模式浏览器从所述元数据容器检索与存储在所述媒体文件中的内容相关的语音伪像,以包括在所述多模式浏览器中可用的语音引擎中; 确定语音伪影是否包括语法规则或发音规则; 如果语音工件包括语法规则,则由多模式浏览器修改语音引擎的语法以包括语法规则; 并且如果语音伪影包括发音规则,则由多模式浏览器修改语音引擎的词典以包括发音规则。
    • 5. 发明授权
    • Records disambiguation in a multimodal application operating on a multimodal device
    • 记录在多模式设备上运行的多模式应用程序中的歧义
    • US09349367B2
    • 2016-05-24
    • US12109167
    • 2008-04-24
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Pradeep P. Mansey
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Pradeep P. Mansey
    • G10L15/00G10L15/08G10L15/183G10L15/22
    • G10L15/22G10L15/00G10L15/08G10L15/183
    • Methods, apparatus, and products are disclosed for record disambiguation in a multimodal application operating on a multimodal device, the multimodal device supporting multiple modes of interaction including at least a voice mode and a visual mode, that include: prompting, by the multimodal application, a user to identify a particular record among a plurality of records; receiving, by the multimodal application in response to the prompt, a voice utterance from the user; determining, by the multimodal application, that the voice utterance ambiguously identifies more than one of the plurality of records; generating, by the multimodal application, a user interaction to disambiguate the records ambiguously identified by the voice utterance in dependence upon record attributes of the records ambiguously identified by the voice utterance; and selecting, by the multimodal application for further processing, one of the records ambiguously identified by the voice utterance in dependence upon the user interaction.
    • 公开了用于在多模式设备上操作的多模式应用中的记录消歧的方法,装置和产品,所述多模式设备支持包括至少语音模式和视觉模式的多种交互模式,其包括:由多模式应用提示, 用户识别多个记录中的特定记录; 由多模式应用程序响应于该提示,接收来自用户的语音发声; 由所述多模式应用程序确定所述语音发音含糊地识别所述多​​个记录中的多于一个的记录; 由多模式应用程序产生用户交互,以消除由声音话语模糊识别的记录,依赖于由语音话语模糊识别的记录的记录属性; 以及通过多模式应用程序进行进一步处理,根据用户交互,通过语音话语模糊识别的记录之一。
    • 6. 发明授权
    • Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise
    • 在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性
    • US08082148B2
    • 2011-12-20
    • US12109204
    • 2008-04-24
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Michael H. Mirt
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Michael H. Mirt
    • G10L15/20
    • G10L15/01
    • Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.
    • 用于在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性的方法,系统和产品,包括:为所述多个操作环境中的每一个接收记录的背景噪声; 产生语音识别引擎使用语法进行识别的测试语音语音; 将测试语音发音与每个记录的背景噪声混合,导致多个混合测试语音话语,每个混合测试语音话语具有不同的背景噪声; 对于每个混合测试语音话语,使用语法和混合测试语音话语进行语音识别,导致每个混合测试语音话语的语音识别结果; 并且对于每个记录的背景噪声,根据具有记录的背景噪声的混合测试语音话语的语音识别结果来评估语法的语音识别可靠性。
    • 7. 发明授权
    • Speech enabled media sharing in a multimodal application
    • 在多模式应用程序中启用语音启用媒体共享
    • US08510117B2
    • 2013-08-13
    • US12500029
    • 2009-07-09
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • G10L21/00
    • G06F17/30923G06F17/30861G10L15/26
    • Speech enabled media sharing in a multimodal application including parsing, by a multimodal browser, one or more markup documents of a multimodal application; identifying, by the multimodal browser, in the one or more markup documents a web resource for display in the multimodal browser; loading, by the multimodal browser, a web resource sharing grammar that includes keywords for modes of resource sharing and keywords for targets for receipt of web resources; receiving, by the multimodal browser, an utterance matching a keyword for the web resource, a keyword for a mode of resource sharing and a keyword for a target for receipt of the web resource in the web resource sharing grammar thereby identifying the web resource, a mode of resource sharing, and a target for receipt of the web resource; and sending, by the multimodal browser, the web resource to the identified target for the web resource using the identified mode of resource sharing.
    • 在多模式应用程序中启用语音启用媒体共享,包括通过多模式浏览器解析多模式应用程序的一个或多个标记文档; 由多模式浏览器在一个或多个标记文档中识别用于在多模式浏览器中显示的网络资源; 由多模式浏览器加载包括资源共享模式的关键字和用于接收网络资源的目标的关键字的网络资源共享语法; 通过多模式浏览器接收与web资源匹配的关键词,用于资源共享模式的关键字和用于在web资源共享语法中接收web资源的目标的关键字,从而识别web资源, 资源共享模式,以及Web资源接收目标; 以及使用所识别的资源共享模式,将多个模式浏览器将web资源发送到所识别的web资源的目标。
    • 9. 发明授权
    • Dynamically publishing directory information for a plurality of interactive voice response systems
    • 动态地发布多个交互式语音应答系统的目录信息
    • US08638909B2
    • 2014-01-28
    • US13527355
    • 2012-06-19
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Fang Wang
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.Fang Wang
    • H04M1/64
    • H04M3/493
    • Some example embodiments include a method of dynamically publishing directory information for a plurality of interactive voice response (‘IVR’) systems. The method includes receiving, by the IVR directory service on behalf of one of the IVR systems, a web services update request. The method includes determining, by the IVR directory service in response to the web services update request, updated directory information for the IVR system. The method includes updating the IVR system directory with the updated directory information for the IVR system. The method includes generating an updated voice mode user interface to reflect the updated IVR system directory with the updated directory information for the IVR system. The generating includes creating one more voice dialogs in accordance with the directory information, the one or more voice dialogs specifying a call flow defining the interaction between a caller and the IVR directory service.
    • 一些示例性实施例包括动态地发布用于多个交互式语音响应(“IVR”)系统的目录信息的方法。 该方法包括代表IVR系统之一的IVR目录服务接收web服务更新请求。 该方法包括响应于Web服务更新请求,通过IVR目录服务确定用于IVR系统的更新的目录信息。 该方法包括用IVR系统更新的目录信息更新IVR系统目录。 该方法包括生成更新的语音模式用户界面,以使用IVR系统的更新的目录信息来反映更新的IVR系统目录。 生成包括根据目录信息创建另外一个语音对话,所述一个或多个语音对话框指定定义呼叫者和IVR目录服务之间的交互的呼叫流。
    • 10. 发明授权
    • Multimodal teleconferencing
    • 多模式电话会议
    • US08416714B2
    • 2013-04-09
    • US12535923
    • 2009-08-05
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • Ciprian AgapiWilliam K. BodinCharles W. Cross, Jr.
    • H04L12/16H04L12/18
    • H04L12/413G10L15/26G10L17/00
    • Multimodal teleconferencing including receiving, by a multimodal teleconferencing module, a speech utterance from one of a plurality of participants in the multimodal teleconference; identifying the participant making the speech utterance as a current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to the current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to one or more other participants in the multimodal teleconference; providing, by the multimodal teleconferencing module to a multimodal teleconferencing client for display to the current speaker, an identification of the speaker and the content retrieved for the speaker; and providing, by the multimodal teleconferencing module to one or more of multimodal teleconferencing clients for display to the other participants, an identification of the current speaker with the content retrieved for the one or more other participants in the multimodal teleconference.
    • 多模式电话会议包括由多模式电话会议模块接收来自多模式电话会议中的多个参与者之一的演讲话语; 将作为演讲话语的参与者识别为当前的演讲者; 由多模式电话会议模块从当前说话者的帐户检索用于显示给当前说话者的内容; 由多模式电话会议模块从当前说话者的帐户中检索用于向多模式电话会议中的一个或多个其他参与者显示的内容; 由多模式电话会议模块向多模式电话会议客户端提供用于向当前扬声器显示的扬声器的标识和为扬声器检索的内容; 以及由所述多模式电话会议模块向一个或多个多模式电话会议客户端提供用于向所述其他参与者显示的当前说话者的识别,所述内容是为所述多模式电话会议中的所述一个或多个其他参与者检索的内容。