专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US09076454B2 Adjusting a speech engine for a mobile computing device based on background noise 有权
标题翻译：基于背景噪声调整移动计算设备的语音引擎
公开(公告)号：US09076454B2
公开(公告)日：2015-07-07
申请号：US13358097
申请日：2012-01-25
申请人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Paritosh D. Patel
发明人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Paritosh D. Patel
IPC分类号： G10L15/20 , G10L21/0208
CPC分类号： G10L21/0208 , G10L15/20
摘要： Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a microphone, that include: sampling, through the microphone, background noise for a plurality of operating environments in which the mobile computing device operates; generating, for each operating environment, a noise model in dependence upon the sampled background noise for that operating environment; and configuring the speech engine for the mobile computing device with the noise model for the operating environment in which the mobile computing device currently operates.
摘要翻译：公开了用于基于背景噪声调整用于移动计算设备的语音引擎的方法，装置和产品，该移动计算设备可操作地耦合到麦克风，其包括：通过麦克风对多个操作环境的背景噪声进行采样其中移动计算设备运行; 根据所述操作环境的采样背景噪声，为每个操作环境产生噪声模型; 以及为移动计算设备当前操作的操作环境的噪声模型配置移动计算设备的语音引擎。

2. 发明授权

US08121837B2 Adjusting a speech engine for a mobile computing device based on background noise 有权
标题翻译：基于背景噪声调整移动计算设备的语音引擎
公开(公告)号：US08121837B2
公开(公告)日：2012-02-21
申请号：US12109151
申请日：2008-04-24
申请人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Paritosh D. Patel
发明人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Paritosh D. Patel
IPC分类号： G10L15/20
CPC分类号： G10L21/0208 , G10L15/20
摘要： Methods, apparatus, and products are disclosed for adjusting a speech engine for a mobile computing device based on background noise, the mobile computing device operatively coupled to a microphone, that include: sampling, through the microphone, background noise for a plurality of operating environments in which the mobile computing device operates; generating, for each operating environment, a noise model in dependence upon the sampled background noise for that operating environment; and configuring the speech engine for the mobile computing device with the noise model for the operating environment in which the mobile computing device currently operates.
摘要翻译：公开了用于基于背景噪声调整用于移动计算设备的语音引擎的方法，装置和产品，该移动计算设备可操作地耦合到麦克风，其包括：通过麦克风对多个操作环境的背景噪声进行采样其中移动计算设备运行; 根据所述操作环境的采样背景噪声，为每个操作环境产生噪声模型; 以及为移动计算设备当前操作的操作环境的噪声模型配置移动计算设备的语音引擎。

3. 发明授权

US07548977B2 Client / server application task allocation based upon client resources 有权
标题翻译：基于客户端资源的客户/服务器应用程序任务分配
公开(公告)号：US07548977B2
公开(公告)日：2009-06-16
申请号：US11056493
申请日：2005-02-11
申请人： Ciprian Agapi , Charles W. Cross, Jr. , Nicolae D. Metianu , Paritosh D. Patel
发明人： Ciprian Agapi , Charles W. Cross, Jr. , Nicolae D. Metianu , Paritosh D. Patel
IPC分类号： G06F15/173
CPC分类号： G06F9/505 , G06F9/5055
摘要： A software method for allocating application tasks between a client and a server can include the step of detecting client-based computing resources for executing at least one application task. At least one indicator of the detected client-based computing resources can be conveyed to a remotely located application server, the application server can determine whether to allocate at least one application task to the client or to a server component based upon at least one indicator.
摘要翻译：用于在客户机和服务器之间分配应用任务的软件方法可以包括检测基于客户机的计算资源以执行至少一个应用任务的步骤。检测到的基于客户端的计算资源的至少一个指示符可以被传送到位于远程的应用服务器，所述应用服务器可以基于至少一个指示符来确定是否向客户端或服务器组件分配至少一个应用任务。

4. 发明授权

US08380513B2 Improving speech capabilities of a multimodal application 有权
标题翻译：提高多模式应用程序的语音能力
公开(公告)号：US08380513B2
公开(公告)日：2013-02-19
申请号：US12468166
申请日：2009-05-19
申请人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr.
发明人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr.
IPC分类号： G10L11/00
CPC分类号： G10L15/22 , G10L15/187 , G10L15/19 , G10L2015/228
摘要： Improving speech capabilities of a multimodal application including receiving, by the multimodal browser, a media file having a metadata container; retrieving, by the multimodal browser, from the metadata container a speech artifact related to content stored in the media file for inclusion in the speech engine available to the multimodal browser; determining whether the speech artifact includes a grammar rule or a pronunciation rule; if the speech artifact includes a grammar rule, modifying, by the multimodal browser, the grammar of the speech engine to include the grammar rule; and if the speech artifact includes a pronunciation rule, modifying, by the multimodal browser, the lexicon of the speech engine to include the pronunciation rule.
摘要翻译：改善多模式应用的语音能力，包括由多模式浏览器接收具有元数据容器的媒体文件; 由所述多模式浏览器从所述元数据容器检索与存储在所述媒体文件中的内容相关的语音伪像，以包括在所述多模式浏览器中可用的语音引擎中; 确定语音伪影是否包括语法规则或发音规则; 如果语音工件包括语法规则，则由多模式浏览器修改语音引擎的语法以包括语法规则; 并且如果语音伪影包括发音规则，则由多模式浏览器修改语音引擎的词典以包括发音规则。

5. 发明授权

US09349367B2 Records disambiguation in a multimodal application operating on a multimodal device 有权
标题翻译：记录在多模式设备上运行的多模式应用程序中的歧义
公开(公告)号：US09349367B2
公开(公告)日：2016-05-24
申请号：US12109167
申请日：2008-04-24
申请人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Pradeep P. Mansey
发明人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Pradeep P. Mansey
IPC分类号： G10L15/00 , G10L15/08 , G10L15/183 , G10L15/22
CPC分类号： G10L15/22 , G10L15/00 , G10L15/08 , G10L15/183
摘要： Methods, apparatus, and products are disclosed for record disambiguation in a multimodal application operating on a multimodal device, the multimodal device supporting multiple modes of interaction including at least a voice mode and a visual mode, that include: prompting, by the multimodal application, a user to identify a particular record among a plurality of records; receiving, by the multimodal application in response to the prompt, a voice utterance from the user; determining, by the multimodal application, that the voice utterance ambiguously identifies more than one of the plurality of records; generating, by the multimodal application, a user interaction to disambiguate the records ambiguously identified by the voice utterance in dependence upon record attributes of the records ambiguously identified by the voice utterance; and selecting, by the multimodal application for further processing, one of the records ambiguously identified by the voice utterance in dependence upon the user interaction.
摘要翻译：公开了用于在多模式设备上操作的多模式应用中的记录消歧的方法，装置和产品，所述多模式设备支持包括至少语音模式和视觉模式的多种交互模式，其包括：由多模式应用提示，用户识别多个记录中的特定记录; 由多模式应用程序响应于该提示，接收来自用户的语音发声; 由所述多模式应用程序确定所述语音发音含糊地识别所述多个记录中的多于一个的记录; 由多模式应用程序产生用户交互，以消除由声音话语模糊识别的记录，依赖于由语音话语模糊识别的记录的记录属性; 以及通过多模式应用程序进行进一步处理，根据用户交互，通过语音话语模糊识别的记录之一。

6. 发明授权

US08082148B2 Testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise 有权
标题翻译：在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性
公开(公告)号：US08082148B2
公开(公告)日：2011-12-20
申请号：US12109204
申请日：2008-04-24
申请人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Michael H. Mirt
发明人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Michael H. Mirt
IPC分类号： G10L15/20
CPC分类号： G10L15/01
摘要： Methods, systems, and products for testing a grammar used in speech recognition for reliability in a plurality of operating environments having different background noise that include: receiving recorded background noise for each of the plurality of operating environments; generating a test speech utterance for recognition by a speech recognition engine using a grammar; mixing the test speech utterance with each recorded background noise, resulting in a plurality of mixed test speech utterances, each mixed test speech utterance having different background noise; performing, for each of the mixed test speech utterances, speech recognition using the grammar and the mixed test speech utterance, resulting in speech recognition results for each of the mixed test speech utterances; and evaluating, for each recorded background noise, speech recognition reliability of the grammar in dependence upon the speech recognition results for the mixed test speech utterance having that recorded background noise.
摘要翻译：用于在具有不同背景噪声的多个操作环境中测试用于语音识别中的语法的可靠性的方法，系统和产品，包括：为所述多个操作环境中的每一个接收记录的背景噪声; 产生语音识别引擎使用语法进行识别的测试语音语音; 将测试语音发音与每个记录的背景噪声混合，导致多个混合测试语音话语，每个混合测试语音话语具有不同的背景噪声; 对于每个混合测试语音话语，使用语法和混合测试语音话语进行语音识别，导致每个混合测试语音话语的语音识别结果; 并且对于每个记录的背景噪声，根据具有记录的背景噪声的混合测试语音话语的语音识别结果来评估语法的语音识别可靠性。

7. 发明授权

US08510117B2 Speech enabled media sharing in a multimodal application 有权
标题翻译：在多模式应用程序中启用语音启用媒体共享
公开(公告)号：US08510117B2
公开(公告)日：2013-08-13
申请号：US12500029
申请日：2009-07-09
申请人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr.
发明人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr.
IPC分类号： G10L21/00
CPC分类号： G06F17/30923 , G06F17/30861 , G10L15/26
摘要： Speech enabled media sharing in a multimodal application including parsing, by a multimodal browser, one or more markup documents of a multimodal application; identifying, by the multimodal browser, in the one or more markup documents a web resource for display in the multimodal browser; loading, by the multimodal browser, a web resource sharing grammar that includes keywords for modes of resource sharing and keywords for targets for receipt of web resources; receiving, by the multimodal browser, an utterance matching a keyword for the web resource, a keyword for a mode of resource sharing and a keyword for a target for receipt of the web resource in the web resource sharing grammar thereby identifying the web resource, a mode of resource sharing, and a target for receipt of the web resource; and sending, by the multimodal browser, the web resource to the identified target for the web resource using the identified mode of resource sharing.
摘要翻译：在多模式应用程序中启用语音启用媒体共享，包括通过多模式浏览器解析多模式应用程序的一个或多个标记文档; 由多模式浏览器在一个或多个标记文档中识别用于在多模式浏览器中显示的网络资源; 由多模式浏览器加载包括资源共享模式的关键字和用于接收网络资源的目标的关键字的网络资源共享语法; 通过多模式浏览器接收与web资源匹配的关键词，用于资源共享模式的关键字和用于在web资源共享语法中接收web资源的目标的关键字，从而识别web资源，资源共享模式，以及Web资源接收目标; 以及使用所识别的资源共享模式，将多个模式浏览器将web资源发送到所识别的web资源的目标。

8. 发明授权

US08214242B2 Signaling correspondence between a meeting agenda and a meeting discussion 失效
标题翻译：会议议程和会议讨论之间的信号通信
公开(公告)号：US08214242B2
公开(公告)日：2012-07-03
申请号：US12109227
申请日：2008-04-24
申请人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Brian D. Goodman , Frank L. Jania , Darren M. Shaw
发明人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Brian D. Goodman , Frank L. Jania , Darren M. Shaw
IPC分类号： G06Q10/00
CPC分类号： G06Q10/109 , G06Q10/1095
摘要： Signaling correspondence between a meeting agenda and a meeting discussion includes: receiving a meeting agenda specifying one or more topics for a meeting; analyzing, for each topic, one or more documents to identify topic keywords for that topic; receiving meeting discussions among participants for the meeting; identifying a current topic for the meeting in dependence upon the meeting agenda; determining a correspondence indicator in dependence upon the meeting discussions and the topic keywords for the current topic, the correspondence indicator specifying the correspondence between the meeting agenda and the meeting discussion; and rendering the correspondence indicator to the participants of the meeting.
摘要翻译：会议议程和会议讨论之间的信号通信包括：收到会议议程，列出会议的一个或多个主题; 为每个主题分析一个或多个文档以标识该主题的主题关键字; 接受会议与会者的会议讨论; 根据会议议程确定会议目前的议题; 根据会议讨论和当前主题的主题关键词确定一个通信指标，指定会议议程和会议讨论之间的对应关系的对应指标; 并将通信指标提交给会议与会者。

9. 发明授权

US08638909B2 Dynamically publishing directory information for a plurality of interactive voice response systems 失效
标题翻译：动态地发布多个交互式语音应答系统的目录信息
公开(公告)号：US08638909B2
公开(公告)日：2014-01-28
申请号：US13527355
申请日：2012-06-19
申请人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Fang Wang
发明人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr. , Fang Wang
IPC分类号： H04M1/64
CPC分类号： H04M3/493
摘要： Some example embodiments include a method of dynamically publishing directory information for a plurality of interactive voice response (‘IVR’) systems. The method includes receiving, by the IVR directory service on behalf of one of the IVR systems, a web services update request. The method includes determining, by the IVR directory service in response to the web services update request, updated directory information for the IVR system. The method includes updating the IVR system directory with the updated directory information for the IVR system. The method includes generating an updated voice mode user interface to reflect the updated IVR system directory with the updated directory information for the IVR system. The generating includes creating one more voice dialogs in accordance with the directory information, the one or more voice dialogs specifying a call flow defining the interaction between a caller and the IVR directory service.
摘要翻译：一些示例性实施例包括动态地发布用于多个交互式语音响应（“IVR”）系统的目录信息的方法。该方法包括代表IVR系统之一的IVR目录服务接收web服务更新请求。该方法包括响应于Web服务更新请求，通过IVR目录服务确定用于IVR系统的更新的目录信息。该方法包括用IVR系统更新的目录信息更新IVR系统目录。该方法包括生成更新的语音模式用户界面，以使用IVR系统的更新的目录信息来反映更新的IVR系统目录。生成包括根据目录信息创建另外一个语音对话，所述一个或多个语音对话框指定定义呼叫者和IVR目录服务之间的交互的呼叫流。

10. 发明授权

US08416714B2 Multimodal teleconferencing 有权
标题翻译：多模式电话会议
公开(公告)号：US08416714B2
公开(公告)日：2013-04-09
申请号：US12535923
申请日：2009-08-05
申请人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr.
发明人： Ciprian Agapi , William K. Bodin , Charles W. Cross, Jr.
IPC分类号： H04L12/16 , H04L12/18
CPC分类号： H04L12/413 , G10L15/26 , G10L17/00
摘要： Multimodal teleconferencing including receiving, by a multimodal teleconferencing module, a speech utterance from one of a plurality of participants in the multimodal teleconference; identifying the participant making the speech utterance as a current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to the current speaker; retrieving, by the multimodal teleconferencing module from accounts for the current speaker, content for display to one or more other participants in the multimodal teleconference; providing, by the multimodal teleconferencing module to a multimodal teleconferencing client for display to the current speaker, an identification of the speaker and the content retrieved for the speaker; and providing, by the multimodal teleconferencing module to one or more of multimodal teleconferencing clients for display to the other participants, an identification of the current speaker with the content retrieved for the one or more other participants in the multimodal teleconference.
摘要翻译：多模式电话会议包括由多模式电话会议模块接收来自多模式电话会议中的多个参与者之一的演讲话语; 将作为演讲话语的参与者识别为当前的演讲者; 由多模式电话会议模块从当前说话者的帐户检索用于显示给当前说话者的内容; 由多模式电话会议模块从当前说话者的帐户中检索用于向多模式电话会议中的一个或多个其他参与者显示的内容; 由多模式电话会议模块向多模式电话会议客户端提供用于向当前扬声器显示的扬声器的标识和为扬声器检索的内容; 以及由所述多模式电话会议模块向一个或多个多模式电话会议客户端提供用于向所述其他参与者显示的当前说话者的识别，所述内容是为所述多模式电话会议中的所述一个或多个其他参与者检索的内容。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式