会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 31. 发明授权
    • Method and system for voice-enabled autofill
    • 语音自动填充的方法和系统
    • US07953597B2
    • 2011-05-31
    • US11199672
    • 2005-08-09
    • Soonthorn AtivanichayaphongCharles W. Cross, Jr.Gerald M. McCobb
    • Soonthorn AtivanichayaphongCharles W. Cross, Jr.Gerald M. McCobb
    • G10L15/26G06F17/00G10L15/00
    • G06F17/243G10L15/193G10L15/26H04M3/4938
    • A computer-implemented method and system are provided for filling a graphic-based form field in response to a speech utterance. The computer-implemented method includes generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The method further includes creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the auto-fill event causing the filling of the form field with data corresponding to the user profile. The system includes a grammar-generating module for generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The system also includes an event module for creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the event causing the filling of the form field with data corresponding to the user profile.
    • 提供了一种计算机实现的方法和系统,用于响应于语音说话填充基于图形的表单字段。 计算机实现的方法包括生成对应于表单域的语法,语法基于用户简档并且包括语义解释字符串。 所述方法还包括基于所述至少一个语法并且响应于所述语音话语来创建自动填充事件,所述自动填充事件导致用与所述用户简档对应的数据填写所述表单域。 该系统包括用于生成对应于表单域的语法的语法生成模块,所述语法基于用户简档并且包括语义解释字符串。 该系统还包括一个事件模块,用于基于该至少一个语法创建一个自动填充事件,并且响应于语音话语,该事件导致用对应于用户简档的数据填写表单域。
    • 33. 发明授权
    • Method and system for voice-enabled autofill
    • 语音自动填充的方法和系统
    • US07739117B2
    • 2010-06-15
    • US10945112
    • 2004-09-20
    • Soonthorn AtivanichayaphongCharles W. Cross, Jr.Gerald M. McCobb
    • Soonthorn AtivanichayaphongCharles W. Cross, Jr.Gerald M. McCobb
    • G10L11/00G10L15/00G06F3/00G06F11/30
    • G06F17/243G10L15/193G10L15/26H04M3/4938
    • A computer-implemented method and system are provided for filling a graphic-based form field in response to a speech utterance. The computer-implemented method includes generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The method further includes creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the auto-fill event causing the filling of the form field with data corresponding to the user profile. The system includes a grammar-generating module for generating a grammar corresponding to the form field, the grammar being based on a user profile and comprising a semantic interpretation string. The system also includes an event module for creating an auto-fill event based upon the at least one grammar and responsive to the speech utterance, the event causing the filling of the form field with data corresponding to the user profile.
    • 提供了一种计算机实现的方法和系统,用于响应于语音说话填充基于图形的表单字段。 计算机实现的方法包括生成对应于表单域的语法,语法基于用户简档并且包括语义解释字符串。 所述方法还包括基于所述至少一个语法并且响应于所述语音话语来创建自动填充事件,所述自动填充事件导致用与所述用户简档对应的数据填写所述表单域。 该系统包括用于生成对应于表单域的语法的语法生成模块,所述语法基于用户简档并且包括语义解释字符串。 该系统还包括一个事件模块,用于基于该至少一个语法创建一个自动填充事件,并且响应于语音话语,该事件导致用对应于用户简档的数据填写表单域。
    • 34. 发明申请
    • Speech-Enabled Content Navigation And Control Of A Distributed Multimodal Browser
    • 分布式多模态浏览器的语音启用内容导航和控制
    • US20080255851A1
    • 2008-10-16
    • US11734445
    • 2007-04-12
    • Soonthorn AtivanichayaphongCharles W. CrossGerald M. McCobb
    • Soonthorn AtivanichayaphongCharles W. CrossGerald M. McCobb
    • G10L21/00
    • G10L15/265G06F3/16G06F3/167G10L15/26
    • Speech-enabled content navigation and control of a distributed multimodal browser is disclosed, the browser providing an execution environment for a multimodal application, the browser including a graphical user agent (‘GUA’) and a voice user agent (‘VUA’), the GUA operating on a multimodal device, the VUA operating on a voice server, that includes: transmitting, by the GUA, a link message to the VUA, the link message specifying voice commands that control the browser and an event corresponding to each voice command; receiving, by the GUA, a voice utterance from a user, the voice utterance specifying a particular voice command; transmitting, by the GUA, the voice utterance to the VUA for speech recognition by the VUA; receiving, by the GUA, an event message from the VUA, the event message specifying a particular event corresponding to the particular voice command; and controlling, by the GUA, the browser in dependence upon the particular event.
    • 公开了一种分布式多模式浏览器的语音启用内容导航和控制,浏览器为多模式应用提供执行环境,浏览器包括图形用户代理(“GUA”)和语音用户代理(“VUA”), GUA在多模式设备上操作,VUA在语音服务器上操作,其包括:由GUA向VUA发送链接消息,指定控制浏览器的语音命令的链接消息和与每个语音命令相对应的事件; 由GUA接收来自用户的语音发音,指定特定语音命令的语音话语; 通过GUA向VUA发送语音识别语音识别语音; 由GUA接收来自VUA的事件消息,事件消息指定与特定语音命令对应的特定事件; 并由GUA根据特定事件控制浏览器。
    • 35. 发明授权
    • Altering behavior of a multimodal application based on location
    • 基于位置改变多模式应用程序的行为
    • US09208783B2
    • 2015-12-08
    • US11679301
    • 2007-02-27
    • Soonthorn AtivanichayaphongCharles W. Cross, Jr.Igor R. JablokovGerald M. McCobb
    • Soonthorn AtivanichayaphongCharles W. Cross, Jr.Igor R. JablokovGerald M. McCobb
    • G10L21/00G10L25/00G10L15/22G10L15/24
    • G10L15/22G10L15/24
    • Methods, apparatus, and products are disclosed for altering behavior of a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application, including a voice mode and one or more non-voice modes. The voice mode of user interaction with the multimodal application is supported by a voice interpreter. Altering behavior of a multimodal application based on location includes: receiving a location change notification in the voice interpreter from a device location manager, the device location manager operatively coupled to a position detection component of the multimodal device, the location change notification specifying a current location of the multimodal device; updating, by the voice interpreter, location-based environment parameters for the voice interpreter in dependence upon the current location of the multimodal device; and interpreting, by the voice interpreter, the multimodal application in dependence upon the location-based environment parameters.
    • 公开了基于位置改变多模式应用的行为的方法,装置和产品。 多模式应用程序在多模式设备上运行,支持与多模式应用程序的多种用户交互模式,包括语音模式和一种或多种非语音模式。 与多模式应用程序的用户交互的语音模式由语音解释器支持。 基于位置改变多模式应用的行为包括:从设备位置管理器在语音解释器中接收位置改变通知,该设备位置管理器可操作地耦合到多模态设备的位置检测组件,位置变化通知指定当前位置 的多模式设备; 语音解释器根据多模式设备的当前位置更新语音解释器的基于位置的环境参数; 并且由语音解释器根据基于位置的环境参数来解释多模式应用。
    • 38. 发明授权
    • Enabling global grammars for a particular multimodal application
    • 启用特定多模式应用程序的全局语法
    • US07809575B2
    • 2010-10-05
    • US11679279
    • 2007-02-27
    • Soonthorn AtivanichayaphongCharles W. Cross, Jr.Gerald M. McCobb
    • Soonthorn AtivanichayaphongCharles W. Cross, Jr.Gerald M. McCobb
    • G10L21/00G10L11/00G10L15/18
    • G10L15/19
    • Methods, apparatus, and computer program products are described for enabling global grammars for a particular multimodal application according to the present invention by loading a multimodal web page; determining whether the loaded multimodal web page is one of a plurality of multimodal web pages of the particular multimodal application. If the loaded multimodal web page is one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes loading any currently unloaded global grammars of the particular multimodal application identified in the multimodal web page and maintaining any previously loaded global grammars. If the loaded multimodal web page is not one of the plurality of multimodal web pages of the particular multimodal application, enabling global grammars typically includes unloading any currently loaded global grammars.
    • 描述了方法,装置和计算机程序产品,用于通过加载多模式网页来实现根据本发明的特定多模式应用的全局语法; 确定加载的多模式网页是否是特定多模式应用的多个多模式网页之一。 如果加载的多模式网页是特定多模式应用程序的多个多模式网页之一,则启用全局语法通常包括加载在多模式网页中标识的特定多模式应用程序的任何当前未加载的全局语法,并维护任何先前加载的全局语法 。 如果加载的多模式网页不是特定多模式应用程序的多个多模式网页之一,则启用全局语法通常包括卸载任何当前加载的全局语法。
    • 39. 发明申请
    • PARTIALLY FILLING MIXED-INITIATIVE FORMS FROM UTTERANCES HAVING SUB-THRESHOLD CONFIDENCE SCORES BASED UPON WORD-LEVEL CONFIDENCE DATA
    • 根据词级信心数据,从具有亚阈值信心评分的新西兰部分地填充混合式主动式
    • US20080243502A1
    • 2008-10-02
    • US11692741
    • 2007-03-28
    • SOONTHORN ATIVANICHAYAPHONGGerald M. McCobbPARITOSH D. PATELMARC WHITE
    • SOONTHORN ATIVANICHAYAPHONGGerald M. McCobbPARITOSH D. PATELMARC WHITE
    • G10L15/26
    • G10L15/22G10L15/193
    • The invention discloses prompting for a spoken response that provides input for multiple elements. A single spoken utterance including content for multiple elements can be received, where each element is mapped to a data field. The spoken utterance can be speech-to-text converted to derive values for each of the multiple elements. An utterance level confidence score can be determined, which can fall below an associated certainty threshold. Element-level confidence scores for each of the derived elements can then be ascertained. A first set of the multiple elements can have element-level confidence scores above an associated certainty threshold and a second set can have scores below. Values can be stored in data fields mapped to the first set. A prompt for input for the second set can be played. Accordingly, data fields are partially filled in based upon the original speech utterance, where a second prompt for unfilled fields is played.
    • 本发明公开了一种为多个元素提供输入的口头响应的提示。 可以接收包括多个元素的内容的单个语音话语,其中每个元素被映射到数据字段。 讲话语音可以是语音到文本转换,以导出每个多个元素的值。 可以确定话语等级置信度得分,其可以低于相关的确定性阈值。 然后可以确定每个派生元素的元素级置信度得分。 多个元素的第一组可以具有高于相关确定性阈值的元素级置信度得分,而第二组可以具有下面的得分。 值可以存储在映射到第一组的数据字段中。 可以播放第二组的输入提示。 因此,基于原始语音话语部分地填充数据字段,其中播放未填充字段的第二提示。
    • 40. 发明申请
    • Altering Behavior Of A Multimodal Application Based On Location
    • 改变基于位置的多模态应用的行为
    • US20080208593A1
    • 2008-08-28
    • US11679301
    • 2007-02-27
    • Soonthorn AtivanichayaphongCharles W. CrossIgor R. JablokovGerald M. McCobb
    • Soonthorn AtivanichayaphongCharles W. CrossIgor R. JablokovGerald M. McCobb
    • G10L21/00
    • G10L15/22G10L15/24
    • Methods, apparatus, and products are disclosed for altering behavior of a multimodal application based on location. The multimodal application operates on a multimodal device supporting multiple modes of user interaction with the multimodal application, including a voice mode and one or more non-voice modes. The voice mode of user interaction with the multimodal application is supported by a voice interpreter. Altering behavior of a multimodal application based on location includes: receiving a location change notification in the voice interpreter from a device location manager, the device location manager operatively coupled to a position detection component of the multimodal device, the location change notification specifying a current location of the multimodal device; updating, by the voice interpreter, location-based environment parameters for the voice interpreter in dependence upon the current location of the multimodal device; and interpreting, by the voice interpreter, the multimodal application in dependence upon the location-based environment parameters.
    • 公开了基于位置改变多模式应用的行为的方法,装置和产品。 多模式应用程序在多模式设备上运行,支持与多模式应用程序的多种用户交互模式,包括语音模式和一种或多种非语音模式。 与多模式应用程序的用户交互的语音模式由语音解释器支持。 基于位置改变多模式应用的行为包括:从设备位置管理器在语音解释器中接收位置改变通知,该设备位置管理器可操作地耦合到多模态设备的位置检测组件,位置变化通知指定当前位置 的多模式设备; 语音解释器根据多模式设备的当前位置更新语音解释器的基于位置的环境参数; 并且由语音解释器根据基于位置的环境参数来解释多模式应用。