专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US07181397B2 Speech dialog method and system 有权
标题翻译：语音对话方法和系统
公开(公告)号：US07181397B2
公开(公告)日：2007-02-20
申请号：US11118670
申请日：2005-04-29
申请人： Changxue C. Ma , Yan M. Cheng , Chen Liu , Ted Mazurkiewicz , Steven J. Nowlan , James R. Talley , Yuan-Jun Wei
发明人： Changxue C. Ma , Yan M. Cheng , Chen Liu , Ted Mazurkiewicz , Steven J. Nowlan , James R. Talley , Yuan-Jun Wei
IPC分类号： G10L15/14
CPC分类号： G10L17/26 , G10L13/033 , G10L15/22
摘要： An electronic device (300) for speech dialog includes functions that receive (305, 105) a speech phrase that comprises a request phrase that includes an instantiated variable (215), generate (335, 115) pitch and voicing characteristics (315) of the instantiated variable, and performs speech recognition (319, 125) of the instantiated variable to determine a most likely set of acoustic states (235). The electronic device may generate (335, 140) a synthesized value of the instantiated variable using the most likely set of acoustic states and the pitch and voicing characteristics of the instantiated variable. The electronic device may use a table of previously entered values of variables that have been determined to be unique, and in which the values are associated with a most likely set of acoustic states and the pitch and voicing characteristics determined at the receipt of each value to disambiguate (425, 430) a newly received instantiated variable.
摘要翻译：一种用于语音对话的电子设备（300）包括接收（305,105）语音短语的功能，该语音短语包括包含实例化变量（215）的请求短语，生成（335,115）音调和语音特征（315）并且执行所述实例化变量的语音识别（319,125）以确定最可能的一组声学状态（235）。电子设备可以使用最可能的声学状态集合和实例化变量的音调和语音特征来生成（335,140）实例化变量的合成值。电子设备可以使用已经被确定为唯一的先前输入的变量值的表，并且其中值与最可能的一组声学状态相关联，并且在接收每个值时确定的音高和发声特性消除歧义（425,430）一个新接收的实例变量。

2. 发明申请

US20080080678A1 METHOD AND SYSTEM FOR PERSONALIZED VOICE DIALOGUE 审中-公开
标题翻译：用于个性化语音对话的方法和系统
公开(公告)号：US20080080678A1
公开(公告)日：2008-04-03
申请号：US11536854
申请日：2006-09-29
申请人： Changxue C. Ma , Yan Ming Cheng , Steven J. Nowlan , Dale W. Russell , Yuan-Jun Wei
发明人： Changxue C. Ma , Yan Ming Cheng , Steven J. Nowlan , Dale W. Russell , Yuan-Jun Wei
IPC分类号： H04M11/00
CPC分类号： H04M3/4936 , G10L2015/226
摘要： A method (10) and system (200) for personalized voice dialogue can include tracking (12) a user's use of voice dialogue states or transitions and progressively offering (16) a user more efficient voice dialogue transitions or states such as voice dialogue transition or states having fewer and fewer words. The tracking of dialog states or transitions can include tracking (14) of repeated use of the dialogue states or transitions. A user can be prompted to create a new transition or state. The prompting (18) and confirmation and verification (20) by the user of a new transition or state can be done using SCXML language. The method can further include instantiating (21) the new transition or state with voice tags or words and performing (22) speech recognition using the new transition or state. The method can again determine (23) if the new transition or state is a repeat transition or state.
摘要翻译：用于个性化语音对话的方法（10）和系统（200）可以包括跟踪（12）用户对语音对话状态或转换的使用，并逐渐提供（16）用户更有效的语音对话转换或状态，例如语音对话转换或状态越来越少的单词。跟踪对话状态或转换可以包括跟踪（14）重复使用对话状态或转换。可以提示用户创建新的转换或状态。用户可以使用SCXML语言完成新的转换或状态的提示（18）和确认（20）。该方法还可以包括使用语音标签或单词实例化（21）新的转换或状态，并使用新的转换或状态执行（22）语音识别。该方法可以再次确定（23）如果新的转换或状态是重复转换或状态。

3. 发明申请

US20080207125A1 Method and Apparatus to Facilitate Conforming a Wireless Personal Communications Device to a Local Social Standard 审中-公开
标题翻译：促进将无线个人通信设备符合本地社会标准的方法和装置
公开(公告)号：US20080207125A1
公开(公告)日：2008-08-28
申请号：US11679704
申请日：2007-02-27
申请人： Yuan-Jun Wei , Steven W. Albrecht , Changxue C. Ma
发明人： Yuan-Jun Wei , Steven W. Albrecht , Changxue C. Ma
IPC分类号： H04B7/00 , H04B7/24 , H04M1/00
CPC分类号： H04M19/04 , H04M1/72563 , H04M19/045 , H04W4/021 , H04W4/029 , H04W4/42 , H04W28/18
摘要： A wireless transmitter (201) transmits (102) a message intended for at least one wireless personal communications device (202). That message comprises content (203) configured and arranged to at least attempt to prompt a particular operability configuration for the wireless personal communications device that conforms to social standards as correspond to a given local venue (204). Such content can vary with the application setting with some relevant examples comprising, but not being limited to, information indicative of a degree to which the operability configuration comprises a required operability configuration (as versus a voluntary or merely suggested configuration), information indicative of at least one particular capability of the wireless personal communication device to which the operability configuration pertains, and/or information corresponding to a time frame during which the operability configuration is applicable, to note but a few.
摘要翻译：无线发射机（201）发送（102）用于至少一个无线个人通信设备（202）的消息。该消息包括被配置和布置为至少尝试针对符合给定的本地场所（204）的符合社会标准的无线个人通信设备提示特定可操作性配置的内容（203）。这样的内容可以随着应用设置而变化，其中一些相关示例包括但不限于指示可操作性配置包括所需可操作性配置（与自愿或仅仅建议的配置相关）的程度的信息，指示在可操作性配置所属的无线个人通信设备的至少一个特定能力和/或与可操作性配置可应用的时间帧相对应的信息注意到少数。

4. 发明授权

US07584099B2 Method and system for interpreting verbal inputs in multimodal dialog system 有权
标题翻译：在多模态对话系统中解释口头输入的方法和系统
公开(公告)号：US07584099B2
公开(公告)日：2009-09-01
申请号：US11100185
申请日：2005-04-06
申请人： Changxue C. Ma , Harry M. Bliss , Yan M. Cheng
发明人： Changxue C. Ma , Harry M. Bliss , Yan M. Cheng
IPC分类号： G10L15/00
CPC分类号： G06F17/279 , G06F3/038 , G06F2203/0381 , G10L15/1815
摘要： A method, a system and a computer program product for interpreting a verbal input in a multimodal dialog system are provided. The method includes assigning (302) a confidence value to at least one word generated by a verbal recognition component. The method further includes generating (304) a semantic unit confidence score for the verbal input. The generation of a semantic unit confidence score is based on the confidence value of at least one word and at least one semantic confidence operator.
摘要翻译：提供了一种用于在多模式对话系统中解释口头输入的方法，系统和计算机程序产品。该方法包括将置信度值（302）分配（302）至由语言识别组件生成的至少一个词。该方法还包括为语言输入生成（304）语义单位置信度得分。语义单位置信度得分的产生基于至少一个单词和至少一个语义置信度运算符的置信度值。

5. 发明授权

US07533018B2 Tailored speaker-independent voice recognition system 有权
标题翻译：量身定制的与扬声器无关的语音识别系统
公开(公告)号：US07533018B2
公开(公告)日：2009-05-12
申请号：US10967957
申请日：2004-10-19
申请人： Changxue C. Ma , Yan M. Cheng
发明人： Changxue C. Ma , Yan M. Cheng
IPC分类号： G10L15/06 , G10L15/00
CPC分类号： G10L15/063 , G10L2015/0631
摘要： A tailored speaker-independent voice recognition system has a speech recognition dictionary (360) with at least one word (371). That word (371) has at least two transcriptions (373), each transcription (373) having a probability factor (375) and an indicator (377) of whether the transcription is active. When a speech utterance is received (510), the voice recognition system determines (520, 530) the word signified by the speech utterance, evaluates (540) the speech utterance against the transcriptions of the correct word, updates (550) the probability factors for each transcription, and inactivates (570) any transcription that has an updated probability factor that is less than a threshold.
摘要翻译：定制的与扬声器无关的语音识别系统具有至少一个单词（371）的语音识别词典（360）。该字（371）具有至少两个转录（373），每个转录（373）具有概率因子（375）和指示符（377）是否转录是活性的。当接收到语音话语（510）时，语音识别系统确定（520,530）由语音发音表示的单词，根据正确单词的转录评估（540）语音发音，更新（550）概率因子对于每个转录，并使（570）任何具有小于阈值的更新概率因子的转录失活。

6. 发明申请

US20080130699A1 CONTENT SELECTION USING SPEECH RECOGNITION 审中-公开
标题翻译：使用语音识别的内容选择
公开(公告)号：US20080130699A1
公开(公告)日：2008-06-05
申请号：US11566832
申请日：2006-12-05
申请人： Changxue C. Ma , Yan M. Cheng
发明人： Changxue C. Ma , Yan M. Cheng
IPC分类号： H01S5/00
CPC分类号： G06F16/433
摘要： Disclosed are a method and wireless device for selecting a content file using speech recognition. The method includes establishing a set of tagged text items wherein each tagged text item is uniquely associated with one content file of the set of content files. At least one audible utterance (226) is received (804) from a user. A phoneme lattice (302) is generated (808) based on the audible utterance (226). A phoneme lattice statistical model is generated (810) based on the phoneme lattice (302). A score is assigned (1008) to the tagged text items based on probabilistic estimates in the phoneme lattice statistical model. A list of high scoring tagged text items is presented (1014) so that a selection of a content file may be made. A word lattice (402) and a word lattice statistical model are also used in some embodiments
摘要翻译：公开了一种使用语音识别来选择内容文件的方法和无线装置。该方法包括建立一组标记的文本项目，其中每个标记的文本项目与该组内容文件的一个内容文件唯一地相关联。从用户接收至少一个听觉话语（226）（804）。基于可听话语（226）产生音素格（302）（808）。基于音素格（302）生成音素格子统计模型（810）。基于音素格子统计模型中的概率估计，将得分（1008）分配给带标签的文本项目。呈现高得分标签文本的列表（1014），以便可以进行内容文件的选择。在一些实施例中也使用字格（402）和字格统计模型

7. 发明申请

US20080102817A1 METHOD AND SYSTEM FOR SHARING CELLULAR PHONES 审中-公开
标题翻译：用于共享蜂窝电话的方法和系统
公开(公告)号：US20080102817A1
公开(公告)日：2008-05-01
申请号：US11553679
申请日：2006-10-27
申请人： Daryoosh Shenassa , Changxue C. Ma , Deborah A. Matteo
发明人： Daryoosh Shenassa , Changxue C. Ma , Deborah A. Matteo
IPC分类号： H04Q7/20
CPC分类号： H04L67/322 , H04L45/00 , H04L65/4061
摘要： A method (40) and system (10 or 200) for sharing a cellular phone includes sending (41) a request to use a second cellular phone (12) as a server from a first cellular phone (11), exchanging (43) audio streams between the cellulars phone, receiving (44) a dialing signal at the first cellular phone from the second cellular phone and forming (45) a call connection between the first cellular phone and a third party (13) via the second cellular phone. The step of sending the request can include sending an SMS message, sending a phone number, or sending a push-to-share request for nearby cellular phones having stronger signal strength. The push-to-share request can be a Bluetooth search of nearby cellular phones having stronger signal strength for their cellular network connection. The method can also include automatically (42) sending the push-to-share request upon detection of a signal strength below a predetermined threshold.
摘要翻译：用于共享蜂窝电话的方法（40）和系统（10或200）包括从第一蜂窝电话（11）发送（41）使用第二蜂窝电话（12）作为服务器的请求，交换（43）音频在蜂窝电话之间流动，从第二蜂窝电话接收（44）在第一蜂窝电话处的拨号信号，并且经由第二蜂窝电话形成（45）第一蜂窝电话和第三方（13）之间的呼叫连接。发送请求的步骤可以包括发送SMS消息，发送电话号码或发送具有更强信号强度的附近蜂窝电话的分组请求。推送分享请求可以是对于其蜂窝网络连接具有更强信号强度的附近蜂窝电话的蓝牙搜索。该方法还可以在检测到低于预定阈值的信号强度时，自动（42）发送分组请求。

8. 发明申请

US20080162472A1 METHOD AND APPARATUS FOR VOICE SEARCHING IN A MOBILE COMMUNICATION DEVICE 审中-公开
标题翻译：用于在移动通信设备中进行语音搜索的方法和装置
公开(公告)号：US20080162472A1
公开(公告)日：2008-07-03
申请号：US11617134
申请日：2006-12-28
申请人： Yan Ming Cheng , Changxue C. Ma , Theodore Mazurkiewicz , Paul C. Davis
发明人： Yan Ming Cheng , Changxue C. Ma , Theodore Mazurkiewicz , Paul C. Davis
IPC分类号： G06F17/30 , G06F3/048
CPC分类号： G06F3/16 , G06F16/24522 , G10L15/26 , H04M1/271
摘要： A method and apparatus for performing a voice search in a mobile communication device is disclosed. The method may include receiving a search query from a user of the mobile communication device, converting speech parts in the search query into linguistic representations, comparing the query linguistic representations to the linguistic representations of all items in the voice search database to find matches, wherein the voice search database has indexed all items that are associated with the device, displaying the matches to the user, receiving the user's selection from the displayed matches, and retrieving and executing the user's selection.
摘要翻译：公开了一种用于在移动通信设备中执行语音搜索的方法和装置。该方法可以包括从移动通信设备的用户接收搜索查询，将搜索查询中的语音部分转换为语言表示，将查询语言表示与语音搜索数据库中的所有项目的语言表示进行比较以找到匹配，其中语音搜索数据库对与设备相关联的所有项目进行索引，向用户显示匹配，从显示的匹配中接收用户的选择，以及检索和执行用户的选择。

9. 发明申请

US20080162125A1 METHOD AND APPARATUS FOR LANGUAGE INDEPENDENT VOICE INDEXING AND SEARCHING 审中-公开
标题翻译：语言独立语音索引和搜索的方法和装置
公开(公告)号：US20080162125A1
公开(公告)日：2008-07-03
申请号：US11617265
申请日：2006-12-28
申请人： Changxue C. Ma , Feipeng Li
发明人： Changxue C. Ma , Feipeng Li
IPC分类号： G10L19/12
CPC分类号： G10L15/02 , G06F16/632 , G06F16/685 , G10L2015/025
摘要： A method and apparatus for language independent voice searching in a mobile communication device is disclosed. The method may include receiving a search query from a user of the mobile communication device, converting speech parts in the search query into linguistic representations which covers at least one languages, generating a search phoneme lattice based on the linguistic representations, extracting query features from the search phoneme lattice, generating query feature vectors based on the extracted features, performing a coarse search using the query feature vectors and the indexing feature vectors from the indexing database, performing a fine search using the results of the coarse search and the indexing phoneme lattices stored in the indexing database, and outputting the fine search results to a dialog manager.
摘要翻译：公开了一种在移动通信设备中用于语言独立语音搜索的方法和装置。该方法可以包括从移动通信设备的用户接收搜索查询，将搜索查询中的语音部分转换成涵盖至少一种语言的语言表示，基于语言表示生成搜索音素格，从搜索音素格，基于提取的特征生成查询特征向量，使用查询特征向量和来自索引数据库的索引特征向量执行粗略搜索，使用存储的粗搜索和索引音素格的结果执行精细搜索在索引数据库中，并将精细搜索结果输出到对话管理器。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式