会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Speech dialog method and system
    • 语音对话方法和系统
    • US07181397B2
    • 2007-02-20
    • US11118670
    • 2005-04-29
    • Changxue C. MaYan M. ChengChen LiuTed MazurkiewiczSteven J. NowlanJames R. TalleyYuan-Jun Wei
    • Changxue C. MaYan M. ChengChen LiuTed MazurkiewiczSteven J. NowlanJames R. TalleyYuan-Jun Wei
    • G10L15/14
    • G10L17/26G10L13/033G10L15/22
    • An electronic device (300) for speech dialog includes functions that receive (305, 105) a speech phrase that comprises a request phrase that includes an instantiated variable (215), generate (335, 115) pitch and voicing characteristics (315) of the instantiated variable, and performs speech recognition (319, 125) of the instantiated variable to determine a most likely set of acoustic states (235). The electronic device may generate (335, 140) a synthesized value of the instantiated variable using the most likely set of acoustic states and the pitch and voicing characteristics of the instantiated variable. The electronic device may use a table of previously entered values of variables that have been determined to be unique, and in which the values are associated with a most likely set of acoustic states and the pitch and voicing characteristics determined at the receipt of each value to disambiguate (425, 430) a newly received instantiated variable.
    • 一种用于语音对话的电子设备(300)包括接收(305,105)语音短语的功能,该语音短语包括包含实例化变量(215)的请求短语,生成(335,115)音调和语音特征(315) 并且执行所述实例化变量的语音识别(319,125)以确定最可能的一组声学状态(235)。 电子设备可以使用最可能的声学状态集合和实例化变量的音调和语音特征来生成(335,140)实例化变量的合成值。 电子设备可以使用已经被确定为唯一的先前输入的变量值的表,并且其中值与最可能的一组声学状态相关联,并且在接收每个值时确定的音高和发声特性 消除歧义(425,430)一个新接收的实例变量。
    • 2. 发明申请
    • METHOD AND SYSTEM FOR PERSONALIZED VOICE DIALOGUE
    • 用于个性化语音对话的方法和系统
    • US20080080678A1
    • 2008-04-03
    • US11536854
    • 2006-09-29
    • Changxue C. MaYan Ming ChengSteven J. NowlanDale W. RussellYuan-Jun Wei
    • Changxue C. MaYan Ming ChengSteven J. NowlanDale W. RussellYuan-Jun Wei
    • H04M11/00
    • H04M3/4936G10L2015/226
    • A method (10) and system (200) for personalized voice dialogue can include tracking (12) a user's use of voice dialogue states or transitions and progressively offering (16) a user more efficient voice dialogue transitions or states such as voice dialogue transition or states having fewer and fewer words. The tracking of dialog states or transitions can include tracking (14) of repeated use of the dialogue states or transitions. A user can be prompted to create a new transition or state. The prompting (18) and confirmation and verification (20) by the user of a new transition or state can be done using SCXML language. The method can further include instantiating (21) the new transition or state with voice tags or words and performing (22) speech recognition using the new transition or state. The method can again determine (23) if the new transition or state is a repeat transition or state.
    • 用于个性化语音对话的方法(10)和系统(200)可以包括跟踪(12)用户对语音对话状态或转换的使用,并逐渐提供(16)用户更有效的语音对话转换或状态,例如语音对话转换或 状态越来越少的单词。 跟踪对话状态或转换可以包括跟踪(14)重复使用对话状态或转换。 可以提示用户创建新的转换或状态。 用户可以使用SCXML语言完成新的转换或状态的提示(18)和确认(20)。 该方法还可以包括使用语音标签或单词实例化(21)新的转换或状态,并使用新的转换或状态执行(22)语音识别。 该方法可以再次确定(23)如果新的转换或状态是重复转换或状态。
    • 5. 发明授权
    • Tailored speaker-independent voice recognition system
    • 量身定制的与扬声器无关的语音识别系统
    • US07533018B2
    • 2009-05-12
    • US10967957
    • 2004-10-19
    • Changxue C. MaYan M. Cheng
    • Changxue C. MaYan M. Cheng
    • G10L15/06G10L15/00
    • G10L15/063G10L2015/0631
    • A tailored speaker-independent voice recognition system has a speech recognition dictionary (360) with at least one word (371). That word (371) has at least two transcriptions (373), each transcription (373) having a probability factor (375) and an indicator (377) of whether the transcription is active. When a speech utterance is received (510), the voice recognition system determines (520, 530) the word signified by the speech utterance, evaluates (540) the speech utterance against the transcriptions of the correct word, updates (550) the probability factors for each transcription, and inactivates (570) any transcription that has an updated probability factor that is less than a threshold.
    • 定制的与扬声器无关的语音识别系统具有至少一个单词(371)的语音识别词典(360)。 该字(371)具有至少两个转录(373),每个转录(373)具有概率因子(375)和指示符(377)是否转录是活性的。 当接收到语音话语(510)时,语音识别系统确定(520,530)由语音发音表示的单词,根据正确单词的转录评估(540)语音发音,更新(550)概率因子 对于每个转录,并使(570)任何具有小于阈值的更新概率因子的转录失活。
    • 6. 发明申请
    • CONTENT SELECTION USING SPEECH RECOGNITION
    • 使用语音识别的内容选择
    • US20080130699A1
    • 2008-06-05
    • US11566832
    • 2006-12-05
    • Changxue C. MaYan M. Cheng
    • Changxue C. MaYan M. Cheng
    • H01S5/00
    • G06F16/433
    • Disclosed are a method and wireless device for selecting a content file using speech recognition. The method includes establishing a set of tagged text items wherein each tagged text item is uniquely associated with one content file of the set of content files. At least one audible utterance (226) is received (804) from a user. A phoneme lattice (302) is generated (808) based on the audible utterance (226). A phoneme lattice statistical model is generated (810) based on the phoneme lattice (302). A score is assigned (1008) to the tagged text items based on probabilistic estimates in the phoneme lattice statistical model. A list of high scoring tagged text items is presented (1014) so that a selection of a content file may be made. A word lattice (402) and a word lattice statistical model are also used in some embodiments
    • 公开了一种使用语音识别来选择内容文件的方法和无线装置。 该方法包括建立一组标记的文本项目,其中每个标记的文本项目与该组内容文件的一个内容文件唯一地相关联。 从用户接收至少一个听觉话语(226)(804)。 基于可听话语(226)产生音素格(302)(808)。 基于音素格(302)生成音素格子统计模型(810)。 基于音素格子统计模型中的概率估计,将得分(1008)分配给带标签的文本项目。 呈现高得分标签文本的列表(1014),以便可以进行内容文件的选择。 在一些实施例中也使用字格(402)和字格统计模型
    • 7. 发明申请
    • METHOD AND SYSTEM FOR SHARING CELLULAR PHONES
    • 用于共享蜂窝电话的方法和系统
    • US20080102817A1
    • 2008-05-01
    • US11553679
    • 2006-10-27
    • Daryoosh ShenassaChangxue C. MaDeborah A. Matteo
    • Daryoosh ShenassaChangxue C. MaDeborah A. Matteo
    • H04Q7/20
    • H04L67/322H04L45/00H04L65/4061
    • A method (40) and system (10 or 200) for sharing a cellular phone includes sending (41) a request to use a second cellular phone (12) as a server from a first cellular phone (11), exchanging (43) audio streams between the cellulars phone, receiving (44) a dialing signal at the first cellular phone from the second cellular phone and forming (45) a call connection between the first cellular phone and a third party (13) via the second cellular phone. The step of sending the request can include sending an SMS message, sending a phone number, or sending a push-to-share request for nearby cellular phones having stronger signal strength. The push-to-share request can be a Bluetooth search of nearby cellular phones having stronger signal strength for their cellular network connection. The method can also include automatically (42) sending the push-to-share request upon detection of a signal strength below a predetermined threshold.
    • 用于共享蜂窝电话的方法(40)和系统(10或200)包括从第一蜂窝电话(11)发送(41)使用第二蜂窝电话(12)作为服务器的请求,交换(43)音频 在蜂窝电话之间流动,从第二蜂窝电话接收(44)在第一蜂窝电话处的拨号信号,并且经由第二蜂窝电话形成(45)第一蜂窝电话和第三方(13)之间的呼叫连接。 发送请求的步骤可以包括发送SMS消息,发送电话号码或发送具有更强信号强度的附近蜂窝电话的分组请求。 推送分享请求可以是对于其蜂窝网络连接具有更强信号强度的附近蜂窝电话的蓝牙搜索。 该方法还可以在检测到低于预定阈值的信号强度时,自动(42)发送分组请求。
    • 9. 发明申请
    • METHOD AND APPARATUS FOR LANGUAGE INDEPENDENT VOICE INDEXING AND SEARCHING
    • 语言独立语音索引和搜索的方法和装置
    • US20080162125A1
    • 2008-07-03
    • US11617265
    • 2006-12-28
    • Changxue C. MaFeipeng Li
    • Changxue C. MaFeipeng Li
    • G10L19/12
    • G10L15/02G06F16/632G06F16/685G10L2015/025
    • A method and apparatus for language independent voice searching in a mobile communication device is disclosed. The method may include receiving a search query from a user of the mobile communication device, converting speech parts in the search query into linguistic representations which covers at least one languages, generating a search phoneme lattice based on the linguistic representations, extracting query features from the search phoneme lattice, generating query feature vectors based on the extracted features, performing a coarse search using the query feature vectors and the indexing feature vectors from the indexing database, performing a fine search using the results of the coarse search and the indexing phoneme lattices stored in the indexing database, and outputting the fine search results to a dialog manager.
    • 公开了一种在移动通信设备中用于语言独立语音搜索的方法和装置。 该方法可以包括从移动通信设备的用户接收搜索查询,将搜索查询中的语音部分转换成涵盖至少一种语言的语言表示,基于语言表示生成搜索音素格,从 搜索音素格,基于提取的特征生成查询特征向量,使用查询特征向量和来自索引数据库的索引特征向量执行粗略搜索,使用存储的粗搜索和索引音素格的结果执行精细搜索 在索引数据库中,并将精细搜索结果输出到对话管理器。