专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US06950796B2 Speech recognition by dynamical noise model adaptation 有权
标题翻译：动态噪声模型适应的语音识别
公开(公告)号：US06950796B2
公开(公告)日：2005-09-27
申请号：US10007886
申请日：2001-11-05
申请人： Changxue Ma , Yuan-Jun Wei
发明人： Changxue Ma , Yuan-Jun Wei
IPC分类号： G10L15/12 , G10L15/14 , G10L15/20 , G10L21/0216 , G10L15/06 , G10L21/02
CPC分类号： G10L15/20 , G10L2021/02168
摘要： The invention provides a Hidden Markov Model (132) based automated speech recognition system (100) that dynamically adapts to changing background noise by detecting long pauses in speech, and for each pause processing background noise during the pause to extract a feature vector that characterizes the background noise, identifying a Gaussian mixture component of noise states that most closely matches the extracted feature vector, and updating the mean of the identified Gaussian mixture component so that it more closely matches the extracted feature vector, and consequently more closely matches the current noise environment. Alternatively, the process is also applied to refine the Gaussian mixtures associated with other emitting states of the Hidden Markov Model.
摘要翻译：本发明提供了一种基于隐马尔可夫模型（132）的自动化语音识别系统（100），其通过检测语音中的长暂停动态地适应变化的背景噪声，并且对于暂停期间的每个暂停处理背景噪声来提取表征背景噪声，识别与提取的特征向量最紧密匹配的噪声状态的高斯混合分量，以及更新所识别的高斯混合分量的平均值，使得其与提取的特征向量更紧密地匹配，并且因此更紧密地匹配当前噪声环境。或者，该过程也被应用于改进与隐马尔可夫模型的其他发射状态相关联的高斯混合。

2. 发明申请

US20060247921A1 Speech dialog method and system 有权
标题翻译：语音对话方法和系统
公开(公告)号：US20060247921A1
公开(公告)日：2006-11-02
申请号：US11118670
申请日：2005-04-29
申请人： Changxue Ma , Yan Cheng , Chen Liu , Ted Mazurkiewicz , Steven Nowlan , James Talley , Yuan-Jun Wei
发明人： Changxue Ma , Yan Cheng , Chen Liu , Ted Mazurkiewicz , Steven Nowlan , James Talley , Yuan-Jun Wei
IPC分类号： G10L11/04
CPC分类号： G10L17/26 , G10L13/033 , G10L15/22
摘要： An electronic device (300) for speech dialog includes functions that receive (305, 105) a speech phrase that comprises a request phrase that includes an instantiated variable (215), generate (335, 115) pitch and voicing characteristics (315) of the instantiated variable, and performs voice recognition (319, 125) of the instantiated variable to determine a most likely set of acoustic states (235). The electronic device may generate (335, 140) a synthesized value of the instantiated variable using the most likely set of acoustic states and the pitch and voicing characteristics of the instantiated variable. The electronic device may use a table of previously entered values of variables that have been determined to be unique, and in which the values are associated with a most likely set of acoustic states and the pitch and voicing characteristics determined at the receipt of each value to disambiguate (425, 430) a newly received instantiated variable.
摘要翻译：一种用于语音对话的电子设备（300）包括接收（305,105）语音短语的功能，该语音短语包括包含实例化变量（215）的请求短语，产生（335,115）音调和语音特征（315）并且执行所述实例化变量的语音识别（319,125）以确定最可能的一组声学状态（235）。电子设备可以使用最可能的声学状态集合和实例化变量的音调和语音特征来生成（335,140）实例化变量的合成值。电子设备可以使用已经被确定为唯一的先前输入的变量值的表，并且其中值与最可能的一组声学状态相关联，并且在接收每个值时确定的音高和发声特性消除歧义（425,430）一个新接收的实例变量。

3. 发明申请

US20090259469A1 METHOD AND APPARATUS FOR SPEECH RECOGNITION 审中-公开
标题翻译：用于语音识别的方法和装置
公开(公告)号：US20090259469A1
公开(公告)日：2009-10-15
申请号：US12102141
申请日：2008-04-14
申请人： Changxue Ma , Yuan-Jun Wei
发明人： Changxue Ma , Yuan-Jun Wei
IPC分类号： G10L15/00 , G10L15/02 , G10L15/06
CPC分类号： G10L15/02 , G10L15/142
摘要： A method and apparatus for performing speech recognition receives an audio signal, generates a sequence of frames of the audio signal, transforms each frame of the audio signal into a set of narrow band feature vectors using a narrow passband, couples the narrow band feature vectors to a speech model, and determines whether the audio signal is a wide band signal. When the audio signal is determined to be a wide band signal, a pass band parameter of each of one or more passbands that are outside the narrow passband is generated for each frame and the one or more band energy parameters are coupled to the speech model.
摘要翻译：用于执行语音识别的方法和装置接收音频信号，产生音频信号的一系列帧，使用窄通带将音频信号的每一帧转换成一组窄带特征向量，将窄带特征向量耦合到语音模型，并且确定音频信号是否是宽带信号。当音频信号被确定为宽带信号时，针对每个帧产生在窄通带外部的一个或多个通带中的每一个的通带参数，并且一个或多个频带能量参数耦合到语音模型。

4. 发明授权

US08041700B2 Content item retrieval based on a free text entry 失效
标题翻译：基于自由文本输入的内容项检索
公开(公告)号：US08041700B2
公开(公告)日：2011-10-18
申请号：US12419341
申请日：2009-04-07
申请人： Changxue Ma
发明人： Changxue Ma
IPC分类号： G06F7/00 , G06F17/30
CPC分类号： G06F17/30675
摘要： A method and apparatus for textual searching of a database is provided herein. During operation a user will input a letter into a search engine. The search engine will score words based on the letter and display results of the highest-scored words. Another letter will again be received and the process repeated. In situations where titles are returned to the user, additional steps of associating the words with a title and scoring the title take place. The highest-scored titles are provided to the user as the displayed results.
摘要翻译：本文提供了一种用于文本搜索数据库的方法和装置。在操作期间，用户将输入一个字母到搜索引擎。搜索引擎将根据最高得分字的字母和显示结果对单词进行分数。再次收到另一封信，重复过程。在将标题返回给用户的情况下，会发生将单词与标题相关联并对标题进行评分的其他步骤。作为显示结果，向用户提供最高分的标题。

5. 发明申请

US20110071826A1 METHOD AND APPARATUS FOR ORDERING RESULTS OF A QUERY 审中-公开
标题翻译：用于订购查询结果的方法和装置
公开(公告)号：US20110071826A1
公开(公告)日：2011-03-24
申请号：US12564968
申请日：2009-09-23
申请人： Changxue Ma , Harry M. Bliss
发明人： Changxue Ma , Harry M. Bliss
IPC分类号： G10L15/26 , G06F17/30
CPC分类号： G10L15/083 , G06F16/3343
摘要： A method and apparatus for ordering results from a query is provided herein. During operation, a spoken query is received and converted to a textual representation, such as a word lattice. Search strings are then created from the word lattice. For example a set search strings may be created from the N-grams, such as unigrams and bigrams, of the word lattice. The search strings may be ordered and truncated based on confidence values assigned to the n-grams by the speech recognition system. The set of search strings are sent to at least one search engine, and search results are obtained. The search results are then re-arranged or reordered based on a semantic similarity between the search results and the word lattice.
摘要翻译：本文提供了一种用于排序查询结果的方法和装置。在操作期间，接收到口语查询并将其转换为文本表示，例如单词格。搜索字符串然后从单词格中创建。例如，可以从单词格的N克（例如单字母和双字母）创建集合搜索字符串。搜索字符串可以基于由语音识别系统分配给n-gram的置信度来排序和截断。搜索字符串集合被发送到至少一个搜索引擎，并且获得搜索结果。然后基于搜索结果和单词格之间的语义相似度重新排列或重新排序搜索结果。

6. 发明申请

US20090210226A1 Method and Apparatus for Voice Searching for Stored Content Using Uniterm Discovery 有权
标题翻译：使用Uniterm发现的语音搜索存储内容的方法和装置
公开(公告)号：US20090210226A1
公开(公告)日：2009-08-20
申请号：US12032258
申请日：2008-02-15
申请人： Changxue Ma
发明人： Changxue Ma
IPC分类号： G10L15/08
CPC分类号： G10L15/26 , G10L2015/025 , G10L2015/088
摘要： A method, system and communication device for enabling voice-to-voice searching and ordered content retrieval via audio tags assigned to individual content, which tags generate uniterms that are matched against components of a voice query. The method includes storing content and tagging at least one of the content with an audio tag. The method further includes receiving a voice query to retrieve content stored on the device. When the voice query is received, the method completes a voice-to-voice search utilizing uniterms of the audio tag, scored against the phoneme latent lattice model generated by the voice query to identify matching terms within the audio tags and corresponding stored content. The retrieved content(s) associated with the identified audio tags having uniterms that score within the phoneme lattice model are outputted in an order corresponding to an order in which the uniterms are structured within the voice query.
摘要翻译：一种用于通过分配给各个内容的音频标签启用语音到语音搜索和排序内容检索的方法，系统和通信设备，该标签生成与语音查询的组件匹配的单位。该方法包括存储内容并且将具有音频标签的内容中的至少一个标记。该方法还包括接收语音查询以检索存储在设备上的内容。当接收到语音查询时，该方法使用音频标签的单位完成语音到语音搜索，对由语音查询产生的音素潜在网格模型进行评分，以识别音频标签内的匹配项和对应的存储内容。与所识别的音频标签相关联的检索到的内容，其具有在音素格子模型内得分的单位格式，其顺序与语音查询内的单元格结构的顺序相对应地输出。

7. 发明申请

US20090089059A1 METHOD AND APPARATUS FOR ENABLING MULTIMODAL TAGS IN A COMMUNICATION DEVICE 有权
标题翻译：用于在通信设备中实现多模式标签的方法和装置
公开(公告)号：US20090089059A1
公开(公告)日：2009-04-02
申请号：US11863763
申请日：2007-09-28
申请人： Changxue Ma , Harry M. Bliss
发明人： Changxue Ma , Harry M. Bliss
IPC分类号： G10L15/04
CPC分类号： G10L15/24 , G06F3/016 , G06F3/0481 , G10L15/06 , G10L15/187 , H04M1/72538 , H04M1/72547 , H04M2250/12 , H04M2250/22 , H04M2250/74
摘要： A method and apparatus for enabling multimodal tags in a communication device is disclosed. The method comprises receiving a first training signal and receiving a second training signal in conjunction with the first training signal. A multimodal tag is created to represent a combination of the first training signal and the second training signal and a function is associated with the created multimodal tag.
摘要翻译：公开了一种用于在通信设备中实现多模式标签的方法和装置。所述方法包括接收第一训练信号并且与第一训练信号一起接收第二训练信号。创建多模式标签以表示第一训练信号和第二训练信号的组合，并且功能与所创建的多模态标签相关联。

8. 发明申请

US20090066541A1 System and Method for Enabling a Mobile Device as a Portable Character Input Peripheral Device 有权
标题翻译：将移动设备用作便携式字符输入外围设备的系统和方法
公开(公告)号：US20090066541A1
公开(公告)日：2009-03-12
申请号：US11853912
申请日：2007-09-12
申请人： Changxue Ma , Wei Lin , Li-Xin Zhen
发明人： Changxue Ma , Wei Lin , Li-Xin Zhen
IPC分类号： G06F3/023
CPC分类号： G06F3/0231 , G06F3/0237
摘要： A portable electronic communication device, designed for voice and data communication is utilized as a peripheral input device for transmitting/providing character inputs, entered in the first device's touch input mechanism, to a second electronic device. The first device has a mode switching utility that switches the first device between a first standard communication mode and a second peripheral input device mode. When the first device is in the second peripheral input device mode, the first device operates as a peripheral input device for the second device. A character input recognition utility executes on the first device to provide the functions of: detecting an input on the touch screen input mechanism; generating an electronic representation of the input; establishing a communication link between the second communication transmitter and an identified second device; and forwarding the electronic representation of the character input to the communication transmitter for transmission to the identified second device.
摘要翻译：用于语音和数据通信的便携式电子通信设备被用作用于将输入到第一设备的触摸输入机构中的字符输入发送/提供给第二电子设备的外围输入设备。第一设备具有模式切换实用程序，用于在第一标准通信模式和第二外围输入设备模式之间切换第一设备。当第一设备处于第二外设输入设备模式时，第一设备作为第二设备的外围输入设备工作。字符输入识别实用程序在第一设备上执行以提供以下功能：检测触摸屏输入机构上的输入; 生成输入的电子表示; 建立第二通信发射机和识别的第二设备之间的通信链路; 以及将所述字符输入的电子表示转发到所述通信发射机以便发送到所识别的第二设备。

9. 发明申请

US20070239455A1 METHOD AND SYSTEM FOR MANAGING PRONUNCIATION DICTIONARIES IN A SPEECH APPLICATION 审中-公开
标题翻译：用于管理语音应用中的发音词典的方法和系统
公开(公告)号：US20070239455A1
公开(公告)日：2007-10-11
申请号：US11278983
申请日：2006-04-07
申请人： Michael Groble , Changxue Ma
发明人： Michael Groble , Changxue Ma
IPC分类号： G10L13/08
CPC分类号： G10L15/187 , G10L13/08
摘要： A voice toolkit (100) and a method (700) for managing pronunciation dictionaries are provided. The visual toolkit can include a user-interface (110) for entering in a text and a corresponding spoken utterance, a text-to-speech system (120) for synthesizing a pronunciation from the text, a talking speech recognizer (132) for generating pronunciations of the spoken utterance, and a voice processor (130) for validating at least one pronunciation. A developer can type a text of a word into the toolkit and listen to the pronunciation to determine whether the pronunciation is acceptable. If the pronunciation is incorrect the developer can speak the word for providing a spoken utterance having a correct pronunciation.
摘要翻译：提供了一种用于管理发音词典的语音工具包（100）和方法（700）。视觉工具包可以包括用于输入文本的用户界面（110）和对应的说话话语，用于从文本合成发音的文本到语音系统（120），用于生成语音识别器（132）讲话语音的发音，以及用于验证至少一个发音的语音处理器（130）。开发人员可以在工具包中输入单词的文字，并听发音来确定发音是否可以接受。如果发音不正确，开发人员可以说出提供具有正确发音的口语发音。

10. 发明授权

US07983428B2 Noise reduction on wireless headset input via dual channel calibration within mobile phone 有权
标题翻译：无线耳机输入通过手机双通道校准降噪
公开(公告)号：US07983428B2
公开(公告)日：2011-07-19
申请号：US11746455
申请日：2007-05-09
申请人： Changxue Ma , Chen Liu
发明人： Changxue Ma , Chen Liu
IPC分类号： H04B15/00
CPC分类号： H04M9/082 , H04M1/6066
摘要： A communication device includes: (1) a wireless adapter at which a wireless headset is communicatively connected to the communication device and at which is received a first acoustic input that includes a speech input and a first ambient noise input; (2) a microphone that receives a second acoustic input, which includes a second ambient noise input; and (3) a dual-channel adaptive noise canceller that utilizes the second ambient noise input to filter the first ambient noise input out of the first acoustic input to generate an acoustic output that primarily comprises the speech input.
摘要翻译：通信设备包括：（1）无线适配器，其中无线耳机通信地连接到通信设备，并且其中接收到包括语音输入和第一环境噪声输入的第一声输入; （2）麦克风，其接收包括第二环境噪声输入的第二声输入; 以及（3）双通道自适应噪声消除器，其利用所述第二环境噪声输入来对从所述第一声输入中输出的所述第一环境噪声进行滤波，以产生主要包括所述语音输入的声输出。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式