会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Speech recognition by dynamical noise model adaptation
    • 动态噪声模型适应的语音识别
    • US06950796B2
    • 2005-09-27
    • US10007886
    • 2001-11-05
    • Changxue MaYuan-Jun Wei
    • Changxue MaYuan-Jun Wei
    • G10L15/12G10L15/14G10L15/20G10L21/0216G10L15/06G10L21/02
    • G10L15/20G10L2021/02168
    • The invention provides a Hidden Markov Model (132) based automated speech recognition system (100) that dynamically adapts to changing background noise by detecting long pauses in speech, and for each pause processing background noise during the pause to extract a feature vector that characterizes the background noise, identifying a Gaussian mixture component of noise states that most closely matches the extracted feature vector, and updating the mean of the identified Gaussian mixture component so that it more closely matches the extracted feature vector, and consequently more closely matches the current noise environment. Alternatively, the process is also applied to refine the Gaussian mixtures associated with other emitting states of the Hidden Markov Model.
    • 本发明提供了一种基于隐马尔可夫模型(132)的自动化语音识别系统(100),其通过检测语音中的长暂停动态地适应变化的背景噪声,并且对于暂停期间的每个暂停处理背景噪声来提取表征 背景噪声,识别与提取的特征向量最紧密匹配的噪声状态的高斯混合分量,以及更新所识别的高斯混合分量的平均值,使得其与提取的特征向量更紧密地匹配,并且因此更紧密地匹配当前噪声环境 。 或者,该过程也被应用于改进与隐马尔可夫模型的其他发射状态相关联的高斯混合。
    • 2. 发明申请
    • Speech dialog method and system
    • 语音对话方法和系统
    • US20060247921A1
    • 2006-11-02
    • US11118670
    • 2005-04-29
    • Changxue MaYan ChengChen LiuTed MazurkiewiczSteven NowlanJames TalleyYuan-Jun Wei
    • Changxue MaYan ChengChen LiuTed MazurkiewiczSteven NowlanJames TalleyYuan-Jun Wei
    • G10L11/04
    • G10L17/26G10L13/033G10L15/22
    • An electronic device (300) for speech dialog includes functions that receive (305, 105) a speech phrase that comprises a request phrase that includes an instantiated variable (215), generate (335, 115) pitch and voicing characteristics (315) of the instantiated variable, and performs voice recognition (319, 125) of the instantiated variable to determine a most likely set of acoustic states (235). The electronic device may generate (335, 140) a synthesized value of the instantiated variable using the most likely set of acoustic states and the pitch and voicing characteristics of the instantiated variable. The electronic device may use a table of previously entered values of variables that have been determined to be unique, and in which the values are associated with a most likely set of acoustic states and the pitch and voicing characteristics determined at the receipt of each value to disambiguate (425, 430) a newly received instantiated variable.
    • 一种用于语音对话的电子设备(300)包括接收(305,105)语音短语的功能,该语音短语包括包含实例化变量(215)的请求短语,产生(335,115)音调和语音特征(315) 并且执行所述实例化变量的语音识别(319,125)以确定最可能的一组声学状态(235)。 电子设备可以使用最可能的声学状态集合和实例化变量的音调和语音特征来生成(335,140)实例化变量的合成值。 电子设备可以使用已经被确定为唯一的先前输入的变量值的表,并且其中值与最可能的一组声学状态相关联,并且在接收每个值时确定的音高和发声特性 消除歧义(425,430)一个新接收的实例变量。
    • 4. 发明授权
    • Content item retrieval based on a free text entry
    • 基于自由文本输入的内容项检索
    • US08041700B2
    • 2011-10-18
    • US12419341
    • 2009-04-07
    • Changxue Ma
    • Changxue Ma
    • G06F7/00G06F17/30
    • G06F17/30675
    • A method and apparatus for textual searching of a database is provided herein. During operation a user will input a letter into a search engine. The search engine will score words based on the letter and display results of the highest-scored words. Another letter will again be received and the process repeated. In situations where titles are returned to the user, additional steps of associating the words with a title and scoring the title take place. The highest-scored titles are provided to the user as the displayed results.
    • 本文提供了一种用于文本搜索数据库的方法和装置。 在操作期间,用户将输入一个字母到搜索引擎。 搜索引擎将根据最高得分字的字母和显示结果对单词进行分数。 再次收到另一封信,重复过程。 在将标题返回给用户的情况下,会发生将单词与标题相关联并对标题进行评分的其他步骤。 作为显示结果,向用户提供最高分的标题。
    • 5. 发明申请
    • METHOD AND APPARATUS FOR ORDERING RESULTS OF A QUERY
    • 用于订购查询结果的方法和装置
    • US20110071826A1
    • 2011-03-24
    • US12564968
    • 2009-09-23
    • Changxue MaHarry M. Bliss
    • Changxue MaHarry M. Bliss
    • G10L15/26G06F17/30
    • G10L15/083G06F16/3343
    • A method and apparatus for ordering results from a query is provided herein. During operation, a spoken query is received and converted to a textual representation, such as a word lattice. Search strings are then created from the word lattice. For example a set search strings may be created from the N-grams, such as unigrams and bigrams, of the word lattice. The search strings may be ordered and truncated based on confidence values assigned to the n-grams by the speech recognition system. The set of search strings are sent to at least one search engine, and search results are obtained. The search results are then re-arranged or reordered based on a semantic similarity between the search results and the word lattice.
    • 本文提供了一种用于排序查询结果的方法和装置。 在操作期间,接收到口语查询并将其转换为文本表示,例如单词格。 搜索字符串然后从单词格中创建。 例如,可以从单词格的N克(例如单字母和双字母)创建集合搜索字符串。 搜索字符串可以基于由语音识别系统分配给n-gram的置信度来排序和截断。 搜索字符串集合被发送到至少一个搜索引擎,并且获得搜索结果。 然后基于搜索结果和单词格之间的语义相似度重新排列或重新排序搜索结果。
    • 6. 发明申请
    • Method and Apparatus for Voice Searching for Stored Content Using Uniterm Discovery
    • 使用Uniterm发现的语音搜索存储内容的方法和装置
    • US20090210226A1
    • 2009-08-20
    • US12032258
    • 2008-02-15
    • Changxue Ma
    • Changxue Ma
    • G10L15/08
    • G10L15/26G10L2015/025G10L2015/088
    • A method, system and communication device for enabling voice-to-voice searching and ordered content retrieval via audio tags assigned to individual content, which tags generate uniterms that are matched against components of a voice query. The method includes storing content and tagging at least one of the content with an audio tag. The method further includes receiving a voice query to retrieve content stored on the device. When the voice query is received, the method completes a voice-to-voice search utilizing uniterms of the audio tag, scored against the phoneme latent lattice model generated by the voice query to identify matching terms within the audio tags and corresponding stored content. The retrieved content(s) associated with the identified audio tags having uniterms that score within the phoneme lattice model are outputted in an order corresponding to an order in which the uniterms are structured within the voice query.
    • 一种用于通过分配给各个内容的音频标签启用语音到语音搜索和排序内容检索的方法,系统和通信设备,该标签生成与语音查询的组件匹配的单位。 该方法包括存储内容并且将具有音频标签的内容中的至少一个标记。 该方法还包括接收语音查询以检索存储在设备上的内容。 当接收到语音查询时,该方法使用音频标签的单位完成语音到语音搜索,对由语音查询产生的音素潜在网格模型进行评分,以识别音频标签内的匹配项和对应的存储内容。 与所识别的音频标签相关联的检索到的内容,其具有在音素格子模型内得分的单位格式,其顺序与语音查询内的单元格结构的顺序相对应地输出。
    • 8. 发明申请
    • System and Method for Enabling a Mobile Device as a Portable Character Input Peripheral Device
    • 将移动设备用作便携式字符输入外围设备的系统和方法
    • US20090066541A1
    • 2009-03-12
    • US11853912
    • 2007-09-12
    • Changxue MaWei LinLi-Xin Zhen
    • Changxue MaWei LinLi-Xin Zhen
    • G06F3/023
    • G06F3/0231G06F3/0237
    • A portable electronic communication device, designed for voice and data communication is utilized as a peripheral input device for transmitting/providing character inputs, entered in the first device's touch input mechanism, to a second electronic device. The first device has a mode switching utility that switches the first device between a first standard communication mode and a second peripheral input device mode. When the first device is in the second peripheral input device mode, the first device operates as a peripheral input device for the second device. A character input recognition utility executes on the first device to provide the functions of: detecting an input on the touch screen input mechanism; generating an electronic representation of the input; establishing a communication link between the second communication transmitter and an identified second device; and forwarding the electronic representation of the character input to the communication transmitter for transmission to the identified second device.
    • 用于语音和数据通信的便携式电子通信设备被用作用于将输入到第一设备的触摸输入机构中的字符输入发送/提供给第二电子设备的外围输入设备。 第一设备具有模式切换实用程序,用于在第一标准通信模式和第二外围输入设备模式之间切换第一设备。 当第一设备处于第二外设输入设备模式时,第一设备作为第二设备的外围输入设备工作。 字符输入识别实用程序在第一设备上执行以提供以下功能:检测触摸屏输入机构上的输入; 生成输入的电子表示; 建立第二通信发射机和识别的第二设备之间的通信链路; 以及将所述字符输入的电子表示转发到所述通信发射机以便发送到所识别的第二设备。
    • 9. 发明申请
    • METHOD AND SYSTEM FOR MANAGING PRONUNCIATION DICTIONARIES IN A SPEECH APPLICATION
    • 用于管理语音应用中的发音词典的方法和系统
    • US20070239455A1
    • 2007-10-11
    • US11278983
    • 2006-04-07
    • Michael GrobleChangxue Ma
    • Michael GrobleChangxue Ma
    • G10L13/08
    • G10L15/187G10L13/08
    • A voice toolkit (100) and a method (700) for managing pronunciation dictionaries are provided. The visual toolkit can include a user-interface (110) for entering in a text and a corresponding spoken utterance, a text-to-speech system (120) for synthesizing a pronunciation from the text, a talking speech recognizer (132) for generating pronunciations of the spoken utterance, and a voice processor (130) for validating at least one pronunciation. A developer can type a text of a word into the toolkit and listen to the pronunciation to determine whether the pronunciation is acceptable. If the pronunciation is incorrect the developer can speak the word for providing a spoken utterance having a correct pronunciation.
    • 提供了一种用于管理发音词典的语音工具包(100)和方法(700)。 视觉工具包可以包括用于输入文本的用户界面(110)和对应的说话话语,用于从文本合成发音的文本到语音系统(120),用于生成语音识别器(132) 讲话语音的发音,以及用于验证至少一个发音的语音处理器(130)。 开发人员可以在工具包中输入单词的文字,并听发音来确定发音是否可以接受。 如果发音不正确,开发人员可以说出提供具有正确发音的口语发音。