会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • SPEECH RECOGNITION USING VARIABLE-LENGTH CONTEXT
    • 使用可变长度语境进行语音识别
    • US20130006623A1
    • 2013-01-03
    • US13539284
    • 2012-06-29
    • Ciprian I. ChelbaPeng XuFernando Pereira
    • Ciprian I. ChelbaPeng XuFernando Pereira
    • G10L15/20
    • G10L15/187G10L15/063G10L15/14G10L15/34G10L2015/0631
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for recognizing speech using a variable length of context. Speech data and data identifying a candidate transcription for the speech data are received. A phonetic representation for the candidate transcription is accessed. Multiple test sequences are extracted for a particular phone in the phonetic representation. Each of the multiple test sequences includes a different set of contextual phones surrounding the particular phone. Data indicating that an acoustic model includes data corresponding to one or more of the multiple test sequences is received. From among the one or more test sequences, the test sequence that includes the highest number of contextual phones is selected. A score for the candidate transcription is generated based on the data from the acoustic model that corresponds to the selected test sequence.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用可变长度的上下文识别语音。 接收用于识别语音数据的候选转录的语音数据和数据。 访问候选转录的语音表示。 在语音表示中为特定电话提取多个测试序列。 多个测试序列中的每一个包括围绕特定电话的不同的上下文电话组。 指示声学模型包括与多个测试序列中的一个或多个对应的数据的数据被接收。 从一个或多个测试序列中,选择包括最多数量的上下文电话的测试序列。 基于来自对应于所选择的测试序列的声学模型的数据生成候选转录的得分。
    • 3. 发明授权
    • Speech recognition using variable-length context
    • 使用可变长度上下文的语音识别
    • US08494850B2
    • 2013-07-23
    • US13539284
    • 2012-06-29
    • Ciprian I. ChelbaPeng XuFernando Pereira
    • Ciprian I. ChelbaPeng XuFernando Pereira
    • G10L15/20
    • G10L15/187G10L15/063G10L15/14G10L15/34G10L2015/0631
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for recognizing speech using a variable length of context. Speech data and data identifying a candidate transcription for the speech data are received. A phonetic representation for the candidate transcription is accessed. Multiple test sequences are extracted for a particular phone in the phonetic representation. Each of the multiple test sequences includes a different set of contextual phones surrounding the particular phone. Data indicating that an acoustic model includes data corresponding to one or more of the multiple test sequences is received. From among the one or more test sequences, the test sequence that includes the highest number of contextual phones is selected. A score for the candidate transcription is generated based on the data from the acoustic model that corresponds to the selected test sequence.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用可变长度的上下文识别语音。 接收用于识别语音数据的候选转录的语音数据和数据。 访问候选转录的语音表示。 在语音表示中为特定电话提取多个测试序列。 多个测试序列中的每一个包括围绕特定电话的不同的上下文电话组。 指示声学模型包括与多个测试序列中的一个或多个对应的数据的数据被接收。 从一个或多个测试序列中,选择包括最多数量的上下文电话的测试序列。 基于来自对应于所选择的测试序列的声学模型的数据生成候选转录的得分。
    • 4. 发明申请
    • TRAINING ACOUSTIC MODELS
    • 训练声学模型
    • US20130006612A1
    • 2013-01-03
    • US13539225
    • 2012-06-29
    • Peng XuFernando PereiraCiprian I. Chelba
    • Peng XuFernando PereiraCiprian I. Chelba
    • G06F17/27
    • G10L15/187G10L15/063G10L15/14G10L15/34G10L2015/0631
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training acoustic models. Speech data and data identifying a transcription for the speech data are received. A phonetic representation for the transcription is accessed. Training sequences are identified for a particular phone in the phonetic representation. Each of the training sequences includes a different set of contextual phones surrounding the particular phone. A partitioning key is identified based on a sequence of phones that occurs in each of the training sequences. A processing module to which the identified partitioning key is assigned is selected. Data identifying the training sequences and a portion of the speech data are transmitted to the selected processing module.
    • 方法,系统和装置,包括在计算机存储介质上编码的用于训练声学模型的计算机程序。 接收用于识别语音数据的转录的语音数据和数据。 访问转录的语音表示。 在语音表示中为特定电话识别训练序列。 每个训练序列包括围绕特定电话的不同的上下文电话组。 基于在每个训练序列中出现的电话序列来识别分区密钥。 选择分配了所识别的分区键的处理模块。 识别训练序列和语音数据的一部分的数据被发送到所选择的处理模块。
    • 8. 发明授权
    • Adaptation of exponential models
    • 指数模型的适应
    • US07860314B2
    • 2010-12-28
    • US10977871
    • 2004-10-29
    • Ciprian I. ChelbaAlejandro Acero
    • Ciprian I. ChelbaAlejandro Acero
    • G06K9/00
    • G06F17/273G06K9/6297
    • A method and apparatus are provided for adapting an exponential probability model. In a first stage, a general-purpose background model is built from background data by determining a set of model parameters for the probability model based on a set of background data. The background model parameters are then used to define a prior model for the parameters of an adapted probability model that is adapted and more specific to an adaptation data set of interest. The adaptation data set is generally of much smaller size than the background data set. A second set of model parameters are then determined for the adapted probability model based on the set of adaptation data and the prior model.
    • 提供了一种适应指数概率模型的方法和装置。 在第一阶段,通过基于一组背景数据确定概率模型的一组模型参数,从背景数据构建通用背景模型。 背景模型参数然后用于定义适应性概率模型的参数的先验模型,其适应并且更具体于感兴趣的自适应数据集。 自适应数据集通常比背景数据集小得多的大小。 然后,基于适配数据集和先​​验模型,针对适应概率模型确定第二组模型参数。
    • 9. 发明授权
    • Selecting speech data for speech recognition vocabulary
    • 选择语音识别词汇的语音数据
    • US08515745B1
    • 2013-08-20
    • US13593703
    • 2012-08-24
    • Maryam GarrettCiprian I. Chelba
    • Maryam GarrettCiprian I. Chelba
    • G10L15/00G10L15/04G06F17/30
    • G06F17/30976G06F17/2735G10L15/01G10L15/063
    • Methods, systems, and apparatus for selecting training data. In an aspect, a method comprises: obtaining search session data comprising search sessions that include search queries, wherein each search query comprises words; determining a threshold out of vocabulary rate indicating a rate at which a word in a search query is not included in a vocabulary; determining a threshold session out of vocabulary rate, the session out of vocabulary rate indicating a rate at which search sessions have an out of vocabulary rate that meets the threshold out of vocabulary rate; selecting a vocabulary of words that, for a set of test data, has a session out of vocabulary rate that meets the threshold session out of vocabulary rate, the vocabulary of words being selected from the one or more words included in each of the search queries included in the search sessions.
    • 用于选择训练数据的方法,系统和装置。 一方面,一种方法包括:获得搜索会话数据,其包括包括搜索查询的搜索会话,其中每个搜索查询包括单词; 确定表示搜索查询中的单词不包括在词汇中的速率的词汇率的阈值; 从词汇率确定阈值会话,会话中的词汇率表示搜索会话具有超出词汇率的符合阈值的词汇率的速率; 选择词汇的词汇,对于一组测试数据,具有超出词汇率的符合阈值会话的词汇率的会话,从包括在每个搜索查询中的一个或多个单词中选择单词的词汇表 包含在搜索会话中。