会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • SPEECH RECOGNITION USING VARIABLE-LENGTH CONTEXT
    • 使用可变长度语境进行语音识别
    • US20130006623A1
    • 2013-01-03
    • US13539284
    • 2012-06-29
    • Ciprian I. ChelbaPeng XuFernando Pereira
    • Ciprian I. ChelbaPeng XuFernando Pereira
    • G10L15/20
    • G10L15/187G10L15/063G10L15/14G10L15/34G10L2015/0631
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for recognizing speech using a variable length of context. Speech data and data identifying a candidate transcription for the speech data are received. A phonetic representation for the candidate transcription is accessed. Multiple test sequences are extracted for a particular phone in the phonetic representation. Each of the multiple test sequences includes a different set of contextual phones surrounding the particular phone. Data indicating that an acoustic model includes data corresponding to one or more of the multiple test sequences is received. From among the one or more test sequences, the test sequence that includes the highest number of contextual phones is selected. A score for the candidate transcription is generated based on the data from the acoustic model that corresponds to the selected test sequence.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用可变长度的上下文识别语音。 接收用于识别语音数据的候选转录的语音数据和数据。 访问候选转录的语音表示。 在语音表示中为特定电话提取多个测试序列。 多个测试序列中的每一个包括围绕特定电话的不同的上下文电话组。 指示声学模型包括与多个测试序列中的一个或多个对应的数据的数据被接收。 从一个或多个测试序列中,选择包括最多数量的上下文电话的测试序列。 基于来自对应于所选择的测试序列的声学模型的数据生成候选转录的得分。
    • 3. 发明申请
    • TRAINING ACOUSTIC MODELS
    • 训练声学模型
    • US20130006612A1
    • 2013-01-03
    • US13539225
    • 2012-06-29
    • Peng XuFernando PereiraCiprian I. Chelba
    • Peng XuFernando PereiraCiprian I. Chelba
    • G06F17/27
    • G10L15/187G10L15/063G10L15/14G10L15/34G10L2015/0631
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training acoustic models. Speech data and data identifying a transcription for the speech data are received. A phonetic representation for the transcription is accessed. Training sequences are identified for a particular phone in the phonetic representation. Each of the training sequences includes a different set of contextual phones surrounding the particular phone. A partitioning key is identified based on a sequence of phones that occurs in each of the training sequences. A processing module to which the identified partitioning key is assigned is selected. Data identifying the training sequences and a portion of the speech data are transmitted to the selected processing module.
    • 方法,系统和装置,包括在计算机存储介质上编码的用于训练声学模型的计算机程序。 接收用于识别语音数据的转录的语音数据和数据。 访问转录的语音表示。 在语音表示中为特定电话识别训练序列。 每个训练序列包括围绕特定电话的不同的上下文电话组。 基于在每个训练序列中出现的电话序列来识别分区密钥。 选择分配了所识别的分区键的处理模块。 识别训练序列和语音数据的一部分的数据被发送到所选择的处理模块。
    • 4. 发明授权
    • Speech recognition using variable-length context
    • 使用可变长度上下文的语音识别
    • US08494850B2
    • 2013-07-23
    • US13539284
    • 2012-06-29
    • Ciprian I. ChelbaPeng XuFernando Pereira
    • Ciprian I. ChelbaPeng XuFernando Pereira
    • G10L15/20
    • G10L15/187G10L15/063G10L15/14G10L15/34G10L2015/0631
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for recognizing speech using a variable length of context. Speech data and data identifying a candidate transcription for the speech data are received. A phonetic representation for the candidate transcription is accessed. Multiple test sequences are extracted for a particular phone in the phonetic representation. Each of the multiple test sequences includes a different set of contextual phones surrounding the particular phone. Data indicating that an acoustic model includes data corresponding to one or more of the multiple test sequences is received. From among the one or more test sequences, the test sequence that includes the highest number of contextual phones is selected. A score for the candidate transcription is generated based on the data from the acoustic model that corresponds to the selected test sequence.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用可变长度的上下文识别语音。 接收用于识别语音数据的候选转录的语音数据和数据。 访问候选转录的语音表示。 在语音表示中为特定电话提取多个测试序列。 多个测试序列中的每一个包括围绕特定电话的不同的上下文电话组。 指示声学模型包括与多个测试序列中的一个或多个对应的数据的数据被接收。 从一个或多个测试序列中,选择包括最多数量的上下文电话的测试序列。 基于来自对应于所选择的测试序列的声学模型的数据生成候选转录的得分。