会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 84. 发明授权
    • Two-stage implementation for phonetic recognition using a bi-directional target-filtering model of speech coarticulation and reduction
    • 使用语音合成和还原的双向目标滤波模型进行语音识别的两阶段实现
    • US07409346B2
    • 2008-08-05
    • US11069474
    • 2005-03-01
    • Alejandro AceroDong YuLi Deng
    • Alejandro AceroDong YuLi Deng
    • G10L15/10
    • G10L15/02G10L25/15G10L25/24G10L2015/025
    • A structured generative model of a speech coarticulation and reduction is described with a novel two-stage implementation. At the first stage, the dynamics of formants or vocal tract resonance (VTR) are generated using prior information of resonance targets in the phone sequence. Bi-directional temporal filtering with finite impulse response (FIR) is applied to the segmental target sequence as the FIR filter's input. At the second stage the dynamics of speech cepstra are predicted analytically based on the FIR filtered VTR targets. The combined system of these two stages thus generates correlated and causally related VTR and cepstral dynamics where phonetic reduction is represented explicitly in the hidden resonance space and implicitly in the observed cepstral space. The combined system also gives the acoustic observation probability given a phone sequence. Using this probability, different phone sequences can be compared and ranked in terms of their respective probability values. This then permits the use of the model for phonetic recognition.
    • 用新的两阶段实现来描述语音合成和简化的结构化生成模型。 在第一阶段,使用电话序列中共振目标的先前信息产生共振峰或声道共振(VTR)的动力学。 具有有限脉冲响应(FIR)的双向时间滤波作为FIR滤波器的输入应用于分段目标序列。 在第二阶段,基于FIR滤波的VTR目标,分析地预测语音cepstra的动力学。 这两个阶段的组合系统因此产生相关和因果相关的VTR和倒谱动力学,其中语音减少在隐藏共振空间中明确表示,并且隐含地在观察到的倒频谱空间中。 组合系统还给出了电话序列的声学观察概率。 使用这种概率,可以根据它们各自的概率值对不同的电话序列进行比较和排序。 这样就允许使用模型进行语音识别。
    • 85. 发明申请
    • Integrated speech recognition and semantic classification
    • 综合语音识别和语义分类
    • US20080177547A1
    • 2008-07-24
    • US11655703
    • 2007-01-19
    • Sibel YamanLi DengDong YuYe-Yi WangAlejandro Acero
    • Sibel YamanLi DengDong YuYe-Yi WangAlejandro Acero
    • G10L15/18
    • G10L15/1815
    • A novel system integrates speech recognition and semantic classification, so that acoustic scores in a speech recognizer that accepts spoken utterances may be taken into account when training both language models and semantic classification models. For example, a joint association score may be defined that is indicative of a correspondence of a semantic class and a word sequence for an acoustic signal. The joint association score may incorporate parameters such as weighting parameters for signal-to-class modeling of the acoustic signal, language model parameters and scores, and acoustic model parameters and scores. The parameters may be revised to raise the joint association score of a target word sequence with a target semantic class relative to the joint association score of a competitor word sequence with the target semantic class. The parameters may be designed so that the semantic classification errors in the training data are minimized.
    • 一种新颖的系统集成了语音识别和语义分类,从而在训练语言模型和语义分类模型时,可以考虑接受讲话语音的语音识别器中的声学分数。 例如,可以定义联合关联分数,其表示声学信号的语义类别和单词序列的对应关系。 联合关联分数可以包括参数,例如声信号的信号到类建模的加权参数,语言模型参数和分数,以及声学模型参数和分数。 可以修改参数以相对于具有目标语义类的竞争者词序列的联合关联分数来提高具有目标语义类别的目标词序列的联合关联分数。 可以设计参数,使得训练数据中的语义分类误差最小化。
    • 87. 发明授权
    • Method of noise reduction using correction and scaling vectors with partitioning of the acoustic space in the domain of noisy speech
    • 使用校正和缩放矢量进行噪声降低的方法,其中噪声语音领域的声学空间分割
    • US07254536B2
    • 2007-08-07
    • US11059036
    • 2005-02-16
    • Li DengXuedong HuangAlejandro Acero
    • Li DengXuedong HuangAlejandro Acero
    • G10L21/02
    • G10L21/0208
    • A method and apparatus are provided for reducing noise in a training signal and/or test signal. The noise reduction technique uses a stereo signal formed of two channel signals, each channel containing the same pattern signal. One of the channel signals is “clean” and the other includes additive noise. Using feature vectors from these channel signals, a collection of noise correction and scaling vectors is determined. When a feature vector of a noisy pattern signal is later received, it is multiplied by the best scaling vector for that feature vector and the best correction vector is added to the product to produce a noise reduced feature vector. Under one embodiment, the best scaling and correction vectors are identified by choosing an optimal mixture component for the noisy feature vector. The optimal mixture component being selected based on a distribution of noisy channel feature vectors associated with each mixture component.
    • 提供了一种用于减少训练信号和/或测试信号中的噪声的方法和装置。 噪声降低技术使用由两个信道信号形成的立体声信号,每个信道包含相同的模式信号。 一个通道信号是“干净的”,另一个包括加性噪声。 使用来自这些信道信号的特征向量,确定噪声校正和缩放向量的集合。 当稍后接收到噪声模式信号的特征向量时,将其乘以该特征向量的最佳缩放向量,并将最佳校正向量加到乘积以产生降噪特征向量。 在一个实施例中,通过为噪声特征向量选择最佳混合分量来识别最佳缩放和校正矢量。 基于与每个混合物组分相关联的噪声通道特征向量的分布来选择最佳混合物组分。