会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • State-dependent speaker clustering for speaker adaptation
    • 用于说话者适应的状态依赖的扬声器聚类
    • US5787394A
    • 1998-07-28
    • US572223
    • 1995-12-13
    • Lalit Rai BahlPonani GopalakrishnanDavid NahamooMukund Padmanabhan
    • Lalit Rai BahlPonani GopalakrishnanDavid NahamooMukund Padmanabhan
    • G10L15/06G10L5/06
    • G10L15/07G10L2015/0631
    • A system and method for adaptation of a speaker independent speech recognition system for use by a particular user. The system and method gather acoustic characterization data from a test speaker and compare the data with acoustic characterization data generated for a plurality of training speakers. A match score is computed between the test speaker's acoustic characterization for a particular acoustic subspace and each training speaker's acoustic characterization for the same acoustic subspace. The training speakers are ranked for the subspace according to their scores and a new acoustic model is generated for the test speaker based upon the test speaker's acoustic characterization data and the acoustic characterization data of the closest matching training speakers. The process is repeated for each acoustic subspace.
    • 一种适用于特定用户使用的独立于说话者的语音识别系统的系统和方法。 该系统和方法从测试扬声器收集声学表征数据,并将数据与为多个训练说话者生成的声学特征数据进行比较。 在特定声学子空间的测试扬声器的声学特性与相同声学子空间的每个训练说话者的声学特性之间计算匹配分数。 训练演讲者根据其分数对子空间进行排名,并且基于测试讲者的声学表征数据和最接近的匹配训练说话者的声学表征数据为测试说话者生成新的声学模型。 对于每个声学子空间重复该过程。
    • 3. 发明授权
    • System and method for partitioning the feature space of a classifier in
a pattern classification system
    • US6058205A
    • 2000-05-02
    • US781574
    • 1997-01-09
    • Lalit Rai BahlPeter Vincent deSouzaDavid NahamooMukund Padmanabhan
    • Lalit Rai BahlPeter Vincent deSouzaDavid NahamooMukund Padmanabhan
    • G06K9/62G06F17/20
    • G06K9/6282
    • A system and method are provided which partition the feature space of a classifier by using hyperplanes to construct a binary decision tree or hierarchical data structure for obtaining the class probabilities for a particular feature vector. One objective in the construction of the decision tree is to minimize the average entropy of the empirical class distributions at each successive node or subset, such that the average entropy of the class distributions at the terminal nodes is minimized. First, a linear discriminant vector is computed that maximally separates the classes at any particular node. A threshold is then chosen that can be applied on the value of the projection onto the hyperplane such that all feature vectors that have a projection onto the hyperplane that is less than the threshold are assigned to a child node (say, left child node) and the feature vectors that have a projection greater than or equal to the threshold are assigned to a right child node. The above two steps are then repeated for each child node until the data at a node falls below a predetermined threshold and the node is classified as a terminal node (leaf of the decision tree). After all non-terminal nodes have been processed, the final step is to store a class distribution associated with each terminal node. The class probabilities for a particular feature vector can then be obtained by traversing the decision tree in a top-down fashion until a terminal node is identified which corresponds to the particular feature vector. The information provided by the decision tree is that, in computing the class probabilities for the particular feature vector, only the small number of classes associated with that particular terminal node need be considered. Alternatively, the required class probabilities can be obtained simply by taking the stored distribution of the terminal node associated with the particular feature vector.
    • 4. 发明授权
    • Method and apparatus for estimating phone class probabilities
a-posteriori using a decision tree
    • 用于使用决策树估计电话类概率的方法和装置
    • US5680509A
    • 1997-10-21
    • US312584
    • 1994-09-27
    • Ponani S. GopalakrishnanDavid NahamooMukund PadmanabhanMichael Alan Picheny
    • Ponani S. GopalakrishnanDavid NahamooMukund PadmanabhanMichael Alan Picheny
    • G10L15/06G10L15/08G10L5/06
    • G10L15/063G10L15/08
    • A method and apparatus for estimating the probability of phones, a-posteriori, in the context of not only the acoustic feature at that time, but also the acoustic features in the vicinity of the current time, and its use in cutting down the search-space in a speech recognition system. The method constructs and uses a decision tree, with the predictors of the decision tree being the vector-quantized acoustic feature vectors at the current time, and in the vicinity of the current time. The process starts with an enumeration of all (predictor, class) events in the training data at the root node, and successively partitions the data at a node according to the most informative split at that node. An iterative algorithm is used to design the binary partitioning. After the construction of the tree is completed, the probability distribution of the predicted class is stored at all of its terminal leaves. The decision tree is used during the decoding process by tracing a path down to one of its leaves, based on the answers to binary questions about the vector-quantized acoustic feature vector at the current time and its vicinity.
    • 在不仅在当时的声学特征以及当前时间附近的声学特征的上下文中估计电话的概率的方法和装置,以及其用于减少搜索 - 语音识别系统中的空间。 该方法构造并使用决策树,其中决策树的预测变量是当前时间和当前时间附近的矢量量化的声学特征向量。 该过程从在根节点的训练数据中的所有(预测器,类)事件的枚举开始,并且根据该节点处的最多信息拆分在节点处依次划分数据。 迭代算法用于设计二进制分区。 树完成后,预测类的概率分布存储在其所有终端叶上。 基于对当前时间及其附近的向量量化声学特征向量的二进制问题的答案,在解码过程中使用决策树通过跟踪到其叶子之一的路径。
    • 5. 发明授权
    • Reduction of search space in speech recognition using phone boundaries
and phone ranking
    • 使用手机边界和手机排名减少语音识别中的搜索空间
    • US5729656A
    • 1998-03-17
    • US347013
    • 1994-11-30
    • David NahamooMukund Padmanabhan
    • David NahamooMukund Padmanabhan
    • G10L15/00G10L15/02G10L15/04G10L15/14G10L5/06
    • G10L15/04G10L15/142G10L2015/085
    • A method for estimating the probability of phone boundaries and the accuracy of the acoustic modelling in reducing a search-space in a speech recognition system. The accuracy of the acoustic modelling is quantified by the rank of the correct phone. The system includes a microphone for converting an utterance into an electrical signal, which is processed by an acoustic processor and label match which finds the best-matched acoustic label prototype. A probability distribution on phone boundaries is produced for every time frame using a first decision tree. These probabilities are compared to a threshold and some time frames are identified as boundaries between phones. An acoustic score is computed for all phones between every given pair of hypothesized boundaries, and the phones are ranked on the basis of this score. A second decision tree is traversed for every time frame to obtain the worst case rank of the correct phone at that time, and a short list of allowed phones is made for every time frame. A fast acoustic word match processor matches the label string from the acoustic processor to produce an utterance signal which includes at least one word. From recognition candidates produced by the fast acoustic match and the language model, the detailed acoustic match matches the label string from the acoustic processor against acoustic word models and outputs a word string corresponding to an utterance.
    • 一种用于在减少语音识别系统中的搜索空间中估计电话边界的概率和声学建模的准确度的方法。 声学建模的准确度由正确的手机的等级来量化。 该系统包括用于将发音转换成电信号的麦克风,该电信号由声学处理器处理,并且标签匹配找到最佳匹配的声学标签原型。 使用第一决策树为每个时间帧产生电话边界上的概率分布。 将这些概率与阈值进行比较,并且将一些时间帧识别为电话之间的边界。 对于所有给定的一对假设边界之间的所有电话,计算声学得分,并且手机基于该分数进行排名。 每个时间帧都会遍历第二个决策树,以获得当时正确的电话的最差情况等级,并为每个时间帧制作一个简短的允许电话列表。 快速声学词匹配处理器将来自声学处理器的标签串匹配以产生包括至少一个单词的话语信号。 从快速声学匹配和语言模型产生的识别候选中,详细的声匹配将来自声学处理器的标签串与声学词模型相匹配,并输出与发音对应的字串。