会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • System and method for speech verification using a confidence measure
    • 使用置信度测量语音验证的系统和方法
    • US06473735B1
    • 2002-10-29
    • US09553985
    • 2000-04-20
    • Duanpei WuXavier Menendez-PidalLex OlorenshawRuxin Chen
    • Duanpei WuXavier Menendez-PidalLex OlorenshawRuxin Chen
    • G10L1506
    • G10L15/10G10L2015/085
    • The present invention comprises a system and method for speech verification using a confidence measure that includes a speech verifier which compares a differential score for a recognized word to a predetermined threshold value, where a recognized word is the word model that produced the highest recognition score. In one embodiment, a single threshold is used for each word in a vocabulary. In another embodiment, each word model has an associated threshold, so that a differential score for a recognized word is compared to a unique threshold associated with that word. In a further embodiment, pairs of confused words in the vocabulary are dealt with separately. If a confused word is the recognized word, the speech verifier compares the differential score to a threshold that depends on the word model that produced the next-highest recognition score. Different values for the various thresholds may maximize rejection accuracy or recognition accuracy. A trade-off between rejection accuracy and recognition accuracy may be made by utilizing an intermediate threshold value that is between a minimum threshold value and a maximum threshold value.
    • 本发明包括一种用于使用置信度测量的语音验证的系统和方法,所述置信度测量包括将识别的词的差分得分与预定阈值进行比较的语音验证器,其中识别词是产生最高识别分数的单词模型。 在一个实施例中,词汇中的每个单词使用单个阈值。 在另一个实施例中,每个单词模型具有相关联的阈值,使得将识别的单词的差分分数与与该单词相关联的唯一阈值进行比较。 在另一实施例中,词汇表中的混淆词对被单独处理。 如果一个混淆的单词是被识别的单词,语音验证器将差分分数与取决于产生下一最高识别分数的单词模型的阈值进行比较。 各种阈值的不同值可以最大化拒绝准确度或识别精度。 可以通过利用处于最小阈值和最大阈值之间的中间阈值来进行拒绝准确度和识别精度之间的折衷。
    • 5. 发明授权
    • Method for utilizing validity constraints in a speech endpoint detector
    • 用于在语音端点检测器中使用有效性约束的方法
    • US06718302B1
    • 2004-04-06
    • US09482396
    • 2000-01-12
    • Duanpei WuMiyuki TanakaRuxin ChenLex Olorenshaw
    • Duanpei WuMiyuki TanakaRuxin ChenLex Olorenshaw
    • G10L1102
    • G10L25/87
    • A method for utilizing validity constraints in a speech endpoint detector comprises a validity manager that may utilize a pulse width module to validate utterances that include a plurality of energy pulses during a certain time period. The validity manager also may utilize a minimum power module to ensure that speech energy below a pre-determined level is not classified as a valid utterance. In addition the validity manager may use a duration module to ensure that valid utterances fall within a specified duration. Finally, the validity manager may utilize a short-utterance minimum power module to specifically distinguish an utterance of short duration from background noise based on the energy level of the short utterance.
    • 一种用于在语音端点检测器中利用有限约束的方法包括有效性管理器,其可以利用脉冲宽度模块来在特定时间段期间验证包括多个能量脉冲的话语。 有效性管理器还可以利用最小功率模块来确保低于预定电平的语音能量不被分类为有效的话语。 此外,有效性管理器可以使用持续时间模块来确保有效的话语落在指定的持续时间内。 最后,有效性管理器可以利用短话语最小功率模块来基于短语的能量级别来特别地区分短时间的短时间与背景噪声的发音。
    • 6. 发明授权
    • Method and apparatus for a parameter sharing speech recognition system
    • 一种参数共享语音识别系统的方法和装置
    • US6006186A
    • 1999-12-21
    • US953026
    • 1997-10-16
    • Ruxin ChenMiyuki TanakaDuanpei WuLex S. Olorenshaw
    • Ruxin ChenMiyuki TanakaDuanpei WuLex S. Olorenshaw
    • G10L15/14G10L15/18G10L7/08
    • G10L15/142G10L15/148
    • A method and an apparatus for a parameter sharing speech recognition system are provided. Speech signals are received into a processor of a speech recognition system. The speech signals are processed using a speech recognition system hosting a shared hidden Markov model (HMM) produced by generating a number of phoneme models, some of which are shared. The phoneme models are generated by retaining as a separate phoneme model any triphone model having a number of trained frames available that exceeds a prespecified threshold. A shared phoneme model is generated to represent each of the groups of triphone phoneme models for which the number of trained frames having a common biphone exceed the prespecified threshold. A shared phoneme model is generated to represent each of the groups of triphone phoneme models for which the number of trained frames having an equivalent effect on a phonemic context exceed the prespecified threshold. A shared phoneme model is generated to represent each of the groups of triphone phoneme models having the same center context. The generated phoneme models are trained, and shared phoneme model states are generated that are shared among the phoneme models. Shared probability distribution functions are generated that are shared among the phoneme model states. Shared probability sub-distribution functions are generated that are shared among the phoneme model probability distribution functions. The shared phoneme model hierarchy is reevaluated for further sharing in response to the shared probability sub-distribution functions. Signals representative of the received speech signals are generated.
    • 提供了一种用于参数共享语音识别系统的方法和装置。 语音信号被接收到语音识别系统的处理器中。 语音信号使用一个语音识别系统进行处理,该语音识别系统承载通过生成许多音素模型而产生的共享隐马尔可夫模型(HMM),其中一些是共享的。 音素模型是通过保留作为单独音素模型的任何具有超过预定阈值的已训练帧数的三音模型而产生的。 生成共享音素模型以表示具有共同biphone的经过训练的帧的数量超过预定阈值的三音节音素模型组中的每一组。 生成共享音素模型以表示三音节音素模型中的每一组,其中对音素上下文具有等效影响的经过训练的帧的数量超过预先指定的阈值。 生成共享音素模型以表示具有相同中心上下文的三音节音素模型组中的每一组。 生成的音素模型被训练,并且生成在音素模型中共享的共享音素模型状态。 生成在音素模型状态之间共享的共享概率分布函数。 生成在音素模型概率分布函数中共享的共享概率子分布函数。 共享音素模型层次结构被重新评估以响应于共享概率子分布函数进一步共享。 生成表示接收到的语音信号的信号。
    • 10. 发明申请
    • ROBUSTNESS TO ENVIRONMENTAL CHANGES OF A CONTEXT DEPENDENT SPEECH RECOGNIZER
    • 对语境相关语音识别器的环境变化的鲁棒性
    • US20110288869A1
    • 2011-11-24
    • US12785375
    • 2010-05-21
    • Xavier Menendez-PidalRuxin Chen
    • Xavier Menendez-PidalRuxin Chen
    • G10L15/14G06F15/18G06F17/30
    • G10L15/144G10L15/187G10L2015/022G10L2015/0631
    • An apparatus to improve robustness to environmental changes of a context dependent speech recognizer for an application, that includes a training database to store sounds for speech recognition training, a dictionary to store words supported by the speech recognizer, and a speech recognizer training module to train a set of one or more multiple state Hidden Markov Models (HMMs) with use of the training database and the dictionary. The speech recognizer training module performs a non-uniform state clustering process on each of the states of each HMM, which includes using a different non-uniform cluster threshold for at least some of the states of each HMM to more heavily cluster and correspondingly reduce a number of observation distributions for those of the states of each HMM that are less empirically affected by one or more contextual dependencies.
    • 一种用于提高对应用的上下文相关语音识别器对环境变化的鲁棒性的装置,其包括用于存储用于语音识别训练的声音的训练数据库,用于存储由语音识别器支持的单词的词典和用于训练的语音识别器训练模块 使用训练数据库和字典的一组或多个多状态隐马尔可夫模型(HMM)。 语音识别器训练模块对每个HMM的每个状态执行不均匀的状态聚类处理,其包括对于每个HMM的至少一些状态使用不同的非均匀簇阈值来进行更大的聚类,并相应地减少 每个HMM的状态的观察分布的数量较少受一个或多个上下文相关性的经验影响。