会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Method and apparatus for a parameter sharing speech recognition system
    • 一种参数共享语音识别系统的方法和装置
    • US6006186A
    • 1999-12-21
    • US953026
    • 1997-10-16
    • Ruxin ChenMiyuki TanakaDuanpei WuLex S. Olorenshaw
    • Ruxin ChenMiyuki TanakaDuanpei WuLex S. Olorenshaw
    • G10L15/14G10L15/18G10L7/08
    • G10L15/142G10L15/148
    • A method and an apparatus for a parameter sharing speech recognition system are provided. Speech signals are received into a processor of a speech recognition system. The speech signals are processed using a speech recognition system hosting a shared hidden Markov model (HMM) produced by generating a number of phoneme models, some of which are shared. The phoneme models are generated by retaining as a separate phoneme model any triphone model having a number of trained frames available that exceeds a prespecified threshold. A shared phoneme model is generated to represent each of the groups of triphone phoneme models for which the number of trained frames having a common biphone exceed the prespecified threshold. A shared phoneme model is generated to represent each of the groups of triphone phoneme models for which the number of trained frames having an equivalent effect on a phonemic context exceed the prespecified threshold. A shared phoneme model is generated to represent each of the groups of triphone phoneme models having the same center context. The generated phoneme models are trained, and shared phoneme model states are generated that are shared among the phoneme models. Shared probability distribution functions are generated that are shared among the phoneme model states. Shared probability sub-distribution functions are generated that are shared among the phoneme model probability distribution functions. The shared phoneme model hierarchy is reevaluated for further sharing in response to the shared probability sub-distribution functions. Signals representative of the received speech signals are generated.
    • 提供了一种用于参数共享语音识别系统的方法和装置。 语音信号被接收到语音识别系统的处理器中。 语音信号使用一个语音识别系统进行处理,该语音识别系统承载通过生成许多音素模型而产生的共享隐马尔可夫模型(HMM),其中一些是共享的。 音素模型是通过保留作为单独音素模型的任何具有超过预定阈值的已训练帧数的三音模型而产生的。 生成共享音素模型以表示具有共同biphone的经过训练的帧的数量超过预定阈值的三音节音素模型组中的每一组。 生成共享音素模型以表示三音节音素模型中的每一组,其中对音素上下文具有等效影响的经过训练的帧的数量超过预先指定的阈值。 生成共享音素模型以表示具有相同中心上下文的三音节音素模型组中的每一组。 生成的音素模型被训练,并且生成在音素模型中共享的共享音素模型状态。 生成在音素模型状态之间共享的共享概率分布函数。 生成在音素模型概率分布函数中共享的共享概率子分布函数。 共享音素模型层次结构被重新评估以响应于共享概率子分布函数进一步共享。 生成表示接收到的语音信号的信号。
    • 3. 发明授权
    • Method for utilizing validity constraints in a speech endpoint detector
    • 用于在语音端点检测器中使用有效性约束的方法
    • US06718302B1
    • 2004-04-06
    • US09482396
    • 2000-01-12
    • Duanpei WuMiyuki TanakaRuxin ChenLex Olorenshaw
    • Duanpei WuMiyuki TanakaRuxin ChenLex Olorenshaw
    • G10L1102
    • G10L25/87
    • A method for utilizing validity constraints in a speech endpoint detector comprises a validity manager that may utilize a pulse width module to validate utterances that include a plurality of energy pulses during a certain time period. The validity manager also may utilize a minimum power module to ensure that speech energy below a pre-determined level is not classified as a valid utterance. In addition the validity manager may use a duration module to ensure that valid utterances fall within a specified duration. Finally, the validity manager may utilize a short-utterance minimum power module to specifically distinguish an utterance of short duration from background noise based on the energy level of the short utterance.
    • 一种用于在语音端点检测器中利用有限约束的方法包括有效性管理器,其可以利用脉冲宽度模块来在特定时间段期间验证包括多个能量脉冲的话语。 有效性管理器还可以利用最小功率模块来确保低于预定电平的语音能量不被分类为有效的话语。 此外,有效性管理器可以使用持续时间模块来确保有效的话语落在指定的持续时间内。 最后,有效性管理器可以利用短话语最小功率模块来基于短语的能量级别来特别地区分短时间的短时间与背景噪声的发音。
    • 6. 发明授权
    • System and method for speech verification using a confidence measure
    • 使用置信度测量语音验证的系统和方法
    • US06473735B1
    • 2002-10-29
    • US09553985
    • 2000-04-20
    • Duanpei WuXavier Menendez-PidalLex OlorenshawRuxin Chen
    • Duanpei WuXavier Menendez-PidalLex OlorenshawRuxin Chen
    • G10L1506
    • G10L15/10G10L2015/085
    • The present invention comprises a system and method for speech verification using a confidence measure that includes a speech verifier which compares a differential score for a recognized word to a predetermined threshold value, where a recognized word is the word model that produced the highest recognition score. In one embodiment, a single threshold is used for each word in a vocabulary. In another embodiment, each word model has an associated threshold, so that a differential score for a recognized word is compared to a unique threshold associated with that word. In a further embodiment, pairs of confused words in the vocabulary are dealt with separately. If a confused word is the recognized word, the speech verifier compares the differential score to a threshold that depends on the word model that produced the next-highest recognition score. Different values for the various thresholds may maximize rejection accuracy or recognition accuracy. A trade-off between rejection accuracy and recognition accuracy may be made by utilizing an intermediate threshold value that is between a minimum threshold value and a maximum threshold value.
    • 本发明包括一种用于使用置信度测量的语音验证的系统和方法,所述置信度测量包括将识别的词的差分得分与预定阈值进行比较的语音验证器,其中识别词是产生最高识别分数的单词模型。 在一个实施例中,词汇中的每个单词使用单个阈值。 在另一个实施例中,每个单词模型具有相关联的阈值,使得将识别的单词的差分分数与与该单词相关联的唯一阈值进行比较。 在另一实施例中,词汇表中的混淆词对被单独处理。 如果一个混淆的单词是被识别的单词,语音验证器将差分分数与取决于产生下一最高识别分数的单词模型的阈值进行比较。 各种阈值的不同值可以最大化拒绝准确度或识别精度。 可以通过利用处于最小阈值和最大阈值之间的中间阈值来进行拒绝准确度和识别精度之间的折衷。
    • 7. 发明授权
    • Weighted frequency-channel background noise suppressor
    • 加权频道背景噪声抑制器
    • US06826528B1
    • 2004-11-30
    • US09691878
    • 2000-10-18
    • Duanpei WuMiyuki TanakaXavier Menendez-Pidal
    • Duanpei WuMiyuki TanakaXavier Menendez-Pidal
    • G10L2102
    • G10L21/0208G10L21/0232G10L25/18G10L25/78
    • A method for implementing a noise suppressor in a speech recognition system comprises a filter bank for separating source speech data into discrete frequency sub-bands to generate filtered channel energy, and a noise suppressor for weighting the frequency sub-bands to improve the signal-to-noise ratio of the resultant noise-suppressed channel energy. The noise suppressor preferably includes a noise calculator for calculating background noise values, a speech energy calculator for calculating speech energy values for each channel of the filter bank, and a weighting module for applying calculated weighting values to the projected channel energy to generate the noise-suppressed channel energy.
    • 一种用于在语音识别系统中实现噪声抑制器的方法包括:滤波器组,用于将源语音数据分离成离散频率子带以产生经滤波的信道能量;以及噪声抑制器,用于对频率子带进行加权以改善信号到 噪声抑制通道能量的噪声比。 噪声抑制器优选地包括用于计算背景噪声值的噪声计算器,用于计算滤波器组的每个通道的语音能量值的语音能量计算器,以及用于将计算的加权值应用于投影的通道能量以产生噪声抑制器的加权模块, 抑制通道能量。
    • 8. 发明授权
    • Speech detection with noise suppression based on principal components analysis
    • 基于主成分分析的噪声抑制语音检测
    • US06230122B1
    • 2001-05-08
    • US09176178
    • 1998-10-21
    • Duanpei WuMiyuki TanakaMariscela Amador-Hernandez
    • Duanpei WuMiyuki TanakaMariscela Amador-Hernandez
    • G10L2102
    • G10L21/0208G10L21/0232
    • A method for effectively suppressing background noise in a speech detection system comprises a filter bank for separating source speech data into discrete frequency sub-bands to generate filtered channel energy, and a noise suppressor for weighting the frequency sub-bands to improve the signal-to-noise ratio of the resultant noise-suppressed channel energy. The noise suppressor preferably includes a subspace module for using a Karhunen-Loeve transformation to create a subspace based on the background noise, a projection module for generating projected channel energy by projecting the filtered channel energy onto the created subspace, and a weighting module for applying calculated weighting values to the projected channel energy to generate the noise-suppressed channel energy.
    • 一种用于有效地抑制语音检测系统中的背景噪声的方法包括用于将源语音数据分离成离散频率子带以产生经滤波的信道能量的滤波器组,以及用于对频率子带进行加权以改善信号到 噪声抑制通道能量的噪声比。 噪声抑制器优选地包括用于使用Karhunen-Loeve变换来创建基于背景噪声的子空间的子空间模块,用于通过将滤波的信道能量投影到所创建的子空间上来产生投影通道能量的投影模块,以及用于应用的加权模块 计算加权值到投影通道能量以产生噪声抑制的通道能量。
    • 9. 发明授权
    • Method for performing microphone conversions in a speech recognition system
    • 用于在语音识别系统中执行麦克风转换的方法
    • US06751588B1
    • 2004-06-15
    • US09449424
    • 1999-11-23
    • Xavier Menendez-PidalMiyuki TanakaDuanpei Wu
    • Xavier Menendez-PidalMiyuki TanakaDuanpei Wu
    • G10L1506
    • G10L15/065
    • A method for performing microphone conversions in a speech recognition system comprises a speech module that simultaneously captures an identical input signal using both an original microphone and a final microphone. The original microphone is also used to record an original training database. The final microphone is also used to capture input signals during normal use of the speech recognition system. A characterization module then analyzes the recorded identical input signal to generate characterization values that are subsequently utilized by a conversion module to convert the original training database into a final training database. A training program then uses the final training database to train a recognizer in the speech module in order to optimally perform a speech recognition process, in accordance with the present invention.
    • 用于在语音识别系统中执行麦克风转换的方法包括语音模块,其使用原始麦克风和最终麦克风同时捕获相同的输入信号。 原始麦克风也用于记录原始的训练数据库。 最后的麦克风也用于在语音识别系统的正常使用期间捕获输入信号。 表征模块然后分析记录的相同输入信号以产生表征值,随后由转换模块将原始训练数据库转换成最终训练数据库。 训练程序然后使用最终训练数据库来训练语音模块中的识别器,以便根据本发明最佳地执行语音识别过程。
    • 10. 发明授权
    • Method for implementing a speech verification system for use in a noisy environment
    • 用于实现在嘈杂环境中使用的语音验证系统的方法
    • US06272460B1
    • 2001-08-07
    • US09264288
    • 1999-03-08
    • Duanpei WuMiyuki TanakaLex Olorenshaw
    • Duanpei WuMiyuki TanakaLex Olorenshaw
    • G10L1900
    • G10L25/78
    • A method for implementing a speech verification system for use in a noisy environment comprises the steps of generating a confidence index for an utterance using a speech verifier, and controlling the speech verifier with a processor, wherein the utterance contains frames of sound energy. The speech verifier includes a noise suppressor, a pitch detector, and a confidence determiner. The noise suppressor suppresses noise in each frame in the utterance by summing a frequency spectrum for each frame with frequency spectra of a selected number of previous frames to produce a spectral sum. The pitch detector applies a spectral comb window to each spectral sum to produce correlation values for each frame in the utterance. The pitch detector also applies an alternate spectral comb window to each spectral sum to produce alternate correlation values for each frame in the utterance. The confidence determiner evaluates the correlation values to produce a frame confidence measure for each frame in the utterance. The confidence determiner then uses the frame confidence measures to generate the confidence index for the utterance, which indicates whether the utterance is or is not speech.
    • 用于实现在噪声环境中使用的语音验证系统的方法包括以下步骤:使用语音验证器产生用于话语的置信度指标,以及用处理器控制语音检验器,其中所述话语包含声能帧。 语音检验器包括噪声抑制器,音调检测器和置信度确定器。 噪声抑制器通过将每帧的频谱与选定数量的先前帧的频谱相加来抑制每个帧中的噪声,以产生频谱和。 音调检测器将频谱梳窗口应用于每个频谱和,以产生话音中每帧的相关值。 音调检测器还对每个频谱和应用替代频谱梳窗口,以产生话音中每帧的交替相关值。 置信度确定器评估相关值以产生话语中的每个帧的帧置信度量。 然后,置信度确定器使用帧置信度度量来产生话语的置信指数,这表明语音是否是语音。