会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 35. 发明授权
    • Method of speech modelling and a speech recognizer
    • 语音整形的用于语音识别的方法和装置
    • EP0590925B1
    • 1999-04-14
    • EP93307664.8
    • 1993-09-28
    • International Business Machines Corporation
    • Nishimura, MasafumiOkochi, Masaaki
    • G10L5/06G10L7/08G10L9/06
    • G10L15/142G10L15/187G10L15/197G10L2015/025G10L2015/0631
    • Provided is a speech recognizer representing various pronunciational transformations efficiently by statistical combinations (N-grams) of a few types of hidden Markov models. Analysis of a word input from a speech input device 1 for its features is made by a feature extractor 4 to obtain a feature vector sequence corresponding to said word or a label sequence by applying a further transformation in a labeler 8. Phonemic hidden Markov models for each speech transformation candidate transformed as a sequence of subwords constituting said word are retained in a parameter table 18, keeping an N-gram relationship (N = an integer greater than or equal to 2) with the speech transformation candidate of other preceding subwords in the word. A recognizer 16 then applies hidden Markov models to each speech transformation candidate in correspondence to the candidate words in a word pronunciation dictionary 13 and on the basis of said N-gram relation, and joins each hidden Markov model for each of these speech transformation candidates in parallel among the subwords to compose a speech model. The recognizer determines the probability of the speech model composed for each candidate word outputting said label sequence or feature vector sequence input as speech and outputs a candidate word corresponding to the speech model of the highest probability to a display 19.
    • 36. 发明公开
    • Method and apparatus for speech recognition performing noise adaptation
    • 用于语音识别噪声适配的方法和装置
    • EP0847041A3
    • 1999-02-03
    • EP97309678.7
    • 1997-12-02
    • CANON KABUSHIKI KAISHA
    • Komori, YasuhiroYamamoto, Hiroki
    • G10L5/06
    • G10L15/142G10L15/144G10L15/20
    • A speech processing apparatus includes a noise model production device for extracting a noise-speech interval from input speech data and producing a noise model by using the data of the extracted interval. The apparatus also includes a composite distribution production device for dividing the distributions of a speech model into a plurality of groups, producing a composite distribution of each group, and determining the positional relationship of each distribution within each group. In addition, the apparatus includes a memory for storing each composite distribution and the positional relationship of each distribution within the group, and a parallel model combination (PMC) conversion device for PMC-converting each produced composite distribution. Also provided is a noise-adaptive speech model production device for producing a noise-adaptive speech model on the basis of the composite distribution which is PMC-converted by the PMC conversion device and the positional relationship stored by the memory. Further, the apparatus includes an output device for determining and outputting a recognition result and a candidate with their likelihood for the input speech data by using the produced noise-adaptive speech model.