会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Apparatus for speech recognition
    • 语音识别装置
    • US4736429A
    • 1988-04-05
    • US618368
    • 1984-06-07
    • Katsuyuki NiyadaIkuo InoueSatoru FujiiShuji Morii
    • Katsuyuki NiyadaIkuo InoueSatoru FujiiShuji Morii
    • G10L15/10G10L15/04G10L5/00
    • G10L15/04
    • Apparatus for speech recognition, having each phoneme as a fundamental recognition unit, recognizes input speech by discriminating phonemes in the input speech. The apparatus comprises a memory for storing phoneme standard patterns of phonemes or phoneme groups; a spectrum analyzer for obtaining parameters indicative of the input speech signal spectrum; a statistical distance measure similarity calculator calculates the degree of similarity between the output of the spectrum analyzer and standard patterns stored in the memory; a segmentation portion for segmenting by using time-dependent low- and high-frequency power variations of the input speech signal and results from the similarity calculator; and a phoneme discriminator for recognizing phonemes by using the results from the similarity calculator.
    • 具有每个音素作为基本识别单元的语音识别装置通过在输入语音中区分音素来识别输入语音。 该装置包括用于存储音素或音素组的音素标准模式的存储器; 频谱分析器,用于获得指示输入语音信号频谱的参数; 统计距离测量相似度计算器计算频谱分析仪的输出与存储在存储器中的标准模式之间的相似度; 分割部分,用于通过使用输入语音信号的时间相关的低频和高频功率变化和来自相似性计算器的结果来分割; 以及通过使用相似度计算器的结果来识别音素的音素鉴别器。
    • 2. 发明授权
    • Apparatus for speech recognition
    • 语音识别装置
    • US4885791A
    • 1989-12-05
    • US920785
    • 1986-10-20
    • Satoru FujiiKatsuyuki NiyadaShuji MoriiTaisuke Watanabe
    • Satoru FujiiKatsuyuki NiyadaShuji MoriiTaisuke Watanabe
    • G10L15/00
    • G10L15/00
    • Disclosed is a speech recognition apparatus comprising: a speech analysis portion for extracting parameters necessary for determination of spoken words; a speech period detecting portion for extracting one or more combinations of speech periods using the parameters; and a structure analysis portion for detecting feature points indicative of phoneme structure of each word and for determining a word through computation of similarity to proposed words in accordance with the presence and absence of the feature points. Therefore, erroneous recognition due to noise introduction or the like can be reduced by detecting one or more combinations of proposed speech periods by the speech period detecting portion. By extracting only necessary number of extracting points, which contribute to the distinguishment between words, with reference to analysis procedure provided for each word, the sharpness of determination is bettered. More stable operation than conventional apparatus has been achieved in connection with time base expansion/compression. Small numbers of parameters obtained through speech analysis are used to reduce the amount of computation, while the above-mentioned parameters are stable against difference in phenemes due to difference in speakers.
    • 公开了一种语音识别装置,包括:语音分析部分,用于提取确定口语所需的参数; 语音周期检测部分,用于使用参数提取语音周期的一个或多个组合; 以及结构分析部分,用于检测指示每个单词的音素结构的特征点,并且根据特征点的存在和不存在,通过计算与所提出的单词的相似度来确定单词。 因此,可以通过由语音周期检测部分检测所提出的语音周期的一个或多个组合来减少由噪声引入引起的错误识别等。 通过提取必要数量的提取点,这有助于区分词,参考为每个单词提供的分析程序,提高了确定的清晰度。 结合时基扩展/压缩已经实现了比传统装置更稳定的操作。 使用通过语音分析获得的少量参数来减少计算量,而上述参数由于扬声器的差异而对于差异性是稳定的。
    • 3. 发明授权
    • Method of and apparatus for speech recognition wherein decisions are
made based on phonemes
    • 用于语音识别的方法和装置,其中基于音素进行决定
    • US5131043A
    • 1992-07-14
    • US441225
    • 1989-11-20
    • Satoru FujiiKatsuyuki Niyada
    • Satoru FujiiKatsuyuki Niyada
    • G10L15/00
    • G10L15/10
    • Linear prediction coefficients of a speech signal including unknown words are derived for each of successive periodic frame intervals. For every frame over the duration of an individual phoneme of the speech signal, the degree of similarity of stored coefficients of known words and derived coefficients of the unknown words are calculated so that at the end of the individual phonemes, the degree of similarity is calculated. Phoneme segmentation data are derived in response to the speech signal and combined with the calculated degree of similarity over the individual phoneme to derive phoneme strings of the speech signal. The derived and stored phoneme strings are compared to indicate the words stored in a word dictionary having the greatest similarity with the derived phoneme strings.
    • 对于每个连续的周期性帧间隔导出包括未知字的语音信号的线性预测系数。 对于在语音信号的单个音素的持续时间内的每个帧,计算已知单词的存储系数和未知单词的导出系数的相似度,使得在单个音素的末尾,计算相似度 。 音素分割数据是响应于语音信号导出的,并且与计算的各个音素上的相似程度相结合以导出语音信号的音素串。 将导出和存储的音素字符串进行比较,以指示与导出的音素串具有最大相似性的词典中存储的词。
    • 4. 发明授权
    • Method for speech recognition
    • 语音识别方法
    • US4991216A
    • 1991-02-05
    • US501386
    • 1990-03-23
    • Satoru FujiiKatsuyuki Niyada
    • Satoru FujiiKatsuyuki Niyada
    • G10L15/00
    • G10L15/10
    • Phoneme standard patterns are prepared beforehand using speech sounds from a number of individual speakers. Unknown input speech sounds are divided into continuous frames and then some of these frames are extracted such that extracted frames include noncontinuous frames. LPC cepstrum coefficients are obtained for each of the frames and a mean value is obtained from the LPC cepstrum coefficients for each of the phonemes. A first standard pattern is then formed for each of the phonemes belonging to each group determined by characteristics of known speech sounds. Spectrum information of the unknown input speech sounds is produced using such extracted frames, and is then compared with the phoneme standard patterns prestored in a storage to determine and recognize phonemes of the unknown speech sounds by calculating similarity between the two using a statistical distance measure. When obtaining LPC cepstrum coefficients through LPC analysis, the order of LPC cepstrum coefficients is set to a value below the order of LPC analysis. The method according to the present invention reduces the amount of calculations necessary to perform speech recognition without deteriorating phoneme and word recognition rate.
    • 预先使用来自多个扬声器的讲话声音准备音素标准图案。 未知的输入语音被分为连续帧,然后提取这些帧中的一些,使得提取的帧包括非连续帧。 对于每个帧获得LPC倒谱系数,并且从每个音素的LPC倒频谱系数获得平均值。 然后,通过已知语音的特征确定属于每个组的每个音素,形成第一标准模式。 使用这种提取的帧产生未知输入语音的频谱信息,然后将其与预先存储在存储器中的音素标准模式进行比较,以通过使用统计距离测量来计算两者之间的相似度来确定和识别未知语音的音素。 通过LPC分析获得LPC倒谱系数时,LPC倒谱系数的顺序设置为低于LPC分析顺序的值。 根据本发明的方法减少了在不恶化音素和字识别率的情况下执行语音识别所需的计算量。