会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Speech recognition apparatus using neural network and fuzzy logic
    • 使用神经网络和模糊逻辑的语音识别装置
    • US5040215A
    • 1991-08-13
    • US400342
    • 1989-08-30
    • Akio AmanoAkira IchikawaNobuo Hataoka
    • Akio AmanoAkira IchikawaNobuo Hataoka
    • G06F3/16G06F15/18G06N3/00G10L15/02G10L15/08G10L15/10G10L15/16G10L15/28
    • G10L15/16Y10S706/90
    • A speech recognition apparatus has a speech input unit for inputting a speech; a speech analysis unit for analyzing the inputted speech to output the time series of a feature vector; a candidates selection unit for inputting the time series of a feature vector from the speech analysis unit to select a plurality of candidates of recognition result from the speech categories; and a discrimination processing unit for discriminating the selected candidates to obtain a final recognition result. The discrimination processing unit includes three components in the form of a pair generation unit for generating all of the two combinations of the n-number of candidates selected by said candidate selection unit, a pair discrimination unit for discriminating which of the candidates of the combinations is more certain for each of all .sub.n C.sub.2 -number of combinations (or pairs) on the basis of the extracted result of the acoustic feature intrinsic to each of said candidate speeches, and a final decision unit for collecting all the pair discrimination results obtained from the pair discrimination unit for each of all the .sub.n C.sub.2 -number of combinations (or pairs) to decide the final result. The pair discrimination unit handles the extracted result of the acoustic feature intrinsic to each of the candidate speeches as fuzzy information and accomplishes the discrimination processing on the basis of fuzzy logic algorithms, and the final decision unit accomplishes its collections on the basis of the fuzzy logic algorithms.
    • 语音识别装置具有用于输入语音的语音输入单元; 语音分析单元,用于分析输入的语音以输出特征向量的时间序列; 候选选择单元,用于从语音分析单元输入特征向量的时间序列,以从语音类别中选择多个候选的识别结果; 以及鉴别处理单元,用于识别所选择的候选以获得最终识别结果。 鉴别处理单元包括成对生成单元形式的三个组件,用于产生由所述候选选择单元选择的n个候选者的所有两个组合;对鉴别单元,用于鉴别组合中的哪一个候选 基于每个所述候选讲话所固有的声学特征的提取结果,对于所有nC2个组合(或对)中的每一个更确定,以及用于收集从该对获得的所有对鉴别结果的最终决定单元 所有nC2个组合(或对)中的每一个的判别单元来决定最终结果。 对鉴别单元处理作为模糊信息的每个候选语音固有的声学特征的提取结果,并且基于模糊逻辑算法完成鉴别处理,并且最终决策单元基于模糊逻辑来完成其集合 算法。
    • 4. 发明授权
    • Speech recognition apparatus using neural network and fuzzy logic
    • 使用神经网络和模糊逻辑的语音识别装置
    • US5179624A
    • 1993-01-12
    • US727089
    • 1991-07-09
    • Akio AmanoAkira IchikawaNobuo Hataoka
    • Akio AmanoAkira IchikawaNobuo Hataoka
    • G10L15/16
    • G10L15/16Y10S706/90
    • A speech recognition apparatus has: a speech input unit for inputting a speech; a speech analysis unit for analyzing the inputted speech to output the time series of a feature vector; a candidates selection unit for inputting the time series of a feature vector from the speech analysis unit to select a plurality of candidates of recognition result from the speech categories; and a discrimination processing unit for discriminating the selected candidates to obtain a final recognition result. The discrimination processing unit includes three components in the form of a pair generation unit for generating all of the two combinations of the n-number of candidates selected by said candidate selection unit a pair discrimination unit for discriminating which of the candidates of the combinations is more certain for each of all .sub.n C.sub.2 -number of combinations (or pairs) on the basis of the extracted result of the acoustic feature intrinsic to each of said candidate speeches and a final decision unit for collecting all the pair discrimination results obtained from the pair discrimination unit for each of all the .sub.n C.sub.2 -number of combinations (or pairs) to decide the final result. The pair discrimination unit handles the extracted result of the acoustic feature intrinsic to each of the candidate speeches as fuzzy information and accomplishes the discrimination processing on the basis of fuzzy logic algorithms, and the final decision unit accomplishes its collections on the basis of the fuzzy logic algorithms.
    • 语音识别装置具有:用于输入语音的语音输入单元; 语音分析单元,用于分析输入的语音以输出特征向量的时间序列; 候选选择单元,用于从语音分析单元输入特征向量的时间序列,以从语音类别中选择多个候选的识别结果; 以及鉴别处理单元,用于识别所选择的候选以获得最终识别结果。 鉴别处理单元包括成对生成单元形式的三个组成部分,用于产生由所述候选选择单元选择的n个候选项的所有两个组合中的一个对鉴别单元,用于鉴别组合中的哪个候选者更多 基于每个所述候选讲话所固有的声学特征的提取结果,以及用于收集从对鉴别单元获得的所有对鉴别结果的最终决定单元,对于所有nC2个组合(或对)中的每一个确定; 对于所有nC2个组合(或对)中的每一个来决定最终结果。 对鉴别单元处理作为模糊信息的每个候选语音固有的声学特征的提取结果,并且基于模糊逻辑算法完成鉴别处理,并且最终决策单元基于模糊逻辑来完成其集合 算法。
    • 8. 发明授权
    • Speech recognition method
    • 语音识别方法
    • US4718095A
    • 1988-01-05
    • US554960
    • 1983-11-25
    • Yoshiaki AsakawaAkio KomatsuNobuo HataokaAkira IchikawaKiyoshi Nagasawa
    • Yoshiaki AsakawaAkio KomatsuNobuo HataokaAkira IchikawaKiyoshi Nagasawa
    • G10L11/00G10L15/00G10L15/12G10L5/00
    • G10L15/12G10L15/00
    • A speech recognition method makes it possible to improve the accuracy of recognition of input speech and is capable of operating on a real time basis. This is accomplished by generating from the input speech signal a difference signal which indicates whether the speech power of the input speech is increasing or decreasing for each frame. The similarity between the input speech and a standard pattern is then calculated for each frame, and this is then followed by correcting the similarity calculation on the basis of the generated difference signal and a difference signal relating to the standard pattern obtained from storage. The matching of the input speech and the standard pattern is then effected by using the corrected similarity, and the input speech is then recognized from the result of this matching. Thus, a spectrum matching distance weighted by power information of speech can be obtained in real time.
    • 语音识别方法使得可以提高输入语音的识别精度并能够实时地进行操作。 这是通过从输入语音信号生成指示输入语音的语音功率对于每个帧是增加还是减少的差分信号来实现的。 然后针对每个帧计算输入语音与标准模式之间的相似度,然后根据生成的差分信号和与从存储获得的标准模式相关的差分信号来校正相似度计算。 然后通过使用校正的相似度来实现输入语音和标准模式的匹配,然后从该匹配的结果中识别输入语音。 因此,可以实时获得通过语音功率信息加权的频谱匹配距离。