会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 51. 发明申请
    • FEATURE EXTRACTING APPARATUS, COMPUTER PROGRAM PRODUCT, AND FEATURE EXTRACTION METHOD
    • 特征提取装置,计算机程序产品和特征提取方法
    • US20090048835A1
    • 2009-02-19
    • US12042018
    • 2008-03-04
    • Takashi Masuko
    • Takashi Masuko
    • G10L15/00
    • G10L15/02G10L17/02G10L25/90
    • A feature extracting apparatus includes a spectrum calculator that calculates a logarithmic frequency spectrum including frequency components obtained from an input speech signal at regular intervals on a logarithmic frequency scale of a frame; a function calculator that calculates a cross-correlation function between a logarithmic frequency spectrum of a time and a logarithmic frequency spectrum of one or plural times included in a certain temporal width before and after the time, from a sequence of the logarithmic frequency spectra calculated at each time; and a feature extractor that extracts a set of the cross-correlation functions as a local and relative fundamental-frequency pattern feature at the frame.
    • 特征提取装置包括频谱计算器,其计算包括以帧的对数频率刻度上的规则间隔从输入语音信号获得的频率分量的对数频谱; 功能计算器,根据从时间上计算的对数频谱的序列,计算包含在时间之前和之后的一定时间宽度的时间的对数频谱和对数频谱之间的互相关频谱的互相关函数 每一次; 以及特征提取器,其提取一组所述互相关函数作为所述帧处的局部和相对基频图案特征。
    • 53. 发明申请
    • Indexing apparatus and indexing method
    • 索引设备和索引方法
    • US20060058998A1
    • 2006-03-16
    • US11202155
    • 2005-08-12
    • Koichi YamamotoTakashi MasukoShinichi Tanaka
    • Koichi YamamotoTakashi MasukoShinichi Tanaka
    • G10L15/04
    • G10L17/00G06F16/95
    • An indexing apparatus includes an acquiring unit that acquires an acoustic signal; a dividing unit that divides the acoustic signal into a plurality of segments; an acoustic model producing unit that produces an acoustic model for each of the segments; a reliability determining unit that determines reliability of the acoustic model; a similarity vector producing unit that produces a similarity vector having elements that are the similarities between the acoustic model for a predetermined segment and the acoustic signal of each of the other segments, based on the reliability; a clustering unit that clusters similarity vectors produced by the similarity vector producing unit; and an indexing unit that indexes the acoustic signal based on the similarity vectors clustered.
    • 索引装置包括获取单元,其获取声信号; 分割单元,其将所述声信号分割成多个段; 声学模型生成单元,其为每个段产生声学模型; 确定声学模型的可靠性的可靠性确定单元; 相似度矢量产生单元,其基于可靠性产生具有作为预定段的声学模型与每个其它段的声学信号之间的相似度的元素的相似度向量; 聚类单元,其对相似矢量产生单元产生的相似性向量进行聚类; 以及索引单元,其基于聚类的相似矢量对声音信号进行索引。
    • 55. 发明授权
    • Apparatus, method and computer program product for feature extraction
    • 用于特征提取的装置,方法和计算机程序产品
    • US08073686B2
    • 2011-12-06
    • US12366037
    • 2009-02-05
    • Yusuke KidaTakashi Masuko
    • Yusuke KidaTakashi Masuko
    • G10L11/04G10L19/00
    • G10L15/02G10L25/06G10L25/90
    • A feature extraction apparatus includes a spectrum calculating unit that calculates, based on an input speech signal, a frequency spectrum having frequency components obtained at regular intervals on a logarithmic frequency scale for each of frames that are defined by regular time intervals, and thereby generates a time series of the frequency spectrum; a cross-correlation coefficients calculating unit that calculates, for each target frame of the frames, a cross-correlation coefficients between frequency spectra calculated for two different frames that are in vicinity of the target frame and a predetermined frame width apart from each other; and a shift amount predicting unit that predicts a shift amount of the frequency spectra on the logarithmic frequency scale with respect to the predetermined frame width by use of the cross-correlation coefficients.
    • 特征提取装置包括频谱计算单元,该频谱计算单元基于输入的语音信号,以规则的时间间隔以规律的间隔对每个由规则的时间间隔定义的帧进行频率分量的频谱,从而生成 时间序列的频谱; 互相关系数计算单元,对于帧的每个目标帧,计算针对在目标帧附近的两个不同帧计算的频谱与彼此分开的预定帧宽度的互相关系数; 以及偏移量预测单元,其通过使用互相关系数来预测相对于预定帧宽度的对数频谱上的频谱的偏移量。
    • 57. 发明授权
    • Apparatus and method for speech recognition using probability and mixed distributions
    • 使用概率和混合分布进行语音识别的装置和方法
    • US07921012B2
    • 2011-04-05
    • US11857104
    • 2007-09-18
    • Hiroshi FujimuraTakashi Masuko
    • Hiroshi FujimuraTakashi Masuko
    • G10L15/28
    • G10L15/32
    • A speech recognition apparatus includes a first storing unit configured to store a first acoustic model invariable regardless of speaker and environment, a second storing unit configured to store a classification model that has shared parameters and non-shared parameters with the first acoustic model to classify second acoustic models, a recognizing unit configured to calculate a first likelihood with regard to the input speech by applying the first acoustic model to the input speech and obtain calculation result on the shared parameter and a plurality of candidate words that have relatively large values as the first likelihood, and a calculating unit configured to calculate a second likelihood for each of the groups with regard to the input speech by use of the calculation result on the shared parameters and the non-shared parameters of the classification model.
    • 语音识别装置包括:第一存储单元,被配置为存储不管扬声器和环境如何不变的第一声学模型;第二存储单元,被配置为存储具有共享参数和非共享参数的分类模型与第一声学模型,以对第二声学模型进行分类 声学模型,识别单元,被配置为通过将第一声学模型应用于输入语音来计算关于输入语音的第一可能性,并且获得关于共享参数的计算结果和具有相对较大值的多个候选词作为第一 以及计算单元,被配置为通过使用关于共享参数的计算结果和分类模型的非共享参数来计算关于输入语音的每个组的第二似然。
    • 60. 发明申请
    • Apparatus and Method for Speech Recognition
    • 语音识别装置与方法
    • US20080201136A1
    • 2008-08-21
    • US11857104
    • 2007-09-18
    • Hiroshi FujimuraTakashi Masuko
    • Hiroshi FujimuraTakashi Masuko
    • G10L19/00G10L15/00
    • G10L15/32
    • A speech recognition apparatus includes a first storing unit configured to store a first acoustic model invariable regardless of speaker and environment, a second storing unit configured to store a classification model that has shared parameters and non-shared parameters with the first acoustic model to classify second acoustic models, a recognizing unit configured to calculate a first likelihood with regard to the input speech by applying the first acoustic model to the input speech and obtain calculation result on the shared parameter and a plurality of candidate words that have relatively large values as the first likelihood, and a calculating unit configured to calculate a second likelihood for each of the groups with regard to the input speech by use of the calculation result on the shared parameters and the non-shared parameters of the classification model.
    • 语音识别装置包括:第一存储单元,被配置为存储不管扬声器和环境如何不变的第一声学模型;第二存储单元,被配置为存储具有共享参数和非共享参数的分类模型与第一声学模型,以对第二声学模型进行分类 声学模型,识别单元,被配置为通过将第一声学模型应用于输入语音来计算关于输入语音的第一可能性,并且获得关于共享参数的计算结果和具有相对较大值的多个候选词作为第一 以及计算单元,被配置为通过使用关于共享参数的计算结果和分类模型的非共享参数来计算关于输入语音的每个组的第二似然。