会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • Speaker identifying apparatus and computer program product
    • 扬声器识别装置和计算机程序产品
    • US20070106511A1
    • 2007-05-10
    • US11527607
    • 2006-09-27
    • Parham MokhtariTatsuya KitamuraHironori TakemotoSeiji AdachiKiyoshi Honda
    • Parham MokhtariTatsuya KitamuraHironori TakemotoSeiji AdachiKiyoshi Honda
    • G10L15/00
    • G10L17/02
    • A speaker identifying apparatus includes: a module for performing a principal component analysis on predetermined vocal tract geometrical parameters of a plurality of speakers and calculating an average and principal component vectors representing speaker-dependent variation; a module for performing acoustic analysis on the speech data being uttered for each of the speakers to calculate cepstrum coefficients; a module for calculating principal component coefficients for approximating the vocal tract geometrical parameter of each of the plurality of speakers by a linear sum of principal component coefficients; a module for determining, by multiple regression analysis, a coefficient sequence for estimating principal component coefficients by a linear sum of the plurality of prescribed features, for each of the plurality of speakers; a module for calculating a plurality of features from speech data of the speaker to be identified, and estimating principal component coefficients for calculating the vocal tract geometrical parameter of the speaker to be identified, by a linear sum obtained by applying the coefficient sequence calculated by the regression analyzing module; and a module for identifying said speaker to be identified, by comparing the estimated principal component coefficients with the principal component coefficients calculated for each of the plurality of speakers by the principal component coefficient calculating module.
    • 扬声器识别装置包括:用于对多个扬声器的预定声道几何参数执行主分量分析并计算表示说话者相关变化的平均和主分量矢量的模块; 用于对每个扬声器发出的语音数据执行声学分析以计算倒谱系数的模块; 用于通过主分量系数的线性和来计算用于近似多个扬声器中的每一个的声道几何参数的主分量系数的模块; 用于通过多元回归分析确定用于对所述多个说话者中的每一个的所述多个规定特征的线性和估计主成分系数的系数序列的模块; 用于根据要识别的说话者的语音数据计算多个特征的模块,以及通过应用由所述识别的所述系统序列计算的系数序列来获得的线性和来估计用于计算要识别的说话者的声道几何参数的主成分系数 回归分析模块; 以及用于通过将所估计的主分量系数与由主成分系数计算模块为多个扬声器中的每一个计算的主分量系数进行比较来识别要识别的所述扬声器的模块。
    • 2. 发明申请
    • Syllabic kernel extraction apparatus and program product thereof
    • 音节提取仪器及其程序产品
    • US20050246168A1
    • 2005-11-03
    • US10514413
    • 2003-02-21
    • Nick CampbellParham Mokhtari
    • Nick CampbellParham Mokhtari
    • G10L11/06G10L11/00G10L13/06G10L15/02G10L15/10G10L21/06
    • G10L25/00G10L21/06
    • An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit (92) calculating, from data, distribution of an energy of a prescribed frequency range of the speech waveform on a time axis, and for extracting, among various syllables of the speech waveform, a range that is generated stably, based on the distribution and the pitch of the speech waveform; cepstral analysis unit (94) estimating, based on the spectral distribution of the speech waveform on the time axis, a range of the speech waveform of which change is well controlled by a speaker; and a pseudo-syllabic center extracting unit (96) extracting, as a portion of high reliability of the speech waveform, that range which has been estimated to be the stably generated range and of which change is estimated to be well controlled by the speaker.
    • 一种能够自动确定可靠地表示语音波形特征的部分的装置,包括:声/韵律分析部(92),从时间轴上计算语音波形的规定频率范围的能量的分布 并且基于语音波形的分布和音调,在语音波形的各个音节中提取稳定生成的范围; 倒谱分析单元(94)基于时间轴上的语音波形的频谱分布来估计由扬声器很好地控制变化的语音波形的范围; 以及伪音节中心提取单元(96)作为语音波形的高可靠性的一部分提取已经被估计为稳定产生的范围并且其改变被该扬声器良好地控制的范围。
    • 3. 发明授权
    • Apparatus and method for extracting syllabic nuclei
    • 提取音节核的装置和方法
    • US07627468B2
    • 2009-12-01
    • US10514413
    • 2003-02-21
    • Nick CampbellParham Mokhtari
    • Nick CampbellParham Mokhtari
    • G10L19/06G10L11/00G10L19/14G10L11/04G10L11/06G10L15/00G10L15/20
    • G10L25/00G10L21/06
    • An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit calculating, from data, distribution of an energy of a prescribed frequency range of the speech waveform on a time axis, and for extracting, among various syllables of the speech waveform, a range that is generated stably, based on the distribution and the pitch of the speech waveform; cepstral analysis unit estimating, based on the spectral distribution of the speech waveform on the time axis, a range of the speech waveform of which change is well controlled by a speaker; and a pseudo-syllabic center extracting unit extracting, as a portion of high reliability of the speech waveform, that range which has been estimated to be the stably generated range and of which change is estimated to be well controlled by the speaker.
    • 能够自动确定可靠地表示语音波形特征的部分的装置包括:声/韵律分析单元,从时间轴上计算语音波形的规定频率范围的能量的分布, 在语音波形的各个音节中,基于语音波形的分布和音调提取稳定地生成的范围; 倒谱分析单元基于时间轴上的语音波形的频谱分布来估计由扬声器很好地控制变化的语音波形的范围; 以及伪音节中心提取单元,作为语音波形的高可靠性的一部分,提取已经被估计为稳定产生的范围并且其改变被该扬声器良好地控制的范围。
    • 4. 发明授权
    • Speaker identifying apparatus and computer program product
    • 扬声器识别装置和计算机程序产品
    • US07617102B2
    • 2009-11-10
    • US11527607
    • 2006-09-27
    • Parham MokhtariTatsuya KitamuraHironori TakemotoSeiji AdachiKiyoshi Honda
    • Parham MokhtariTatsuya KitamuraHironori TakemotoSeiji AdachiKiyoshi Honda
    • G10L17/00
    • G10L17/02
    • A speaker identifying apparatus includes: a module for performing a principal component analysis on predetermined vocal tract geometrical parameters of a plurality of speakers and calculating an average and principal component vectors representing speaker-dependent variation; a module for performing acoustic analysis on the speech data being uttered for each of the speakers to calculate cepstrum coefficients; a module for calculating principal component coefficients for approximating the vocal tract geometrical parameter of each of the plurality of speakers by a linear sum of principal component coefficients; a module for determining, by multiple regression analysis, a coefficient sequence for estimating principal component coefficients by a linear sum of the plurality of prescribed features, for each of the plurality of speakers; a module for calculating a plurality of features from speech data of the speaker to be identified, and estimating principal component coefficients for calculating the vocal tract geometrical parameter of the speaker to be identified, by a linear sum obtained by applying the coefficient sequence calculated by the regression analyzing module; and a module for identifying said speaker to be identified, by comparing the estimated principal component coefficients with the principal component coefficients calculated for each of the plurality of speakers by the principal component coefficient calculating module.
    • 扬声器识别装置包括:用于对多个扬声器的预定声道几何参数执行主分量分析并计算表示说话者相关变化的平均和主分量矢量的模块; 用于对每个扬声器发出的语音数据执行声学分析以计算倒谱系数的模块; 用于通过主分量系数的线性和来计算用于近似多个扬声器中的每一个的声道几何参数的主分量系数的模块; 用于通过多元回归分析确定用于对所述多个说话者中的每一个的所述多个规定特征的线性和估计主成分系数的系数序列的模块; 用于根据要识别的说话者的语音数据计算多个特征的模块,以及通过应用由所述识别的所述系统序列计算的系数序列来获得的线性和来估计用于计算要识别的说话者的声道几何参数的主分量系数 回归分析模块; 以及用于通过将所估计的主分量系数与由主成分系数计算模块为多个扬声器中的每一个计算的主分量系数进行比较来识别要识别的所述扬声器的模块。