专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20070106511A1 Speaker identifying apparatus and computer program product 有权
标题翻译：扬声器识别装置和计算机程序产品
公开(公告)号：US20070106511A1
公开(公告)日：2007-05-10
申请号：US11527607
申请日：2006-09-27
申请人： Parham Mokhtari , Tatsuya Kitamura , Hironori Takemoto , Seiji Adachi , Kiyoshi Honda
发明人： Parham Mokhtari , Tatsuya Kitamura , Hironori Takemoto , Seiji Adachi , Kiyoshi Honda
IPC分类号： G10L15/00
CPC分类号： G10L17/02
摘要： A speaker identifying apparatus includes: a module for performing a principal component analysis on predetermined vocal tract geometrical parameters of a plurality of speakers and calculating an average and principal component vectors representing speaker-dependent variation; a module for performing acoustic analysis on the speech data being uttered for each of the speakers to calculate cepstrum coefficients; a module for calculating principal component coefficients for approximating the vocal tract geometrical parameter of each of the plurality of speakers by a linear sum of principal component coefficients; a module for determining, by multiple regression analysis, a coefficient sequence for estimating principal component coefficients by a linear sum of the plurality of prescribed features, for each of the plurality of speakers; a module for calculating a plurality of features from speech data of the speaker to be identified, and estimating principal component coefficients for calculating the vocal tract geometrical parameter of the speaker to be identified, by a linear sum obtained by applying the coefficient sequence calculated by the regression analyzing module; and a module for identifying said speaker to be identified, by comparing the estimated principal component coefficients with the principal component coefficients calculated for each of the plurality of speakers by the principal component coefficient calculating module.
摘要翻译：扬声器识别装置包括：用于对多个扬声器的预定声道几何参数执行主分量分析并计算表示说话者相关变化的平均和主分量矢量的模块; 用于对每个扬声器发出的语音数据执行声学分析以计算倒谱系数的模块; 用于通过主分量系数的线性和来计算用于近似多个扬声器中的每一个的声道几何参数的主分量系数的模块; 用于通过多元回归分析确定用于对所述多个说话者中的每一个的所述多个规定特征的线性和估计主成分系数的系数序列的模块; 用于根据要识别的说话者的语音数据计算多个特征的模块，以及通过应用由所述识别的所述系统序列计算的系数序列来获得的线性和来估计用于计算要识别的说话者的声道几何参数的主成分系数回归分析模块; 以及用于通过将所估计的主分量系数与由主成分系数计算模块为多个扬声器中的每一个计算的主分量系数进行比较来识别要识别的所述扬声器的模块。

2. 发明申请

US20050246168A1 Syllabic kernel extraction apparatus and program product thereof 失效
标题翻译：音节提取仪器及其程序产品
公开(公告)号：US20050246168A1
公开(公告)日：2005-11-03
申请号：US10514413
申请日：2003-02-21
申请人： Nick Campbell , Parham Mokhtari
发明人： Nick Campbell , Parham Mokhtari
IPC分类号： G10L11/06 , G10L11/00 , G10L13/06 , G10L15/02 , G10L15/10 , G10L21/06
CPC分类号： G10L25/00 , G10L21/06
摘要： An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit (92) calculating, from data, distribution of an energy of a prescribed frequency range of the speech waveform on a time axis, and for extracting, among various syllables of the speech waveform, a range that is generated stably, based on the distribution and the pitch of the speech waveform; cepstral analysis unit (94) estimating, based on the spectral distribution of the speech waveform on the time axis, a range of the speech waveform of which change is well controlled by a speaker; and a pseudo-syllabic center extracting unit (96) extracting, as a portion of high reliability of the speech waveform, that range which has been estimated to be the stably generated range and of which change is estimated to be well controlled by the speaker.
摘要翻译：一种能够自动确定可靠地表示语音波形特征的部分的装置，包括：声/韵律分析部（92），从时间轴上计算语音波形的规定频率范围的能量的分布并且基于语音波形的分布和音调，在语音波形的各个音节中提取稳定生成的范围; 倒谱分析单元（94）基于时间轴上的语音波形的频谱分布来估计由扬声器很好地控制变化的语音波形的范围; 以及伪音节中心提取单元（96）作为语音波形的高可靠性的一部分提取已经被估计为稳定产生的范围并且其改变被该扬声器良好地控制的范围。

3. 发明授权

US07627468B2 Apparatus and method for extracting syllabic nuclei 失效
标题翻译：提取音节核的装置和方法
公开(公告)号：US07627468B2
公开(公告)日：2009-12-01
申请号：US10514413
申请日：2003-02-21
申请人： Nick Campbell , Parham Mokhtari
发明人： Nick Campbell , Parham Mokhtari
IPC分类号： G10L19/06 , G10L11/00 , G10L19/14 , G10L11/04 , G10L11/06 , G10L15/00 , G10L15/20
CPC分类号： G10L25/00 , G10L21/06
摘要： An apparatus enabling automatic determination of a portion that reliably represents a feature of a speech waveform includes: an acoustic/prosodic analysis unit calculating, from data, distribution of an energy of a prescribed frequency range of the speech waveform on a time axis, and for extracting, among various syllables of the speech waveform, a range that is generated stably, based on the distribution and the pitch of the speech waveform; cepstral analysis unit estimating, based on the spectral distribution of the speech waveform on the time axis, a range of the speech waveform of which change is well controlled by a speaker; and a pseudo-syllabic center extracting unit extracting, as a portion of high reliability of the speech waveform, that range which has been estimated to be the stably generated range and of which change is estimated to be well controlled by the speaker.
摘要翻译：能够自动确定可靠地表示语音波形特征的部分的装置包括：声/韵律分析单元，从时间轴上计算语音波形的规定频率范围的能量的分布，在语音波形的各个音节中，基于语音波形的分布和音调提取稳定地生成的范围; 倒谱分析单元基于时间轴上的语音波形的频谱分布来估计由扬声器很好地控制变化的语音波形的范围; 以及伪音节中心提取单元，作为语音波形的高可靠性的一部分，提取已经被估计为稳定产生的范围并且其改变被该扬声器良好地控制的范围。

4. 发明授权

US07617102B2 Speaker identifying apparatus and computer program product 有权
标题翻译：扬声器识别装置和计算机程序产品
公开(公告)号：US07617102B2
公开(公告)日：2009-11-10
申请号：US11527607
申请日：2006-09-27
申请人： Parham Mokhtari , Tatsuya Kitamura , Hironori Takemoto , Seiji Adachi , Kiyoshi Honda
发明人： Parham Mokhtari , Tatsuya Kitamura , Hironori Takemoto , Seiji Adachi , Kiyoshi Honda
IPC分类号： G10L17/00
CPC分类号： G10L17/02
摘要： A speaker identifying apparatus includes: a module for performing a principal component analysis on predetermined vocal tract geometrical parameters of a plurality of speakers and calculating an average and principal component vectors representing speaker-dependent variation; a module for performing acoustic analysis on the speech data being uttered for each of the speakers to calculate cepstrum coefficients; a module for calculating principal component coefficients for approximating the vocal tract geometrical parameter of each of the plurality of speakers by a linear sum of principal component coefficients; a module for determining, by multiple regression analysis, a coefficient sequence for estimating principal component coefficients by a linear sum of the plurality of prescribed features, for each of the plurality of speakers; a module for calculating a plurality of features from speech data of the speaker to be identified, and estimating principal component coefficients for calculating the vocal tract geometrical parameter of the speaker to be identified, by a linear sum obtained by applying the coefficient sequence calculated by the regression analyzing module; and a module for identifying said speaker to be identified, by comparing the estimated principal component coefficients with the principal component coefficients calculated for each of the plurality of speakers by the principal component coefficient calculating module.
摘要翻译：扬声器识别装置包括：用于对多个扬声器的预定声道几何参数执行主分量分析并计算表示说话者相关变化的平均和主分量矢量的模块; 用于对每个扬声器发出的语音数据执行声学分析以计算倒谱系数的模块; 用于通过主分量系数的线性和来计算用于近似多个扬声器中的每一个的声道几何参数的主分量系数的模块; 用于通过多元回归分析确定用于对所述多个说话者中的每一个的所述多个规定特征的线性和估计主成分系数的系数序列的模块; 用于根据要识别的说话者的语音数据计算多个特征的模块，以及通过应用由所述识别的所述系统序列计算的系数序列来获得的线性和来估计用于计算要识别的说话者的声道几何参数的主分量系数回归分析模块; 以及用于通过将所估计的主分量系数与由主成分系数计算模块为多个扬声器中的每一个计算的主分量系数进行比较来识别要识别的所述扬声器的模块。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式