会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明申请
    • FEATURE NORMALIZATION FOR SPEECH AND AUDIO PROCESSING
    • 特征用于语音和音频处理的标准化
    • US20100094622A1
    • 2010-04-15
    • US12564457
    • 2009-09-22
    • Peter S. CardilloMark A. Clements
    • Peter S. CardilloMark A. Clements
    • G10L19/14
    • G10L15/02
    • Systems, method, and apparatus for processing a speech utterance or audio record that includes receiving one or more feature vectors characterizing the speech utterance or audio record, each feature vector having a plurality of feature elements, each feature element being associated with a spectral representation of a characteristic of one of a plurality of sequential segments of the speech utterance or audio record; and processing the one or more feature vectors in a rank order filter to obtain one or more normalized feature vectors, each normalized feature vector having a plurality of normalized feature elements corresponding to the plurality of feature elements.
    • 用于处理语音发音或音频记录的系统,方法和装置,包括接收表征语音发音或音频记录的一个或多个特征向量,每个特征向量具有多个特征元素,每个特征元素与 语音发音或音频记录的多个连续片段之一的特征; 以及处理秩阶滤波器中的所述一个或多个特征向量以获得一个或多个归一化特征向量,每个归一化特征向量具有对应于所述多个特征元素的多个归一化特征元素。
    • 7. 发明授权
    • Apparatus and method for modifying a speech waveform to compensate for
recruitment of loudness
    • 用于修改语音波形以补偿响度招募的装置和方法
    • US5274711A
    • 1993-12-28
    • US436428
    • 1989-11-14
    • Janet C. RutledgeMark A. Clements
    • Janet C. RutledgeMark A. Clements
    • G10L21/00G10L21/02G10L5/00
    • G10L21/0364G10L2021/065G10L21/0264
    • An apparatus and method for modifying a speech waveform using sinusoidal speech model parameters, includes finding a net masked threshold for each sinusoid for a normal-hearing subject, and adding the effects of impairment and obtaining an impaired masked threshold. The method also includes finding gain needed for each sinusoid so that its distance above the impaired masked threshold is equal to the distance above normal masked threshold, and multiplying sinusoid amplitudes by the gain. The sinusoidal model is used to address the problem of spread of masking within internal speech components by determining the amount of masking that occurs between surrounding sinusoids. The masked threshold for each sinusoid is determined based on the additive effects of masking by other sinusoids in each frame. The method compensates for recruitment by a transformation to determine how much each sinusoidal amplitude must be amplified in order to maintain the loudness relationships between sinusoids and their masked threshold in the normal-hearing and hearing-impaired domains.
    • 一种用于使用正弦语音模型参数来修改语音波形的装置和方法,包括为正常听力对象找到每个正弦曲线的净屏蔽阈值,并添加损伤的影响并获得受损的屏蔽阈值。 该方法还包括找到每个正弦曲线所需的增益,使得其在受损屏蔽阈值之上的距离等于高于正常屏蔽阈值的距离,并将正弦波幅度乘以增益。 正弦模型用于通过确定周围正弦波之间发生的掩蔽量来解决内部语音分量内的掩蔽扩散的问题。 每个正弦曲线的掩蔽阈值是基于每帧中其他正弦波掩蔽的加法效应来确定的。 该方法通过变换来补偿招募,以确定每个正弦波幅度必须被放大多少,以便在正常听力和听力受损区域中保持正弦曲线与其屏蔽阈值之间的响度关系。