会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Speech signal enhancement using visual information
    • 使用视觉信息的语音信号增强
    • US09293151B2
    • 2016-03-22
    • US14352016
    • 2011-10-17
    • Tobias HerbigTobias WolffMarkus Buck
    • Tobias HerbigTobias WolffMarkus Buck
    • G10L25/27G06K9/00G06T7/00H04M3/56H04R3/00G10L15/20G10L17/00H04N7/15G10L25/78G10L21/0208
    • G10L25/27G06K9/00624G06T7/73G06T2207/30196G10L15/20G10L17/00G10L25/78G10L2021/02082H04M3/568H04N7/15
    • Visual information is used to alter or set an operating parameter of an audio signal processor, other than a beamformer. A digital camera captures visual information about a scene that includes a human speaker and/or a listener. The visual information is analyzed to ascertain information about acoustics of a room. A distance between the speaker and a microphone may be estimated, and this distance estimate may be used to adjust an overall gain of the system. Distances among, and locations of, the speaker, the listener, the microphone, a loudspeaker and/or a sound-reflecting surface may be estimated. These estimates may be used to estimate reverberations within the room and adjust aggressiveness of an anti-reverberation filter, based on an estimated ratio of direct to indirect (reverberated) sound energy expected to reach the microphone. In addition, orientation of the speaker or the listener, relative to the microphone or the loudspeaker, can also be estimated, and this estimate may be used to adjust frequency-dependent filter weights to compensate for uneven frequency propagation of acoustic signals from a mouth, or to a human ear, about a human head.
    • 视觉信息用于改变或设置除波束形成器之外的音频信号处理器的操作参数。 数码相机拍摄有关包含人类扬声器和/或听众的场景的视觉信息。 分析视觉信息以确定关于房间声学的信息。 可以估计扬声器和麦克风之间的距离,并且该距离估计可以用于调整系统的整体增益。 可以估计扬声器,收听者,麦克风,扬声器和/或声音反射表面之间的距离和位置。 这些估计可以用于估计房间内的混响,并基于估计达到麦克风的直接到间接(混响)声能的估计比例来调节反混响滤波器的积极性。 此外,还可以估计扬声器或收听者相对于麦克风或扬声器的取向,并且该估计可用于调整频率依赖的滤波器权重以补偿来自口的声信号的不均匀频率传播, 或人的耳朵,关于人的头部。
    • 8. 发明申请
    • Method for Adapting a Codebook for Speech Recognition
    • 适应语音识别码本的方法
    • US20100138222A1
    • 2010-06-03
    • US12622717
    • 2009-11-20
    • Tobias HerbigFranz Gerl
    • Tobias HerbigFranz Gerl
    • G10L15/06
    • G10L15/065
    • A method for adapting a codebook for speech recognition, wherein the codebook is from a set of codebooks comprising a speaker-independent codebook and at least one speaker-dependent codebook is disclosed. A speech input is received and a feature vector based on the received speech input is determined. For each of the Gaussian densities, a first mean vector is estimated using an expectation process and taking into account the determined feature vector. For each of the Gaussian densities, a second mean vector using an Eigenvoice adaptation is determined taking into account the determined feature vector. For each of the Gaussian densities, the mean vector is set to a convex combination of the first and the second mean vector. Thus, this process allows for adaptation during operation and does not require a lengthy training phase.
    • 一种用于适应用于语音识别的码本的方法,其中所述码本来自包括与扬声器无关的码本和至少一个与扬声器相关的码本的码本集合。 接收到语音输入,并且确定基于所接收的语音输入的特征向量。 对于每个高斯密度,使用期望过程并且考虑确定的特征向量来估计第一平均向量。 对于每个高斯密度,使用特征语音适配的第二平均向量被确定,其考虑所确定的特征向量。 对于每个高斯密度,将平均矢量设置为第一和第二平均矢量的凸组合。 因此,该过程允许在操作期间的适应并且不需要冗长的训练阶段。
    • 9. 发明授权
    • Method for determining the presence of a wanted signal component
    • 用于确定有用信号分量的存在的方法
    • US09530432B2
    • 2016-12-27
    • US12507444
    • 2009-07-22
    • Tobias HerbigFranz Gerl
    • Tobias HerbigFranz Gerl
    • G10L25/78G10L15/22
    • G10L25/78G10L15/222G10L2015/223
    • This invention provides a method for determining, in a speech dialog system issuing speech prompts, a score value as an indicator for the presence of a wanted signal component in an input signal stemming from a microphone, comprising the steps of: using a first likelihood function to determine a first likelihood value for the presence of the wanted signal component in the input signal, using a second likelihood function to determine a second likelihood value for the presence of a noise signal component in the input signal, and determining a score value based on the first and the second likelihood values, wherein the first likelihood function is based on a predetermined reference wanted signal, and the second likelihood function is based on a predetermined reference noise signal.
    • 本发明提供了一种在发出语音提示的语音对话系统中确定得分值作为来自麦克风的输入信号中有用信号分量的存在的指标的方法,包括以下步骤:使用第一似然函数 使用第二似然函数来确定用于在输入信号中存在噪声信号分量的第二似然值,并且基于该输入信号确定分数值来确定用于输入信号中有用信号分量的存在的第一似然值 第一似然值和第二似然值,其中第一似然函数基于预定的参考有用信号,第二似然函数基于预定的参考噪声信号。
    • 10. 发明授权
    • Method for adapting a codebook for speech recognition
    • 适用于语音识别的码本的方法
    • US08346551B2
    • 2013-01-01
    • US12622717
    • 2009-11-20
    • Tobias HerbigFranz Gerl
    • Tobias HerbigFranz Gerl
    • G10L15/06
    • G10L15/065
    • A method for adapting a codebook for speech recognition, wherein the codebook is from a set of codebooks comprising a speaker-independent codebook and at least one speaker dependent codebook. A speech input is received and a feature vector based on the received speech input is determined. For each of the Gaussian densities, a first mean vector is estimated using an expectation process and taking into account the determined feature vector. For each of the Gaussian densities, a second mean vector using an Eigenvoice adaptation is determined taking into account the determined feature vector. For each of the Gaussian densities, the mean vector is set to a convex combination of the first and the second mean vector. Thus, this process allows for adaptation during operation and does not require a lengthy training phase.
    • 一种用于调整用于语音识别的码本的方法,其中码本来自包括与扬声器无关的码本和至少一个扬声器相关码本的一组码本。 接收到语音输入,并且确定基于所接收的语音输入的特征向量。 对于每个高斯密度,使用期望过程并且考虑确定的特征向量来估计第一平均向量。 对于每个高斯密度,使用特征语音适配的第二平均向量被确定,其考虑所确定的特征向量。 对于每个高斯密度,将平均矢量设置为第一和第二平均矢量的凸组合。 因此,该过程允许在操作期间的适应并且不需要冗长的训练阶段。