会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 11. 发明授权
    • Speech signal enhancement using visual information
    • 使用视觉信息的语音信号增强
    • US09293151B2
    • 2016-03-22
    • US14352016
    • 2011-10-17
    • Tobias HerbigTobias WolffMarkus Buck
    • Tobias HerbigTobias WolffMarkus Buck
    • G10L25/27G06K9/00G06T7/00H04M3/56H04R3/00G10L15/20G10L17/00H04N7/15G10L25/78G10L21/0208
    • G10L25/27G06K9/00624G06T7/73G06T2207/30196G10L15/20G10L17/00G10L25/78G10L2021/02082H04M3/568H04N7/15
    • Visual information is used to alter or set an operating parameter of an audio signal processor, other than a beamformer. A digital camera captures visual information about a scene that includes a human speaker and/or a listener. The visual information is analyzed to ascertain information about acoustics of a room. A distance between the speaker and a microphone may be estimated, and this distance estimate may be used to adjust an overall gain of the system. Distances among, and locations of, the speaker, the listener, the microphone, a loudspeaker and/or a sound-reflecting surface may be estimated. These estimates may be used to estimate reverberations within the room and adjust aggressiveness of an anti-reverberation filter, based on an estimated ratio of direct to indirect (reverberated) sound energy expected to reach the microphone. In addition, orientation of the speaker or the listener, relative to the microphone or the loudspeaker, can also be estimated, and this estimate may be used to adjust frequency-dependent filter weights to compensate for uneven frequency propagation of acoustic signals from a mouth, or to a human ear, about a human head.
    • 视觉信息用于改变或设置除波束形成器之外的音频信号处理器的操作参数。 数码相机拍摄有关包含人类扬声器和/或听众的场景的视觉信息。 分析视觉信息以确定关于房间声学的信息。 可以估计扬声器和麦克风之间的距离,并且该距离估计可以用于调整系统的整体增益。 可以估计扬声器,收听者,麦克风,扬声器和/或声音反射表面之间的距离和位置。 这些估计可以用于估计房间内的混响,并基于估计达到麦克风的直接到间接(混响)声能的估计比例来调节反混响滤波器的积极性。 此外,还可以估计扬声器或收听者相对于麦克风或扬声器的取向,并且该估计可用于调整频率依赖的滤波器权重以补偿来自口的声信号的不均匀频率传播, 或人的耳朵,关于人的头部。
    • 16. 发明授权
    • Beamforming pre-processing for speaker localization
    • 演讲者本地化的波束成形预处理
    • US08660274B2
    • 2014-02-25
    • US12504333
    • 2009-07-16
    • Tobias WolffMarkus BuckGerhard Schmidt
    • Tobias WolffMarkus BuckGerhard Schmidt
    • H04R3/00
    • H04R3/005H04R2430/20H04R2430/23
    • Embodiments of the present invention relate to methods, systems, and computer program products for signal processing. A first plurality of microphone signals is obtained by a first microphone array. A second plurality of microphone signals is obtained by a second microphone array different from the first microphone array. The first plurality of microphone signals is beamformed by a first beamformer comprising beamforming weights to obtain a first beamformed signal. The second plurality of microphone signals is beamformed by a second beamformer comprising the same beamforming weights as the first beamformer to obtain a second beamformed signal. The beamforming weights are adjusted such that the power density of echo components and/or noise components present in the first and second plurality of microphone signals is substantially reduced.
    • 本发明的实施例涉及用于信号处理的方法,系统和计算机程序产品。 第一麦克风信号由第一麦克风阵列获得。 通过与第一麦克风阵列不同的第二麦克风阵列获得第二多个麦克风信号。 第一组多个麦克风信号由包括波束成形权重的第一波束形成器波束形成,以获得第一波束形成信号。 第二组麦克风信号由包括与第一波束形成器相同的波束形成权重的第二波束形成器波束形成,以获得第二波束形成信号。 调整波束成形权重使得第一和第二多个麦克风信号中存在的回波分量和/或噪声分量的功率密度显着降低。