会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 14. 发明申请
    • Speech Signal Enhancement Using Visual Information
    • 使用视觉信息的语音信号增强
    • US20140337016A1
    • 2014-11-13
    • US14352016
    • 2011-10-17
    • NUANCE COMMUNICATIONS, INC.
    • Tobias HerbigTobias WolffMarkus Buck
    • G10L25/27G06K9/00
    • G10L25/27G06K9/00624G06T7/73G06T2207/30196G10L15/20G10L17/00G10L25/78G10L2021/02082H04M3/568H04N7/15
    • Visual information is used to alter or set an operating parameter of an audio signal processor, other than a beamformer. A digital camera captures visual information about a scene that includes a human speaker and/or a listener. The visual information is analyzed to ascertain information about acoustics of a room. A distance between the speaker and a microphone may be estimated, and this distance estimate may be used to adjust an overall gain of the system. Distances among, and locations of, the speaker, the listener, the microphone, a loudspeaker and/or a sound-reflecting surface may be estimated. These estimates may be used to estimate reverberations within the room and adjust aggressiveness of an anti-reverberation filter, based on an estimated ratio of direct to indirect (reverberated) sound energy expected to reach the microphone. In addition, orientation of the speaker or the listener, relative to the microphone or the loudspeaker, can also be estimated, and this estimate may be used to adjust frequency-dependent filter weights to compensate for uneven frequency propagation of acoustic signals from a mouth, or to a human ear, about a human head.
    • 视觉信息用于改变或设置除波束形成器之外的音频信号处理器的操作参数。 数码相机拍摄有关包含人类扬声器和/或听众的场景的视觉信息。 分析视觉信息以确定关于房间声学的信息。 可以估计扬声器和麦克风之间的距离,并且该距离估计可以用于调整系统的整体增益。 可以估计扬声器,收听者,麦克风,扬声器和/或声音反射表面之间的距离和位置。 这些估计可以用于估计房间内的混响,并基于估计达到麦克风的直接到间接(混响)声能的估计比例来调节反混响滤波器的积极性。 此外,还可以估计扬声器或收听者相对于麦克风或扬声器的取向,并且该估计可用于调整频率依赖的滤波器权重以补偿来自口的声信号的不均匀频率传播, 或人的耳朵,关于人的头部。