会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 6. 发明申请
    • Voice Activity Detector for Audio Signals
    • 语音信号检测器
    • US20150243300A1
    • 2015-08-27
    • US14701622
    • 2015-05-01
    • DOLBY LABORATORIES LICENSING CORPORATION
    • Hannes Muesch
    • G10L25/78G10L19/012
    • G10L25/78G10L19/012G10L19/018G10L21/02G10L21/0205G10L21/0364G10L25/93G10L2025/932G10L2025/937
    • According to one aspect, a method for detecting voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having an sample rate; dividing the frame into a plurality of subbands based on the sample rate, the plurality of subbands including at least a lowest subband and a highest subband; filtering the lowest subband with a moving average filter to reduce an energy of the lowest subband; estimating a noise level for each of the plurality of subbands; calculating a signal to noise ratio value for each of the plurality of subbands; and determining a speech activity level of the frame based on an average of the calculated signal to noise ratio values and a weighted average of an energy of each of the plurality of subbands. Other aspects include audio decoders that decode audio that was encoded using the methods described herein.
    • 根据一个方面,公开了一种用于检测语音活动的方法,所述方法包括接收输入音频信号的帧,所述输入音频信号具有采样率; 基于所述采样率将所述帧划分成多个子带,所述多个子带至少包括最低子带和最高子带; 用移动平均滤波器对最低子带进行滤波,以减少最低子带的能量; 估计所述多个子带中的每一个的噪声电平; 计算所述多个子带中的每一个的信噪比值; 以及基于所计算的信噪比值的平均值和所述多个子带中的每一个的能量的加权平均值来确定所述帧的语音活动水平。 其他方面包括解码使用本文描述的方法编码的音频的音频解码器。
    • 9. 发明授权
    • Voice activity detector for audio signals
    • 语音活动检测器,用于音频信号
    • US09418680B2
    • 2016-08-16
    • US14701622
    • 2015-05-01
    • Dolby Laboratories Licensing Corporation
    • Hannes Muesch
    • G10L25/78G10L21/02G10L21/0364G10L19/012
    • G10L25/78G10L19/012G10L19/018G10L21/02G10L21/0205G10L21/0364G10L25/93G10L2025/932G10L2025/937
    • According to one aspect, a method for detecting voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having an sample rate; dividing the frame into a plurality of subbands based on the sample rate, the plurality of subbands including at least a lowest subband and a highest subband; filtering the lowest subband with a moving average filter to reduce an energy of the lowest subband; estimating a noise level for each of the plurality of subbands; calculating a signal to noise ratio value for each of the plurality of subbands; and determining a speech activity level of the frame based on an average of the calculated signal to noise ratio values and a weighted average of an energy of each of the plurality of subbands. Other aspects include audio decoders that decode audio that was encoded using the methods described herein.
    • 根据一个方面,公开了一种用于检测语音活动的方法,所述方法包括接收输入音频信号的帧,所述输入音频信号具有采样率; 基于所述采样率将所述帧划分为多个子带,所述多个子带至少包括最低子带和最高子带; 用移动平均滤波器对最低子带进行滤波,以减少最低子带的能量; 估计所述多个子带中的每一个的噪声电平; 计算所述多个子带中的每一个的信噪比值; 以及基于所计算的信噪比值的平均值和所述多个子带中的每一个的能量的加权平均值来确定所述帧的语音活动水平。 其他方面包括解码使用本文描述的方法编码的音频的音频解码器。
    • 10. 发明申请
    • Method and System for Scaling Ducking of Speech-Relevant Channels in Multi-Channel Audio
    • 多通道音频语音相关通道缩小方法与系统
    • US20160071527A1
    • 2016-03-10
    • US14942706
    • 2015-11-16
    • Dolby Laboratories Licensing Corporation
    • Hannes Muesch
    • G10L21/0364G10L21/034
    • G10L21/0364G10L21/0208G10L21/0232G10L21/034H04S3/008H04S7/30H04S2400/09H04S2400/13
    • A method and system for filtering a multi-channel audio signal having a speech channel and at least one non-speech channel, to improve intelligibility of speech determined by the signal. In typical embodiments, the method includes steps of determining at least one attenuation control value indicative of a measure of similarity between speech-related content determined by the speech channel and speech-related content determined by the non-speech channel, and attenuating the non-speech channel in response to the at least one attenuation control value. Typically, the attenuating step includes scaling of a raw attenuation control signal (e.g., a ducking gain control signal) for the non-speech channel in response to the at least one attenuation control value. Some embodiments are a general or special purpose processor programmed with software or firmware and/or otherwise configured to perform filtering in accordance the invention.
    • 一种用于对具有语音信道和至少一个非语音信道的多声道音频信号进行滤波的方法和系统,以提高由信号确定的语音的可懂度。 在典型的实施例中,该方法包括以下步骤:确定指示由语音信道确定的语音相关内容与由非语音频道确定的语音相关内容之间的相似度的度量的至少一个衰减控制值, 响应于所述至少一个衰减控制值的语音信道。 通常,衰减步骤包括响应于至少一个衰减控制值缩放非语音信道的原始衰减控制信号(例如,下降增益控制信号)。 一些实施例是用软件或固件编程和/或以其他方式配置为根据本发明执行滤波的通用或专用处理器。