会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 21. 发明授权
    • Complexity scalable perceptual tempo estimation
    • 复杂性可扩展感知速度估计
    • US09466275B2
    • 2016-10-11
    • US13503136
    • 2010-10-26
    • Arijit BiswasDanilo HollosiMichael Schug
    • Arijit BiswasDanilo HollosiMichael Schug
    • G10H7/00G10H1/40
    • G10H1/40G10H2210/076G10H2230/015G10H2240/075
    • The present document relates to methods and systems for estimating the tempo of a media signal, such as audio or combined video/audio signal. In particular, the document relates to the estimation of tempo perceived by human listeners, as well as to methods and systems for tempo estimation at scalable computational complexity. A method and system for extracting tempo information of an audio signal from an encoded bit-stream of the audio signal comprising spectral band replication data is described. The method comprises the steps of determining a payload quantity associated with the amount of spectral band replication data comprised in the encoded bit-stream for a time interval of the audio signal; repeating the determining step for successive time intervals of the encoded bit-stream of the audio signal, thereby determining a sequence of payload quantities; identifying a periodicity in the sequence of payload quantities; and extracting tempo information of the audio signal from the identified periodicity.
    • 本文件涉及用于估计媒体信号(诸如音频或组合视频/音频信号)的速度的方法和系统。 特别地,该文件涉及人类听众感知的节奏的估计,以及用于以可缩放的计算复杂度进行速度估计的方法和系统。 描述用于从包括频谱带复制数据的音频信号的编码比特流中提取音频信号的速度信息的方法和系统。 该方法包括以下步骤:在音频信号的时间间隔中确定与包含在编码比特流中的频谱带复制数据量相关联的有效载荷数量; 重复音频信号的编码比特流的连续时间间隔的确定步骤,从而确定有效载荷量的序列; 识别有效载荷数量序列中的周期; 以及从所识别的周期中提取音频信号的速度信息。
    • 22. 发明授权
    • Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols
    • 音频编码方法和系统,用于通过实现不同解码协议的解码器生成统一的比特流解码
    • US09378743B2
    • 2016-06-28
    • US14009503
    • 2012-04-05
    • Jeffrey C. RiedmillerFarhad FarahaniMichael SchugRegunathan RadhakrishnanMark S. Vinton
    • Jeffrey C. RiedmillerFarhad FarahaniMichael SchugRegunathan RadhakrishnanMark S. Vinton
    • G10L19/008G10L19/002G10L19/16
    • G10L19/002G10L19/167
    • In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.
    • 在一类实施例中,音频编码系统(通常是感知编码系统,其被配置为生成与第一解码器兼容的(即可解码的)单个(“统一”)比特流,第一解码器被配置为对 根据第一编码协议(例如,多频道杜比数字+或DD +协议)和被配置为对根据第二编码协议(例如立体声AAC,HE AAC v1或HE)编码的音频数据进行解码的第二解码器 统一比特流可以包括可由第一解码器解码(并由第二解码器忽略)的可编码数据(例如,数据突发)和由第二解码器解码的编码数据(例如,其他数据突发) 并且被第一解码器忽略),实际上,当第一解码器对比特流进行解码时,第二编码格式被隐藏在统一比特流内,并且当比特流中第一编码格式被隐藏在统一比特流内时 令牌由第二解码器解码。 根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。 本发明的其他方面是由本发明编码器的任何实施例执行的编码方法,由本发明解码器的任何实施例执行的解码方法,以及存储用于实现本发明的任何实施例的代码的计算机可读介质(例如,盘) 方法。
    • 23. 发明申请
    • AUDIO ENCODING METHOD AND SYSTEM FOR GENERATING A UNIFIED BITSTREAM DECODABLE BY DECODERS IMPLEMENTING DIFFERENT DECODING PROTOCOLS
    • 音视频编码方法和系统,用于生成由解码器实现的不同解码协议解码的统一的双绞线
    • US20140358554A1
    • 2014-12-04
    • US14009503
    • 2012-04-05
    • Jeffrey C. RiedmillerFarhad FarahaniMichael SchugRegunathan RadhakrishnanMark S. Vinton
    • Jeffrey C. RiedmillerFarhad FarahaniMichael SchugRegunathan RadhakrishnanMark S. Vinton
    • G10L19/002
    • G10L19/002G10L19/167
    • In a class of embodiments, an audio encoding system (typically, a perceptual encoding system that is configured to generate a single (“unified”) bitstream that is compatible with (i.e., decodable by) a first decoder configured to decode audio data encoded in accordance with a first encoding protocol (e.g., the multichannel Dolby Digital Plus, or DD+, protocol) and a second decoder configured to decode audio data encoded in accordance with a second encoding protocol (e.g., the stereo AAC, HE AAC v1, or HE AAC v2 protocol). The unified bitstream can include both encoded data (e.g., bursts of data) decodable by the first decoder (and ignored by the second decoder) and encoded data (e.g., other bursts of data) decodable by the second decoder (and ignored by the first decoder). In effect, the second encoding format is hidden within the unified bitstream when the bitstream is decoded by the first decoder, and the first encoding format is hidden within the unified bitstream when the bitstream is decoded by the second decoder. The format of the unified bitstream generated in accordance with the invention may eliminate the need for transcoding elements throughout an entire media chain and/or ecosystem. Other aspects of the invention are an encoding method performed by any embodiment of the inventive encoder, a decoding method performed by any embodiment of the inventive decoder, and a computer readable medium (e.g., disc) which stores code for implementing any embodiment of the inventive method.
    • 在一类实施例中,音频编码系统(通常是感知编码系统,其被配置为生成与第一解码器兼容的(即可解码的)单个(“统一”)比特流,第一解码器被配置为对 根据第一编码协议(例如,多频道杜比数字+或DD +协议)和被配置为对根据第二编码协议(例如立体声AAC,HE AAC v1或HE)编码的音频数据进行解码的第二解码器 统一比特流可以包括可由第一解码器解码(并由第二解码器忽略)的可编码数据(例如,数据突发)和由第二解码器解码的编码数据(例如,其他数据突发) 并且被第一解码器忽略),实际上,当第一解码器对比特流进行解码时,第二编码格式被隐藏在统一比特流内,并且当比特流中第一编码格式被隐藏在统一比特流内时 令牌由第二解码器解码。 根据本发明生成的统一比特流的格式可以消除在整个媒体链和/或生态系统中对代码转换元素的需要。 本发明的其他方面是由本发明编码器的任何实施例执行的编码方法,由本发明解码器的任何实施例执行的解码方法,以及存储用于实现本发明的任何实施例的代码的计算机可读介质(例如,盘) 方法。
    • 24. 发明授权
    • Method and encoder for processing a digital stereo audio signal
    • 用于处理数字立体声音频信号的方法和编码器
    • US08891775B2
    • 2014-11-18
    • US14113362
    • 2012-05-07
    • Michael SchugHarald H. Mundt
    • Michael SchugHarald H. Mundt
    • H04S1/00G10L19/008G10L19/03
    • H04S1/007G10L19/008G10L19/03
    • The invention discloses a method and an encoder for processing a digital audio stereo signal. A digital audio encoder for coding such audio signal comprises a predictive Temporal Noise Shaping (TNS) filter, a Mid-/Side (M/S) coding unit, a control unit for determining a first prediction gain related to the unmodified L/R signal processed by the TNS filter and for determining a second prediction gain related to the M/S-coded L/R signal processed by the TNS filter, wherein the control unit is adapted to disable TNS-filtering—i.e. to bypass the TNS filter—for a current signal frame, if the first and second prediction gains differ by more than a pre-determined mismatch range. Preferably, the first and second prediction gains are determined from signal energy ratios calculated for each channel of the stereo signal including the signal energies of both the TNS-processed (unmodified) L- respectively (unmodified) R-signal and the TNS-processed M/S coded L- respectively M/S coded R-signal divided by the respective signal energies before TNS processing. Furthermore, the control unit is preferably adapted to overrule the disabling of the TNS filter, if the input signal is a near-mono audio signal exhibiting only low energy either in its M- or S-band. In that case, operation of the TNS filter on the stereo audio signal is maintained.
    • 本发明公开了一种用于处理数字音频立体声信号的方法和编码器。 用于对这种音频信号进行编码的数字音频编码器包括预测时间噪声整形(TNS)滤波器,中/侧(M / S)编码单元,用于确定与未修改的L / R信号相关的第一预测增益的控制单元 由TNS滤波器处理并确定与由TNS滤波器处理的M / S编码的L / R信号相关的第二预测增益,其中该控制单元用于禁用TNS滤波 如果第一和第二预测增益相差超过预定的不匹配范围,则绕过TNS滤波器以获得当前信号帧。 优选地,第一和第二预测增益是根据对包括TNS处理(未修改)L信号和TNS处理的M信号的两个信号能量的立体声信号的每个信道计算的信号能量比确定的 / S编码的L-分别M / S编码的R信号除以TNS处理之前的各个信号能量。 此外,如果输入信号是在其M波段或S波段中仅表现出低能量的近乎单声道的音频信号,则控制单元优选地适用于推翻TNS滤波器的禁用。 在这种情况下,维持TNS滤波器对立体声音频信号的操作。
    • 25. 发明申请
    • Method and System for Encoding Audio Data with Adaptive Low Frequency Compensation
    • 用自适应低频补偿编码音频数据的方法和系统
    • US20130179175A1
    • 2013-07-11
    • US13588890
    • 2012-08-17
    • Arijit BiswasVinay MelkoteMichael SchugGrant A. DavidsonMark S. Vinton
    • Arijit BiswasVinay MelkoteMichael SchugGrant A. DavidsonMark S. Vinton
    • G10L19/00
    • G10L19/028G10L19/0204G10L19/032G10L19/265
    • A method for determining mantissa bit allocation of frequency domain audio data to be encoded, including by performing adaptive low frequency compensation on each frequency band of a set of low frequency bands of the data. The low frequency compensation includes steps of: performing tonality detection on the audio data to generate compensation control data indicative of whether each frequency band in the set has prominent tonal content; and performing low frequency compensation on each frequency band in the set having prominent tonal content, including by correcting a preliminary masking value for each frequency band having prominent tonal content, but not performing low frequency compensation on the audio data in any other frequency band in the set. Other aspects are audio encoding methods including such tonality detection and low frequency compensation steps, and a system configured to perform any embodiment of the inventive method.
    • 一种用于确定要编码的频域音频数据的尾数位分配的方法,包括通过对数据的一组低频带的每个频带执行自适应低频补偿。 低频补偿包括以下步骤:对音频数据执行音调检测,以产生指示该组中的每个频带是否具有突出的音调内容的补偿控制数据; 并且对具有突出音调内容的集合中的每个频带执行低频补偿,包括通过校正具有突出音调内容的每个频带的初步屏蔽值,但不对低频补偿中的任何其他频带中的音频数据执行低频补偿 组。 其他方面是包括这种音调检测和低频补偿步骤的音频编码方法,以及被配置为执行本发明方法的任何实施例的系统。
    • 29. 发明申请
    • Device and method for analysing a decoded time signal
    • 用于分析解码时间信号的装置和方法
    • US20050175252A1
    • 2005-08-11
    • US10220651
    • 2001-02-16
    • Juergen HerreMartin DietzThomas SporerMichael SchugWolfgang Schildbach
    • Juergen HerreMartin DietzThomas SporerMichael SchugWolfgang Schildbach
    • H04N19/00H04N19/126H04N19/40H04N19/60G06K9/36
    • H04N19/00H04N19/126H04N19/40H04N19/60
    • An apparatus for analyzing an analysis time signal that has been generated from encoding and decoding an original time signal according to an encoding algorithm first, wherein first the encoding block raster underlying the analysis time signal used by the encoding algorithm is determined. Thereupon, the analysis time signal will be converted from its timely representation comprising a plurality of analysis spectral coefficients, to a spectral representation by using the established encoding block raster. Then, at least two analysis spectral coefficients or at least two spectral coefficients derived from the analysis spectral coefficients by multiplication of an encoding amplification factor or by multiplication with a compression function are grouped. Then, the greatest common divisor of the analysis spectral coefficients or the spectral coefficients derived from the analysis spectral coefficients will be calculated, corresponding to the quantization step width used when quantizing the encoding algorithm or an integer multiple of it. Then, in the case of an audio signal, the scale factor can easily be established for this group of spectral coefficients, i.e. for a scale factor band, from the quantization step width. Thus, all parameters used for the quantization of the original time signal are known, so that for quantizing the analysis time signal no longer full iteration loops have to be performed, which are, on the one hand, very computing time intensive and, on the other hand, introduce tandem encoding distortions.
    • 一种用于分析根据编码算法首先对原始时间信号进行编码和解码而产生的分析时间信号的装置,其中首先确定编码算法使用的分析时间信号下面的编码块光栅。 因此,分析时间信号将通过使用所建立的编码块光栅从包括多个分析频谱系数的及时表示转换成频谱表示。 然后,将通过编码放大因子的乘法或通过与压缩函数相乘而从分析频谱系数导出的至少两个分析频谱系数或至少两个频谱系数分组。 然后,对应于当量化编码算法或其整数倍时使用的量化步长,将计算分析频谱系数的最大公约数或从分析频谱系数导出的频谱系数。 然后,在音频信号的情况下,从量化步长可以容易地为该组频谱系数建立比例因子,即缩放因子频带。 因此,用于原始时间信号的量化的所有参数是已知的,使得对于分析时间信号的量化不再必须执行完整的迭代循环,这一方面一方面非常计算时间密集,并且在 另一方面,引入串联编码失真。
    • 30. 发明授权
    • Method and system for encoding audio data with adaptive low frequency compensation
    • 用自适应低频补偿编码音频数据的方法和系统
    • US08527264B2
    • 2013-09-03
    • US13588890
    • 2012-08-17
    • Arijit BiswasVinay MelkoteMichael SchugGrant Allen DavidsonMark Stuart Vinton
    • Arijit BiswasVinay MelkoteMichael SchugGrant Allen DavidsonMark Stuart Vinton
    • G10L19/00
    • G10L19/028G10L19/0204G10L19/032G10L19/265
    • A method for determining mantissa bit allocation of frequency domain audio data to be encoded, including by performing adaptive low frequency compensation on each frequency band of a set of low frequency bands of the data. The low frequency compensation includes steps of: performing tonality detection on the audio data to generate compensation control data indicative of whether each frequency band in the set has prominent tonal content; and performing low frequency compensation on each frequency band in the set having prominent tonal content, including by correcting a preliminary masking value for each frequency band having prominent tonal content, but not performing low frequency compensation on the audio data in any other frequency band in the set; wherein the frequency domain audio data comprises an exponent value for said each low frequency band of the set, and the tonality detection includes determining, for said each low frequency band of the set, a measure of difference between exponents and corresponding tented exponents of the audio data. Other aspects are audio encoding methods including such tonality detection and low frequency compensation steps, and a system configured to perform any embodiment of the inventive method.
    • 一种用于确定要编码的频域音频数据的尾数位分配的方法,包括通过对数据的一组低频带的每个频带执行自适应低频补偿。 低频补偿包括以下步骤:对音频数据执行音调检测,以产生指示该组中的每个频带是否具有突出的音调内容的补偿控制数据; 并且对具有突出音调内容的集合中的每个频带执行低频补偿,包括通过校正具有突出音调内容的每个频带的初步屏蔽值,但不对低频补偿中的任何其他频带中的音频数据执行低频补偿 组; 其中所述频域音频数据包括所述组的所述每个低频带的指数值,并且所述音调检测包括针对所述组的所述每个低频带确定所述音频的指数和对应的帐篷指数之间的差的度量 数据。 其他方面是包括这种音调检测和低频补偿步骤的音频编码方法,以及被配置为执行本发明方法的任何实施例的系统。