会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 33. 发明申请
    • SPATIAL AUDIO SIGNAL MANIPULATION
    • 空间音频信号处理
    • WO2016172254A1
    • 2016-10-27
    • PCT/US2016/028501
    • 2016-04-20
    • DOLBY LABORATORIES LICENSING CORPORATIONDOLBY INTERNATIONAL AB
    • BREEBAART, Dirk JeroenMATEOS SOLE, AntonioPURNHAGEN, HeikoTSINGOS, Nicolas R.
    • H04S7/00
    • H04S7/303H04R5/02H04S3/008H04S7/30H04S2400/11H04S2420/03
    • Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).
    • 这里描述了一种在由目标扬声器系统(23)定义的音频环境(27)中渲染用于回放的音频信号(17)的方法(30),该音频信号(17)包括与音频对象相关的音频数据, 指示对象位置的关联位置数据。 方法(30)包括接收音频信号(17)的初始步骤(31)。 在步骤(32),接收目标扬声器系统(23)的扬声器布局数据。 在步骤(33),接收指示要在音频环境(27)中应用于音频对象的位置修改的控制数据。 在步骤(38)响应于位置数据,扬声器布局数据和控制数据,生成渲染修改数据。 最后,在步骤(39)中,利用渲染修改数据渲染音频信号(17),以便在音频环境(27)内的扬声器之间的修改对象位置处以音频对象输出音频信号(17)。
    • 34. 发明申请
    • PARAMETRIC MIXING OF AUDIO SIGNALS
    • 音频信号的参数混合
    • WO2016066705A1
    • 2016-05-06
    • PCT/EP2015/075022
    • 2015-10-28
    • DOLBY INTERNATIONAL AB
    • VILLEMOES, LarsPURNHAGEN, HeikoLEHTONEN, Heidi-Maria
    • G10L19/008
    • H04S3/008G10L19/008H04S2400/01H04S2400/03H04S2420/03
    • In an encoding section (100), a downmix section (110) forms first and second channels ( L 1 , L 2 ) of a downmix signal as linear combinations of first and second groups (401, 402) of channels, respectively, of an M-channel audio signal; and an analysis section (120) determines upmix parameters (α LU ) for parametric reconstruction of the audio signal, and mixing parameters ( α LM ). In a decoding section (1200), a decorrelating section (1210) outputs a decorrelated signal ( D ) based on the downmix signal; and a mixing section (1220) determines mixing coefficients based on the mixing parameters or the upmix parameters, and forms a K -channel output signal ( L͂ 1 ,...,L͂ K ) as a linear combination of the downmix signal and the decorrelated signal in accordance with the mixing coefficients. The channels of the output signal approximate linear combinations of K groups (501-502, 1301-1303) of channels, respectively, of the audio signal. The K groups constitute a different partition of the audio signal than the first and second groups, and 2 ≤ K .
    • 在编码部分(100)中,下混合部分(110)分别形成下混合信号的第一和第二信道(L 1,L 2),作为信道的第一和第二组(401,402)的线性组合, M通道音频信号; 和分析部分(120)确定用于音频信号的参数重建的混合参数(αLU)和混合参数(αLM)。 在解码部分(1200)中,解相关部分(1210)基于下混合信号输出解相关信号(D); 和混合部分(1220)根据混合参数或上混参数确定混合系数,并形成K沟道输出信号(L 1,...,L K),作为降混信号和解相关信号 信号根据混合系数。 输出信号的通道分别近似于音频信号的K个组(501-502,1301-1303)的线性组合。 K组构成音频信号与第一组和第二组不同的分区,2≤K
    • 35. 发明申请
    • EXPLOITING METADATA REDUNDANCY IN IMMERSIVE AUDIO METADATA
    • 在无与伦比的音频元数据中实现元数据冗余
    • WO2015150480A1
    • 2015-10-08
    • PCT/EP2015/057231
    • 2015-04-01
    • DOLBY INTERNATIONAL AB
    • FERSCH, ChristofPURNHAGEN, HeikoPOPP, JensWOLTERS, Martin
    • G10L19/008
    • H04S7/30G10L19/008H04S3/008H04S2400/03H04S2400/11H04S2400/13
    • The present document relates to the field of encoding and decoding of audio. In particular, the present document relates to encoding and decoding of an audio scene comprising audio objects. A method (400) for encoding metadata relating to a plurality of audio objects (106a) of an audio scene (102) is described. The metadata comprises a first set (114, 314) of metadata and a second set (104) of metadata. The first and second sets (104, 114, 314) of metadata comprise one or more data elements which are indicative of a property of an audio object (106a) from the plurality of audio objects (106a) and/or of a downmix signal (112) derived from the plurality of audio objects (106a). The method (400) comprises identifying (401) a redundant data element which is common to the first and second sets (104, 114, 314) of metadata. Furthermore, the method comprises encoding (402) the redundant data element of the first set (114, 314) of metadata by referring to a redundant data element of a set (104) of metadata external for the first set (114, 314) of metadata.
    • 本文件涉及音频的编码和解码领域。 特别地,本文件涉及包括音频对象的音频场景的编码和解码。 描述了用于编码与音频场景(102)的多个音频对象(106a)有关的元数据的方法(400)。 元数据包括元数据的第一集合(114,314)和元数据的第二集合(104)。 元数据的第一和第二组(104,114,314)包括指示来自多个音频对象(106a)和/或降混信号(106a)的音频对象(106a)的属性的一个或多个数据元素 112)从多个音频对象(106a)导出。 方法(400)包括识别(401)元数据的第一和第二组(104,114,314)共有的冗余数据元素。 此外,该方法包括通过参考第一组(114,314)的外部元数据(104)的冗余数据元素(104)来编码(402)元数据的第一组(114,314)的冗余数据元素 元数据。
    • 36. 发明申请
    • TIME- ALIGNMENT OF QMF BASED PROCESSING DATA
    • 基于QMF的处理数据的时间对齐
    • WO2015036348A1
    • 2015-03-19
    • PCT/EP2014/069039
    • 2014-09-08
    • DOLBY INTERNATIONAL AB
    • KJOERLING, KristoferPURNHAGEN, HeikoPOPP, Jens
    • G10L21/0388
    • G10L19/167G10L19/018G10L19/0204G10L19/032G10L21/0388
    • The present document relates to time-alignment of encoded data of an audio encoder with associated metadata, such as spectral band replication (SBR) metadata. An audio decoder (100, 300) configured to determine a reconstructed frame of an audio signal (237) from an access unit (110) of a received data stream is described. The access unit (110) comprises waveform data (111) and metadata (112), wherein the waveform data (111) and the metadata (112) are associated with the same reconstructed frame of the audio signal (127). The audio decoder (100, 300) comprises a waveform processing path (101, 102, 103, 104, 105) configured to generate a plurality of waveform subband signals (123) from the waveform data (111), and a metadata processing path (108, 109) configured to generate decoded metadata (128) from the metadata (111).
    • 本文件涉及具有诸如频谱带复制(SBR)元数据的相关元数据的音频编码器的编码数据的时间对准。 被配置为从接收数据流的访问单元(110)确定音频信号(237)的重建帧的音频解码器(100,300)。 访问单元(110)包括波形数据(111)和元数据(112),其中波形数据(111)和元数据(112)与音频信号(127)的相同的重构帧相关联。 音频解码器(100,300)包括被配置为从波形数据(111)生成多个波形子带信号(123)的波形处理路径(101,102,103,104,105)和元数据处理路径 ,其被配置为从所述元数据(111)生成解码的元数据(128)。
    • 37. 发明申请
    • RECONSTRUCTION OF AUDIO SCENES FROM A DOWNMIX
    • 从下载重建音频场景
    • WO2014187989A2
    • 2014-11-27
    • PCT/EP2014/060732
    • 2014-05-23
    • DOLBY INTERNATIONAL AB
    • HIRVONEN, ToniPURNHAGEN, HeikoSAMUELSSON, Leif JonasVILLEMOES, Lars
    • G10L19/008
    • G10L19/008G10L19/0204G10L19/20G10L25/06H04S3/008H04S3/02H04S5/00H04S7/30H04S2400/03H04S2400/11H04S2420/03
    • Audio objects are associated with positional metadata. A received downmix signal comprises downmix channels that are linear combinations of one or more audio objects and are associated with respective positional locators. In a first aspect, the downmix signal, the positional metadata and frequency- dependent object gains are received. An audio object is reconstructed by applying the object gain to an upmix of the downmix signal in accordance with coefficients based on the positional metadata and the positional locators. In a second aspect, audio objects have been encoded together with at least one bed channel positioned at a positional locator of a corresponding downmix channel. The decoding system receives the downmix signal and the positional metadata of the audio objects. A bed channel is reconstructed by suppressing the content representing audio objects from the corresponding downmix channel on the basis of the positional locator of the corresponding downmix channel.
    • 音频对象与位置元数据相关联。 接收的下混合信号包括作为一个或多个音频对象的线性组合的下混通道,并且与相应的位置定位器相关联。 在第一方面,接收降混信号,位置元数据和与频率相关的对象增益。 根据基于位置元数据和位置定位器的系数,将对象增益应用于缩混信号的上混合来重构音频对象。 在第二方面,音频对象已经与位于对应的下混通道的位置定位器处的至少一个床通道一起被编码。 解码系统接收下混合信号和音频对象的位置元数据。 基于对应的下混通道的位置定位器,通过从对应的下混通道中抑制表示音频对象的内容来重构床通道。
    • 39. 发明申请
    • AUDIO PROCESSING SYSTEM
    • 音频处理系统
    • WO2014161996A2
    • 2014-10-09
    • PCT/EP2014/056857
    • 2014-04-04
    • DOLBY INTERNATIONAL AB
    • KJOERLING, KristoferPURNHAGEN, HeikoVILLEMOES, Lars
    • G10L19/008
    • G10L19/008G10L19/032G10L19/04G10L19/20
    • An audio processing system (100) comprises a front-end component (102, 103), which receives quantized spectral components and performs an inverse quantization, yielding a time-domain representation of an intermediate signal. The audio processing system further comprises a frequency-domain processing stage (104, 105, 106, 107, 108), configured to provide a time-domain representation of a processed audio signal, and a sample rate converter (109), providing a reconstructed audio signal sampled at a target sampling frequency. The respective internal sampling rates of the time-domain representation of the intermediate audio signal and of the time-domain representation of the processed audio signal are equal. In particular embodiments, the processing stage comprises a parametric upmix stage which is operable in at least two different modes and is associated with a delay stage that ensures constant total delay.
    • 音频处理系统(100)包括接收量化频谱分量并执行逆量化的前端组件(102,103),产生中间信号的时域表示。 音频处理系统还包括被配置为提供经处理的音频信号的时域表示的频域处理级(104,105,106,107,108),以及采样速率转换器(109),提供重构的 以目标采样频率采样的音频信号。 中间音频信号的时域表示和处理后的音频信号的时域表示各自的内部采样率是相等的。 在特定实施例中,处理级包括参数上混级,其可在至少两种不同模式下操作并且与确保恒定总延迟的延迟级相关联。