专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

31. 发明申请

WO2019143867A1 METHODS AND DEVICES FOR CODING SOUNDFIELD REPRESENTATION SIGNALS 审中-公开
公开(公告)号：WO2019143867A1
公开(公告)日：2019-07-25
申请号：PCT/US2019/014090
申请日：2019-01-17
申请人： DOLBY LABORATORIES LICENSING CORPORATION , DOLBY INTERNATIONAL AB
发明人： KJOERLING, Kristofer , MCGRATH, David S. , PURNHAGEN, Heiko , THOMAS, Mark R. P.
IPC分类号： G10L19/008
摘要： The present document describes a method (400) for encoding a soundfield representation (SR) input signal (101, 301) describing a soundfield at a reference position, wherein the SR input signal (101, 301) comprises a plurality of channels for a plurality of different directivity patterns of the soundfield at the reference position. The method (400) comprises extracting (401) one or more audio objects (103, 303) from the SR input signal (101, 301). Furthermore, the method (400) comprises determining (402) a residual signal (102, 302) based on the SR input signal (101, 301) and based on the one or more audio objects (103, 303). The method (400) also comprises performing joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302). In addition, the method (400) comprises generating (403) a bitstream (701) based on data generated in the context of joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302).

32. 发明申请

WO2018162472A1 INTEGRATED RECONSTRUCTION AND RENDERING OF AUDIO SIGNALS 审中-公开
公开(公告)号：WO2018162472A1
公开(公告)日：2018-09-13
申请号：PCT/EP2018/055462
申请日：2018-03-06
申请人： DOLBY INTERNATIONAL AB
发明人： PEICHL, Klaus , FRIEDRICH, Tobias , THESING, Robin , PURNHAGEN, Heiko , WOLTERS, Martin
IPC分类号： H04S7/00 , H04S3/00 , G10L19/008
CPC分类号： H04S7/30 , G10L19/008 , H04S3/008 , H04S2400/03 , H04S2420/03
摘要： A method for rendering an audio output based on an audio data stream including M audio signals, side information including a series of reconstruction instances of a reconstruction matrix C and first timing data, the side information allowing reconstruction of N audio objects from the M audio signals, and object metadata defining spatial relationships between the N audio objects. The method includes generating a synchronized rendering matrix based on the object metadata, the first timing data, and information relating to a current playback system configuration, the synchronized rendering matrix having a rendering instance for each reconstruction instance, multiplying each reconstruction instance with a corresponding rendering instance to form a corresponding instance of an integrated rendering matrix, and applying the integrated rendering matrix to the audio signals in order to render an audio output.

33. 发明申请

WO2016172254A1 SPATIAL AUDIO SIGNAL MANIPULATION 审中-公开
标题翻译：空间音频信号处理
公开(公告)号：WO2016172254A1
公开(公告)日：2016-10-27
申请号：PCT/US2016/028501
申请日：2016-04-20
申请人： DOLBY LABORATORIES LICENSING CORPORATION , DOLBY INTERNATIONAL AB
发明人： BREEBAART, Dirk Jeroen , MATEOS SOLE, Antonio , PURNHAGEN, Heiko , TSINGOS, Nicolas R.
IPC分类号： H04S7/00
CPC分类号： H04S7/303 , H04R5/02 , H04S3/008 , H04S7/30 , H04S2400/11 , H04S2420/03
摘要： Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).
摘要翻译：这里描述了一种在由目标扬声器系统（23）定义的音频环境（27）中渲染用于回放的音频信号（17）的方法（30），该音频信号（17）包括与音频对象相关的音频数据，指示对象位置的关联位置数据。方法（30）包括接收音频信号（17）的初始步骤（31）。在步骤（32），接收目标扬声器系统（23）的扬声器布局数据。在步骤（33），接收指示要在音频环境（27）中应用于音频对象的位置修改的控制数据。在步骤（38）响应于位置数据，扬声器布局数据和控制数据，生成渲染修改数据。最后，在步骤（39）中，利用渲染修改数据渲染音频信号（17），以便在音频环境（27）内的扬声器之间的修改对象位置处以音频对象输出音频信号（17）。

34. 发明申请

WO2016066705A1 PARAMETRIC MIXING OF AUDIO SIGNALS 审中-公开
标题翻译：音频信号的参数混合
公开(公告)号：WO2016066705A1
公开(公告)日：2016-05-06
申请号：PCT/EP2015/075022
申请日：2015-10-28
申请人： DOLBY INTERNATIONAL AB
发明人： VILLEMOES, Lars , PURNHAGEN, Heiko , LEHTONEN, Heidi-Maria
IPC分类号： G10L19/008
CPC分类号： H04S3/008 , G10L19/008 , H04S2400/01 , H04S2400/03 , H04S2420/03
摘要： In an encoding section (100), a downmix section (110) forms first and second channels ( L 1 , L 2 ) of a downmix signal as linear combinations of first and second groups (401, 402) of channels, respectively, of an M-channel audio signal; and an analysis section (120) determines upmix parameters (α LU ) for parametric reconstruction of the audio signal, and mixing parameters ( α LM ). In a decoding section (1200), a decorrelating section (1210) outputs a decorrelated signal ( D ) based on the downmix signal; and a mixing section (1220) determines mixing coefficients based on the mixing parameters or the upmix parameters, and forms a K -channel output signal ( L͂ 1 ,...,L͂ K ) as a linear combination of the downmix signal and the decorrelated signal in accordance with the mixing coefficients. The channels of the output signal approximate linear combinations of K groups (501-502, 1301-1303) of channels, respectively, of the audio signal. The K groups constitute a different partition of the audio signal than the first and second groups, and 2 ≤ K .
摘要翻译：在编码部分（100）中，下混合部分（110）分别形成下混合信号的第一和第二信道（L 1，L 2），作为信道的第一和第二组（401,402）的线性组合， M通道音频信号; 和分析部分（120）确定用于音频信号的参数重建的混合参数（αLU）和混合参数（αLM）。在解码部分（1200）中，解相关部分（1210）基于下混合信号输出解相关信号（D）; 和混合部分（1220）根据混合参数或上混参数确定混合系数，并形成K沟道输出信号（L 1，...，L K），作为降混信号和解相关信号信号根据混合系数。输出信号的通道分别近似于音频信号的K个组（501-502,1301-1303）的线性组合。 K组构成音频信号与第一组和第二组不同的分区，2≤K

35. 发明申请

WO2015150480A1 EXPLOITING METADATA REDUNDANCY IN IMMERSIVE AUDIO METADATA 审中-公开
标题翻译：在无与伦比的音频元数据中实现元数据冗余
公开(公告)号：WO2015150480A1
公开(公告)日：2015-10-08
申请号：PCT/EP2015/057231
申请日：2015-04-01
申请人： DOLBY INTERNATIONAL AB
发明人： FERSCH, Christof , PURNHAGEN, Heiko , POPP, Jens , WOLTERS, Martin
IPC分类号： G10L19/008
CPC分类号： H04S7/30 , G10L19/008 , H04S3/008 , H04S2400/03 , H04S2400/11 , H04S2400/13
摘要： The present document relates to the field of encoding and decoding of audio. In particular, the present document relates to encoding and decoding of an audio scene comprising audio objects. A method (400) for encoding metadata relating to a plurality of audio objects (106a) of an audio scene (102) is described. The metadata comprises a first set (114, 314) of metadata and a second set (104) of metadata. The first and second sets (104, 114, 314) of metadata comprise one or more data elements which are indicative of a property of an audio object (106a) from the plurality of audio objects (106a) and/or of a downmix signal (112) derived from the plurality of audio objects (106a). The method (400) comprises identifying (401) a redundant data element which is common to the first and second sets (104, 114, 314) of metadata. Furthermore, the method comprises encoding (402) the redundant data element of the first set (114, 314) of metadata by referring to a redundant data element of a set (104) of metadata external for the first set (114, 314) of metadata.
摘要翻译：本文件涉及音频的编码和解码领域。特别地，本文件涉及包括音频对象的音频场景的编码和解码。描述了用于编码与音频场景（102）的多个音频对象（106a）有关的元数据的方法（400）。元数据包括元数据的第一集合（114,314）和元数据的第二集合（104）。元数据的第一和第二组（104,114,314）包括指示来自多个音频对象（106a）和/或降混信号（106a）的音频对象（106a）的属性的一个或多个数据元素 112）从多个音频对象（106a）导出。方法（400）包括识别（401）元数据的第一和第二组（104,114,314）共有的冗余数据元素。此外，该方法包括通过参考第一组（114,314）的外部元数据（104）的冗余数据元素（104）来编码（402）元数据的第一组（114,314）的冗余数据元素元数据。

36. 发明申请

WO2015036348A1 TIME- ALIGNMENT OF QMF BASED PROCESSING DATA 审中-公开
标题翻译：基于QMF的处理数据的时间对齐
公开(公告)号：WO2015036348A1
公开(公告)日：2015-03-19
申请号：PCT/EP2014/069039
申请日：2014-09-08
申请人： DOLBY INTERNATIONAL AB
发明人： KJOERLING, Kristofer , PURNHAGEN, Heiko , POPP, Jens
IPC分类号： G10L21/0388
CPC分类号： G10L19/167 , G10L19/018 , G10L19/0204 , G10L19/032 , G10L21/0388
摘要： The present document relates to time-alignment of encoded data of an audio encoder with associated metadata, such as spectral band replication (SBR) metadata. An audio decoder (100, 300) configured to determine a reconstructed frame of an audio signal (237) from an access unit (110) of a received data stream is described. The access unit (110) comprises waveform data (111) and metadata (112), wherein the waveform data (111) and the metadata (112) are associated with the same reconstructed frame of the audio signal (127). The audio decoder (100, 300) comprises a waveform processing path (101, 102, 103, 104, 105) configured to generate a plurality of waveform subband signals (123) from the waveform data (111), and a metadata processing path (108, 109) configured to generate decoded metadata (128) from the metadata (111).
摘要翻译：本文件涉及具有诸如频谱带复制（SBR）元数据的相关元数据的音频编码器的编码数据的时间对准。被配置为从接收数据流的访问单元（110）确定音频信号（237）的重建帧的音频解码器（100,300）。访问单元（110）包括波形数据（111）和元数据（112），其中波形数据（111）和元数据（112）与音频信号（127）的相同的重构帧相关联。音频解码器（100,300）包括被配置为从波形数据（111）生成多个波形子带信号（123）的波形处理路径（101,102,103,104,105）和元数据处理路径，其被配置为从所述元数据（111）生成解码的元数据（128）。

37. 发明申请

WO2014187989A2 RECONSTRUCTION OF AUDIO SCENES FROM A DOWNMIX 审中-公开
标题翻译：从下载重建音频场景
公开(公告)号：WO2014187989A2
公开(公告)日：2014-11-27
申请号：PCT/EP2014/060732
申请日：2014-05-23
申请人： DOLBY INTERNATIONAL AB
发明人： HIRVONEN, Toni , PURNHAGEN, Heiko , SAMUELSSON, Leif Jonas , VILLEMOES, Lars
IPC分类号： G10L19/008
CPC分类号： G10L19/008 , G10L19/0204 , G10L19/20 , G10L25/06 , H04S3/008 , H04S3/02 , H04S5/00 , H04S7/30 , H04S2400/03 , H04S2400/11 , H04S2420/03
摘要： Audio objects are associated with positional metadata. A received downmix signal comprises downmix channels that are linear combinations of one or more audio objects and are associated with respective positional locators. In a first aspect, the downmix signal, the positional metadata and frequency- dependent object gains are received. An audio object is reconstructed by applying the object gain to an upmix of the downmix signal in accordance with coefficients based on the positional metadata and the positional locators. In a second aspect, audio objects have been encoded together with at least one bed channel positioned at a positional locator of a corresponding downmix channel. The decoding system receives the downmix signal and the positional metadata of the audio objects. A bed channel is reconstructed by suppressing the content representing audio objects from the corresponding downmix channel on the basis of the positional locator of the corresponding downmix channel.
摘要翻译：音频对象与位置元数据相关联。接收的下混合信号包括作为一个或多个音频对象的线性组合的下混通道，并且与相应的位置定位器相关联。在第一方面，接收降混信号，位置元数据和与频率相关的对象增益。根据基于位置元数据和位置定位器的系数，将对象增益应用于缩混信号的上混合来重构音频对象。在第二方面，音频对象已经与位于对应的下混通道的位置定位器处的至少一个床通道一起被编码。解码系统接收下混合信号和音频对象的位置元数据。基于对应的下混通道的位置定位器，通过从对应的下混通道中抑制表示音频对象的内容来重构床通道。

38. 发明申请

WO2014187986A1 CODING OF AUDIO SCENES 审中-公开
标题翻译：音频场景编码
公开(公告)号：WO2014187986A1
公开(公告)日：2014-11-27
申请号：PCT/EP2014/060727
申请日：2014-05-23
申请人： DOLBY INTERNATIONAL AB
发明人： PURNHAGEN, Heiko , VILLEMOES, Lars , SAMUELSSON, Leif Jonas , HIRVONEN, Toni
IPC分类号： G10L19/008
CPC分类号： G10L19/008 , G10L19/20 , H04S3/02 , H04S5/00 , H04S2400/03 , H04S2400/11 , H04S2420/03 , H04S2420/07
摘要： Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which at least comprises one or more audio objects (106a). The encoder (108, 110) generates a bit stream (116) which comprises downmix signals (112) and side information which includes individual matrix elements (114) of a reconstruction matrix which enables reconstruction of the one or more audio objects (106a) in the decoder (120).
摘要翻译：示例性实施例提供编码和解码方法以及相关联的编码器和解码器，用于至少包括一个或多个音频对象（106a）的音频场景的编码和解码。编码器（108,110）生成比特流（116），其包括下混合信号（112）和侧信息，该信息包括重建矩阵的单个矩阵元素（114），其能够重建一个或多个音频对象（106a）解码器（120）。

39. 发明申请

WO2014161996A2 AUDIO PROCESSING SYSTEM 审中-公开
标题翻译：音频处理系统
公开(公告)号：WO2014161996A2
公开(公告)日：2014-10-09
申请号：PCT/EP2014/056857
申请日：2014-04-04
申请人： DOLBY INTERNATIONAL AB
发明人： KJOERLING, Kristofer , PURNHAGEN, Heiko , VILLEMOES, Lars
IPC分类号： G10L19/008
CPC分类号： G10L19/008 , G10L19/032 , G10L19/04 , G10L19/20
摘要： An audio processing system (100) comprises a front-end component (102, 103), which receives quantized spectral components and performs an inverse quantization, yielding a time-domain representation of an intermediate signal. The audio processing system further comprises a frequency-domain processing stage (104, 105, 106, 107, 108), configured to provide a time-domain representation of a processed audio signal, and a sample rate converter (109), providing a reconstructed audio signal sampled at a target sampling frequency. The respective internal sampling rates of the time-domain representation of the intermediate audio signal and of the time-domain representation of the processed audio signal are equal. In particular embodiments, the processing stage comprises a parametric upmix stage which is operable in at least two different modes and is associated with a delay stage that ensures constant total delay.
摘要翻译：音频处理系统（100）包括接收量化频谱分量并执行逆量化的前端组件（102,103），产生中间信号的时域表示。音频处理系统还包括被配置为提供经处理的音频信号的时域表示的频域处理级（104,105,106,107,108），以及采样速率转换器（109），提供重构的以目标采样频率采样的音频信号。中间音频信号的时域表示和处理后的音频信号的时域表示各自的内部采样率是相等的。在特定实施例中，处理级包括参数上混级，其可在至少两种不同模式下操作并且与确保恒定总延迟的延迟级相关联。

40. 发明申请

WO2014161995A1 AUDIO ENCODER AND DECODER FOR INTERLEAVED WAVEFORM CODING 审中-公开
标题翻译：音频编码器和解码器用于互换波形编码
公开(公告)号：WO2014161995A1
公开(公告)日：2014-10-09
申请号：PCT/EP2014/056856
申请日：2014-04-04
申请人： DOLBY INTERNATIONAL AB
发明人： KJOERLING, Kristofer , THESING, Robin , MUNDT, Harald , PURNHAGEN, Heiko , ROEDEN, Karl Jonas
IPC分类号： G10L19/02 , G10L21/038
CPC分类号： G10L19/0208 , G10L19/02 , G10L19/0212 , G10L19/26 , G10L21/038 , G10L21/0388
摘要： There is provided methods and apparatuses for decoding and encoding of audio signals. In particular, a method for decoding includes receiving a waveform-coded signal having a spectral content corresponding to a subset of the frequency range above a cross-over frequency. The waveform-coded signal is interleaved with a parametric high frequency reconstruction of the audio signal above the cross-over frequency. In this way an improved reconstruction of the high frequency bands of the audio signal is achieved.
摘要翻译：提供了用于对音频信号进行解码和编码的方法和装置。特别地，一种解码方法包括：接收波形编码信号，该波形编码信号具有对应于高于交叉频率的频率范围子集的频谱内容。波形编码信号与高于交叉频率的音频信号的参数高频重构进行交织。以这种方式，实现了音频信号的高频带的改进的重建。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式