会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明公开
    • Apparatus and method for realizing a SAOC downmix of 3D audio content
    • Vorrichtung und Verfahren zur Realisierung eines SAOC-Downmix von 3D-Audioinhalt
    • EP2830048A1
    • 2015-01-28
    • EP13189281.2
    • 2013-10-18
    • Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Friedrich-Alexander-Universität Erlangen-Nürnberg
    • Disch, SaschaFuchs, HaraldHellmuth, OliverHerre, JürgenMurtaza, AdrianRidderbusch, FalkoPaulus, JouniTerentiv, Leon
    • G10L19/008H04S3/00
    • H04S3/02G10L19/008H04S3/00H04S3/006H04S3/008H04S7/305H04S2400/01H04S2400/03H04S2400/11H04S2400/13H04S2420/03
    • An apparatus for generating one or more audio output channels is provided. The apparatus comprises a parameter processor (110) for calculating output channel mixing information and a downmix processor (120) for generating the one or more audio output channels. The downmix processor (120) is configured to receive an audio transport signal comprising one or more audio transport channels, wherein two or more audio object signals are mixed within the audio transport signal, and wherein the number of the one or more audio transport channels is smaller than the number of the two or more audio object signals. The audio transport signal depends on a first mixing rule and on a second mixing rule. The first mixing rule indicates how to mix the two or more audio object signals to obtain a plurality of premixed channels. Moreover, the second mixing rule indicates how to mix the plurality of premixed channels to obtain the one or more audio transport channels of the audio transport signal. The parameter processor (110) is configured to receive information on the second mixing rule, wherein the information on the second mixing rule indicates how to mix the plurality of premixed signals such that the one or more audio transport channels are obtained. Moreover, the parameter processor (110) is configured to calculate the output channel mixing information depending on an audio objects number indicating the number of the two or more audio object signals, depending on a premixed channels number indicating the number of the plurality of premixed channels, and depending on the information on the second mixing rule. The downmix processor (120) is configured to generate the one or more audio output channels from the audio transport signal depending on the output channel mixing information.
    • 提供了一种用于产生一个或多个音频输出通道的装置。 该装置包括用于计算输出通道混合信息的参数处理器(110)和用于产生一个或多个音频输出通道的下混处理器(120)。 下混合处理器(120)被配置为接收包括一个或多个音频传输信道的音频传输信号,其中在音频传输信号内混合两个或多个音频对象信号,并且其中一个或多个音频传输信道的数量为 小于两个或更多个音频对象信号的数量。 音频传输信号取决于第一混合规则和第二混合规则。 第一混合规则指示如何混合两个或多个音频对象信号以获得多个预混频道。 此外,第二混合规则指示如何混合多个预混频道以获得音频传输信号的一个或多个音频传输信道。 参数处理器(110)被配置为接收关于第二混合规则的信息,其中关于第二混合规则的信息指示如何混合多个预混合信号,使得获得一个或多个音频传输信道。 此外,参数处理器(110)被配置为根据指示多个预混频道的数量的预设频道号码,根据指示两个或多个音频对象信号的数量的音频对象号来计算输出频道混合信息 ,并且取决于关于第二混合规则的信息。 下混合处理器(120)被配置为根据输出信道混合信息从音频传输信号生成一个或多个音频输出信道。
    • 2. 发明公开
    • Encoder, decoder and methods for backward compatible dynamic adaption of time/frequency resolution in spatial-audio-object-coding
    • 编码器,解码器和在空间音频对象编码为时间/频率分辨率的向后兼容的动态调整的方法
    • EP2717265A1
    • 2014-04-09
    • EP13167481.4
    • 2013-05-13
    • Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Friedrich-Alexander-Universität Erlangen-Nürnberg
    • Disch, SaschaPaulus, JouniEdler, BerndHellmuth, OliverHerre, JürgenKastner, Thorsten
    • G10L19/025G10L19/008
    • G10L19/008G10L19/02G10L19/0204G10L19/025G10L19/20
    • A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal comprising a plurality of time-domain downmix samples is provided. The downmix signal encodes two or more audio object signals. The decoder comprises a window-sequence generator (134) for determining a plurality of analysis windows, wherein each of the analysis windows comprises a plurality of time-domain downmix samples of the downmix signal. Each analysis window of the plurality of analysis windows has a window length indicating the number of the time-domain downmix samples of said analysis window. The window-sequence generator (134) is configured to determine the plurality of analysis windows so that the window length of each of the analysis windows depends on a signal property of at least one of the two or more audio object signals. Moreover, the decoder comprises a t/f-analysis module (135) for transforming the plurality of time-domain downmix samples of each analysis window of the plurality of analysis windows from a time-domain to a time-frequency domain depending on the window length of said analysis window, to obtain a transformed downmix. Furthermore, the decoder comprises an un-mixing unit (136) for un-mixing the transformed downmix based on parametric side information on the two or more audio object signals to obtain the audio output signal. Moreover, an encoder is provided.
    • 提供了一种用于在包括一个或从下混信号包括时域混样品的多元性多个音频输出声道生成音频输出信号的解码器。 缩混信号编码两个或多个音频对象信号。 解码器用于确定性采矿包括窗口序列产生器(134)的分析窗口复数,worin每个分析窗口的包括缩混信号的时域混样品的多元性。 的分析窗,所述多个每个分析窗口具有窗口长度指示所述分析窗口的时域混的样本的数目。 窗口序列产生器(134)被配置为确定矿分析窗口多元性所以没有每个分析窗口的窗口长度取决于两个或更多个音频对象信号中的至少一个的信号特性。 更上方,所述解码器包括在/ F-分析模块(135),用于将的分析窗口,所述多个每个分析窗口的时域混样品的多元性从时域变换到时频域取决于窗口长度 所述分析窗口,以获得转化的缩混。 进一步,对于未混合单元(136)的解码器包括未混合的转化缩混基于关于两个或更多个音频对象信号,以获得所述音频输出信号参数侧信息。 更过,在编码器提供。
    • 4. 发明公开
    • Encoder, decoder and methods for signal-dependent zoom-transform in spatial audio object coding
    • Codierer,Decodierer und VerfahrenfürsignalabhängigeZoomumwandlung beim Spatial-Audio-Object-Coding
    • EP2717262A1
    • 2014-04-09
    • EP13167487.1
    • 2013-05-13
    • Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Friedrich-Alexander-Universität Erlangen-Nürnberg
    • Disch, SaschaPaulus, JouniEdler, BerndHellmuth, OliverHerre, JürgenKastner, Thorsten
    • G10L19/008G10L19/02G10L19/025G10L19/20
    • G10L19/008G10L19/02G10L19/0204G10L19/025G10L19/20
    • A decoder for generating an audio output signal comprising one or more audio output channels from a downmix signal is provided. The downmix signal encodes one or more audio object signals. The decoder comprises a control unit (181) for setting an activation indication to an activation state depending on a signal property of at least one of the one or more audio object signals. Moreover, the decoder comprises a first analysis module (182) for transforming the downmix signal to obtain a first transformed downmix comprising a plurality of first subband channels. Furthermore, the decoder comprises a second analysis module (183) for generating, when the activation indication is set to the activation state, a second transformed downmix by transforming at least one of the first subband channels to obtain a plurality of second subband channels, wherein the second transformed downmix comprises the first subband channels which have not been transformed by the second analysis module and the second subband channels. Moreover, the decoder comprises an un-mixing unit (184), wherein the un-mixing unit (184) is configured to un-mix the second transformed downmix, when the activation indication is set to the activation state, based on parametric side information on the one or more audio object signals to obtain the audio output signal, and to un-mix the first transformed downmix, when the activation indication is not set to the activation state, based on the parametric side information on the one or more audio object signals to obtain the audio output signal. Furthermore, an encoder is provided.
    • 提供了一种用于从降混信号产生包括一个或多个音频输出声道的音频输出信号的解码器。 降混信号对一个或多个音频对象信号进行编码。 解码器包括用于根据一个或多个音频对象信号中的至少一个的信号属性将激活指示设置为激活状态的控制单元(181)。 此外,解码器包括用于变换下混合信号以获得包括多个第一子带信道的第一变换下混合的第一分析模块(182)。 此外,解码器包括第二分析模块(183),用于当激活指示被设置为激活状态时,通过转换第一子带信道中的至少一个以获得多个第二子带信道来产生第二变换下混合,其中 第二变换下混合包括尚未被第二分析模块和第二子带信道变换的第一子带信道。 此外,解码器包括解混合单元(184),其中,当激活指示被设置为激活状态时,解混合单元(184)被配置为基于参数侧信息来解混合第二变换下混合 在一个或多个音频对象信号上获得音频输出信号,并且当激活指示未被设置为激活状态时,基于关于一个或多个音频对象的参数侧信息来解混合第一变换下混合 信号以获得音频输出信号。 此外,提供了一种编码器。
    • 6. 发明公开
    • Encoder, decoder and methods for backward compatible multi-resolution spatial-audio-object-coding
    • 编码器,解码器和方法,用于向后兼容的空间音频对象编码多分辨率
    • EP2717261A1
    • 2014-04-09
    • EP13167485.5
    • 2013-05-13
    • Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Friedrich-Alexander-Universität Erlangen-Nürnberg
    • Disch, SaschaFuchs, HaraldPaulus, JouniTerentiv, LeonHellmuth, OliverHerre, Jürgen
    • G10L19/008G10L19/02
    • G10L19/008G10L19/02
    • A decoder for generating an un-mixed audio signal comprising a plurality of un-mixed audio channels is provided. Moreover, an encoder and an encoded audio signal is provided. The decoder comprises an un-mixing-information determiner for determining un-mixing information by receiving first parametric side information on the at least one audio object signal and second parametric side information on the at least one audio object signal, wherein the frequency resolution of the second parametric side information is higher than the frequency resolution of the first parametric side information. Moreover, the decoder comprises an un-mix module for applying the un-mixing information on a downmix signal, indicating a downmix of at least one audio object signal, to obtain an un-mixed audio signal comprising the plurality of un-mixed audio channels. The un-mixing-information determiner is configured to determine the un-mixing information by modifying the first parametric information and the second parametric information to obtain modified parametric information, such that the modified parametric information has a frequency resolution which is higher than the first frequency resolution.
    • 提供了一种用于在非混合音频信号产生包括未混合音频信道与多个解码器。 更完了,在编码器和编码的音频信号被提供。 通过接收关于在所述至少一个音频对象信号的所述至少一个音频对象信号,并且第二参数侧信息的第一参数侧信息,worin的频率分辨率的解码器,用于确定开采未混合信息未混合-信息确定包括 第二参数侧信息比的第一参量侧信息的频率分辨率越高。 更上方,所述解码器包括到未混合组件用于施加上的下混信号的未混合的信息,表示至少一个音频对象信号的下混合,以获得包含未混合的音频信道的所述多个未混合音频信号 , 未混合信息确定器被配置为确定矿的未混合通过修改第一参数信息和第二参数的信息,以获得修改的参数信息,检查做了修改的参数信息的频率分辨率的所有比所述第一频率高 分辨率。
    • 8. 发明公开
    • APPARATUS AND METHOD FOR HARMONIC-PERCUSSIVE-RESIDUAL SOUND SEPARATION USING A STRUCTURE TENSOR ON SPECTROGRAMS
    • 使用谱图上的结构张量对谐振 - 余弦声谱分离的装置和方法
    • EP3220386A1
    • 2017-09-20
    • EP16161251.0
    • 2016-03-18
    • Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Friedrich-Alexander-Universität Erlangen-Nürnberg
    • Niedermeier, AndreasFüg, RichardDisch, SaschaMüller, MeinardDriedger, Jonathan
    • G10H1/40G06F17/30
    • G10H1/40G06F17/30743G10H2210/031G10H2250/025G10H2250/031G10H2250/131G10H2250/221G10H2250/235
    • Apparatus and method for analysing a magnitude spectrogram of an audio signal for Harmonic-Percussive Residual Sound Separation HPSS comprising : Determining a change of a frequency for each time-frequency bin of a plurality of time-frequency bins of the magnitude spectrogram of the audio signal; classifying each time-frequency bin into a signal component group depending on the change of the frequency.
      A structural tensor is applied to the image of the spectogram for preprocessing or feature extraction by edge and corner detection, in particular by calculating predominant orientation angles in the spectrogram.The structure tensor can be considered a black box, where the input is a gray scale image and the outputs are angles n for each pixel corresponding to the direction of lowest change and a certainty or anisotropy measure for this direction for each pixel. A local frequency change is extracted from the angles : It can be determined, whether a time-frequency-bin in the spectrogram belongs to a harmonic component (= low local frequency change) or to a percussive component (= high or infinite local frequency change).
      Examples of application : (figure 1) Distinguish between harmonic, percussive, and residual signal components by employing this orientation information.
      (figure 5) Analyse an audio signal for upmixing to five audio output channels front left, center, right, left surround and right surround :
      - The harmonic weighting factor may be greater for generating the left, center and right output channels compared to the harmonic weighting factor for generating the left surround and right surround output channels.
      - The percussive weighting factor may be smaller for generating the left, center and right output channels compared to the percussive weighting factor for generating the left surround and right surround output channels.
      (figure 6) Compute source separation metrics (source to distortion ratio SDR, source to interference ratio SIR, and source to artifacts ratios SAR) in a recorded audio signal. For example : A vibrato in a singing voice has a high instantaneous frequency change rate; an assignment of a bin in the spectrogram to "residual" is dependent on the bin anisotropy.
    • 用于分析用于谐波 - 冲击声残余声音分离HPSS的音频信号的幅度谱图的装置和方法,包括:确定音频信号的幅度谱图的多个时间频率仓中的每个时间 - 频率仓的频率的变化 ; 根据频率的改变将每个时间频率分组分类为信号分量组。 将结构张量应用于谱图的图像,以通过边缘和角点检测进行预处理或特征提取,特别是通过计算谱图中的主要方位角。结构张量可以被认为是黑盒,其中输入是灰度 图像,并且输出是对应于最低改变方向的每个像素的角度n以及针对每个像素的该方向的确定性或各向异性测量。 从角度提取局部频率变化:可以确定频谱图中的时间频率区间是属于谐波分量(=低局部频率变化)还是属于冲击分量(=高或无限局部频率变化 )。 应用示例:(图1)通过采用此方向信息区分谐波,冲击和残余信号分量。 (图5)分析一个音频信号,用于向上混音至左前,中,右,左环绕和右环绕的五个音频输出声道: - 与谐波相比,谐波加权因子可能更大以产生左侧,中间和右侧输出声道 用于生成左环绕和右环绕输出声道的加权因子。 - 与用于产生左环绕和右环绕输出声道的冲击加权因子相比,用于产生左,中和右输出声道的打击加权因子可以更小。 (图6)在记录的音频信号中计算源分离量度(源与失真比SDR,源与干扰比SIR以及源与伪像比SAR)。 例如:歌声中的颤音具有较高的瞬时频率变化率; 频谱图中的箱的分配“剩余”取决于箱各向异性。
    • 9. 发明公开
    • Apparatus and method for efficient synthesis of sinusoids and sweeps by employing spectral patterns
    • 设备和方法,用于Sinosoiden和扫描的有效合成通过使用频谱图案
    • EP2720222A1
    • 2014-04-16
    • EP12199266.3
    • 2012-12-21
    • Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Friedrich-Alexander-Universität Erlangen-Nürnberg
    • Disch, SaschaSschubert, BenjaminGeiger, RalfEdler, BerndDietz, Martin
    • G10L19/02
    • G10L19/02G10L19/0212G10L19/032
    • An apparatus for generating an audio output signal based on an encoded audio signal spectrum is provided. The apparatus comprises a processing unit (115) for processing the encoded audio signal spectrum to obtain a decoded audio signal spectrum comprising a plurality of spectral coefficients, wherein each of the spectral coefficients has a spectral location within the encoded audio signal spectrum and a spectral value, wherein the spectral coefficients are sequentially ordered according to their spectral location within the encoded audio signal spectrum so that the spectral coefficients form a sequence of spectral coefficients. Moreover, the apparatus comprises a pseudo coefficients determiner (125) for determining one or more pseudo coefficients of the decoded audio signal spectrum, each of the pseudo coefficients having a spectral value. Furthermore, the apparatus comprises a replacement unit (135) for replacing at least one or more pseudo coefficients by a determined spectral pattern to obtain a modified audio signal spectrum, wherein the determined spectral pattern comprises at least two pattern coefficients, wherein each of the at least two pattern coefficients has a spectral value. Moreover, the apparatus comprises a spectrum-time-conversion unit (145) for converting the modified audio signal spectrum to a time-domain to obtain the audio output signal.
    • 提供了一种用于在在编码的音频信号的频谱基于音频输出信号产生的装置。 该装置包括一个处理单元(115),用于处理经编码的音频信号的频谱,以获得解码音频信号的频谱,其包括频谱系数的多元worin每个频谱系数的具有经编码的音频信号频谱中的频谱位置和频谱值 ,worin频谱系数被顺序排列gemäß到他们的频谱位置中的编码的音频信号的频谱内所以做了频谱系数形成的频谱系数的序列。 更上方,该装置包括一个伪系数确定器(125),用于确定的采矿一个或解码的音频信号的频谱的多个伪系数,每一个具有一个频谱值伪系数。 进一步,该装置包括一个替换单元(135),用于通过确定性开采频谱图案替换至少一个或多个伪系数以获得经修改的音频信号的频谱,worin确定性开采频谱图案包括至少两个图案系数在worin每个 至少两个图案系数具有光谱值。 更上方,该装置包括一个频谱 - 时间变换单元(145)为修正的音频信号的频谱转换到时域,以获得所述音频输出信号。
    • 10. 发明公开
    • Audio object separation from mixture signal using object-specific time/frequency resolutions
    • Trennung von Audio-Objekt aus einem Mischsignal mit objektspezifischen Zeit- undFrequenzauflösungen
    • EP2804176A1
    • 2014-11-19
    • EP13167484.8
    • 2013-05-13
    • Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Friedrich-Alexander-Universität Erlangen-Nürnberg
    • Disch, SaschaPaulus, JouniKastner, Thorsten
    • G10L19/008G10L19/20G10L25/18
    • G10L19/008G10L19/20G10L25/18
    • An audio decoder is proposed for decoding a multi-object audio signal consisting of a downmix signal X and side information PSI. The side information comprises object-specific side information PSI i for an audio object s i in a time/frequency region R(t R ,f R ), and object-specific time/frequency resolution information TFRI i indicative of an object-specific time/frequency resolution TFR h of the object-specific side information for the audio object s i in the time/frequency region R(t R ,f R ). The audio decoder comprises an object-specific time/frequency resolution determiner 110 configured to determine the object-specific time/frequency resolution information TFRI i from the side information PSI for the audio object s i . The audio decoder further comprises an object separator 120 configured to separate the audio object s i from the downmix signal X using the object-specific side information in accordance with the object-specific time/frequency resolution TFRI i . A corresponding encoder and corresponding methods for decoding or encoding are also described.
    • 提出了一种音频解码器,用于对由下混信号X和侧信息PSI组成的多对象音频信号进行解码。 侧面信息包括用于时间/频率区域R(t R,f R)中的音频对象si的对象特定侧信息PSI i以及指示对象特定时间/频率区域的对象特定时间/频率分辨率信息TFRI i, 在时间/频率区域R(t R,f R)中的音频对象si的对象特定侧信息的频率分辨率TFR h。 音频解码器包括对象特定的时间/频率分辨率确定器110,其被配置为从音频对象s i的侧信息PSI确定对象特定的时间/频率分辨率信息TFRI i。 音频解码器还包括对象分离器120,其被配置为根据对象特定时间/频率分辨率TFRI i,使用对象特定侧信息将音频对象s与降混信号X分离。 还描述了相应的编码器和相应的解码或编码方法。