会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明公开
    • APPARATUS AND METHOD FOR HARMONIC-PERCUSSIVE-RESIDUAL SOUND SEPARATION USING A STRUCTURE TENSOR ON SPECTROGRAMS
    • 使用谱图上的结构张量对谐振 - 余弦声谱分离的装置和方法
    • EP3220386A1
    • 2017-09-20
    • EP16161251.0
    • 2016-03-18
    • Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Friedrich-Alexander-Universität Erlangen-Nürnberg
    • Niedermeier, AndreasFüg, RichardDisch, SaschaMüller, MeinardDriedger, Jonathan
    • G10H1/40G06F17/30
    • G10H1/40G06F17/30743G10H2210/031G10H2250/025G10H2250/031G10H2250/131G10H2250/221G10H2250/235
    • Apparatus and method for analysing a magnitude spectrogram of an audio signal for Harmonic-Percussive Residual Sound Separation HPSS comprising : Determining a change of a frequency for each time-frequency bin of a plurality of time-frequency bins of the magnitude spectrogram of the audio signal; classifying each time-frequency bin into a signal component group depending on the change of the frequency.
      A structural tensor is applied to the image of the spectogram for preprocessing or feature extraction by edge and corner detection, in particular by calculating predominant orientation angles in the spectrogram.The structure tensor can be considered a black box, where the input is a gray scale image and the outputs are angles n for each pixel corresponding to the direction of lowest change and a certainty or anisotropy measure for this direction for each pixel. A local frequency change is extracted from the angles : It can be determined, whether a time-frequency-bin in the spectrogram belongs to a harmonic component (= low local frequency change) or to a percussive component (= high or infinite local frequency change).
      Examples of application : (figure 1) Distinguish between harmonic, percussive, and residual signal components by employing this orientation information.
      (figure 5) Analyse an audio signal for upmixing to five audio output channels front left, center, right, left surround and right surround :
      - The harmonic weighting factor may be greater for generating the left, center and right output channels compared to the harmonic weighting factor for generating the left surround and right surround output channels.
      - The percussive weighting factor may be smaller for generating the left, center and right output channels compared to the percussive weighting factor for generating the left surround and right surround output channels.
      (figure 6) Compute source separation metrics (source to distortion ratio SDR, source to interference ratio SIR, and source to artifacts ratios SAR) in a recorded audio signal. For example : A vibrato in a singing voice has a high instantaneous frequency change rate; an assignment of a bin in the spectrogram to "residual" is dependent on the bin anisotropy.
    • 用于分析用于谐波 - 冲击声残余声音分离HPSS的音频信号的幅度谱图的装置和方法,包括:确定音频信号的幅度谱图的多个时间频率仓中的每个时间 - 频率仓的频率的变化 ; 根据频率的改变将每个时间频率分组分类为信号分量组。 将结构张量应用于谱图的图像,以通过边缘和角点检测进行预处理或特征提取,特别是通过计算谱图中的主要方位角。结构张量可以被认为是黑盒,其中输入是灰度 图像,并且输出是对应于最低改变方向的每个像素的角度n以及针对每个像素的该方向的确定性或各向异性测量。 从角度提取局部频率变化:可以确定频谱图中的时间频率区间是属于谐波分量(=低局部频率变化)还是属于冲击分量(=高或无限局部频率变化 )。 应用示例:(图1)通过采用此方向信息区分谐波,冲击和残余信号分量。 (图5)分析一个音频信号,用于向上混音至左前,中,右,左环绕和右环绕的五个音频输出声道: - 与谐波相比,谐波加权因子可能更大以产生左侧,中间和右侧输出声道 用于生成左环绕和右环绕输出声道的加权因子。 - 与用于产生左环绕和右环绕输出声道的冲击加权因子相比,用于产生左,中和右输出声道的打击加权因子可以更小。 (图6)在记录的音频信号中计算源分离量度(源与失真比SDR,源与干扰比SIR以及源与伪像比SAR)。 例如:歌声中的颤音具有较高的瞬时频率变化率; 频谱图中的箱的分配“剩余”取决于箱各向异性。