会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • METHOD AND DEVICE FOR EFFICIENT FRAME ERASURE CONCEALMENT IN SPEECH CODECS
    • 方法和设备在语音编码中有效的帧消除隐藏
    • WO2007073604A8
    • 2007-12-21
    • PCT/CA2006002146
    • 2006-12-28
    • VOICEAGE CORPVAILLANCOURT TOMMYJELINEK MILANGOURNAY PHILIPPESALAMI REDWAN
    • VAILLANCOURT TOMMYJELINEK MILANGOURNAY PHILIPPESALAMI REDWAN
    • G10L19/00G10L21/02
    • G10L19/005
    • A method and device for concealing frame erasures caused by frames of an encoded sound signal erased during transmission from an encoder to a decoder and for recovery of the decoder after frame erasures comprise, in the encoder, determining concealment/recovery parameters including at least phase information related to frames of the encoded sound signal. The concealment/recovery parameters determined in the encoder are transmitted to the decoder and, in the decoder, frame erasure concealment is conducted in response to the received concealment/recovery parameters. The frame erasure concealment comprises resynchronizing, in response to the received phase information, the erasure-concealed frames with corresponding frames of the sound signal encoded at the encoder. When no concealment/recovery parameters are transmitted to the decoder, a phase information of each frame of the encoded sound signal that has been erased during transmission from the encoder to the decoder is estimated in the decoder. Also, frame erasure concealment is conducted in the decoder in response to the estimated phase information, wherein the frame erasure concealment comprises resynchronizing, in response to the estimated phase information, each erasure-concealed frame with a corresponding frame of the sound signal encoded at the encoder.
    • 一种用于隐藏在从编码器到解码器的传输期间被擦除的编码声音信号的帧引起的帧擦除和在帧擦除之后恢复解码器的方法和装置,在编码器中包括确定包括至少相位信息的隐藏/恢复参数 与编码的声音信号的帧相关。 在编码器中确定的隐藏/恢复参数被发送到解码器,并且在解码器中,响应于接收的隐藏/恢复参数进行帧擦除隐藏。 帧擦除隐藏包括响应于接收到的相位信息,重新同步擦除隐藏的帧与在编码器处编码的声音信号的相应帧。 当没有隐藏/恢复参数被发送到解码器时,在解码器中估计在从编码器到解码器的传输期间被擦除的编码声音信号的每一帧的相位信息。 此外,响应于估计的相位信息在解码器中进行帧擦除隐藏,其中帧擦除隐藏包括响应于估计的相位信息重新同步每个被擦除隐藏的帧与在该编码的声音信号的相应帧 编码器。
    • 3. 发明申请
    • METHODS AND DEVICES FOR SOURCE CONTROLLED VARIABLE BIT-RATE WIDEBAND SPEECH CODING
    • 用于源控制的可变比特率宽带语音编码的方法和设备
    • WO2004034379A3
    • 2004-12-23
    • PCT/CA0301571
    • 2003-10-09
    • NOKIA CORPJELINEK MILAN
    • JELINEK MILAN
    • G01L19/14G10L11/04G10L19/02G10L19/14G10L21/02
    • G10L19/24G10L19/012G10L19/173
    • Speech signal classification and encoding systems and methods are disclosed herein. The signal classification is done in three steps each of them discriminating a specific signal class. First, a voice activity detector (VAD) discriminates between active and inactive speech frames. If an inactive speech frame is detected (background noise signal) then the classification chain ends and the frame is encoded with comfort noise generation (CNG). If an active speech frame is detected, the frame is subjected to a second classifier dedicated to discriminate unvoiced frames. If the classifier classifies the frame as unvoiced speech signal, the classification chain ends, and the frame is encoded using a coding method optimized for unvoiced signals. Otherwise, the speech frame is passed through to the "stable voiced" classification module. If the frame is classified as stable voiced frame, then the frame is encoded using a coding method optimized for stable voiced signals. Otherwise, the frame is likely to contain a non-stationary speech segment such as a voiced onset or rapidly evolving voiced speech signal. In this case a general-purpose speech coder is used at a high bit rate for sustaining good subjective quality .
    • 在此公开了语音信号分类和编码系统和方法。 信号分类分三个步骤完成,每个步骤区分特定的信号类别。 首先,语音活动检测器(VAD)区分活动和非活动语音帧。 如果检测到不活动的语音帧(背景噪声信号),则分类链结束,并且该帧被编码以舒适噪声产生(CNG)。 如果检测到活动语音帧,则该帧经受专用于区分无声帧的第二分类器。 如果分类器将该帧分类为清音语音信号,则分类链结束,并且使用针对清音信号优化的编码方法对帧进行编码。 否则,语音帧被传递到“稳定浊音”分类模块。 如果帧被分类为稳定浊音帧,则使用针对稳定浊音信号优化的编码方法对帧进行编码。 否则,帧可能包含非平稳的语音片段,例如浊音起始或快速演变的浊音语音信号。 在这种情况下,通用语音编码器以高比特率使用,以维持良好的主观质量。
    • 5. 发明申请
    • CODING GENERIC AUDIO SIGNALS AT LOW BITRATES AND LOW DELAY
    • 编码低频和低延迟的一般音频信号
    • WO2012055016A1
    • 2012-05-03
    • PCT/CA2011001182
    • 2011-10-24
    • VOICEAGE CORPVAILLANCOURT TOMMYJELINEK MILAN
    • VAILLANCOURT TOMMYJELINEK MILAN
    • G10L19/12
    • G10L19/20G10L19/02G10L19/08
    • A mixed time-domain / frequency-domain coding device and method for coding an input sound signal, wherein a time-domain excitation contribution is calculated in response to the input sound signal. A cut-off frequency for the time-domain excitation contribution is also calculated in response to the input sound signal, and a frequency extent of the time-domain excitation contribution is adjusted in relation to this cut-off frequency. Following calculation of a frequency-domain excitation contribution in response to the input sound signal, the adjusted time-domain excitation contribution and the frequency-domain excitation contribution are added to form a mixed time-domain / frequency-domain excitation constituting a coded version of the input sound signal. In the calculation of the time-domain excitation contribution, the input sound signal may be processed in successive frames of the input sound signal and a number of sub-frames to be used in a current frame may be calculated. Corresponding encoder and decoder using the mixed time-domain / frequency-domain coding device are also described.
    • 一种用于编码输入声音信号的混合时域/频域编码装置和方法,其中响应于输入声音信号计算时域激励贡献。 还响应于输入声音信号计算时域激励贡献的截止频率,并且相对于该截止频率调整时域激励贡献的频率范围。 在响应于输入声音信号计算频域激励贡献之后,调整调整的时域激励贡献和频域激励贡献以形成构成编码版本的混合时域/频域激励 输入声音信号。 在时域激励贡献的计算中,可以在输入声音信号的连续帧中处理输入声音信号,并且可以计算要在当前帧中使用的多个子帧。 还描述了使用混合时域/频域编码装置的对应编码器和解码器。
    • 6. 发明申请
    • SIGNAL MODIFICATION METHOD FOR EFFICIENT CODING OF SPEECH SIGNALS
    • 用于语音信号有效编码的信号修改方法
    • WO03052744A2
    • 2003-06-26
    • PCT/CA0201948
    • 2002-12-13
    • VOICEAGE CORPTAMMI MIKKOJELINEK MILANLAFLAMME CLAUDERUOPPILA VESA
    • TAMMI MIKKOJELINEK MILANLAFLAMME CLAUDERUOPPILA VESA
    • G10L19/12G10L19/08
    • G10L19/08
    • For determining a long-term-prediction delay parameter characterizing a long term prediction in a technique using signal modification for digitally encoding a sound signal, the sound signal is divided into a series of successive frames, a feature of the sound signal is located in a previous frame, a corresponding feature of the sound signal is located in a current frame, and the long-term-prediction delay parameter is determined for the current frame while mapping, with the long term prediction, the signal feature of the previous frame with the corresponding signal feature of the current frame. In a signal modification method for implementation into a technique for digitally encoding a sound signal, the sound signal is divided into a series of successive frames, each frame of the sound signal is partitioned into a plurality of signal segments, and at least a part of the signal segments of the frame are warped while constraining the warped signal segments inside the frame. For searching pitch pulses in a sound signal, a residual signal is produced by filtering the sound signal through a linear prediction analysis filter, a weighted sound signal is produced by processing the sound signal through a weighting filter, the weighted sound signal being indicative of signal periodicity, a synthesized weighted sound signal is produced by filtering a synthesized speech signal produced during a last subframe of a previous frame of the sound signal through the weighting filter, a last pitch pulse of the sound signal of the previous frame is located from the residual signal, a pitch pulse prototype of given length is extracted around the position of the last pitch pulse of the sound signal of the previous frame using the synthesized weighted sound signal, and the pitch pulses are located in a current frame using the pitch pulse prototype.
    • 为了确定在使用用于数字编码声音信号的信号修改的技术中表征长期预测的长期预测延迟参数,声音信号被分成一系列连续的帧,声音信号的特征位于 前一帧,声音信号的对应特征位于当前帧中,并且为当前帧确定长期预测延迟参数,同时长期预测将前一帧的信号特征与 当前帧的相应信号特征。 在用于实现用于对声音信号进行数字编码的技术的信号修改方法中,声音信号被分成一系列连续的帧,声音信号的每个帧被划分为多个信号段,并且至少部分 框架的信号段扭曲,同时约束框架内的翘曲的信号段。 为了在声音信号中搜索音调脉冲,通过线性预测分析滤波器对声音信号进行滤波来产生残留信号,通过加权滤波器处理声音信号产生加权声音信号,加权声音信号表示信号 通过对通过加权滤波器的声音信号的先前帧的最后一个子帧产生的合成语音信号进行滤波,产生合成加权声音信号,前一帧的声音信号的最后音调脉冲位于剩余 信号,使用合成的加权声音信号在前一帧的声音信号的最后音调脉冲的位置周围提取给定长度的音调脉冲原型,并且使用音调脉冲原型将音调脉冲位于当前帧中。
    • 7. 发明申请
    • SYSTEM AND METHOD FOR ENHANCING A DECODED TONAL SOUND SIGNAL
    • 用于增强解码的声音信号的系统和方法
    • WO2009109050A1
    • 2009-09-11
    • PCT/CA2009000276
    • 2009-03-05
    • VOICEAGE CORPVAILLANCOURT TOMMYJELINEK MILANMALENOVSKY VLADIMIRSALAMI REDWAN
    • VAILLANCOURT TOMMYJELINEK MILANMALENOVSKY VLADIMIRSALAMI REDWAN
    • G10L21/02G10L19/12
    • G10L19/26G10L25/18
    • A system and method for enhancing a tonal sound signal decoded by a decoder of a speech-specific codec in response to a received coded bit stream, in which a spectral analyser is responsive to the decoded tonal sound signal to produce spectral parameters representative of the decoded tonal sound signal. A quantization noise in low-energy spectral regions of the decoded tonal sound signal is reduced in response to the spectral parameters produced by the spectral analyser. The spectral analyser divides a spectrum resulting from spectral analysis into a set of critical frequency bands each comprising a number of frequency bins, and the reducer of quantization noise comprises a noise attenuator that scales the spectrum of the decoded tonal sound signal per critical frequency band, per frequency bin, or per both critical frequency band and frequency bin.
    • 一种用于响应于接收的编码比特流来增强由语音专用编解码器的解码器解码的音调声音信号的系统和方法,其中频谱分析仪响应于解码的音调声音信号以产生表示解码的频谱参数 音调声信号。 响应于由光谱分析仪产生的光谱参数,解码的音调声音信号的低能谱区域中的量化噪声被减小。 光谱分析仪将由光谱分析得到的光谱分成一组包括多个频率仓的临界频带,并且量化噪声的衰减器包括噪声衰减器,其对每个关键频带的解码音调声音信号的频谱进行缩放, 每个频率仓,或每个临界频带和频率仓。
    • 8. 发明申请
    • METHODS FOR INTEROPERATION BETWEEN ADAPTIVE MULTI-RATE WIDEBAND (AMR-WB) AND MULTI-MODE VARIABLE BIT-RATE WIDEBAND (WMR-WB) SPEECH CODECS
    • 用于自适应多速率宽带(AMR-WB)和多模式可变比特率宽带(WMR-WB)语音编码器之间的交互的方法
    • WO2004034376A3
    • 2004-06-10
    • PCT/CA0301572
    • 2003-10-10
    • VOICEAGE CORPJELINEK MILANSALAMI REDWAN
    • JELINEK MILANSALAMI REDWAN
    • G01L19/14G10L11/04G10L19/02G10L19/14G10L21/02
    • G10L19/24G10L19/012G10L19/173
    • A source-controlled Variable bit-rate Multi-mode WideBand (VMR-WB) speech codec, having a mode of operation that is interoperable with the Adaptive Multi-Rate wideband (AMR-WB) codec, the codec comprising: at least one Interoperable full-rate (1-FR) mode, having a first bit allocation structure based an one of a AMR-WB codec coding types; and at least one comfort noise generator (CNG) coding type for encoding inactive speech frame having a second bit allocation structure based on AMR-WB SID_UPDATE coding type. Methods for i) digitally encoding a sound using a source-controlled Variable bit rate multi-mode wideband (VMR-WB) speech codec for interoperation with an adaptative multi-rate wideband (AMR-WB) codec, ii) translating a Variable bit rate multi-mode wideband (VMR-WB) speech codec-signal frame into an Adaptive Multi-Rate wideband (AMR-WB) speech signal frame, iii) translating an Adaptive Multi-Rate wideband (AMR-WB) speech signal frame into a Variable bit rate multi-mode wideband (VMR-WB) speech signal frame, and iv) translating an Adaptive Multi-Rate wideband (AMR-WB) speech signal frame into a Variable bit rate multi-mode wideband (VMR-WB) speech signal frame are also provided.
    • 一种具有与自适应多速率宽带(AMR-WB)编解码器相互操作的操作模式的源控制的可变比特率多模式宽带(VMR-WB)语音编解码器,所述编解码器包括:至少一个可互操作的 全速率(1-FR)模式,具有基于AMR-WB编解码器类型之一的第一比特分配结构; 以及至少一种用于编码基于AMR-WB SID_UPDATE编码类型的具有第二位分配结构的无效语音帧的舒适噪声发生器(CNG)编码类型。 用于i)使用源控制的可变比特率多模宽带(VMR-WB)语音编解码器对数字编码声音的方法,用于与适应性多速率宽带(AMR-WB)编解码器进行互操作,ii)将可变比特率 多模宽带(VMR-WB)语音编解码信号帧转换为自适应多速率宽带(AMR-WB)语音信号帧,iii)将自适应多速率宽带(AMR-WB)语音信号帧转换为变量 比特率多模宽带(VMR-WB)语音信号帧,以及iv)将自适应多速率宽带(AMR-WB)语音信号帧转换为可变比特率多模宽带(VMR-WB)语音信号帧 也提供。
    • 9. 发明申请
    • METHOD AND DEVICE FOR EFFICIENT IN-BAND DIM-AND-BURST SIGNALING AND HALF-RATE MAX OPERATION IN VARIABLE BIT-RATE WIDEBAND SPEECH CODING FOR CDMA WIRELESS SYSTEMS
    • 用于CDMA无线系统的可变位速率宽带语音编码中的有效带内DIM-AND-BURST信令和高达率最大值操作的方法和设备
    • WO2004006226B1
    • 2004-03-04
    • PCT/CA0300980
    • 2003-06-27
    • VOICEAGE CORPJELINEK MILANSALAMI REDWAN
    • JELINEK MILANSALAMI REDWAN
    • G10L19/12G10L19/24H03M7/30H04B1/707H04B7/24H04B7/26G10L19/14H04Q7/30
    • G10L19/24
    • In the method and device for interoperating a first station using a first communication scheme and comprising a first coder and a first decoder with a second station using a second communication scheme and comprising a second coder and a second decoder, communication between the first and second stations is conducted by transmitting signal-coding parameters related to a sound signal from the coder of one of the first and second stations to the decoder of the other station. The sound signal is classified to determine whether the signal-coding parameters should be transmitted from the coder of one station to the decoder of the other station using a first communication mode in which full bit rate is used for transmission of the signal-coding parameters. When classification of the sound signal determines that the signal-coding parameters should be transmitted using the first communication mode and when a request to transmit the signal-coding parameters from the coder of one station to the decoder of the other station using a second communication mode designed to reduce bit rate during transmission of the signal-coding parameters is received, a portion of the signal-coding parameters from the coder one station is dropped and the remaining signal-coding parameters are transmitting to the decoder of the other station using the second communication mode. The dropped portion of the signal-coding parameters are regenerated before the decoder of the other station decodes the signal-coding parameters.
    • 在用于使用第一通信方案互操作第一站的方法和设备中,包括第一编码器和具有第二站的第一解码器,并且包括第二编码器和第二解码器,第一和第二站之间的通信 通过将与来自第一和第二站中的一个的编码器的声音信号相关的信号编码参数发送到另一站的解码器来进行。 声音信号被分类以确定信号编码参数是否应当使用全位比特率用于传输信号编码参数的第一通信模式从一个站的编码器发送到另一站的解码器。 当声音信号的分类确定应当使用第一通信模式发送信号编码参数时,以及当使用第二通信模式从一个站的编码器向另一站的解码器发送信号编码参数的请求时 被设计为在信号编码参数的传输期间降低比特率被接收到,来自编码器一个站的信号编码参数的一部分被丢弃,剩下的信号编码参数使用第二个信号编码参数传送到另一台的解码器 通讯模式。 信号编码参数的丢弃部分在另一站的解码器解码信号编码参数之前被再生。