专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

WO2009033288A1 METHOD AND DEVICE FOR FAST ALGEBRAIC CODEBOOK SEARCH IN SPEECH AND AUDIO CODING 审中-公开
标题翻译：用于在语音和音频编码中快速查看代码的方法和设备
公开(公告)号：WO2009033288A1
公开(公告)日：2009-03-19
申请号：PCT/CA2008/001620
申请日：2008-09-11
申请人： VOICEAGE CORPORATION , SALAMI, Redwan , EKSLER, Vaclav , JELINEK, Milan
发明人： SALAMI, Redwan , EKSLER, Vaclav , JELINEK, Milan
IPC分类号： G10L19/00 , G10L19/12 , H04M3/56 , H04N7/15
CPC分类号： G10L19/107
摘要： A method and device for searching an algebraic codebook during encoding of a sound signal, wherein the algebraic codebook comprises a set of codevectors formed of a number of pulse positions and a number of pulses distributed over the pulse positions. In the algebraic codebook searching method and device, a reference signal for use in searching the algebraic codebook is calculated. In a first stage, a position of a first pulse is determined in relation with the reference signal and among the number of pulse positions. In each of a number of stages subsequent to the first stage, (a) an algebraic codebook gain is recomputed, (b) the reference signal is updated using the recomputed algebraic codebook gain and (c ) a position of another pulse is determined in relation with the updated reference signal and among the number of pulse positions. A codevector of the algebraic codebook is computed using the positions of the pulses determined in the first and subsequent stages, wherein a number of the first and subsequent stages corresponds to the number of pulses in the codevectors of the algebraic codebook.
摘要翻译：一种用于在编码声音信号期间搜索代数码本的方法和装置，其中代数码本包括由多个脉冲位置组成的一组码矢量和分布在脉冲位置上的脉冲数。在代数码本搜索方法和装置中，计算用于搜索代数码本的参考信号。在第一阶段中，相对于参考信号和脉冲位置数确定第一脉冲的位置。在第一阶段之后的多个阶段的每个阶段中，（a）代数码本增益被重新计算，（b）使用重新计算的代数码本增益来更新参考信号，并且（c）另一个脉冲的位置被确定具有更新的参考信号和脉冲位置的数量。使用在第一级和后级中确定的脉冲的位置来计算代数码本的码矢量，其中第一级和后级的数量对应于代数码本的代码矢量中的脉冲数。

2. 发明申请

WO2008049221A1 METHOD AND DEVICE FOR CODING TRANSITION FRAMES IN SPEECH SIGNALS 审中-公开
标题翻译：用于编码语音信号中的过渡帧的方法和装置
公开(公告)号：WO2008049221A1
公开(公告)日：2008-05-02
申请号：PCT/CA2007/001896
申请日：2007-10-24
申请人： VOICEAGE CORPORATION , EKSLER, Vaclav , JELINEK, Milan , SALAMI, Redwan
发明人： EKSLER, Vaclav , JELINEK, Milan , SALAMI, Redwan
IPC分类号： G01L19/08 , G01L19/12
CPC分类号： G10L19/08 , G10L19/12
摘要： There is provided a transition mode device and method for use in a predictive-type sound signal codec for producing a transition mode excitation replacing an adaptive codebook excitation in a transition frame and/or a frame following the transition in the sound signal, comprising an input for receiving a codebook index and a transition mode codebook for generating a set of codevectors independent from past excitation. The transition mode codebook is responsive to the index for generating, in the transition frame and/or frame following the transition, one of the codevectors of the set corresponding to the transition mode excitation. There is also provided an encoding device and method and a decoding device and method using the above described transition mode device and method.
摘要翻译：提供了一种用于预测型声音信号编解码器中的过渡模式装置和方法，用于产生在过渡帧和/或声音信号中的转变之后的帧中替换自适应码本激励的转换模式激励，包括输入用于接收码本索引和用于生成独立于过去激励的一组码矢量的转换模式码本。转换模式码本响应于索引，用于在转换之后的转换帧和/或帧中生成对应于转换模式激励的集合的码矢量之一。还提供了一种编码装置和方法以及使用上述转换模式装置和方法的解码装置和方法。

3. 发明申请

WO2004006226A1 METHOD AND DEVICE FOR EFFICIENT IN-BAND DIM-AND-BURST SIGNALING AND HALF-RATE MAX OPERATION IN VARIABLE BIT-RATE WIDEBAND SPEECH CODING FOR CDMA WIRELESS SYSTEMS 审中-公开
标题翻译：用于CDMA无线系统的可变比特率宽带语音编码中有效的带内DIM-AND-BURST信令和半速率最大操作的方法和设备
公开(公告)号：WO2004006226A1
公开(公告)日：2004-01-15
申请号：PCT/CA2003/000980
申请日：2003-06-27
申请人： VOICEAGE CORPORATION , JELINEK, Milan , SALAMI, Redwan
发明人： JELINEK, Milan , SALAMI, Redwan
IPC分类号： G10L19/14
CPC分类号： G10L19/24
摘要： In the method and device for interoperating a first station using a first communication scheme and comprising a first coder and a first decoder with a second station using a second communication scheme and comprising a second coder and a second decoder, communication between the first and second stations is conducted by transmitting signal-coding parameters related to a sound signal from the coder of one of the first and second stations to the decoder of the other station. The sound signal is classified to determine whether the signal-coding parameters should be transmitted from the coder of one station to the decoder of the other station using a first communication mode in which full bit rate is used for transmission of the signal-coding parameters. When classification of the sound signal determines that the signal-coding parameters should be transmitted using the first communication mode and when a request to transmit the signal-coding parameters from the coder of one station to the decoder of the other station using a second communication mode designed to reduce bit rate during transmission of the signal-coding parameters is received, a portion of the signal-coding parameters from the coder one station is dropped and the remaining signal-coding parameters are transmitting to the decoder of the other station using the second communication mode. The dropped portion of the signal-coding parameters are regenerated before the decoder of the other station decodes the signal-coding parameters.
摘要翻译：在用于使用第一通信方案并且包括第一编码器和第一解码器以及使用第二通信方案的第二站来互操作第一站并且包括第二编码器和第二解码器的方法和设备中，第一和第二站之间的通信是通过将与来自第一和第二站之一的编码器的声音信号有关的信号编码参数发送到另一个站的解码器来进行的。声音信号被分类以确定信号编码参数是否应该使用全比特率被用于传输信号编码参数的第一通信模式从一个站的编码器传送到另一个站的解码器。当声音信号的分类确定应当使用第一通信模式传输信号编码参数时以及当使用第二通信模式从一个站的编码器传输信号编码参数到另一个站的解码器的请求时被设计成在信号编码参数的传输期间降低比特率被接收，来自编码器一站的信号编码参数的一部分被丢弃，并且其余的信号编码参数被传输到另一站的解码器，使用第二通讯模式。信号编码参数的丢失部分在另一个站的解码器解码信号编码参数之前被重新生成。

4. 发明申请

WO2007073604A1 METHOD AND DEVICE FOR EFFICIENT FRAME ERASURE CONCEALMENT IN SPEECH CODECS 审中-公开
标题翻译：语音编解码器中有效帧擦除隐藏的方法和装置
公开(公告)号：WO2007073604A1
公开(公告)日：2007-07-05
申请号：PCT/CA2006/002146
申请日：2006-12-27
申请人： VOICEAGE CORPORATION , VAILLANCOURT, Tommy , JELINEK, Milan , GOURNAY, Philippe , SALAMI, Redwan
发明人： VAILLANCOURT, Tommy , JELINEK, Milan , GOURNAY, Philippe , SALAMI, Redwan
IPC分类号： G10L19/00 , G10L21/02
CPC分类号： G10L19/005
摘要： A method and device for concealing frame erasures caused by frames of an encoded sound signal erased during transmission from an encoder to a decoder and for recovery of the decoder after frame erasures comprise, in the encoder, determining concealment/recovery parameters including at least phase information related to frames of the encoded sound signal. The concealment/recovery parameters determined in the encoder are transmitted to the decoder and, in the decoder, frame erasure concealment is conducted in response to the received concealment/recovery parameters. The frame erasure concealment comprises resynchronizing, in response to the received phase information, the erasure-concealed frames with corresponding frames of the sound signal encoded at the encoder. When no concealment/recovery parameters are transmitted to the decoder, a phase information of each frame of the encoded sound signal that has been erased during transmission from the encoder to the decoder is estimated in the decoder. Also, frame erasure concealment is conducted in the decoder in response to the estimated phase information, wherein the frame erasure concealment comprises resynchronizing, in response to the estimated phase information, each erasure-concealed frame with a corresponding frame of the sound signal encoded at the encoder.
摘要翻译：一种用于隐藏由在从编码器到解码器的传输期间擦除的编码声音信号的帧引起的帧擦除以及在帧擦除之后恢复解码器的方法和设备包括在编码器中确定隐藏 /恢复参数至少包括与编码声音信号的帧有关的相位信息。在编码器中确定的隐藏/恢复参数被发送到解码器，并且在解码器中，响应于所接收的隐藏/恢复参数进行帧删除隐藏。帧擦除隐藏包括响应于接收到的相位信息而将隐藏了擦除的帧与在编码器处编码的声音信号的对应帧再同步。当没有隐藏/恢复参数被发送到解码器时，在解码器中估计在从编码器到解码器的传输期间已被擦除的编码声音信号的每帧的相位信息。另外，响应于估计的相位信息，在解码器中进行帧擦除隐藏，其中帧擦除隐藏包括响应于估计的相位信息，将每个擦除隐藏帧与在该帧中编码的声音信号的对应帧重新同步编码器。

5. 发明申请

WO2004097797A1 METHOD AND DEVICE FOR GAIN QUANTIZATION IN VARIABLE BIT RATE WIDEBAND SPEECH CODING 审中-公开
标题翻译：用于在可变位速率宽带语音编码中增益量化的方法和装置
公开(公告)号：WO2004097797A1
公开(公告)日：2004-11-11
申请号：PCT/CA2004/000380
申请日：2004-03-12
申请人： VOICEAGE CORPORATION , JELINEK, Milan , SALAMI, Redwan
发明人： JELINEK, Milan , SALAMI, Redwan
IPC分类号： G10L19/08
CPC分类号： G10L19/083 , G10L19/24
摘要： The present invention relates to a gain quantization method and device for implementation in a technique for coding a sampled sound signal processed, during coding, by successive frames of L samples, wherein each frame is divided into a number of subframes and each subframe comprises a number N of samples, where N In the gain quantization method and device, an initial pitch gain is calculated based on a number f of subframes, a portion of a gain quantization codebook is selected in relation to the initial pitch gain, and pitch and fixed-codebook gains are jointly quantized. This joint quantization of the pitch and fixed-codebook gains comprises, for the number f of subframes, searching the gain quantization codebook in relation to a search criterion. The codebook search is restricted to the selected portion of the gain quantization codebook and an index of the selected portion of the gain quantization codebook best meeting the search criterion is found.
摘要翻译：增益量化方法和装置技术领域本发明涉及一种增益量化方法和装置，用于在由L个采样的连续帧在编码期间对采样的声音信号进行编码的技术中实现，其中每个帧被划分成多个子帧，并且每个子帧包括数字 N个样本，其中N

6. 发明申请

WO2004034376A2 METHOD FOR INTEROPERATION BETWEEN ADAPTIVE MULTI-RATE WIDEBAND (AMR-WB) AND MULTI-MODE VARIABLE BIT-RATE WIDEBAND (VMR-WB) CODECS 审中-公开
标题翻译：自适应多速率宽带（AMR-WB）与多模式可变比特率宽带（VMR-WB）编解码器之间的交互方法
公开(公告)号：WO2004034376A2
公开(公告)日：2004-04-22
申请号：PCT/CA2003/001572
申请日：2003-10-10
申请人： VOICEAGE CORPORATION , JELINEK, Milan , SALAMI, Redwan
发明人： JELINEK, Milan , SALAMI, Redwan
IPC分类号： G10L
CPC分类号： G10L19/24 , G10L19/012 , G10L19/173
摘要： A source-controlled Variable bit-rate Multi-mode WideBand (VMR-WB) codec, having a mode of operation that is interoperable with the Adaptive Multi-Rate wideband (AMR-WB) codec, the codec comprising: at least one Interoperable full-rate (1-FR) mode, having a first bit allocation structure based an one of a AMR-WB codec coding types; and at least one comfort noise generator (CNG) coding type for encoding inactive speech frame having a second bit allocation structure based an AMR-WB SID_UPDATE coding type. Methods for i) digitally encoding a sound using a source-controlled Variable bit rate multi-mode wideband (VMR-WB) codec for interoperation with an adaptative multi-rate wideband (AMR-WB) codec, ii) translating a Variable bit rate multi-mode wideband (VMR-WB) codecsignal frame into an Adaptive Multi-Rate wideband (AMR-WB) signal frame, iii) translating an Adaptive Multi-Rate wideband (AMR-WB) signal frame into a Variable bit rate multi-mode wideband (VMR-WB) signal frame, and iv) translating an Adaptive Multi-Rate wideband (AMR-WB) signal frame into a Variable bit rate multi-mode wideband (VMR-WB) signal frame are also provided.
摘要翻译：源控制的可变比特率多模宽带（VMR-WB）编解码器，具有可与自适应多速率宽带（AMR-WB）编解码器互操作的操作模式，编解码器包括：至少一个可互操作的全（1-FR）模式，具有基于AMR-WB编解码器类型之一的第一比特分配结构; 以及至少一种用于基于AMR-WB SID_UPDATE编码类型对具有第二比特分配结构的无效语音帧进行编码的舒适噪声发生器（CNG）编码类型。用于i）使用源控制的可变比特率多模宽带（VMR-WB）编解码器对适应性多速率宽带（AMR-WB）编解码器进行互操作的数字编码声音的方法，ii）将可变比特率多（VMR-WB）编解码信号帧转换为自适应多速率宽带（AMR-WB）信号帧，iii）将自适应多速率宽带（AMR-WB）信号帧转换为可变比特率多模宽带（VMR-WB）信号帧，以及iv）将自适应多速率宽带（AMR-WB）信号帧转换为可变比特率多模宽带（VMR-WB）信号帧。

7. 发明申请

WO2009109050A8 SYSTEM AND METHOD FOR ENHANCING A DECODED TONAL SOUND SIGNAL 审中-公开
公开(公告)号：WO2009109050A8
公开(公告)日：2009-09-11
申请号：PCT/CA2009/000276
申请日：2009-03-05
申请人： VOICEAGE CORPORATION , VAILLANCOURT, Tommy , JELINEK, Milan , MALENOVSKY, Vladimir , SALAMI, Redwan
发明人： VAILLANCOURT, Tommy , JELINEK, Milan , MALENOVSKY, Vladimir , SALAMI, Redwan
IPC分类号： G10L21/02 , G10L19/12
摘要： A system and method for enhancing a tonal sound signal decoded by a decoder of a speech-specific codec in response to a received coded bit stream, in which a spectral analyser is responsive to the decoded tonal sound signal to produce spectral parameters representative of the decoded tonal sound signal. A quantization noise in low-energy spectral regions of the decoded tonal sound signal is reduced in response to the spectral parameters produced by the spectral analyser. The spectral analyser divides a spectrum resulting from spectral analysis into a set of critical frequency bands each comprising a number of frequency bins, and the reducer of quantization noise comprises a noise attenuator that scales the spectrum of the decoded tonal sound signal per critical frequency band, per frequency bin, or per both critical frequency band and frequency bin.

8. 发明申请

WO2009000073A1 METHOD AND DEVICE FOR SOUND ACTIVITY DETECTION AND SOUND SIGNAL CLASSIFICATION 审中-公开
标题翻译：用于声音活动检测和声音信号分类的方法和装置
公开(公告)号：WO2009000073A1
公开(公告)日：2008-12-31
申请号：PCT/CA2008/001184
申请日：2008-06-20
申请人： VOICEAGE CORPORATION , MALENOWSKY, Vladimir , JELINEK, Milan , VAILLANCOURT, Tommy , SALAMI, Redwan
发明人： MALENOWSKY, Vladimir , JELINEK, Milan , VAILLANCOURT, Tommy , SALAMI, Redwan
IPC分类号： G10L11/00 , G10L19/02 , G10L21/02
CPC分类号： G10L25/78 , G10L19/22
摘要： A device and method for estimating a tonality of a sound signal comprise: calculating a current residual spectrum of the sound signal; detecting peaks in the current residual spectrum; calculating a correlation map between the current residual spectrum and a previous residual spectrum for each detected peak; and calculating a long-term correlation map based on the calculated correlation map, the long-term correlation map being indicative of a tonality in the sound signal.
摘要翻译：用于估计声音信号的音调的装置和方法包括：计算声音信号的当前残余频谱; 检测当前残留谱中的峰; 计算每个检测到的峰值的当前残差谱和先前残差谱之间的相关图; 以及基于所计算的相关图计算长期相关图，所述长期相关图表示所述声音信号中的音调。

9. 发明申请

WO2003102921A1 METHOD AND DEVICE FOR EFFICIENT FRAME ERASURE CONCEALMENT IN LINEAR PREDICTIVE BASED SPEECH CODECS 审中-公开
标题翻译：基于线性预测的语音编码器中有效框架隐藏的方法和装置
公开(公告)号：WO2003102921A1
公开(公告)日：2003-12-11
申请号：PCT/CA2003/000830
申请日：2003-05-30
申请人： VOICEAGE CORPORATION , JELINEK, Milan , GOURNAY, Philippe
发明人： JELINEK, Milan , GOURNAY, Philippe
IPC分类号： G10L19/00
CPC分类号： G10L19/005
摘要： The present invention relates to a method and device for improving concealment of frame erasure caused by frames of an encoded sound signal erased during transmission from an encoder (106) to a decoder (110), and for accelerating recovery of the decoder after non erased frames of the encoded sound signal have been received. For that purpose, concealment/recovery parameters are determined in the encoder or decoder. When determined in the encoder (106), the concealment/recovery parameters are transmitted to the decoder (110). In the decoder, erasure frame concealment and decoder recovery is conducted in response to the concealment/recovery parameters. The concealment/recovery parameters may be selected from the group consisting of: a signal classification parameter, an energy information parameter and a phase information parameter. The determination of the concealment/recovery parameters comprises classifying the successive frames of the encoded sound signal as unvoiced, unvoiced transition, voiced transition, voiced, or onset, and this classification is determined on the basis of at least a part of the following parameters: a normalized correlation parameter, a spectral tilt parameter, a signal-to-noise ratio parameter, a pitch stability parameter, a relative frame energy parameter, and a zero crossing parameter.
摘要翻译：本发明涉及一种用于改善由编码器（106）到解码器（110）的传输期间被擦除的编码声音信号的帧引起的帧擦除隐藏的方法和装置，并且用于在非擦除帧之后加速解码器的恢复已经接收到编码声音信号。为此，在编码器或解码器中确定隐藏/恢复参数。当在编码器（106）中确定时，隐藏/恢复参数被传送到解码器（110）。在解码器中，响应于隐藏/恢复参数进行擦除帧隐藏和解码器恢复。隐藏/恢复参数可以从由信号分类参数，能量信息参数和相位信息参数组成的组中选择。隐藏/恢复参数的确定包括将编码声音信号的连续帧分类为无声，无声转换，有声转换，有声或起始，并且该分类基于以下参数的至少一部分来确定：标准化相关参数，频谱倾斜参数，信噪比参数，音调稳定性参数，相对帧能量参数和过零参数。

10. 发明申请

WO2004034379A2 METHODS AND DEVICES FOR SOURCE CONTROLLED VARIABLE BIT-RATE WIDEBAND SPEECH CODING 审中-公开
标题翻译：源控制可变比特率宽带语音编码的方法和设备
公开(公告)号：WO2004034379A2
公开(公告)日：2004-04-22
申请号：PCT/CA2003/001571
申请日：2003-10-09
申请人： VOICEAGE CORPORATION , JELINEK, Milan
发明人： JELINEK, Milan
IPC分类号： G10L19/00
CPC分类号： G10L19/24 , G10L19/012 , G10L19/173
摘要： Speech signal classification and encoding systems and methods are disclosed herein. The signal classification is done in three steps each of them discriminating a specific signal class. First, a voice activity detector (VAD) discriminates between active and inactive speech frames. If an inactive speech frame is detected (background noise signal) then the classification chain ends and the frame is encoded with comfort noise generation (CNG). If an active speech frame is detected, the frame is subjected to a second classifier dedicated to discriminate unvoiced frames. If the classifier classifies the frame as unvoiced speech signal, the classification chain ends, and the frame is encoded using a coding method optimized for unvoiced signals. Otherwise, the speech frame is passed through to the "stable voiced" classification module. If the frame is classified as stable voiced frame, then the frame is encoded using a coding method optimized for stable voiced signals. Otherwise, the frame is likely to contain a non-stationary speech segment such as a voiced onset or rapidly evolving voiced speech signal. In this case a general-purpose speech coder is used at a high bit rate for sustaining good subjective quality .
摘要翻译：本文公开了语音信号分类和编码系统和方法。信号分类通过三个步骤完成，每个步骤区分特定的信号类别。首先，语音活动检测器（VAD）在有效和无效的语音帧之间进行区分。如果检测到无效语音帧（背景噪声信号），则分类链结束，并且以舒适噪声产生（CNG）编码该帧。如果检测到活动语音帧，则该帧经受专用于区分清音帧的第二分类器。如果分类器将帧分类为无声语音信号，则分类链结束，并且使用针对无声信号优化的编码方法对帧进行编码。否则，将语音帧传递到“稳定浊音”分类模块。如果帧被分类为稳定的有声帧，则使用针对稳定浊音信号优化的编码方法对帧进行编码。否则，该帧可能包含诸如有声开始或快速演进的有声语音信号之类的非平稳语音段。在这种情况下，通用语音编码器以高比特率被使用以维持良好的主观质量。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式