会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • Method and system for pitch contour quantization in audio coding
    • 音频编码中音调轮廓量化的方法和系统
    • US20080275695A1
    • 2008-11-06
    • US12150307
    • 2008-04-25
    • Anssi RamoJani NurminenSakari HimanenAri Heikkinen
    • Anssi RamoJani NurminenSakari HimanenAri Heikkinen
    • G10L11/04
    • G10L19/032G10L19/09
    • A method and device for improving coding efficiency in audio coding. From the pitch values of a pitch contour of an audio signal, a plurality of simplified pitch contour segments are generated to approximate the pitch contour, based on one or more pre-selected criteria. The contour segments can be linear or non-linear with each contour segment represented by a first end point and a second end point. If the contour segments are linear, then only the information regarding the end points, instead of the pitch values, are provided to a decoder for reconstructing the audio signal. The contour segment can have a fixed maximum length or a variable length, but the deviation between a contour segment and the pitch values in that segment is limited by a maximum value.
    • 一种提高音频编码效率的方法和装置。 根据音频信号的音调轮廓的音调值,基于一个或多个预先选择的标准,生成多个简化俯仰轮廓线段以近似俯仰轮廓。 轮廓段可以是由第一终点和第二终点表示的每个轮廓段线性或非线性的。 如果轮廓段是线性的,则仅将关于终点而不是音调值的信息提供给用于重建音频信号的解码器。 轮廓段可以具有固定的最大长度或可变长度,但轮廓段与该段中的俯仰值之间的偏差受到最大值的限制。
    • 2. 发明申请
    • Method and apparatus for reducing synchronization delay in packet switched voice terminals using speech decoder modification
    • 使用语音解码器修改来减少分组交换语音终端中的同步延迟的方法和装置
    • US20080235009A1
    • 2008-09-25
    • US12154487
    • 2008-05-23
    • Ari HeikkinenAri Lakaniemi
    • Ari HeikkinenAri Lakaniemi
    • G10L19/00
    • G10L19/167G10L21/04G10L2019/0012H04J3/0632
    • A device is disclosed that makes packetized and encoded speech data audible to a listener, as is a method for operating the device. The device includes a unit for generating a synchronization request for reducing an amount of synchronization delay, and further includes a speech decoder that is responsive to the synchronization delay adjustment request for executing a time-warping operation for one of lengthening or shortening a duration of a speech frame. In one embodiment the speech decoder comprises a code excited linear prediction (CELP) speech decoder, and the CELP decoder time-warping operation is applied to a reconstructed excitation signal u(k) to derive a time-warped reconstructed signal uw(k). The time-warped reconstructed signal uw(k) is input to a Linear Predictor (LP) synthesis filter to derive a CELP decoder time-warped output signal ŷw(k) In another embodiment the speech decoder comprises a parametric speech decoder, and where an adaptation of the frame length N in the parametric speech decoder results in the use of a modified frame length Nw.
    • 公开了一种使分组化和编码的语音数据可听见的收听器的设备,以及用于操作设备的方法。 该装置包括用于产生用于减少同步延迟量的同步请求的单元,并且还包括语音解码器,该语音解码器响应于同步延迟调整请求,用于执行延时或缩短持续时间的延时操作 语音框架 在一个实施例中,语音解码器包括代码激励线性预测(CELP)语音解码器,并且将CELP解码器时间扭曲操作应用于重构的激励信号u(k)以导出时间扭曲的重构信号u (k)。 时间扭曲重构信号u(k)被输入到线性预测器(LP)合成滤波器,以导出CELP解码器时间扭曲输出信号ŷŷŷ )。 在另一个实施例中,语音解码器包括参数语音解码器,并且其中参数语音解码器中的帧长度N的调整导致使用经修改的帧长度N N w N。
    • 3. 发明授权
    • Method and apparatus for reducing synchronization delay in packet switched voice terminals using speech decoder modification
    • 使用语音解码器修改来减少分组交换语音终端中的同步延迟的方法和装置
    • US07394833B2
    • 2008-07-01
    • US10364588
    • 2003-02-11
    • Ari HeikkinenAri Lakaniemi
    • Ari HeikkinenAri Lakaniemi
    • H04J3/06
    • G10L19/167G10L21/04G10L2019/0012H04J3/0632
    • A device is disclosed that makes packetized and encoded speech data audible to a listener, as is a method for operating the device. The device includes a unit for generating a synchronization request for reducing an amount of synchronization delay, and further includes a speech decoder that is responsive to the synchronization delay adjustment request for executing a time-warping operation for one of lengthening or shortening a duration of a speech frame. In one embodiment the speech decoder comprises a code excited linear prediction (CELP) speech decoder, and the CELP decoder time-warping operation is applied to a reconstructed excitation signal u(k) to derive a time-warped reconstructed signal uw(k). The time-warped reconstructed signal uw(k) is input to a Linear Predictor (LP) synthesis filter to derive a CELP decoder time-warped output signal y^w(k). In another embodiment the speech decoder comprises a parametric speech decoder, and where an adaptation of the frame length N in the parametric speech decoder results in the use of a modified frame length Nw.
    • 公开了一种使分组化和编码的语音数据可听见的收听器的设备,以及用于操作设备的方法。 该装置包括用于产生用于减少同步延迟量的同步请求的单元,并且还包括语音解码器,该语音解码器响应于同步延迟调整请求,用于执行延时或缩短持续时间的延时操作 语音框架 在一个实施例中,语音解码器包括代码激励线性预测(CELP)语音解码器,并且将CELP解码器时间扭曲操作应用于重构的激励信号u(k)以导出时间扭曲的重构信号u (k)。 时间扭曲重构信号u(k)被输入到线性预测器(LP)合成滤波器,以导出CELP解码器时间扭曲输出信号。 (k)。 在另一个实施例中,语音解码器包括参数语音解码器,并且其中参数语音解码器中的帧长度N的调整导致使用经修改的帧长度N N w N。
    • 5. 发明授权
    • Method and apparatus for speech coding with voiced/unvoiced determination
    • 用语音/清音确定语音编码的方法和装置
    • US06915257B2
    • 2005-07-05
    • US09740826
    • 2000-12-21
    • Ari HeikkinenSamuli PietilaVesa Ruoppila
    • Ari HeikkinenSamuli PietilaVesa Ruoppila
    • G10L25/93G10L11/06G10L11/04
    • G10L25/93
    • This invention presents a voicing determination algorithm for classification of a speech signal segment as voiced or unvoiced. The algorithm is based on a normalized autocorrelation where the length of the window is proportional to the pitch period. The speech segment to be classified is further divided into a number of sub-segments, and the normalized autocorrelation is calculated for each sub-segment if a certain number of the normalized autocorrelation values is above a predetermined threshold, the speech segment is classified as voiced. To improve the performance of the voicing determination algorithm in unvoiced to voiced transients, the normalized autocorrelations of the last sub-segments are emphasized. The performance of the voicing decision algorithm can be enhanced by utilizing also the possible lookahead information.
    • 本发明提出了一种用于将语音信号段分类为有声或无声的语音确定算法。 该算法基于归一化的自相关,其中窗口的长度与音调周期成比例。 要分类的语音段被进一步划分为多个子段,并且如果一定数量的归一化自相关值高于预定阈值,则针对每个子段计算归一化的自相关,该语音段被分类为有声 。 为了提高无声至浊音瞬态中的发音确定算法的性能,强调了最后一个子段的归一化自相关。 可以通过利用可能的前瞻信息来增强语音决策算法的性能。
    • 6. 发明申请
    • Speech coding
    • 语音编码
    • US20050137858A1
    • 2005-06-23
    • US10742645
    • 2003-12-19
    • Ari HeikkinenSakari HimanenAnssi Ramo
    • Ari HeikkinenSakari HimanenAnssi Ramo
    • G10L19/06G10L19/08G10L19/14
    • G10L19/265G10L19/08G10L19/16
    • The invention relates to a method for use in parametric speech coding. In order to enable an improved parametric coding of speech signals, the method comprises a first step of pre-processing a to be encoded speech based signal such that a phase structure of the to be encoded speech based signal is approached to a phase structure which is obtained when the to be encoded speech based signal is parametrically encoded and decoded again. Only in a second step, a parametric encoding is applied to this pre-processed to be encoded speech based signal. The invention relates equally to a corresponding device, to a corresponding coding module, to a corresponding system and to a corresponding software program product.
    • 本发明涉及一种用于参数语音编码的方法。 为了能够实现语音信号的改进的参数编码,该方法包括对要编码的基于语音的信号进行预处理的第一步骤,使得要编码的基于语音的信号的相位结构接近于相位结构,该相位结构是 当要编码的基于语音的信号被再次参数编码和解码时获得。 仅在第二步骤中,将参数编码应用于该预处理为被编码的基于语音的信号。 本发明同样涉及对应的设备,相应的编码模块,相应的系统和相应的软件程序产品。
    • 8. 发明申请
    • Method and system for speech coding
    • 语音编码方法和系统
    • US20050091041A1
    • 2005-04-28
    • US10692290
    • 2003-10-23
    • Anssi RamoJani NurminenSakari HimanenAri Heikkinen
    • Anssi RamoJani NurminenSakari HimanenAri Heikkinen
    • G10L20060101G10L11/06G10L19/02G10L19/04G10L19/14G10L21/04H04B1/06H04M11/00
    • G10L19/24
    • A method and device for use in conjunction with an encoder for encoding an audio signal into a plurality of parameters. Based on the behavior of the parameters, such as pitch, voicing, energy and spectral amplitude information of the audio signal, the audio signal can be segmented, so that the parameter update rate can be optimized. The parameters of the segmented audio signal are recorded in a storage medium or transmitted to a decoder so as to allow the decoder to reconstruct the audio signal based on the parameters indicative of the segment audio signals. For example, based on the pitch characteristic, the pitch contour can be approximated by a plurality of contour segments. An adaptive downsampling method is used to update the parameters based on the contour segments so as to reduce the update rate. At the decoder, the parameters are updated at the original rate.
    • 一种与用于将音频信号编码为多个参数的编码器结合使用的方法和装置。 基于音频信号的音调,发音,能量和频谱幅度信息等参数的行为,可以对音频信号进行分段,从而可以优化参数更新速率。 分段音频信号的参数被记录在存储介质中或被发送到解码器,以便允许解码器基于指示段音频信号的参数重建音频信号。 例如,基于俯仰特性,俯仰轮廓可以由多个轮廓段近似。 使用自适应下采样方法根据轮廓段更新参数,以便降低更新速率。 在解码器处,参数以原始速率更新。