专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

WO2020146869A1 HIGH RESOLUTION AUDIO CODING 审中-公开
公开(公告)号：WO2020146869A1
公开(公告)日：2020-07-16
申请号：PCT/US2020/013301
申请日：2020-01-13
申请人： HUAWEI TECHNOLOGIES CO., LTD. , GAO, Yang
发明人： GAO, Yang
IPC分类号： G10L19/00
摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing long-term prediction (LTP) are described. One example of the methods includes determining a pitch gain and a pitch lag of an input audio signal for at least a predetermined number of frames. It is determined that the pitch gain of the input audio signal has exceeded a predetermined threshold and that a change of the pitch lag of the input audio signal has been within a predetermined range for at least the predetermined number of frames. In response to determining that a pitch gain of the input audio signal has exceeded the predetermined threshold and that the change of the third pitch lag has been within the predetermined range for at least the predetermined number of frames, a pitch gain is set for a current frame of the input audio signal.

2. 发明申请

WO2020146870A1 HIGH RESOLUTION AUDIO CODING 审中-公开
公开(公告)号：WO2020146870A1
公开(公告)日：2020-07-16
申请号：PCT/US2020/013303
申请日：2020-01-13
申请人： HUAWEI TECHNOLOGIES CO., LTD. , GAO, Yang
发明人： GAO, Yang
IPC分类号： G10L19/04
摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing linear predictive coding (LPC) are described. One example of the methods includes determining at least one of a differential spectrum tilt and an energy difference between a current frame and a previous frame of the audio signal. A spectral stability of the audio signal is detected based on at least one of the differential spectrum tilt and an energy difference between the current frame and the previous frame of the audio signal. In response to detecting the spectral stability of the audio signal, quantized LPC parameters for the previous frame are copied into the current frame of the audio signal.

3. 发明申请

WO2014131260A1 SYSTEM AND METHOD FOR POST EXCITATION ENHANCEMENT FOR LOW BIT RATE SPEECH CODING 审中-公开
标题翻译：用于低比特率语音编码的激活增强的系统和方法
公开(公告)号：WO2014131260A1
公开(公告)日：2014-09-04
申请号：PCT/CN2013/080254
申请日：2013-07-27
申请人： HUAWEI TECHNOLOGIES CO., LTD.
发明人： GAO, Yang
IPC分类号： G10L19/04
CPC分类号： G10L19/04 , G10L19/12 , G10L19/26
摘要： In accordance with an embodiment, a method of decoding an audio/speech signal includes decoding an excitation signal based on an incoming audio/speech information, determining a stability of a high frequency portion of the excitation signal, smoothing an energy of the high frequency portion of the excitation signal based on the stability of the high frequency portion of the excitation signal, and producing an audio signal based on smoothing the high frequency portion of the excitation signal.
摘要翻译：根据实施例，对音频/语音信号进行解码的方法包括基于输入音频/语音信息来解码激励信号，确定激励信号的高频部分的稳定性，平滑高频部分的能量基于激励信号的高频部分的稳定性的激励信号，并且基于使激励信号的高频部分平滑来产生音频信号。

4. 发明申请

WO2016015591A1 IMPROVING CLASSIFICATION BETWEEN TIME-DOMAIN CODING AND FREQUENCY DOMAIN CODING 审中-公开
标题翻译：改进时域编码和频域编码之间的分类
公开(公告)号：WO2016015591A1
公开(公告)日：2016-02-04
申请号：PCT/CN2015/084931
申请日：2015-07-23
申请人： HUAWEI TECHNOLOGIES CO., LTD.
发明人： GAO, Yang
IPC分类号： G10L19/20
CPC分类号： G10L19/125 , G10L19/002 , G10L19/22 , G10L2019/0002 , G10L2019/0011 , G10L2019/0016
摘要： A method for processing speech signals prior to encoding a digital signal comprising audio data includes selecting frequency domain coding or time domain coding based on a coding bit rate to be used for coding the digital signal and a short pitch lag detection of the digital signal.
摘要翻译：在对包括音频数据的数字信号进行编码之前处理语音信号的方法包括基于用于对数字信号进行编码的编码比特率和数字信号的短音调滞后检测来选择频域编码或时域编码。

5. 发明申请

WO2015032351A1 UNVOICED/VOICED DECISION FOR SPEECH PROCESSING 审中-公开
标题翻译：无声/有声的语音处理决定
公开(公告)号：WO2015032351A1
公开(公告)日：2015-03-12
申请号：PCT/CN2014/086058
申请日：2014-09-05
申请人： HUAWEI TECHNOLOGIES CO., LTD.
发明人： GAO, Yang
IPC分类号： G10L25/03
CPC分类号： G10L25/78 , G10L19/22 , G10L25/93
摘要： In accordance with an embodiment of the present invention, a method for speech processing includes determining an unvoicing/voicing parameter reflecting a characteristic of unvoiced/voicing speech in a current frame of a speech signal comprising a plurality of frames. A smoothed unvoicing/voicing parameter is determined to include information of the unvoicing/voicing parameter in a frame prior to the current frame of the speech signal. A difference between the unvoicing/voicing parameter and the smoothed unvoicing/voicing parameter is computed. The method further includes generating an unvoiced/voiced decision point for determining whether the current frame comprises unvoiced speech or voiced speech using the computed difference as a decision parameter.
摘要翻译：根据本发明的实施例，一种用于语音处理的方法包括：确定反映在包括多个帧的语音信号的当前帧中的清音/发声语音的特征的清音/发声参数。平滑的清音/发声参数被确定为包括语音信号的当前帧之前的帧中的清音/发声参数的信息。计算出浊音/浊音参数与平滑的浊音/浊音参数之间的差异。该方法还包括生成清音/有声决定点，用于使用所计算的差分作为判定参数来确定当前帧是否包括无声语音或浊音。

6. 发明申请

WO2014124577A1 SYSTEM AND METHOD FOR MIXED CODEBOOK EXCITATION FOR SPEECH CODING 审中-公开
标题翻译：用于语音编码的混合编码激活的系统和方法
公开(公告)号：WO2014124577A1
公开(公告)日：2014-08-21
申请号：PCT/CN2013/080268
申请日：2013-07-29
申请人： HUAWEI TECHNOLOGIES CO., LTD.
发明人： GAO, Yang
IPC分类号： G10L19/00
CPC分类号： G10L19/00 , G10L19/12
摘要： In accordance with an embodiment, a method of encoding an audio/speech signal includes determining a mixed codebook vector based on an incoming audio/speech signal, where the mixed codebook vector includes a sum of a first codebook entry from a first codebook and a second codebook entry from a second codebook. The method further includes generating an encoded audio signal based on the determined mixed codebook vector, and transmitting a coded excitation index of the determined mixed codebook vector.
摘要翻译：根据实施例，对音频/语音信号进行编码的方法包括基于输入音频/语音信号来确定混合码本矢量，其中混合码本矢量包括来自第一码本的第一码本条目和第二码本矢量的和第二码本的码本条目。该方法还包括基于所确定的混合码本矢量生成编码音频信号，并发送所确定的混合码本矢量的编码的激励索引。

7. 发明申请

WO2013096875A2 ADAPTIVELY ENCODING PITCH LAG FOR VOICED SPEECH 审中-公开
标题翻译：适应语音的自适应编码LAG
公开(公告)号：WO2013096875A2
公开(公告)日：2013-06-27
申请号：PCT/US2012/071435
申请日：2012-12-21
申请人： HUAWEI TECHNOLOGIES CO., LTD. , GAO, Yang
发明人： GAO, Yang
CPC分类号： G10L25/90 , G10L19/09 , G10L19/18
摘要： System and method embodiments for dual modes pitch coding are provided. The system and method embodiments are configured to adaptively code pitch lags of a voiced speech signal using one of two pitch coding modes according to a pitch length, stability, or both. The two pitch coding modes include a first pitch coding mode with relatively high precision and reduced dynamic range, and a second pitch coding mode with relatively large dynamic range and reduced precision. The first pitch coding mode is used upon determining that the voiced speech signal has a relatively short or substantially stable pitch. The second pitch coding mode is used upon determining that the voiced speech signal has a relatively long or less stable pitch or is a substantially noisy signal.
摘要翻译：提供了用于双模音调编码的系统和方法实施例。系统和方法实施例被配置为根据间距长度，稳定性或两者来使用两种音调编码模式之一自适应地编码有声语音信号的音调滞后。两个音调编码模式包括具有相对较高精度和降低的动态范围的第一音调编码模式，以及具有相对大的动态范围和精度降低的第二音调编码模式。在确定有声语音信号具有相对较短或基本上稳定的音调时，使用第一音调编码模式。第二音调编码模式在确定有声语音信号具有相对较长或较小的稳定音调或者是基本上噪声的信号时被使用。

8. 发明申请

WO2010127616A1 SYSTEM AND METHOD FOR FREQUENCY DOMAIN AUDIO POST-PROCESSING BASED ON PERCEPTUAL MASKING 审中-公开
标题翻译：基于显着掩码的频域音频后处理系统与方法
公开(公告)号：WO2010127616A1
公开(公告)日：2010-11-11
申请号：PCT/CN2010/072449
申请日：2010-05-05
申请人： HUAWEI TECHNOLOGIES CO., LTD. , GAO, Yang
发明人： GAO, Yang
IPC分类号： G10L19/00 , G10L19/04 , G10L21/02
CPC分类号： G10L19/26 , G10L25/18
摘要： In an embodiment, a method of frequency domain post-processing is disclosed. The method includes applying adaptive modification gain factor to each frequency coefficient, and determining gain factors based on Local Masking Magnitude and Local Masked Magnitude.
摘要翻译：在一个实施例中，公开了一种频域后处理的方法。该方法包括对每个频率系数应用自适应修改增益因子，并且基于局部掩蔽幅度和局部掩蔽幅度来确定增益因子。

9. 发明申请

WO2020146868A1 HIGH RESOLUTION AUDIO CODING 审中-公开
公开(公告)号：WO2020146868A1
公开(公告)日：2020-07-16
申请号：PCT/US2020/013296
申请日：2020-01-13
申请人： HUAWEI TECHNOLOGIES CO., LTD. , GAO, Yang
发明人： GAO, Yang
IPC分类号： G10L21/00
摘要： Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing residual quantization are described. One example of the methods includes performing a first residual quantization on a first target residual signal at a first bit rate to generate a first quantized residual signal. A second target residual signal is generated based at least on the first quantized residual signal and the first target residual signal. A second residual quantization is performed on the second target residual signal at a second bit rate to generate a second quantized residual signal, where the first bit rate is different from the second bit rate.

10. 发明申请

WO2015035896A1 ADAPTIVE BANDWIDTH EXTENSION AND APPARATUS FOR THE SAME 审中-公开
标题翻译：自适应带宽扩展及其设备
公开(公告)号：WO2015035896A1
公开(公告)日：2015-03-19
申请号：PCT/CN2014/086135
申请日：2014-09-09
申请人： HUAWEI TECHNOLOGIES CO., LTD.
发明人： GAO, Yang
IPC分类号： G10L19/032
CPC分类号： G10L19/22 , G10L19/0204 , G10L19/08 , G10L19/12 , G10L19/167 , G10L19/265 , G10L21/038
摘要： In one embodiment of the present invention, a method of decoding an encoded audio bitstream and generating frequency bandwidth extension includes decoding the audio bitstream to produce a decoded low band audio signal and generate a low band excitation spectrum corresponding to a low frequency band. A sub-band area is selected from within the low frequency band using a parameter which indicates energy information of a spectral envelope of the decoded low band audio signal. A high band excitation spectrum is generated for a high frequency band by copying a sub-band excitation spectrum from the selected sub-band area to a high sub-band area corresponding to the high frequency band. Using the generated high band excitation spectrum, an extended high band audio signal is generated by applying a high band spectral envelope. The extended high band audio signal is added to the decoded low band audio signal to generate an audio output signal having an extended frequency bandwidth.
摘要翻译：在本发明的一个实施例中，解码编码音频比特流并产生频率带宽扩展的方法包括对音频比特流进行解码以产生解码的低频带音频信号并产生对应于低频带的低频激励频谱。使用指示解码的低频带音频信号的频谱包络的能量信息的参数从低频带内选择子带区域。通过将子带激励频谱从所选择的子带区域复制到对应于高频带的高子带区域，为高频带生成高频带激励频谱。使用所产生的高频带激励频谱，通过应用高频带频谱包络来产生扩展的高频带音频信号。将扩展的高频带音频信号添加到解码的低频带音频信号以产生具有扩展的频率带宽的音频输出信号。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式