会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明申请
    • UNVOICED/VOICED DECISION FOR SPEECH PROCESSING
    • 无声/有声的语音处理决定
    • WO2015032351A1
    • 2015-03-12
    • PCT/CN2014/086058
    • 2014-09-05
    • HUAWEI TECHNOLOGIES CO., LTD.
    • GAO, Yang
    • G10L25/03
    • G10L25/78G10L19/22G10L25/93
    • In accordance with an embodiment of the present invention, a method for speech processing includes determining an unvoicing/voicing parameter reflecting a characteristic of unvoiced/voicing speech in a current frame of a speech signal comprising a plurality of frames. A smoothed unvoicing/voicing parameter is determined to include information of the unvoicing/voicing parameter in a frame prior to the current frame of the speech signal. A difference between the unvoicing/voicing parameter and the smoothed unvoicing/voicing parameter is computed. The method further includes generating an unvoiced/voiced decision point for determining whether the current frame comprises unvoiced speech or voiced speech using the computed difference as a decision parameter.
    • 根据本发明的实施例,一种用于语音处理的方法包括:确定反映在包括多个帧的语音信号的当前帧中的清音/发声语音的特征的清音/发声参数。 平滑的清音/发声参数被确定为包括语音信号的当前帧之前的帧中的清音/发声参数的信息。 计算出浊音/浊音参数与平滑的浊音/浊音参数之间的差异。 该方法还包括生成清音/有声决定点,用于使用所计算的差分作为判定参数来确定当前帧是否包括无声语音或浊音。
    • 7. 发明申请
    • ADAPTIVELY ENCODING PITCH LAG FOR VOICED SPEECH
    • 适应语音的自适应编码LAG
    • WO2013096875A2
    • 2013-06-27
    • PCT/US2012/071435
    • 2012-12-21
    • HUAWEI TECHNOLOGIES CO., LTD.GAO, Yang
    • GAO, Yang
    • G10L25/90G10L19/09G10L19/18
    • System and method embodiments for dual modes pitch coding are provided. The system and method embodiments are configured to adaptively code pitch lags of a voiced speech signal using one of two pitch coding modes according to a pitch length, stability, or both. The two pitch coding modes include a first pitch coding mode with relatively high precision and reduced dynamic range, and a second pitch coding mode with relatively large dynamic range and reduced precision. The first pitch coding mode is used upon determining that the voiced speech signal has a relatively short or substantially stable pitch. The second pitch coding mode is used upon determining that the voiced speech signal has a relatively long or less stable pitch or is a substantially noisy signal.
    • 提供了用于双模音调编码的系统和方法实施例。 系统和方法实施例被配置为根据间距长度,稳定性或两者来使用两种音调编码模式之一自适应地编码有声语音信号的音调滞后。 两个音调编码模式包括具有相对较高精度和降低的动态范围的第一音调编码模式,以及具有相对大的动态范围和精度降低的第二音调编码模式。 在确定有声语音信号具有相对较短或基本上稳定的音调时,使用第一音调编码模式。 第二音调编码模式在确定有声语音信号具有相对较长或较小的稳定音调或者是基本上噪声的信号时被使用。
    • 10. 发明申请
    • ADAPTIVE BANDWIDTH EXTENSION AND APPARATUS FOR THE SAME
    • 自适应带宽扩展及其设备
    • WO2015035896A1
    • 2015-03-19
    • PCT/CN2014/086135
    • 2014-09-09
    • HUAWEI TECHNOLOGIES CO., LTD.
    • GAO, Yang
    • G10L19/032
    • G10L19/22G10L19/0204G10L19/08G10L19/12G10L19/167G10L19/265G10L21/038
    • In one embodiment of the present invention, a method of decoding an encoded audio bitstream and generating frequency bandwidth extension includes decoding the audio bitstream to produce a decoded low band audio signal and generate a low band excitation spectrum corresponding to a low frequency band. A sub-band area is selected from within the low frequency band using a parameter which indicates energy information of a spectral envelope of the decoded low band audio signal. A high band excitation spectrum is generated for a high frequency band by copying a sub-band excitation spectrum from the selected sub-band area to a high sub-band area corresponding to the high frequency band. Using the generated high band excitation spectrum, an extended high band audio signal is generated by applying a high band spectral envelope. The extended high band audio signal is added to the decoded low band audio signal to generate an audio output signal having an extended frequency bandwidth.
    • 在本发明的一个实施例中,解码编码音频比特流并产生频率带宽扩展的方法包括对音频比特流进行解码以产生解码的低频带音频信号并产生对应于低频带的低频激励频谱。 使用指示解码的低频带音频信号的频谱包络的​​能量信息的参数从低频带内选择子带区域。 通过将子带激励频谱从所选择的子带区域复制到对应于高频带的高子带区域,为高频带生成高频带激励频谱。 使用所产生的高频带激励频谱,通过应用高频带频谱包络来产生扩展的高频带音频信号。 将扩展的高频带音频信号添加到解码的低频带音频信号以产生具有扩展的频率带宽的音频输出信号。