会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 6. 发明授权
    • Voice barge-in in telephony speech recognition
    • 语音插入电话语音识别
    • US07437286B2
    • 2008-10-14
    • US10204034
    • 2000-12-27
    • Xiaobo PiYing Jia
    • Xiaobo PiYing Jia
    • G10L11/02
    • G01S13/66G01S13/726G01S13/9303G10L15/22G10L25/78G10L2021/02087G10L2025/783
    • An interactive voice response system is described that supports full duplex data transfer to enable the playing of a voice prompt to a user of telephony system while the system listens for voice barge-in from the user. The system includes a speech detection module that may utilize various criteria such as frame energy magnitude and duration thresholds to detect speech. The system also includes an automatic speech recognition engine. When the automatic speech recognition engine recognizes a segment of speech, a feature extraction module may be used to subtract a prompt echo spectrum, which corresponds to the currently playing voice prompt, from an echo-dirtied speech spectrum recorded by the system. In order to improve spectrum subtraction, an estimation of the time delay between the echo-dirtied speech and the prompt echo may also be performed.
    • 描述了一种交互式语音应答系统,其支持全双工数据传输,以便在系统从用户收听语音插入时,向电话系统的用户播放语音提示。 该系统包括语音检测模块,其可以利用各种标准,例如帧能量幅度和持续时间阈值来检测语音。 该系统还包括自动语音识别引擎。 当自动语音识别引擎识别出语音段时,可以使用特征提取模块从系统记录的回波污浊语音频谱中减去对应于当前播放的语音提示的提示回波频谱。 为了改进频谱减法,还可以执行回声污浊语音与提示回波之间的时间延迟的估计。
    • 8. 发明申请
    • VOICE BARGE-IN IN TELEPHONY SPEECH RECOGNITION
    • 电话语音识别中的语音
    • US20080310601A1
    • 2008-12-18
    • US12197801
    • 2008-08-25
    • Xiaobo PiYing Jia
    • Xiaobo PiYing Jia
    • H04M1/64
    • G01S13/66G01S13/726G01S13/9303G10L15/22G10L25/78G10L2021/02087G10L2025/783
    • An interactive voice response system is described that supports full duplex data transfer to enable the playing of a voice prompt to a user of telephony system while the system listens for voice barge-in from the user. The system includes a speech detection module that may utilize various criteria such as frame energy magnitude and duration thresholds to detect speech. The system also includes an automatic speech recognition engine. When the automatic speech recognition engine recognizes a segment of speech, a feature extraction module may be used to subtract a prompt echo spectrum, which corresponds to the currently playing voice prompt, from an echo-dirtied speech spectrum recorded by the system. In order to improve spectrum subtraction, an estimation of the time delay between the echo-dirtied speech and the prompt echo may also be performed.
    • 描述了一种交互式语音应答系统,其支持全双工数据传输,以便在系统从用户收听语音插入时,向电话系统的用户播放语音提示。 该系统包括语音检测模块,其可以利用各种标准,例如帧能量幅度和持续时间阈值来检测语音。 该系统还包括自动语音识别引擎。 当自动语音识别引擎识别出语音段时,可以使用特征提取模块从系统记录的回波污浊语音频谱中减去对应于当前播放的语音提示的提示回波频谱。 为了改进频谱减法,还可以执行回声污浊语音与提示回波之间的时间延迟的估计。