会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • METHOD AND SYSTEM FOR GENERATING ADVANCED FEATURE DISCRIMINATION VECTORS FOR USE IN SPEECH RECOGNITION
    • 用于生成语音识别中使用的高级特征歧视向量的方法和系统
    • WO2014145960A2
    • 2014-09-18
    • PCT/US2014030819
    • 2014-03-17
    • SHORT KEVIN MHONE BRIAN
    • SHORT KEVIN MHONE BRIAN
    • G10L17/02
    • G10L15/02G10L25/03G10L25/18G10L25/21G10L25/24G10L25/93G10L2015/025
    • A method of renormalizing high-resolution oscillator peaks, extracted from windowed samples of an audio signal, is disclosed. Feature vectors are generated for which variations in both fundamental frequency and time duration of speech are substantially mitigated. The feature vectors may be aligned within a common coordinate space, free of those variations in frequency and time duration that occurs between speakers, and even over speech by a single speaker, to facilitate a simple and accurate determination of matches between those AFDVs generated from a sample of the audio signal and corpus AFDVs generated for known speech at the phoneme and sub-phoneme level. The renormalized feature vectors can be combined with traditional feature vectors such as MFCCs, or they can be used exclusively to identify voiced, semi-voiced and unvoiced sounds.
    • 公开了一种从音频信号的窗口采样中提取的高分辨率振荡器峰值的重新归一化方法。 生成基本频率和语音持续时间的变化的特征向量被大大减轻。 特征向量可以在公共坐标空间内对齐,没有在扬声器之间发生的频率和持续时间的这些变化,甚至在单个扬声器的语音之间的对准,以便于简单和准确地确定从一个扬声器产生的那些AFDV之间的匹配 在音素和子音素级别为已知语音生成的音频信号和语料库AFDV的样本。 重归一化特征向量可以与诸如MFCC的传统特征向量组合,或者它们可以专门用于识别有声,半声和无声的声音。