会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Using pitch during speech recognition post-processing to improve recognition accuracy
    • 在语音识别后处理中使用音调来提高识别精度
    • US09484027B2
    • 2016-11-01
    • US12635346
    • 2009-12-10
    • Xufang ZhaoUma Arun
    • Xufang ZhaoUma Arun
    • G10L21/00G10L25/00G10L15/20G10L25/90G10L25/15
    • G10L15/20G10L25/15G10L25/90G10L2015/027
    • A method of automated speech recognition in a vehicle. The method includes receiving audio in the vehicle, pre-processing the received audio to generate acoustic feature vectors, decoding the generated acoustic feature vectors to produce at least one speech hypothesis, and post-processing the at least one speech hypothesis using pitch to improve speech recognition accuracy. The speech hypothesis can be accepted as recognized speech during post-processing if pitch is present in the received audio. Alternatively, a pitch count for the received audio can be determined, N-best speech hypotheses can be post-processed by comparing the pitch count to syllable counts associated with the speech hypotheses, and the speech hypothesis having a syllable count equal to the pitch count can be accepted as recognized speech.
    • 一种在车辆中自动语音识别的方法。 该方法包括在车辆中接收音频,对接收的音频进行预处理以产生声学特征向量,解码所生成的声学特征向量以产生至少一个语音假设,以及使用音高对语音假设进行后处理以改善语音 识别精度。 如果接收到的音频中存在音调,则语音假设可以在后处理中被接受为识别语音。 或者,可以确定接收到的音频的音调计数,通过将音调计数与与语音假设相关联的音节计数进行比较,可以对N个最佳语音假设进行后处理,并且具有等于音高计数的音节计数的语音假设 可以被接受为公认的演讲。
    • 4. 发明申请
    • USING PITCH DURING SPEECH RECOGNITION POST-PROCESSING TO IMPROVE RECOGNITION ACCURACY
    • 语音识别后处理中使用PITCH来提高识别精度
    • US20110144987A1
    • 2011-06-16
    • US12635346
    • 2009-12-10
    • Xufang ZhaoUma Arun
    • Xufang ZhaoUma Arun
    • G10L15/00G10L15/28G10L15/20G10L21/00
    • G10L15/20G10L25/15G10L25/90G10L2015/027
    • A method of automated speech recognition in a vehicle. The method includes receiving audio in the vehicle, pre-processing the received audio to generate acoustic feature vectors, decoding the generated acoustic feature vectors to produce at least one speech hypothesis, and post-processing the at least one speech hypothesis using pitch to improve speech recognition accuracy. The speech hypothesis can be accepted as recognized speech during post-processing if pitch is present in the received audio. Alternatively, a pitch count for the received audio can be determined, N-best speech hypotheses can be post-processed by comparing the pitch count to syllable counts associated with the speech hypotheses, and the speech hypothesis having a syllable count equal to the pitch count can be accepted as recognized speech.
    • 一种在车辆中自动语音识别的方法。 该方法包括在车辆中接收音频,对接收的音频进行预处理以产生声学特征向量,解码所生成的声学特征向量以产生至少一个语音假设,以及使用音高对语音假设进行后处理以改善语音 识别精度。 如果接收到的音频中存在音调,则语音假设可以在后处理中被接受为识别语音。 或者,可以确定接收到的音频的音调计数,通过将音调计数与与语音假设相关联的音节计数进行比较,可以对N个最佳语音假设进行后处理,并且具有等于音高计数的音节计数的语音假设 可以被接受为公认的演讲。
    • 9. 发明授权
    • Method of recognizing speech
    • 识别语音的方法
    • US08433570B2
    • 2013-04-30
    • US12683387
    • 2010-01-06
    • Uma Arun
    • Uma Arun
    • G10L15/06
    • G10L25/78G10L15/08G10L15/1815G10L25/15
    • A method for recognizing speech involves presenting an utterance to a speech recognition system and determining, via the speech recognition system, that the utterance contains a particular expression, where the particular expression is capable of being associated with at least two different meanings. The method further involves splitting the utterance into a plurality of speech frames, where each frame is assigned a predetermined time segment and a frame number, and indexing the utterance to i) a predetermined frame number, or ii) a predetermined time segment. The indexing of the utterance identifies that one of the frames includes the particular expression. Then the frame including the particular expression is re-presented to the speech recognition system to verify that the particular expression was actually recited in the utterance.
    • 用于识别语音的方法包括向语音识别系统呈现话语,并且经由语音识别系统确定话语包含特定表达,其中特定表达能够与至少两个不同含义相关联。 该方法还包括将话语分成多个语音帧,其中每个帧被分配预定的时间段和帧号,并且将话语索引为i)预定帧号,或ii)预定时间段。 话音的索引识别出其中一个帧包含特定的表达式。 然后,将包括特定表达式的帧重新呈现给语音识别系统以验证特定表达在实际中被实际叙述。