会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 52. 发明授权
    • Sampling rate independent speech recognition
    • 采样率独立语音识别
    • US07983916B2
    • 2011-07-19
    • US11772992
    • 2007-07-03
    • Rathinavelu Chengalvarayan
    • Rathinavelu Chengalvarayan
    • G10L15/02G10L15/06
    • G10L15/02
    • A sampling-rate-independent method of automated speech recognition (ASR). Speech energies of a plurality of codebooks generated from training data created at an ASR sampling rate are compared to speech energies in a current frame of acoustic data generated from received audio created at an audio sampling rate below the ASR sampling rate. A codebook is selected from the plurality of codebooks, and has speech energies that correspond to speech energies in the current frame over a spectral range corresponding to the audio sampling rate. Speech energies above the spectral range are copied from the selected codebook and appended to the current frame.
    • 自动语音识别(ASR)的采样率独立方法。 将从以ASR采样率创建的训练数据生成的多个码本的语音能量与在以低于ASR采样率的音频采样率产生的接收音频生成的声音数据的当前帧中的语音能量进行比较。 从多个码本中选择码本,并且具有对应于当前帧中对应于音频采样率的频谱范围上的语音能量的语音能量。 高于光谱范围的语音能量从所选码本中复制并附加到当前帧。
    • 53. 发明授权
    • Automated speech recognition using normalized in-vehicle speech
    • 使用归一化车载语音的自动语音识别
    • US07676363B2
    • 2010-03-09
    • US11427590
    • 2006-06-29
    • Rathinavelu ChengalvarayanScott M Pennock
    • Rathinavelu ChengalvarayanScott M Pennock
    • G10L15/20G10L19/14G10L21/00
    • G10L15/20
    • A speech recognition method includes the steps of receiving speech in a vehicle, extracting acoustic data from the received speech, and applying a vehicle-specific inverse impulse response function to the extracted acoustic data to produce normalized acoustic data. The speech recognition method may also include one or more of the following steps: pre-processing the normalized acoustic data to extract acoustic feature vectors; decoding the normalized acoustic feature vectors using as input at least one of a plurality of global acoustic models built according to a plurality of Lombard levels of a Lombard speech corpus covering a plurality of vehicles; calculating the Lombard level of vehicle noise; and/or selecting the at least one of the plurality of global acoustic models that corresponds to the calculated Lombard level for application during the decoding step.
    • 语音识别方法包括以下步骤:在车辆中接收语音,从接收的语音中提取声音数据,以及将车辆特定的反向脉冲响应函数应用于所提取的声学数据,以产生归一化的声学数据。 语音识别方法还可以包括以下一个或多个步骤:预处理归一化声学数据以提取声学特征向量; 使用根据覆盖多个车辆的Lombard语音语料库的多个Lombard级别构建的多个全局声学模型中的至少一个,对归一化的声学特征向量进行解码; 计算车辆噪声的隆巴德水平; 和/或选择与在所述解码步骤期间应用的所计算的伦巴第级别对应的所述多个全局声学模型中的至少一个。
    • 55. 发明授权
    • Speech recognizer performance in car and home applications utilizing novel multiple microphone configurations
    • 使用新型多麦克风配置的汽车和家庭应用中的语音识别器性能
    • US06889189B2
    • 2005-05-03
    • US10672167
    • 2003-09-26
    • Robert BomanLuca RigazioBrian HansonRathinavelu Chengalvarayan
    • Robert BomanLuca RigazioBrian HansonRathinavelu Chengalvarayan
    • G10L15/20G10L21/02G10L21/00
    • G10L21/0208G10L15/20G10L2021/02166
    • System speakers are switched to function as sound input transducers to improve recognizer performance and to support recognizer features. A crossbar switch is selectively activated, either manually or under software control, to allow system loudspeakers to function as sound input transducers that supplement the recognition system microphone or microphone array. Using loudspeakers as “microphones” improves speech recognition in noisy environments, thus attaining better recognition performance with little added system cost. The loudspeakers, positioned in physically separate locations also provide spatial information that can be used to determine the location of the person speaking and thereby offer different functionality for different persons. Acoustic models are selected based on environmental and vehicle operating conditions and may be adapted dynamically using ambient information obtained using the loudspeakers as sound input transducers.
    • 系统扬声器切换为声音输入传感器,以提高识别器性能并支持识别器功能。 手动或软件控制下有选择地激活交叉开关,以允许系统扬声器作为补充识别系统麦克风或麦克风阵列的声音输入换能器。 使用扬声器作为“麦克风”可以改善嘈杂环境中的语音识别,从而获得更好的识别性能,增加系统成本。 放置在物理上分开的位置的扬声器还提供空间信息,其可用于确定说话者的位置,从而为不同的人提供不同的功能。 基于环境和车辆操作条件选择声学模型,并且可以使用使用扬声器获得的环境信息作为声音输入换能器动态地进行调整。
    • 58. 发明授权
    • Distinguishing out-of-vocabulary speech from in-vocabulary speech
    • 将词汇外的词汇与词汇表达式区分开来
    • US08688451B2
    • 2014-04-01
    • US11382789
    • 2006-05-11
    • Timothy J. GrostRathinavelu Chengalvarayan
    • Timothy J. GrostRathinavelu Chengalvarayan
    • G10L15/00G10L15/18G10L21/00H04M1/64
    • G10L15/32
    • A speech recognition method includes receiving input speech from a user, processing the input speech using a first grammar to obtain parameter values of a first N-best list of vocabulary, comparing a parameter value of a top result of the first N-best list to a threshold value, and if the compared parameter value is below the threshold value, then additionally processing the input speech using a second grammar to obtain parameter values of a second N-best list of vocabulary. Other preferred steps include: determining the input speech to be in-vocabulary if any of the results of the first N-best list is also present within the second N-best list, but out-of-vocabulary if none of the results of the first N-best list is within the second N-best list; and providing audible feedback to the user if the input speech is determined to be out-of-vocabulary.
    • 一种语音识别方法,包括从用户接收输入语音,使用第一语法处理输入语音,以获得第一N最佳词汇列表的参数值,将第一N最佳列表的最高结果的参数值与 阈值,并且如果比较的参数值低于阈值,则使用第二语法另外处理输入语音,以获得词汇表的第二N最佳列表的参数值。 其他优选步骤包括:如果第N个最佳列表的任何结果也存在于第二N最佳列表内,则将输入语音确定为词汇表,但是如果没有结果 第一个N最佳列表在第二个N最佳列表中; 以及如果所述输入语音被确定为超出词汇量,则向用户提供可听见的反馈。