会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 131. 发明授权
    • Systems and methods for off-board voice-automated vehicle navigation
    • 车载语音自动车辆导航的系统和方法
    • US08738287B2
    • 2014-05-27
    • US13908421
    • 2013-06-03
    • Agero Connected Services, Inc.
    • Thomas Barton Schalk
    • G06F17/00G06F19/00G08G1/0968G05D1/00
    • G01C21/00G01C21/3608G08G1/096811G08G1/09685G08G1/096872G08G1/096894G10L15/00G10L21/00G10L21/06
    • A method of providing navigational information comprises processing destination information spoken by a user of a mobile processing system. The processed voice information is transmitted to a remote data center. The processed voice information is analyzed at the data center to recognize components of the destination information. The center generates a list of hypothetical recognized components of the destination by confidence levels as calculated for each component of the information analyzed. The hypothetical recognized component list is displayed with confidence levels at the data center for selective checking by a human data center operator. A set of hypothetical components is selected based on confidence levels in the list. The accuracy of the selected set of hypothetical recognized components of the destination information is confirmed though interactive voice exchanges between the mobile system user and the remote data center. A destination is determined from confirmed components of the destination information.
    • 提供导航信息的方法包括处理由移动处理系统的用户所说出的目的地信息。 经处理的语音信息被传送到远程数据中心。 在数据中心分析已处理的语音信息,以识别目的地信息的组件。 中心通过为分析的信息的每个组件计算的置信水平产生目的地的假想识别组件的列表。 在数据中心以置信水平显示假想识别的组件列表,以供人类数据中心操作员进行选择性检查。 基于列表中的置信水平选择一组假设组件。 通过移动系统用户和远程数据中心之间的交互式语音交换来确认目的地信息的所选择的假想识别的组件的精度。 目的地由目的地信息的确认部分确定。
    • 135. 发明申请
    • Speech Recognition with Parallel Recognition Tasks
    • 具有并行识别任务的语音识别
    • US20140058728A1
    • 2014-02-27
    • US14064755
    • 2013-10-28
    • Google Inc.
    • Brian StropeFrancoise BeaufaysOlivier Siohan
    • G10L15/26
    • G10L15/32G10L15/00G10L15/01G10L15/26
    • The subject matter of this specification can be embodied in, among other things, a method that includes receiving an audio signal and initiating speech recognition tasks by a plurality of speech recognition systems (SRS's). Each SRS is configured to generate a recognition result specifying possible speech included in the audio signal and a confidence value indicating a confidence in a correctness of the speech result. The method also includes completing a portion of the speech recognition tasks including generating one or more recognition results and one or more confidence values for the one or more recognition results, determining whether the one or more confidence values meets a confidence threshold, aborting a remaining portion of the speech recognition tasks for SRS's that have not generated a recognition result, and outputting a final recognition result based on at least one of the generated one or more speech results.
    • 除了别的以外,本说明书的主题可以体现在包括通过多个语音识别系统(SRS)接收音频信号和发起语音识别任务的方法。 每个SRS被配置为产生指定包括在音频信号中的可能语音的识别结果,以及指示对语音结果的正确性置信度的置信度值。 该方法还包括完成语音识别任务的一部分,包括生成一个或多个识别结果和一个或多个识别结果的一个或多个置信度值,确定一个或多个置信度值是否满足置信阈值,中止其余部分 的没有产生识别结果的SRS的语音识别任务,并且基于所生成的一个或多个语音结果中的至少一个输出最终识别结果。
    • 138. 发明授权
    • Apparatus and method for automatic extraction of important events in audio signals
    • 自动提取音频信号中重要事件的装置和方法
    • US08635065B2
    • 2014-01-21
    • US10985446
    • 2004-11-10
    • Silke Goronzy-ThomaeThomas KempRalf KompeYin Hay LamKrzysztof MarasekRaquel Tato
    • Silke Goronzy-ThomaeThomas KempRalf KompeYin Hay LamKrzysztof MarasekRaquel Tato
    • G10L15/06G10L21/00G10L19/12G10L19/14G10L17/00
    • G10L25/00G10L15/00G10L17/26
    • The present invention discloses an apparatus for automatic extraction of important events in audio signals comprising: signal input means for supplying audio signals; audio signal fragmenting means for partitioning audio signals supplied by the signal input means into audio fragments of a predetermined length and for allocating a sequence of one or more audio fragments to a respective audio window; feature extracting means for analyzing acoustic characteristics of the audio signals comprised in the audio fragments and for analyzing acoustic characteristics of the audio signals comprised in the audio windows; and important event extraction means for extracting important events in audio signals supplied by the audio signal fragmenting means based on predetermined important event classifying rules depending on acoustic characteristics of the audio signals comprised in the audio fragments and on acoustic characteristics of the audio signals comprised in the audio windows, wherein each important event extracted by the important event extraction means comprises a discrete sequence of cohesive audio fragments corresponding to an important event included in the audio signals.
    • 本发明公开了一种用于自动提取音频信号中的重要事件的装置,包括:用于提供音频信号的信号输入装置; 用于将由信号输入装置提供的音频信号划分成预定长度的音频片段并用于将一个或多个音频片段的序列分配到相应音频窗口的音频信号分段装置; 特征提取装置,用于分析包含在音频片段中的音频信号的声学特性并分析包含在音频窗口中的音频信号的声学特性; 以及重要事件提取装置,用于根据包含在音频片段中的音频信号的声学特性以及包含在音频片段中的音频信号的声学特性,基于预定的重要事件分类规则,提取由音频信号分段装置提供的音频信号中的重要事件。 音频窗口,其中由重要事件提取装置提取的每个重要事件包括对应于包括在音频信号中的重要事件的粘性音频片段的离散序列。
    • 140. 发明授权
    • Systems and methods for extracting meaning from multimodal inputs using finite-state devices
    • 使用有限状态设备从多模态输入中提取意义的系统和方法
    • US08626507B2
    • 2014-01-07
    • US13690037
    • 2012-11-30
    • AT&T Intellectual Property II, L.P.
    • Srinivas BangaloreMichael J. Johnston
    • G10L17/00
    • G10L15/00G06F3/167G06K9/00355G10L15/24
    • Multimodal utterances contain a number of different modes. These modes can include speech, gestures, and pen, haptic, and gaze inputs, and the like. This invention use recognition results from one or more of these modes to provide compensation to the recognition process of one or more other ones of these modes. In various exemplary embodiments, a multimodal recognition system inputs one or more recognition lattices from one or more of these modes, and generates one or more models to be used by one or more mode recognizers to recognize the one or more other modes. In one exemplary embodiment, a gesture recognizer inputs a gesture input and outputs a gesture recognition lattice to a multimodal parser. The multimodal parser generates a language model and outputs it to an automatic speech recognition system, which uses the received language model to recognize the speech input that corresponds to the recognized gesture input.
    • 多模式话语包含多种不同的模式。 这些模式可以包括语音,手势和笔,触觉和注视输入等。 本发明使用这些模式中的一个或多个的识别结果为这些模式中的一个或多个其他模式的识别过程提供补偿。 在各种示例性实施例中,多模式识别系统从这些模式中的一个或多个输入一个或多个识别网格,并且生成要由一个或多个模式识别器使用以识别一个或多个其他模式的一个或多个模型。 在一个示例性实施例中,手势识别器输入手势输入并向多模式解析器输出手势识别格点。 多模式解析器生成语言模型并将其输出到自动语音识别系统,其使用所接收的语言模型来识别对应于识别的手势输入的语音输入。