会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Method and apparatus for context-dependent estimation of multiple
probability distributions of phonetic classes with multilayer
perceptrons in a speech recognition system
    • 用于语音识别系统中具有多层感知器的语音类的多个概率分布的上下文相关估计的方法和装置
    • US5317673A
    • 1994-05-31
    • US901716
    • 1992-06-22
    • Michael H. CohenHoracio E. Franco
    • Michael H. CohenHoracio E. Franco
    • G10L15/14G10L5/06
    • G10L15/144
    • In a hidden Markov model-based speech recognition system, multilayer perceptrons (MLPs) are used in context-dependent estimation of a plurality of state-dependent observation probability distributions of phonetic classes. Estimation is obtained by the Bayesian factorization of the observation likelihood in terms of posterior probabilities of phone classes assuming the context and the input speech vector. The context-dependent estimation is employed as the state-dependent observation probabilities needed as parameter input to a hidden Markov model speech processor to identify the word sequence representing the unknown speech input of input speech vectors. Within the speech processor, models are provided which employ the observation probabilities in the recognition process. The number of context-dependent nets is reduced to a single net by sharing the units of the input layer and the hidden layer and the weights connecting them in the multilayer perceptron while providing one output layer for each relevant context. Each output layer is trained as an independent network on the specific examples of the corresponding context it represents. Training may be optimized at an intermediate set of weights between the context-independent-associated weights and the context-dependent associated weights to which training would normally converge.
    • 在基于隐马尔可夫模型的语音识别系统中,多媒体感知器(MLP)用于语音类的多个状态依赖性观察概率分布的上下文相关估计。 通过假设上下文和输入语音向量的电话类的后验概率的观察可能性的贝叶斯分解获得估计。 采用上下文相关估计作为对隐马尔可夫模型语音处理器的参数输入所需的状态相关观测概率,以识别代表输入语音向量的未知语音输入的单词序列。 在语音处理器中,提供了在识别过程中采用观察概率的模型。 通过共享输入层和隐藏层的单位以及将它们连接到多层感知器中的权重,将上下文相关网络的数量减少到单个网络,同时为每个相关上下文提供一个输出层。 每个输出层作为独立网络被训练在其所代表的相应上下文的具体示例上。 可以在上下文无关关联权重与训练正常收敛到的与上下文相关的权重之间的中间权重集合上优化训练。
    • 7. 发明授权
    • Automatic language model update
    • 自动语言模型更新
    • US07756708B2
    • 2010-07-13
    • US11396770
    • 2006-04-03
    • Michael H. CohenShumeet BalujaPedro J. Moreno
    • Michael H. CohenShumeet BalujaPedro J. Moreno
    • G10L15/06G10L15/08G10L15/00G06F17/30
    • G10L15/065G10L15/06G10L15/063G10L15/187G10L15/26G10L2015/0635
    • A method for generating a speech recognition model includes accessing a baseline speech recognition model, obtaining information related to recent language usage from search queries, and modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. The portion of a sound may include a word. Also, a method for generating a speech recognition model, includes receiving at a search engine from a remote device an audio recording and a transcript that substantially represents at least a portion of the audio recording, synchronizing the transcript with the audio recording, extracting one or more letters from the transcript and extracting the associated pronunciation of the one or more letters from the audio recording, and generating a dictionary entry in a pronunciation dictionary.
    • 一种用于产生语音识别模型的方法,包括:访问基准语音识别模型,从搜索查询获得与最近的语言使用相关的信息,以及修改语音识别模型,以基于该信息修改声音发生的一部分的概率。 声音的一部分可能包含一个字。 另外,一种用于生成语音识别模型的方法包括:从搜索引擎从远程设备接收基本上表示音频记录的至少一部分的音频记录和抄本,将录音与音频记录同步,提取一个或 从录音中提取更多的字母,并且从音频记录中提取一个或多个字母的相关联的发音,以及在发音词典中生成字典条目。
    • 8. 发明申请
    • Activating Content Distribution
    • 激活内容分发
    • US20100121704A1
    • 2010-05-13
    • US12617266
    • 2009-11-12
    • Vincent VanhouckeMichael H. CohenManish G. PatelGudmundur Hafsteinsson
    • Vincent VanhouckeMichael H. CohenManish G. PatelGudmundur Hafsteinsson
    • G06Q30/00
    • G06Q30/0246G06Q30/02G06Q30/0255G06Q30/0261
    • A computer-implemented method for advertisement distribution includes receiving, in a computer system, an input from an advertiser that has previously registered an advertisement for on-demand activation. The input is generated based on the advertiser having an immediate availability and directs the computer system to initiate the on-demand activation substantially in real time with receiving the input. The method includes determining, using the computer system, a geographic location of the advertiser that corresponds to the immediate availability. The method includes defining, using the computer system, a target group to which the advertisement is to be presented, the target group identified based on at least the geographic location and the immediate availability. The method includes initiating the on-demand activation using the computer system, for receipt of the advertisement by at least part of the target group, the on-demand activation initiated substantially in real time with receiving the input.
    • 用于广告分发的计算机实现的方法包括在计算机系统中接收来自已经注册了用于按需激活的广告的广告商的输入。 输入是基于具有即时可用性的广告商生成的,并且指导计算机系统基本实时地接收输入来启动按需激活。 该方法包括使用计算机系统来确定与立即可用性相对应的广告商的地理位置。 该方法包括使用计算机系统定义要向其呈现广告的目标组,至少基于地理位置和即时可用性来识别目标组。 该方法包括使用计算机系统启动按需激活,用于由目标组的至少一部分接收广告,基本实时地接收输入的按需激活。
    • 9. 发明授权
    • Verbal labels for electronic messages
    • 用于电子信息的口头标签
    • US07627638B1
    • 2009-12-01
    • US11019431
    • 2004-12-20
    • Michael H. Cohen
    • Michael H. Cohen
    • G06F15/16
    • G06Q10/107H04L51/34
    • Verbal labels for electronic messages, as well as systems and methods for making and using such labels, are disclosed. A verbal label is a label containing audio data (such as a digital audio file of a user's voice and/or a speaker template thereof) that is associated with one or more electronic messages. Verbal labels permit a user to more efficiently manipulate e-mail and other electronic messages by voice. For example, a user can add such labels verbally to an e-mail or to a group of e-mails, thereby permitting these messages to be sorted and retrieved more easily.
    • 公开了用于电子消息的口头标签,以及用于制作和使用这些标签的系统和方法。 语言标签是包含与一个或多个电子消息相关联的音频数据(例如用户的语音的数字音频文件和/或其扬声器模板)的标签。 语言标签允许用户通过语音更有效地操纵电子邮件和其他电子消息。 例如,用户可以将这些标签口头地添加到电子邮件或一组电子邮件,从而允许更容易地对这些消息进行排序和检索。