会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Speech recognition system
    • 语音识别系统
    • US4718094A
    • 1988-01-05
    • US845155
    • 1986-03-27
    • Lalit R. BahlPeter V. deSouzaSteven V. DeGennaroRobert L. Mercer
    • Lalit R. BahlPeter V. deSouzaSteven V. DeGennaroRobert L. Mercer
    • G10L11/00G10L15/08G10L15/10G10L15/12G10L15/14G10L15/18G10L5/00
    • G10L15/142G10L15/08G10L15/144G10L15/187
    • Speech words are recognized by first recognizing each spectral vector identified by a label (feneme), then identifying the word by matching the string of labels against phones using simplified phone machines based on label and transition probabilities and Merkov chains. In one embodiment, a detailed acoustic match word score is combined with an approximate acoustic match word score to provide a total word score for a subject word. In another embodiment, a polling word score is combined with an acoustic match word score to provide a total word score for a subject word. The acoustic models employed in the acoustic matching may correspond, alternatively, to phonetic elements or to fenemes. Fenemes represent labels generated by an acoustic processor in response to a spoken input. Apparatus and method for determining word scores according to approximate acoustic matching and for determining word scores according to a polling methodology are disclosed.
    • 通过首先识别由标签(feneme)标识的每个频谱矢量,然后通过基于标签和转换概率以及Merkov链使用简化的电话机将标签串与电话匹配来识别词语来识别语音词。 在一个实施例中,将详细的声匹配词得分与近似声匹配词得分组合以提供主题词的总词分数。 在另一个实施例中,轮询词得分与声匹配词得分组合以提供主题词的总词分数。 在声学匹配中使用的声学模型可以对应于语音元件或拼音。 Fenemes表示响应于语音输入由声学处理器产生的标签。 公开了根据近似声匹配确定单词分数并根据轮询方法确定单词分数的装置和方法。
    • 2. 发明授权
    • Feneme-based Markov models for words
    • 基于Feneme的马尔可夫模型的词
    • US5165007A
    • 1992-11-17
    • US366231
    • 1989-06-12
    • Lalit R. BahlPeter V. DeSouzaRobert L. MercerMichael A. Picheny
    • Lalit R. BahlPeter V. DeSouzaRobert L. MercerMichael A. Picheny
    • G10L15/02G10L15/06G10L15/14
    • G10L15/142G10L2015/0631
    • In a speech recognition system, apparatus and method for modelling words with label-based Markov models is disclosed. The modelling includes: entering a first speech input, corresponding to words in a vocabulary, into an acoustic processor which converts each spoken word into a sequence of standard labels, where each standard label corresponds to a sound type assignable to an interval of time; representing each standard label as a probabilistic model which has a plurality of states, at least one transition from a state to a state, and at least one settable output probability at some transitions; entering selected acoustic inputs into an acoustic processor which converts the selected acoustic inputs into personalized labels, each personalized label corresponding to a sound type assigned to an interval of time; and setting each output probability as the probability of the standard label represented by a given model producing a particular personalized label at a given transition in the given model. The present invention addresses the problem of generating models of words simply and automatically in a speech recognition system.
    • 在一种语音识别系统中,公开了用基于标签的马尔可夫模型对词进行建模的装置和方法。 所述建模包括:将对应于词汇表中的单词的第一语音输入输入到将每个口语单词转换成标准标签序列的声学处理器,其中每个标准标签对应于可分配到时间间隔的声音类型; 将每个标准标签表示为具有多个状态的概率模型,至少一个从状态到状态的转变,以及在某些转换时的至少一个可设置的输出概率; 将选定的声音输入输入到将所选择的声音输入转换成个性化标签的声学处理器,每个个性化标签对应于分配给一段时间的声音类型; 并将每个输出概率设置为由给定模型表示的标准标签的概率,该给定模型在给定模型中的给定转换处产生特定个性化标签。 本发明解决了在语音识别系统中简单和自动地生成单词模型的问题。
    • 4. 发明授权
    • Speech recognition employing a set of Markov models that includes Markov
models representing transitions to and from silence
    • 语音识别采用一组马尔可夫模型,其中包括表示从沉默转换到沉默的马尔可夫模型
    • US4977599A
    • 1990-12-11
    • US289447
    • 1988-12-15
    • Lalit R. BahlPeter V. DeSouzaRobert L. MercerMichael A. Picheny
    • Lalit R. BahlPeter V. DeSouzaRobert L. MercerMichael A. Picheny
    • G10L15/02G10L15/06G10L15/14
    • G10L15/02G10L15/142G10L2015/0631
    • Apparatus and method for constructing word baseforms which can be matched against a string of generated acoustic labels. A set of phonetic phone machines are formed, wherein each phone machine has (i) a plurality of states, (ii) a plurality of transitions each of which extends from a state to a state, (iii) a stored probability for each transition, and (iv) stored label output probabilities, each label output probability corresponding to the probability of each phone machine producing a corresponding label. The set of phonetic machines is formed to include a subset of onset phone machines. The stored probabilities of each onset phone macine correspond to at least one phonetic element being uttered at the beginning of a speech segment. The set of phonetic machines is formed to include a subset of trailing phone machines. The stored probabilities of each trailing phone machine correspond to at least one single phonetic element being uttered at the end of a speech segment. Word baseforms are constructed by concatenating phone machines selected from the set.
    • 用于构建可与一串生成的声学标签匹配的字基形式的装置和方法。 形成一组语音电话机,其中每个电话机具有(i)多个状态,(ii)多个转换,每个转换从状态延伸到状态,(iii)每个转换的存储概率, 和(iv)存储的标签输出概率,每个标签输出概率对应于每个电话机产生相应标签的概率。 语音机的组合形成为包括起动电话机的一个子集。 每个起始电话机的存储概率对应于在语音段开始时发出的至少一个语音元素。 该组语音机器被形成为包括拖尾电话机的子集。 每个拖尾电话机的存储概率对应于在语音段结束时发出的至少一个单个语音元素。 字基础是通过连接从集合中选择的电话机构成的。
    • 5. 发明授权
    • Synthesizing word baseforms used in speech recognition
    • 合成语言识别中使用的词基形式
    • US4882759A
    • 1989-11-21
    • US853525
    • 1986-04-18
    • Lalit R. BahlPeter V. deSouzaRobert L. MercerMichael A. Picheny
    • Lalit R. BahlPeter V. deSouzaRobert L. MercerMichael A. Picheny
    • G10L11/00G10L15/06G10L15/14
    • G10L15/14
    • Apparatus and method for synthesizing word baseforms for words not spoken during a training session, wherein each synthesized baseform represents a series of models from a first set of models, which include: (a) uttering speech during a training session and representing the uttered speech as a sequence of models from a second set of models; (b) for each of at least some of the second set models spoken in a given phonetic model context during the training session, storing a respective string of first set models; and (c) constructing a word baseform of first set models for a word not spoken during the training session, including the step of representing each piece of a word that corresponds to a second set model in a given context by the stored respective string, if any, corresponding thereto.
    • 用于合成在训练期间未被说出的词语的词基形式的装置和方法,其中每个合成基形式表示来自第一组模型的一系列模型,其包括:(a)在训练期间发出语音并将发出的语音表示为 来自第二组模型的一系列模型; (b)对于训练期间在给定语音模型上下文中说出的至少一些第二组模型中的每一个,存储相应的第一组模型串; 以及(c)在训练会话期间为未被说出的单词构造第一组模型的单词基本形式,包括在给定上下文中通过存储的相应字符串表示对应于第二组模型的单词的每一段的步骤,如果 任何相应的。