会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Continuous speech pattern recognizer
    • 连续语音识别器
    • US4400788A
    • 1983-08-23
    • US248570
    • 1981-03-27
    • Cory S. MyersFrank C. PirzLawrence R. Rabiner
    • Cory S. MyersFrank C. PirzLawrence R. Rabiner
    • G10L11/00G10L15/00G10L15/10G10L15/12G10L1/00
    • G10L15/12G10L15/00
    • This speech recognizer concatenates a string of reference isolated-words for comparison with the unknown string of connected-words. The invention includes a level-building (LB) algorithm, "level" implying a location in a sequence of words. A constrained endpoint dynamic-time-warp algorithm, in which the slope of the warping function is restricted between 1/2 and 2, is used to find the best alignment between an unknown continuous-word test pattern, and a concatenated sequence of L reference patterns. Properties of the LB algorithm include: modification of the references; back-track decision logic; heuristic selection of multiple candidates, and syntax constraints. As a result, the processing required is less than two-level dynamic-program-matching and sampling algorithms.
    • 该语音识别器连接一串参考隔离词,用于与未知字符串的连接词进行比较。 本发明包括水平建立(LB)算法,“水平”意味着单词序列中的位置。 使用约束的端点动态时间 - 扭曲算法,其中扭曲函数的斜率被限制在1/2和2之间,用于找到未知连续字测试模式和L参考的级联序列之间的最佳对准 模式。 LB算法的属性包括:修改引用; 后轨决策逻辑; 多个候选人的启发式选择和语法约束。 因此,所需的处理小于两级动态程序匹配和采样算法。
    • 2. 发明授权
    • Method and apparatus for generating speech pattern templates
    • 用于生成语音模式模板的方法和装置
    • US4454586A
    • 1984-06-12
    • US322748
    • 1981-11-19
    • Frank C. PirzLawrence R. RabinerJay G. Wilpon
    • Frank C. PirzLawrence R. RabinerJay G. Wilpon
    • G10L11/00G10L15/02G10L15/12G10L1/00
    • G10L15/12
    • A system for generating speech pattern templates for use with either speech recognition or speech synthesis. Reference demisyllable templates are first generated from a reference first speaker using both manual and automatic analysis. The analysis for a second speaker is simplified and automated by comparing with the first speaker's templates. The second speaker speaks the same words at a rate time-warped to match the first speakers rate and template. We define a demisyllable as each of the two halves of a syllable, assuming a syllable starts and ends with a noisy consonant, and the syllable is split at its vowel center, thereby simplifying concatenation and comparison. Key features of the invention include generating a set of signals representative of the time alignment between the first and second speaker's templates, and the time-of-occurence boundaries of each syllable in a word.
    • 一种用于产生用于语音识别或语音合成的语音模式模板的系统。 首先使用手动和自动分析从参考的第一个扬声器生成参考分解模板。 通过与第一个演讲人的模板进行比较,简化和自动化第二个演讲者的分析。 第二个发言人以相同的时间说出一致的话,以匹配第一个演讲者的速度和模板。 我们将一个分音节定义为音节的两个半部分,假设音节开始和结尾是一个嘈杂的辅音,并且音节在其元音中心分裂,从而简化了连接和比较。 本发明的主要特征包括产生一组代表第一和第二说话者模板之间的时间对准的信号以及一个单词中每个音节的发生时间边界。
    • 4. 再颁专利
    • Multiple template speech recognition system
    • 多模板语音识别系统
    • USRE31188E
    • 1983-03-22
    • US336067
    • 1981-12-31
    • Frank C. PirzLawrence R. Rabiner
    • Frank C. PirzLawrence R. Rabiner
    • G10L11/02G10L15/06
    • G10L25/87G10L15/063
    • A speech analyzer for recognizing an unknown utterance as one of a set of reference words is adapted to generate a feature signal set for each utterance of every reference word. At least one template signal is produced for each reference word which template signal is representative of a group of feature signal sets. Responsive to a feature signal set formed from the unknown utterance and each reference word template signal, a signal representative of the similarity between the unknown utterance and the template signal is generated. A plurality of similarity signals for each reference word is selected and a signal corresponding to the average of said selected similarity signals is formed. The average similarity signals are compared to identify the unknown utterance as the most similar reference word. Features of the invention include: template formation by successive clustering involving partitioning feature signal sets into groups of predetermined similarity by centerpoint clustering, and recognition by comparing the average of selected similarity measures of a time-warped unknown feature signal set with the cluster-derived reference templates for each vocabulary word.
    • 6. 发明授权
    • Multiple template speech recognition system
    • 多模板语音识别系统
    • US4181821A
    • 1980-01-01
    • US956438
    • 1978-10-31
    • Frank C. PirzLawrence R. Rabiner
    • Frank C. PirzLawrence R. Rabiner
    • G10L11/00G10L11/02G10L15/02G10L15/06G10L15/10G10L1/00
    • G10L25/87G10L15/063
    • A speech analyzer for recognizing an unknown utterance as one of a set of reference words is adapted to generate a feature signal set for each utterance of every reference word. At least one template signal is produced for each reference word which template signal is representative of a group of feature signal sets. Responsive to a feature signal set formed from the unknown utterance and each reference word template signal, a signal representative of the similarity between the unknown utterance and the template signal is generated. A plurality of similarity signals for each reference word is selected and a signal corresponding to the average of said selected similarity signals is formed. The average similarity signals are compared to identify the unknown utterance as the most similar reference word. Features of the invention include: template formation by successive clustering involving partitioning feature signal sets into groups of predetermined similarity by centerpoint clustering, and recognition by comparing the average of selected similarity measures of a time-warped unknown feature signal set with the cluster-derived reference templates for each vocabulary word.
    • 用于将未知发音识别为一组参考词之一的语音分析器适于为每个参考词的每个发声产生一组特征信号。 为每个参考字产生至少一个模板信号,该模板信号代表一组特征信号组。 响应于由未知发音和每个参考字模板信号形成的特征信号组,产生表示未知发音和模板信号之间的相似性的信号。 选择每个参考字的多个相似度信号,并且形成与所述选择的相似度信号的平均值对应的信号。 比较平均相似度信号以将未知话语识别为最相似的参考词。 本发明的特征包括:通过连续聚类的模板形成,其涉及通过中心点聚类将特征信号组划分成预定相似度的组,以及通过将时变未知特征信号集合的所选相似性度量与所述聚类导出参考的平均值进行比较来进行识别 每个词汇单词的模板。
    • 9. 发明授权
    • Speech recognition employing key word modeling and non-key word modeling
    • 语音识别采用关键词建模和非关键词建模
    • US5509104A
    • 1996-04-16
    • US132430
    • 1993-10-06
    • Chin H. LeeLawrence R. RabinerJay G. Wilpon
    • Chin H. LeeLawrence R. RabinerJay G. Wilpon
    • G10L15/00G10L15/14G10L5/00
    • G10L15/142G10L2015/088
    • Speaker independent recognition of small vocabularies, spoken over the long distance telephone network, is achieved using two types of models, one type for defined vocabulary words (e.g., collect, calling-card, person, third-number and operator), and one type for extraneous input which ranges from non-speech sounds to groups of non-vocabulary words (e.g. `I want to make a collect call please`). For this type of key word spotting, modifications are made to a connected word speech recognition algorithm based on state-transitional (hidden Markov) models which allow it to recognize words from a pre-defined vocabulary list spoken in an unconstrained fashion. Statistical models of both the actual vocabulary words and the extraneous speech and background noises are created. A syntax-driven connected word recognition system is then used to find the best sequence of extraneous input and vocabulary word models for matching the actual input speech.
    • 使用两种类型的模型来实现对长途电话网络上的小词汇的独立识别,一种用于定义的词汇单词(例如,收集,呼叫卡,人,第三号码和运营商)的模型,以及一种类型 对于从非语音声音到非词汇单词组(例如“我想要收集电话”)的无关输入。 对于这种类型的关键词发现,对基于状态转换(隐马尔科夫)模型的连接词语音识别算法进行修改,这允许其识别来自以无限制方式说明的预定义词汇列表中的单词。 创建实际词汇单词和无关语音和背景噪声的统计模型。 然后使用语法驱动的连接词识别系统来找到用于匹配实际输入语音的外来输入和词汇词模型的最佳序列。