专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

21. 发明申请

US20070073541A1 Method for compressing dictionary data 审中-公开
公开(公告)号：US20070073541A1
公开(公告)日：2007-03-29
申请号：US11605655
申请日：2006-11-29
申请人： Jilei Tian
发明人： Jilei Tian
IPC分类号： G10L15/04
CPC分类号： G10L15/12 , G10L2015/025 , H03M7/30
摘要： The invention relates to pre-processing of a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units. According to one aspect of the invention the sequence of character units and the sequence of phoneme units are aligned using a statistical algorithm. The aligned sequence of character units and aligned sequence of phoneme units are interleaved by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.

22. 发明申请

US20070038447A1 PATTERN MATCHING METHOD AND APPARATUS AND SPEECH INFORMATION RETRIEVAL SYSTEM 失效
标题翻译：模式匹配方法和设备和语音信息检索系统
公开(公告)号：US20070038447A1
公开(公告)日：2007-02-15
申请号：US11463332
申请日：2006-08-09
申请人： Kazue Kaneko
发明人： Kazue Kaneko
IPC分类号： G10L15/00
CPC分类号： G10L15/02 , G10L15/12 , G10L2015/025
摘要： A pattern matching method for matching between a first symbol sequence and a second symbol sequence which is shorter than the first symbol sequence is provided. The method includes the steps of performing DP matching between the first and second symbol sequences to create a matrix of the DP matching transition, detecting the maximum length of lengths of consecutive correct answers based on the matrix of the DP matching transition, and calculating similarity based on the maximum length.
摘要翻译：提供了一种用于在第一符号序列和比第一符号序列短的第二符号序列之间进行匹配的模式匹配方法。该方法包括以下步骤：在第一和第二符号序列之间执行DP匹配以创建DP匹配转换的矩阵，基于DP匹配转换的矩阵检测连续正确答案的长度的最大长度，并且基于最大长度。

23. 发明申请

US20060287867A1 Method and apparatus for generating a voice tag 审中-公开
标题翻译：用于生成语音标签的方法和装置
公开(公告)号：US20060287867A1
公开(公告)日：2006-12-21
申请号：US11155944
申请日：2005-06-17
申请人： Yan Cheng , Changxue Ma
发明人： Yan Cheng , Changxue Ma
IPC分类号： G10L21/00
CPC分类号： H04M3/4936 , G10L15/12 , G10L2015/223 , H04M2201/405
摘要： A method and apparatus for generating a voice tag (140) includes a means (110) for combining (205) a plurality of utterances (106, 107, 108) into a combined utterance (111) and a means (120) for extraction (210) of the voice tag as a sequence of phonemes having a high likelihood of representing the combined utterance, using a set of stored phonemes (115) and the combined utterance.
摘要翻译：一种用于生成语音标签（140）的方法和装置包括：用于将多个话语（106,107,108）组合（205）到组合话语（111）中的装置（110）和用于提取的装置（120） 210）作为具有表示组合发音的高可能性的音素序列，使用一组存储的音素（115）和组合的话语。

24. 发明授权

US07054812B2 Database annotation and retrieval 失效
公开(公告)号：US07054812B2
公开(公告)日：2006-05-30
申请号：US09840886
申请日：2001-04-25
申请人： Jason Peter Andrew Charlesworth , Philip Neil Garner
发明人： Jason Peter Andrew Charlesworth , Philip Neil Garner
IPC分类号： G10L15/04
CPC分类号： G10L15/12 , G10L2015/025 , G10L2015/085
摘要： A system is provided for determining a sequence of sub-word units representative of at least two words output by a word recognition unit in response to an input word to be recognized. In a preferred embodiment, the word alternatives output by the recognition unit are converted into sequences of phonemes. An optimum alignment between these sequences is then determined using a dynamic programming alignment technique. The sequence of phonemes representative of the input sequences is then determined using this optimum alignment.

25. 发明授权

US07006971B1 Recognition of a speech utterance available in spelled form 失效
标题翻译：识别具有拼写形式的言语言语
公开(公告)号：US07006971B1
公开(公告)日：2006-02-28
申请号：US09663585
申请日：2000-09-18
申请人： Volker Stahl , Alexander Fischer
发明人： Volker Stahl , Alexander Fischer
IPC分类号： G10L15/12
CPC分类号： G10L15/08 , G10L15/12 , G10L15/197 , G10L2015/086
摘要： This invention relates to a method of recognizing a speech utterance (s) available in spelled form, comprising a processing stage in which a corresponding letter sequence (r) is estimated by means of a letter speech recognition unit (2) based on Hidden Markov Models, and a second processing stage (3) in which the estimated result (r) produced by the first processing stage utilizing a statistical letter sequence model (4) and a statistical model (5) for the speech recognition unit (2) is post-processed, while the dynamic programming method is used during the post-processing. For providing robust and efficient speech recognition procedures for the use of speech signals for system control, a grid structure on which the dynamic programming is based and whose node points are provided for the assignment to accumulated probability values, is converted into a tree structure and that an A* algorithm is used for finding an optimum tree path. Also a speech control device wherein a complete word is input as a control signal and at least part of this word in spelled form is input, while the result of the letter speech recognition is used within the scope of the word speech recognition.
摘要翻译：本发明涉及一种识别语音话语的方法（“

26. 发明申请

US20040049387A1 Dynamic time warping device and speech recognition apparatus using the same 失效
标题翻译：动态时间扭曲装置和使用该装置的语音识别装置
公开(公告)号：US20040049387A1
公开(公告)日：2004-03-11
申请号：US10277978
申请日：2002-10-23
发明人： Hong Jeong , Yong Kim
IPC分类号： G10L015/12
CPC分类号： G10L15/12
摘要： Provided are a dynamic time warping device using speech recognition software, and a speech recognition apparatus using the same. The dynamic time warping device includes memory units for processing characterization vectors of a test pattern and a predetermined reference pattern using a FIFO queue, and a plurality of processing elements serially connected to each other, the plurality of processing elements multiplying a predetermined weight by a difference between the characterization vectors of the test and reference patterns, which are obtained by shifting them in the opposite directions, adding the multiplication result to matching cost values of adjacent nodes, and comparing the addition results to detect the smallest matching cost value. Accordingly, fast speech recognition can be realized by embedding speech recognition software using a dynamic time warping algorithm into hardware. Also, it is possible to increase a recognition rate of speech by adjusting weight according to a node to be compared, and provide a dynamic time warping device that can be mass-produced as application-specification integrated circuits (ASICs). Further, a compact speech recognition apparatus using the dynamic time warping device can be provided without requiring a computer to drive software for speech recognition.
摘要翻译：提供了一种使用语音识别软件的动态时间扭曲装置和使用其的语音识别装置。动态时间扭曲装置包括用于处理测试图案的特征向量和使用FIFO队列的预定参考图案的存储单元和多个串联连接的处理元件，所述多个处理元件将预定权重乘以差在通过相反方向移位而获得的测试和参考图案的表征向量之间，将相乘结果相加到相邻节点的匹配成本值，并比较加法结果以检测最小匹配成本值。因此，可以通过使用动态时间扭曲算法将语音识别软件嵌入硬件来实现快速语音识别。此外，通过根据要比较的节点调整权重，可以提高语音的识别率，并且提供可以作为应用规范集成电路（ASIC）批量生产的动态时间整经装置。此外，可以提供使用动态时间扭曲装置的紧凑型语音识别装置，而不需要计算机来驱动用于语音识别的软件。

27. 发明授权

US06611801B2 Gain and noise matching for speech recognition 有权
标题翻译：语音识别的增益和噪声匹配
公开(公告)号：US06611801B2
公开(公告)日：2003-08-26
申请号：US10233493
申请日：2002-09-04
申请人： Adoram Erell
发明人： Adoram Erell
IPC分类号： G10L1504
CPC分类号： G10L15/20 , G10L15/12 , G10L21/0216
摘要： A speech recognition system includes a token builder, a noise estimator, a template padder, a gain and noise adapter and a dynamic time warping (DTW) unit. The token builder produces a widened test token representing an input test utterance and at least one frame before and after the input test utterance. The noise estimator estimates noise qualities of the widened test token. The template padder pads each of a plurality of reference templates with at least one blank frame either the beginning or end of the reference template. The gain and noise adapter adapts each padded reference template with the noise and gain qualities thereby producing adapted reference templates having noise frames wherever a blank frame was originally placed and noise adapted speech where speech exists. The DTW unit performs a noise adapted DTW operation comparing the widened token with one of the noise adapted reference templates, wherein, when comparing against one of the noise frames, no duration constraints are used. The present invention includes the method performed by the system.
摘要翻译：语音识别系统包括令牌构建器，噪声估计器，模板加法器，增益和噪声适配器以及动态时间扭曲（DTW）单元。令牌构建器产生扩展的测试令牌，其表示输入测试话语，并且在输入测试话语之前和之后产生至少一个帧。噪声估计器估计扩大的测试令牌的噪声质量。模板填充器将多个参考模板中的每一个用参考模板的开头或结尾的至少一个空白框来填充。增益和噪声适配器使每个填充的参考模板具有噪声和增益质量，从而产生具有噪声帧的适配参考模板，无论空白帧最初放置在哪里，并且噪声适应的语音存在于语音中。 DTW单元执行噪声调整的DTW操作，将加宽的令牌与噪声适应的参考模板之一进行比较，其中当与噪声帧之一进行比较时，不使用持续时间约束。本发明包括由该系统执行的方法。

28. 发明申请

US20030120482A1 Method for compressing dictionary data 失效
标题翻译：压缩字典数据的方法
公开(公告)号：US20030120482A1
公开(公告)日：2003-06-26
申请号：US10292122
申请日：2002-11-11
发明人： Jilei Tian
IPC分类号： G10L019/06 , G10L015/14
CPC分类号： G10L15/12 , G10L2015/025 , H03M7/30
摘要： The invention relates to pre-processing of a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units. According to one aspect of the invention the sequence of character units and the sequence of phoneme units are aligned using a statistical algorithm. The aligned sequence of character units and aligned sequence of phoneme units are interleaved by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.
摘要翻译：本发明涉及一种用于在数据处理设备中进行压缩的发音字典的预处理，该发音字典包括至少一个条目，该条目包括一系列字符单元和一系列音素单元。根据本发明的一个方面，使用统计算法来对齐字符单元的序列和音素单元的序列。通过将每个音素单元相对于相应的字符单元插入预定位置来交织字符单元的对齐序列和对准的音素单元的顺序。

29. 发明申请

US20020032566A1 APPARATUS, METHOD AND COMPUTER READABLE MEMORY MEDIUM FOR SPEECH RECOGNITON USING DYNAMIC PROGRAMMING 有权
标题翻译：使用动态编程的语音识别器的装置，方法和计算机可读存储器介质
公开(公告)号：US20020032566A1
公开(公告)日：2002-03-14
申请号：US09359912
申请日：1999-07-26
发明人： ELI TZIRKEL-HANCOCK , ROBERT ALEXANDER KEILLER
IPC分类号： G10L015/12 , G10L015/08 , G10L021/00
CPC分类号： H04M3/533 , G10L15/12 , G10L2015/085 , H04M1/271 , H04M1/57 , H04M1/6505 , H04M3/42204 , H04M3/53383 , H04M2201/40
摘要： A method for matching an input pattern with a number of stored reference patterns using a dynamic programming matching technique is described. The reference patterns of a reference signal which are at the end of a dynamic programming path for a current input pattern are listed in an active list. The dynamic programming paths are propagated by processing the reference patterns on the active list, and a new active list is generated for the succeeding input pattern. The amount of processing required for each pattern on the active list is reduced by using a pointer which identifies the reference pattern which is the earliest in the sequence of patterns of the current reference signal listed on the new active list during the processing of a preceding dynamic programming path. In a second aspect, a speech recognition interface is used as a control system for a telephony system.
摘要翻译：描述了使用动态编程匹配技术将输入模式与多个存储的参考模式进行匹配的方法。在当前输入模式的动态编程路径的末尾处的参考信号的参考模式被列在活动列表中。通过处理活动列表上的参考模式来传播动态编程路径，并为后续输入模式生成新的活动列表。通过使用标识在前一动态处理期间在新的活动列表上列出的当前参考信号的模式序列中最早的参考模式的指针来减少活动列表上每个模式所需的处理量编程路径。在第二方面，语音识别接口被用作电话系统的控制系统。

30. 发明授权

US06263216B1 Radiotelephone voice control device, in particular for use in a motor vehicle 有权
标题翻译：无线电话语音控制装置，特别适用于机动车辆
公开(公告)号：US06263216B1
公开(公告)日：2001-07-17
申请号：US09411382
申请日：1999-10-04
申请人： Henri Seydoux , Nicolas Besnard
发明人： Henri Seydoux , Nicolas Besnard
IPC分类号： H04B138
CPC分类号： H04M1/6075 , G10L15/12 , G10L15/20 , G10L2021/02168 , H01M10/48 , H01M2200/10 , H04M1/271
摘要： The apparatus comprises a data memory containing a series of correspondents' call numbers and, for each call number, at least one associated voice print; a sound transducer suitable for picking up the name of a desired corespondent as spoken by the user of the apparatus; voice recognition means suitable for analyzing the correspondent's name as picked up by the transducer and for transforming it into an associated voice print; selective memory addressing means including associative means suitable for finding a voice print in the memory corresponding to the print supplied by the voice recognition means, and in the event of a match, for addressing the corresponding memory position; and means co-operating with the associative means for applying the addressed call number to the radiotelephone circuits. The voice recognition means evaluate and store a current noise level as picked up by the transducer in the absence of a speech signal; when in the presence of a speech signal, they subtract the previously evaluated current noise level from the signal as picked up; and then they apply the resulting signal as obtained in this way to a DTW type voice recognition algorithm with pattern recognition by dynamic programming adapted to speech using dynamic parameter extraction functions, in particular a predictive dynamic algorithm with forward and/or backward and/or frequency masking.
摘要翻译：该装置包括数据存储器，其包含一系列通信对方的呼叫号码，并且对于每个呼叫号码，包含至少一个相关联的语音打印; 适于拾取所述装置的使用者所说的期望的记者的名称的声音传感器; 语音识别装置适用于分析由传感器拾取的记者姓名，并将其转换成相关的语音打印; 选择性存储器寻址装置，包括适于在对应于由语音识别装置提供的打印的存储器中找到语音打印的关联装置，以及在匹配的情况下，用于寻址相应的存储器位置; 并且与用于将寻址的呼叫号码应用于无线电话电路的关联装置协同工作。语音识别装置评估和存储在没有语音信号的情况下由换能器拾取的当前噪声电平; 当存在语音信号时，它们从拾取的信号中减去先前评估的当前噪声电平; 然后利用动态参数提取函数，特别是具有向前和/或向后和/或频率的预测动态算法，将以这种方式获得的结果信号应用于具有利用动态规划的动态规划的模式识别的DTW型语音识别算法掩蔽。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式