会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 25. 发明授权
    • Recognition of a speech utterance available in spelled form
    • 识别具有拼写形式的言语言语
    • US07006971B1
    • 2006-02-28
    • US09663585
    • 2000-09-18
    • Volker StahlAlexander Fischer
    • Volker StahlAlexander Fischer
    • G10L15/12
    • G10L15/08G10L15/12G10L15/197G10L2015/086
    • This invention relates to a method of recognizing a speech utterance (s) available in spelled form, comprising a processing stage in which a corresponding letter sequence (r) is estimated by means of a letter speech recognition unit (2) based on Hidden Markov Models, and a second processing stage (3) in which the estimated result (r) produced by the first processing stage utilizing a statistical letter sequence model (4) and a statistical model (5) for the speech recognition unit (2) is post-processed, while the dynamic programming method is used during the post-processing. For providing robust and efficient speech recognition procedures for the use of speech signals for system control, a grid structure on which the dynamic programming is based and whose node points are provided for the assignment to accumulated probability values, is converted into a tree structure and that an A* algorithm is used for finding an optimum tree path. Also a speech control device wherein a complete word is input as a control signal and at least part of this word in spelled form is input, while the result of the letter speech recognition is used within the scope of the word speech recognition.
    • 本发明涉及一种识别语音话语的方法(“
    • 26. 发明申请
    • Dynamic time warping device and speech recognition apparatus using the same
    • 动态时间扭曲装置和使用该装置的语音识别装置
    • US20040049387A1
    • 2004-03-11
    • US10277978
    • 2002-10-23
    • Hong JeongYong Kim
    • G10L015/12
    • G10L15/12
    • Provided are a dynamic time warping device using speech recognition software, and a speech recognition apparatus using the same. The dynamic time warping device includes memory units for processing characterization vectors of a test pattern and a predetermined reference pattern using a FIFO queue, and a plurality of processing elements serially connected to each other, the plurality of processing elements multiplying a predetermined weight by a difference between the characterization vectors of the test and reference patterns, which are obtained by shifting them in the opposite directions, adding the multiplication result to matching cost values of adjacent nodes, and comparing the addition results to detect the smallest matching cost value. Accordingly, fast speech recognition can be realized by embedding speech recognition software using a dynamic time warping algorithm into hardware. Also, it is possible to increase a recognition rate of speech by adjusting weight according to a node to be compared, and provide a dynamic time warping device that can be mass-produced as application-specification integrated circuits (ASICs). Further, a compact speech recognition apparatus using the dynamic time warping device can be provided without requiring a computer to drive software for speech recognition.
    • 提供了一种使用语音识别软件的动态时间扭曲装置和使用其的语音识别装置。 动态时间扭曲装置包括用于处理测试图案的特征向量和使用FIFO队列的预定参考图案的存储单元和多个串联连接的处理元件,所述多个处理元件将预定权重乘以差 在通过相反方向移位而获得的测试和参考图案的表征向量之间,将相乘结果相加到相邻节点的匹配成本值,并比较加法结果以检测最小匹配成本值。 因此,可以通过使用动态时间扭曲算法将语音识别软件嵌入硬件来实现快速语音识别。 此外,通过根据要比较的节点调整权重,可以提高语音的识别率,并且提供可以作为应用规范集成电路(ASIC)批量生产的动态时间整经装置。 此外,可以提供使用动态时间扭曲装置的紧凑型语音识别装置,而不需要计算机来驱动用于语音识别的软件。
    • 27. 发明授权
    • Gain and noise matching for speech recognition
    • 语音识别的增益和噪声匹配
    • US06611801B2
    • 2003-08-26
    • US10233493
    • 2002-09-04
    • Adoram Erell
    • Adoram Erell
    • G10L1504
    • G10L15/20G10L15/12G10L21/0216
    • A speech recognition system includes a token builder, a noise estimator, a template padder, a gain and noise adapter and a dynamic time warping (DTW) unit. The token builder produces a widened test token representing an input test utterance and at least one frame before and after the input test utterance. The noise estimator estimates noise qualities of the widened test token. The template padder pads each of a plurality of reference templates with at least one blank frame either the beginning or end of the reference template. The gain and noise adapter adapts each padded reference template with the noise and gain qualities thereby producing adapted reference templates having noise frames wherever a blank frame was originally placed and noise adapted speech where speech exists. The DTW unit performs a noise adapted DTW operation comparing the widened token with one of the noise adapted reference templates, wherein, when comparing against one of the noise frames, no duration constraints are used. The present invention includes the method performed by the system.
    • 语音识别系统包括令牌构建器,噪声估计器,模板加法器,增益和噪声适配器以及动态时间扭曲(DTW)单元。 令牌构建器产生扩展的测试令牌,其表示输入测试话语,并且在输入测试话语之前和之后产生至少一个帧。 噪声估计器估计扩大的测试令牌的噪声质量。 模板填充器将多个参考模板中的每一个用参考模板的开头或结尾的至少一个空白框来填充。 增益和噪声适配器使每个填充的参考模板具有噪声和增益质量,从而产生具有噪声帧的适配参考模板,无论空白帧最初放置在哪里,并且噪声适应的语音存在于语音中。 DTW单元执行噪声调整的DTW操作,将加宽的令牌与噪声适应的参考模板之一进行比较,其中当与噪声帧之一进行比较时,不使用持续时间约束。 本发明包括由该系统执行的方法。
    • 28. 发明申请
    • Method for compressing dictionary data
    • 压缩字典数据的方法
    • US20030120482A1
    • 2003-06-26
    • US10292122
    • 2002-11-11
    • Jilei Tian
    • G10L019/06G10L015/14
    • G10L15/12G10L2015/025H03M7/30
    • The invention relates to pre-processing of a pronunciation dictionary for compression in a data processing device, the pronunciation dictionary comprising at least one entry, the entry comprising a sequence of character units and a sequence of phoneme units. According to one aspect of the invention the sequence of character units and the sequence of phoneme units are aligned using a statistical algorithm. The aligned sequence of character units and aligned sequence of phoneme units are interleaved by inserting each phoneme unit at a predetermined location relative to the corresponding character unit.
    • 本发明涉及一种用于在数据处理设备中进行压缩的发音字典的预处理,该发音字典包括至少一个条目,该条目包括一系列字符单元和一系列音素单元。 根据本发明的一个方面,使用统计算法来对齐字符单元的序列和音素单元的序列。 通过将每个音素单元相对于相应的字符单元插入预定位置来交织字符单元的对齐序列和对准的音素单元的顺序。
    • 30. 发明授权
    • Radiotelephone voice control device, in particular for use in a motor vehicle
    • 无线电话语音控制装置,特别适用于机动车辆
    • US06263216B1
    • 2001-07-17
    • US09411382
    • 1999-10-04
    • Henri SeydouxNicolas Besnard
    • Henri SeydouxNicolas Besnard
    • H04B138
    • H04M1/6075G10L15/12G10L15/20G10L2021/02168H01M10/48H01M2200/10H04M1/271
    • The apparatus comprises a data memory containing a series of correspondents' call numbers and, for each call number, at least one associated voice print; a sound transducer suitable for picking up the name of a desired corespondent as spoken by the user of the apparatus; voice recognition means suitable for analyzing the correspondent's name as picked up by the transducer and for transforming it into an associated voice print; selective memory addressing means including associative means suitable for finding a voice print in the memory corresponding to the print supplied by the voice recognition means, and in the event of a match, for addressing the corresponding memory position; and means co-operating with the associative means for applying the addressed call number to the radiotelephone circuits. The voice recognition means evaluate and store a current noise level as picked up by the transducer in the absence of a speech signal; when in the presence of a speech signal, they subtract the previously evaluated current noise level from the signal as picked up; and then they apply the resulting signal as obtained in this way to a DTW type voice recognition algorithm with pattern recognition by dynamic programming adapted to speech using dynamic parameter extraction functions, in particular a predictive dynamic algorithm with forward and/or backward and/or frequency masking.
    • 该装置包括数据存储器,其包含一系列通信对方的呼叫号码,并且对于每个呼叫号码,包含至少一个相关联的语音打印; 适于拾取所述装置的使用者所说的期望的记者的名称的声音传感器; 语音识别装置适用于分析由传感器拾取的记者姓名,并将其转换成相关的语音打印; 选择性存储器寻址装置,包括适于在对应于由语音识别装置提供的打印的存储器中找到语音打印的关联装置,以及在匹配的情况下,用于寻址相应的存储器位置; 并且与用于将寻址的呼叫号码应用于无线电话电路的关联装置协同工作。 语音识别装置评估和存储在没有语音信号的情况下由换能器拾取的当前噪声电平; 当存在语音信号时,它们从拾取的信号中减去先前评估的当前噪声电平; 然后利用动态参数提取函数,特别是具有向前和/或向后和/或频率的预测动态算法,将以这种方式获得的结果信号应用于具有利用动态规划的动态规划的模式识别的DTW型语音识别算法 掩蔽。