会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 82. 发明授权
    • Speech recognition
    • 语音识别
    • US4956865A
    • 1990-09-11
    • US191824
    • 1988-05-02
    • Matthew LennigPaul MermelsteinVishwa N. Gupta
    • Matthew LennigPaul MermelsteinVishwa N. Gupta
    • G10L11/02G10L15/02
    • G10L25/87G10L15/02
    • In a speech recognizer, for recognizing unknown utterances in isolated-word speech or continuous speech, improved recognition accuracy is obtained by augmenting the usual spectral representation of the unknown utterance with a dynamic component. A corresponding dynamic component is provided in the templates with which the spectral representation of the utterance is compared. In preferred embodiments, the representation is mel-based cepstral and the dynamic components comprise vector differences between pairs of primary cepstra. Preferably the time interval between each pair is about 50 milliseconds. It is also preferable to compute a dynamic perceptual loudness component along with the dynamic parameters.
    • 在语音识别器中,为了识别孤立词语音或连续语音中的未知语音,通过用动态分量增加未知语音的常规频谱表示来获得改进的识别精度。 在模板中提供相应的动态分量,与之对比发音的频谱表示。 在优选实施例中,该表示是基于mel的倒频谱,并且动态分量包括主要cepstra对之间的矢量差异。 优选地,每对之间的时间间隔约为50毫秒。 还优选地计算动态感知响度分量以及动态参数。
    • 83. 发明授权
    • Speech recognition apparatus
    • 语音识别装置
    • US4833714A
    • 1989-05-23
    • US228149
    • 1988-08-04
    • Mitsuo ShimotaniMasahiro HibinoKenji Shima
    • Mitsuo ShimotaniMasahiro HibinoKenji Shima
    • G10L11/02G10L11/04G10L15/00
    • G10L25/87G10L15/00G10L25/90
    • A word speech recognition apparatus recognizes a speech inputted to a microphone (11). A feature extracting portion (20) extracts a feature parameter based on a aural signal waveform outputted from the microphone. The feature extracting circuit comprises a pitch cycle extraction circuit (21) for extracting pitch frequency of the speech signal waveform, a digital filter (23) for extracting, as a feature parameter, spectrum data of the speech signal waveform, and a filter coefficient setting circuit (22) for setting a filter coefficient so that a resonance frequency of the digital filter is an integral multiple of the pitch frequency. The feature parameter extracted from the feature extracting circuit is stored in an input pattern memory (3). A recognition processing portion (50) evaluates similarity between the feature parameter stored in advance in a registration pattern memory (4) and the feature parameter stored in the input pattern memory, so that speech recognition processing is made. Improved signal in noise performance results from very narrow bandwidth filters.
    • 字语音识别装置识别输入到麦克风(11)的语音。 特征提取部(20)基于从麦克风输出的听觉信号波形提取特征参数。 特征提取电路包括用于提取语音信号波形的音调频率的音调周期提取电路(21),用于提取语音信号波形的频谱数据作为特征参数的数字滤波器(23)和滤波器系数设置 电路(22),用于设置滤波器系数,使得数字滤波器的谐振频率是音调频率的整数倍。 从特征提取电路提取的特征参数存储在输入图案存储器(3)中。 识别处理部分(50)评估预先存储在注册模式存储器(4)中的特征参数与存储在输入模式存储器中的特征参数之间的相似度,从而进行语音识别处理。 改进的噪声性能信号来自非常窄的带宽滤波器。
    • 85. 发明授权
    • Speech recognition training method
    • 语音识别训练方法
    • US4718088A
    • 1988-01-05
    • US593891
    • 1984-03-27
    • James K. BakerJohn W. KlovstadChin-Hui LeeKalyan Ganesan
    • James K. BakerJohn W. KlovstadChin-Hui LeeKalyan Ganesan
    • G10L11/02G10L15/02G10L15/06G10L15/08G10L15/12G10L5/00
    • G10L25/87G10L15/063G10L15/083G10L15/02G10L15/12G10L2015/0638G10L25/27
    • A speech recognition method and apparatus employ a speech processing circuitry for repetitively deriving from a speech input, at a frame repetition rate, a plurality of acoustic parameters. The acoustic parameters represent the speech input signal for a frame time. A plurality of template matching and cost processing circuitries are connected to a system bus, along with the speech processing circuitry, for determining, or identifying, the speech units in the input speech, by comparing the acoustic parameters with stored template patterns. The apparatus can be expanded by adding more template matching and cost processing circuitry to the bus thereby increasing the speech recognition capacity of the apparatus. Template pattern generation is advantageously aided by using a "joker" word to specify the time boundaries of utterances spoken in isolation, by finding the beginning and ending of an utterance surrounded by silence.
    • 语音识别方法和装置采用语音处理电路,以帧重复率重复地从语音输入中导出多个声学参数。 声学参数表示帧时间的语音输入信号。 通过将声学参数与存储的模板图案进行比较,多个模板匹配和成本处理电路连同语音处理电路连接到用于确定或识别输入语音中的语音单元的系统总线。 可以通过向总线添加更多的模板匹配和成本处理电路来扩展该装置,从而增加装置的语音识别能力。 通过使用“小丑”字通过找到由沉默包围的话语的开始和结束来有助于指定孤立地说出的话语的时间边界。
    • 86. 发明授权
    • Apparatus for detecting an utterance boundary
    • 用于检测话语边界的装置
    • US4696041A
    • 1987-09-22
    • US575383
    • 1984-01-30
    • Tomio Sakata
    • Tomio Sakata
    • G10L11/00G10L11/02G10L15/04G10L5/00
    • G10L25/87
    • An utterance boundary detecting apparatus of this invention includes an acoustic processor for generating speech parameter time sequence data according to an input speech signal. The speech parameter time sequence data generated from the acoustic processor is delivered to a buffer memory and noise level determining circuit. The noise level determining circuit calculates the average value of speech parameter values of a background noise corresponding to a silent period when a speech signal is input as words uttered. The apparatus includes a threshold value calculating circuit for calculating an utterance boundary detection threshold value on the basis of an average value calculated by the noise level determining circuit. An utterance boundary detecting circuit generates utterance boundary data on the basis of the threshold value from the threshold value calculating circuit and speech parameter time sequence data in a buffer memory.
    • 本发明的发声边界检测装置包括声处理器,用于根据输入的语音信号产生语音参数时间序列数据。 从声学处理器生成的语音参数时间序列数据被传送到缓冲存储器和噪声电平确定电路。 噪声电平确定电路计算当语音信号被输入时与静音时段对应的背景噪声的语音参数值的平均值。 该装置包括阈值计算电路,用于根据由噪声电平确定电路计算的平均值来计算发声边界检测阈值。 话音边界检测电路根据来自阈值计算电路的阈值和缓冲存储器中的语音参数时间序列数据产生话音边界数据。
    • 88. 再颁专利
    • Multiple template speech recognition system
    • 多模板语音识别系统
    • USRE31188E
    • 1983-03-22
    • US336067
    • 1981-12-31
    • Frank C. PirzLawrence R. Rabiner
    • Frank C. PirzLawrence R. Rabiner
    • G10L11/02G10L15/06
    • G10L25/87G10L15/063
    • A speech analyzer for recognizing an unknown utterance as one of a set of reference words is adapted to generate a feature signal set for each utterance of every reference word. At least one template signal is produced for each reference word which template signal is representative of a group of feature signal sets. Responsive to a feature signal set formed from the unknown utterance and each reference word template signal, a signal representative of the similarity between the unknown utterance and the template signal is generated. A plurality of similarity signals for each reference word is selected and a signal corresponding to the average of said selected similarity signals is formed. The average similarity signals are compared to identify the unknown utterance as the most similar reference word. Features of the invention include: template formation by successive clustering involving partitioning feature signal sets into groups of predetermined similarity by centerpoint clustering, and recognition by comparing the average of selected similarity measures of a time-warped unknown feature signal set with the cluster-derived reference templates for each vocabulary word.
    • 89. 发明授权
    • Training circuit for audio signal recognition computer
    • 音频信号识别计算机训练电路
    • US4297528A
    • 1981-10-27
    • US073781
    • 1979-09-10
    • Richard D. Beno
    • Richard D. Beno
    • G06K9/66G06T1/00G10L11/00G10L11/02G10L15/06G10L1/00
    • G10L25/87G10L15/063
    • A signal pattern recognition system includes a reference pattern memory storing plural reference data patterns against which input patterns are compared for recognition. The reference data patterns are formed by merging training patterns together. Each training pattern, to be accepted for merging, must match the previously merged patterns by a threshold amount. The threshold is automatically varied as the number of previously merged training patterns increases. If a predetermined number of successive training patterns is below the threshold, the training process is repeated, from the beginning, for the reference pattern which generates the errors. The system automatically trains each reference pattern with the same number of training patterns to assure uniformity when input data patterns are compared for recognition.
    • 信号模式识别系统包括存储多个参考数据模式的参考模式存储器,用于识别输入模式。 通过将训练模式合并在一起形成参考数据模式。 要被合并的每个训练模式必须与先前合并的模式相匹配,达到阈值。 随着先前合并训练模式的数量增加,阈值自动变化。 如果预定数量的连续训练模式低于阈值,则从一开始就针对产生错误的参考模式重复训练过程。 系统使用相同数量的训练模式自动训练每个参考模式,以确保在输入数据模式进行比较以进行识别时的均匀性。
    • 90. 发明授权
    • Audio signal recognition computer
    • 音频信号识别电脑
    • US4292470A
    • 1981-09-29
    • US73792
    • 1979-09-10
    • Byung H. An
    • Byung H. An
    • G10L11/00G10L11/02G10L15/02G10L15/28G10L1/00
    • G10L25/87G10L15/285
    • A signal encoder and classifier particularly adapted to speech recognition includes a circularly addressed buffer which is independently addressed by a new data writing address system and a buffered data reading system so that writing and reading of data may be accomplished on a time shared basis. This time shared operation permits serial writing and reading of the pattern data without interrupting income signal storage. The writing data address system addresses the data into the buffer in a circular fashion while the reading data address system utilizes stored addresses identifying the beginning and end of the signal patterns for addressing sequential patterns from the buffer.
    • 特别适用于语音识别的信号编码器和分类器包括循环寻址的缓冲器,其由新的数据写入地址系统和缓冲的数据读取系统独立地寻址,使得数据的写入和读取可以在时间上共享的基础上完成。 这种共享操作允许串行写入和读取模式数据而不中断收入信号存储。 写入数据地址系统以循环方式将数据寻址到缓冲器中,而读取数据地址系统利用标识信号模式的开始和结束的存储的地址来从缓冲器寻址顺序模式。