专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US4209672A Method and apparatus for measuring characteristics of a loudspeaker 失效
标题翻译：用于测量扬声器特性的方法和装置
公开(公告)号：US4209672A
公开(公告)日：1980-06-24
申请号：US924357
申请日：1978-07-13
申请人： Tsuneo Nitta , Masatoshi Tanaka
发明人： Tsuneo Nitta , Masatoshi Tanaka
IPC分类号： H04R29/00
CPC分类号： H04R29/001
摘要： A loudspeaker and a microphone are arranged in a normal room, the loudspeaker being supplied with an impulse signal. A direct response sound from the loudspeaker and reflected sounds from wall surfaces in three directions of the normal room are converted into a response signal by the microphone. The response signal is A/D-converted, and then Fourier-transformed. The Fourier-transformed response signal is converted into a response signal with an absolute value, and then into a logarithmic response signal. The logarithmic response signal is filtered to eliminate signal components corresponding to the reflected sound. The filtered logarithmic response signal is A/D-converted, and supplied to a recorder.
摘要翻译：扬声器和麦克风布置在正常的房间中，扬声器被提供脉冲信号。来自扬声器的直接响应声音和来自正常房间三个方向的墙壁表面的反射声音被麦克风转换成响应信号。响应信号进行A / D转换，然后进行傅里叶变换。傅里叶变换的响应信号被转换为具有绝对值的响应信号，然后转换成对数响应信号。对数响应信号被滤波以消除对应于反射声的信号分量。经滤波的对数响应信号进行A / D转换，并提供给记录器。

2. 发明授权

US4677673A Continuous speech recognition apparatus 失效
标题翻译：连续语音识别装置
公开(公告)号：US4677673A
公开(公告)日：1987-06-30
申请号：US563755
申请日：1983-12-21
申请人： Teruhiko Ukita , Tsuneo Nitta , Sadakazu Watanabe
发明人： Teruhiko Ukita , Tsuneo Nitta , Sadakazu Watanabe
IPC分类号： G10L11/00 , G10L15/00 , G10L15/10 , G10L5/00
CPC分类号： G10L15/00
摘要： Continuous speech signal is recognized using "rough" and "detail" parameters derived from prestored reference speech and current unknown speech. The detail parameters are 16 spectral coefficients, the rough parameters 2 or 4 spectral coefficients representing the signal. A word interval detector decides segmentation based on rough parameter similarity.
摘要翻译：使用从预存储的参考语音和当前未知语音导出的“粗略”和“详细”参数识别连续语音信号。详细参数是16个频谱系数，粗略参数2或4个表示信号的频谱系数。字间隔检测器根据粗略的参数相似度来决定分割。

3. 发明授权

US4405838A Phoneme information extracting apparatus 失效
标题翻译：音素信息提取装置
公开(公告)号：US4405838A
公开(公告)日：1983-09-20
申请号：US273400
申请日：1981-06-15
申请人： Tsuneo Nitta , Hideki Kasuya
发明人： Tsuneo Nitta , Hideki Kasuya
IPC分类号： G10L11/00 , G10L15/00 , G10L15/04 , G10L15/10 , G10L1/00
CPC分类号： G10L15/00
摘要： A phoneme information extracting apparatus includes correlation data generators for successively generating correlation data representing the correlation between the acoustic power spectrum data corresponding to input voice and power spectrum data of various reference phonemes, selection circuits for successively transferring these correlation data when they detect that three or more successive correlation data have values greater than a predetermined value, maximum data hold circuits for holding the maximum correlation data among the correlation data transferred from the respective selection circuits, and a phoneme determination circuit for determining the optimum phoneme by detecting one of the data hold circuits that is holding the maximum correlation data among the correlation data held in the data hold circuits.
摘要翻译：音素信息提取装置包括：相关数据发生器，用于连续产生表示与输入声音相对应的声功率谱数据与各种参考音素的功率谱数据之间的相关性的相关数据;选择电路，当它们检测到三个或更连续的相关数据具有大于预定值的值，用于保持从各个选择电路传送的相关数据之间的最大相关数据的最大数据保持电路和用于通过检测数据保持之一来确定最佳音素的音素确定电路在保持在数据保持电路中的相关数据中保持最大相关数据的电路。

4. 发明授权

US5615300A Text-to-speech synthesis with controllable processing time and speech quality 失效
标题翻译：具有可控处理时间和语音质量的文本到语音合成
公开(公告)号：US5615300A
公开(公告)日：1997-03-25
申请号：US67079
申请日：1993-05-26
申请人： Yoshiyuki Hara , Tsuneo Nitta
发明人： Yoshiyuki Hara , Tsuneo Nitta
IPC分类号： G06F3/16 , G06F17/28 , G10L13/04 , G10L13/06 , G10L13/08 , G10L9/00
CPC分类号： G10L13/047 , G10L13/04
摘要： Synthesized speech is generated by a software-implemented system with a programmed central processing unit. Phonetic parameters are generated from a series of phonetic symbols of an input text to be converted into synthesized speech, and prosodic parameters are also generated from prosodic information of the input text. The activity ratio of the central processing unit is determined, and the order of phonetic parameters or the arrangement of a synthesis unit or filter for speech synthesis is determined depending on the determined activity ratio of the central processing unit. Synthesized speech sounds are generated and filtered based on the phonetic and prosodic parameters according to the determined order of phonetic parameters or the determined arrangement of the filter.
摘要翻译：合成语音由具有编程的中央处理单元的软件实现的系统产生。语音参数是从要转换为合成语音的输入文本的一系列语音符号生成的，并且还由输入文本的韵律信息生成韵律参数。确定中央处理单元的活动比，并且根据所确定的中央处理单元的活动比确定语音合成的合成单元或滤波器的语音参数或排列顺序。根据确定的语音参数顺序或确定的滤波器布置，基于语音和韵律参数来生成和滤波合成语音。

5. 发明授权

US5133012A Speech recognition system utilizing both a long-term strategic and a short-term strategic scoring operation in a transition network thereof 失效
标题翻译：语音识别系统利用长期战略和短期战略分级操作在其过渡网络中
公开(公告)号：US5133012A
公开(公告)日：1992-07-21
申请号：US443485
申请日：1989-11-30
申请人： Tsuneo Nitta
发明人： Tsuneo Nitta
IPC分类号： G06F3/16 , G10L11/00 , G10L15/00 , G10L15/10 , G10L15/18
CPC分类号： G10L15/00
摘要： A plurality of candidate phonetic segments extracted from the input speech signal are passed through transition networks prepared for the respective words so as to obtain a score by weighting/averaging the long-term strategic scores by taking consideration of statistic distribution of the similarities or distances of phonetic segments and the short-term strategic scores by taking consideration of the environment of the phonetic segments.

6. 发明授权

US4979213A Orthogonalized dictionary speech recognition apparatus and method thereof 失效
标题翻译：正交字典语音识别装置及其方法
公开(公告)号：US4979213A
公开(公告)日：1990-12-18
申请号：US378780
申请日：1989-07-12
申请人： Tsuneo Nitta
发明人： Tsuneo Nitta
IPC分类号： G10L11/00 , G10L15/06
CPC分类号： G10L15/063
摘要： Speech pattern data representing speech of a plurality of speakers are stored in a pattern storage section in advance. Averaged pattern data obtained by averaging a plurality of speech pattern data of the first of the plurality of speakers are obtained. Data obtained by blurring and differentiating the averaged pattern data are stored in an orthogonalized dictionary as basic orthogonalized dictionary data of first and second axes, respectively. Blurred data and differentiated data obtained with respect to the second and subsequent of the plurality of speakers are selectively stored in the orthogonalized dictionary as additional dictionary data having new axes. Speech of the plurality of speakers is recognized by computing a similarity between the orthogonalized dictionary formed in this manner and input speech.
摘要翻译：代表多个扬声器的语音的语音模式数据预先存储在模式存储部分中。获得通过对多个扬声器中的第一个的多个语音图案数据进行平均而获得的平均图案数据。通过模糊和区分平均图案数据获得的数据分别存储在正交字典中作为第一和第二轴的基本正交字典数据。选择性地，在正交字典中存储相对于多个扬声器的第二和随后获得的模糊数据和微分数据作为具有新轴的附加字典数据。通过计算以这种方式形成的正交字典和输入语音之间的相似度来识别多个扬声器的语音。

7. 发明授权

US4624011A Speech recognition system 失效
标题翻译：语音识别系统
公开(公告)号：US4624011A
公开(公告)日：1986-11-18
申请号：US462042
申请日：1983-01-28
申请人： Sadakazu Watanabe , Hidenori Shinoda , Tsuneo Nitta , Yoichi Takebayashi , Shouichi Hirai , Tomio Sakata , Kensuke Uehara , Yasuo Takahashi , Haruo Asada
发明人： Sadakazu Watanabe , Hidenori Shinoda , Tsuneo Nitta , Yoichi Takebayashi , Shouichi Hirai , Tomio Sakata , Kensuke Uehara , Yasuo Takahashi , Haruo Asada
IPC分类号： G10L15/10 , G10L11/00 , G10L15/00 , G10L15/28 , G10L5/00
CPC分类号： G10L15/00
摘要： An acoustic signal processing circuit extracts input speech pattern data and subsidiary feature data from an input speech signal. The input speech pattern data comprise frequency spectra, whereas the subsidiary feature data comprise phoneme and acoustic features. These data are then stored in a data buffer memory. The similarity measures between the input speech pattern data stored in the data buffer memory and reference speech pattern data stored in a dictionary memory are computed by a similarity computation circuit. When the largest similarity measure exceeds a first threshold value and when the difference between the largest similarity measure and the second largest measure exceeds a second threshold value, category data of the reference pattern which gives the largest similarity measure is produced by a control circuit to correspond to an input speech. When recognition cannot be performed, the categories of the reference speech patterns which respectively give the largest to mth similarity measures are respectively compared with the subsidiary feature data. In this manner, subsidiary feature recognition of the input voice is performed by a subsidiary feature recognition section.
摘要翻译：声信号处理电路从输入语音信号中提取输入语音模式数据和辅助特征数据。输入语音模式数据包括频谱，而辅助特征数据包括音素和声学特征。然后将这些数据存储在数据缓冲存储器中。存储在数据缓冲存储器中的输入语音模式数据与字典存储器中存储的参考语音模式数据之间的相似性度量由相似度计算电路计算。当最大相似性度量超过第一阈值时，当最大相似性度量与第二最大量度之间的差异超过第二阈值时，通过控制电路产生给出最大相似性度量的参考模式的类别数据，以对应于输入语音。当不能执行识别时，将分别给出最大到第m个相似性度量的参考语音模式的类别分别与辅助特征数据进行比较。以这种方式，由辅助特征识别部执行输入语音的辅助特征识别。

8. 发明授权

US08626508B2 Speech search device and speech search method 失效
标题翻译：语音搜索设备和语音搜索方法
公开(公告)号：US08626508B2
公开(公告)日：2014-01-07
申请号：US13203371
申请日：2010-02-10
申请人： Koichi Katsurada , Tsuneo Nitta , Shigeki Teshima
发明人： Koichi Katsurada , Tsuneo Nitta , Shigeki Teshima
IPC分类号： G10L15/00 , G10L17/00 , G10L15/06 , G10L21/00 , G10L25/00 , G06F7/00 , G06F17/30
CPC分类号： G10L15/12 , G10L2015/025
摘要： Provided are a speech search device, the search speed of which is very fast, the search performance of which is also excellent, and which performs fuzzy search, and a speech search method. Not only the fuzzy search is performed, but also the distance between phoneme discrimination features included in speech data is calculated to determine the similarity with respect to the speech using both a suffix array and dynamic programming, and an object to be searched for is narrowed by means of search keyword division based on a phoneme and search thresholds relative to a plurality of the divided search keywords, the object to be searched for is repeatedly searched for while increasing the search thresholds in order, and whether or not there is the keyword division is determined according to the length of the search keywords, thereby implementing speech search, the search speed of which is very fast and the search performance of which is also excellent.
摘要翻译：提供了一种语音搜索装置，其搜索速度非常快，其搜索性能也优异，并且执行模糊搜索和语音搜索方法。不仅执行模糊搜索，而且还计算包括在语音数据中的音素辨别特征之间的距离，以使用后缀数组和动态编程来确定相对于语音的相似度，并且要搜索的对象被基于相对于多个划分的搜索关键词的音素和搜索阈值的搜索关键词划分的手段，重复地搜索要搜索的对象，同时依次增加搜索阈值，并且是否存在关键词分割根据搜索关键词的长度确定，从而实现语音搜索，搜索速度非常快，搜索性能也很好。

9. 发明授权

US5506933A Speech recognition using continuous density hidden markov models and the orthogonalizing karhunen-loeve transformation 失效
标题翻译：使用连续密度隐马尔可夫模型和正交化karhunen-loeve变换的语音识别
公开(公告)号：US5506933A
公开(公告)日：1996-04-09
申请号：US30618
申请日：1993-03-12
申请人： Tsuneo Nitta
发明人： Tsuneo Nitta
IPC分类号： G10L15/10 , G10L15/02 , G10L15/06 , G10L15/14 , G10L9/00
CPC分类号： G10L15/144
摘要： A recognition system comprises a feature extractor for extracting a feature vector x from an input speech signal, and a recognizing section for defining continuous density Hidden Markov Models of predetermined categories k as transition network models each having parameters of transition probabilities p(k,i,j) that a state Si transits to a next state Sj and output probabilities g(k,s) that a feature vector x is output in transition from the state Si to one of the states Si and Sj, and recognizing the input signal on the basis of similarity between a sequence X of feature vectors extracted by the feature extractor and the continuous density HMMs. Particularly, the recognizing section includes a memory section for storing a set of orthogonal vectors .phi..sub.m (k,s) provided for the continuous density HMMs, and a modified CDHMM processor for obtaining each of the output probabilities g(k,s) for the continuous density HMMs in accordance with corresponding orthogonal vectors .phi..sub.m (k,s).
摘要翻译：识别系统包括用于从输入语音信号中提取特征向量x的特征提取器，以及用于定义预定类别k的连续密度隐马尔科夫模型的识别部分，其中每个转移网络模型具有转移概率p（k，i， j）状态Si转换到下一状态Sj，并且输出特征向量x从状态Si向状态Si和Sj中的一个转变而输出的概率g（k，s），并且识别输入信号由特征提取器提取的特征向量的序列X与连续密度HMM之间的相似度的基础。特别地，识别部分包括存储部分，用于存储为连续密度HMM提供的一组正交向量phi（k，s），以及修改的CDHMM处理器，用于获得针对所述连续密度HMM的每个输出概率g（k，s）连续密度HMM根据相应的正交向量phi（k，s）。

10. 发明授权

US4851654A IC card 失效
标题翻译： IC卡
公开(公告)号：US4851654A
公开(公告)日：1989-07-25
申请号：US199762
申请日：1988-05-27
申请人： Tsuneo Nitta
发明人： Tsuneo Nitta
IPC分类号： B42D15/10 , G06F3/16 , G06K7/00 , G06K19/07 , G06K19/077 , G07C9/00 , G07F7/10 , G10L15/28 , G11C7/00 , G11C7/16 , H04B7/26 , H04L29/12 , H04R1/02
CPC分类号： G07F7/1008 , G06K19/0723 , G06K19/07703 , G06K19/07749 , G06K7/0008 , G06Q20/346 , G07C9/00071 , G07C9/00111 , G11C7/00 , G11C7/16 , G11C2207/16
摘要： An IC card used for identifying an individual, has a key pad on the surface, a memory for storing a program and data, and a planar microphone and speaker. Speech corresponding to data or a command is converted into signals by a microphone. These signals can be supplied to an output terminal of the IC card under control of the IC card.Alternatively, the IC card can store or analyze the speech and generate a corresponding command or data.As the number of functions of the IC card is increased, the number of commands must also be increased. By inputting commands or data by voice instead of dedicated function keys, the number of necessary keys is reduced and convenience is enhanced.
摘要翻译：用于识别个人的IC卡，具有表面上的键盘，用于存储程序和数据的存储器，以及平面麦克风和扬声器。与数据或命令对应的语音通过麦克风转换为信号。这些信号可以在IC卡的控制下提供给IC卡的输出端子。或者，IC卡可以存储或分析语音并产生相应的命令或数据。随着IC卡的功能数量的增加，命令数也必须增加。通过用语音输入命令或数据而不是专用功能键，所需的键的数量减少，便于提高。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式