会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 35. 发明授权
    • Speech recognition method, program and apparatus using multiple acoustic models
    • 语音识别方法,程序和设备使用多种声学模型
    • US07065487B2
    • 2006-06-20
    • US09981996
    • 2001-10-19
    • Yasunaga Miyazawa
    • Yasunaga Miyazawa
    • G10L15/20G10L15/06G10L15/28
    • G10L15/20G10L21/0208
    • The present invention provides a speech recognition method for achieving a high recognition rate even under an environment where plural types of noise exist. Noise is eliminated by the spectral subtraction noise elimination method from each of speech data on which different types of noise are superposed, and acoustic models corresponding to each of the noise types are created based on the feature vectors obtained by analyzing the features of each of the speech data which have undergone the noise elimination. When a speech recognition is performed, a first speech feature analysis is performed on speech data to be recognized, and it is determined whether the speech data is a noise segment or a speech segment. When a noise segment is detected, the feature data thereof is stored, and when a speech segment is detected, the type of the noise is determined based on the feature data which has been stored, and a corresponding acoustic model is selected based on the result thereof. The noise is eliminated by the spectral subtraction noise elimination method from the speech data to be recognized, and a second feature analysis is performed on the speech data which has undergone the noise elimination to obtain a feature vector to be used in speech recognition.
    • 本发明提供即使在存在多种噪声的环境下实现高识别率的语音识别方法。 根据不同类型的噪声叠加的每个语音数据的频谱减法噪声消除方法消除噪声,并且基于通过分析每个噪声类型的特征获得的特征向量来创建与每个噪声类型对应的声学模型 已消除噪声的语音数据。 当执行语音识别时,对要被识别的语音数据执行第一语音特征分析,并且确定语音数据是噪声段还是语音段。 当检测到噪声段时,其特征数据被存储,并且当检测到语音段时,基于已经存储的特征数据确定噪声的类型,并且基于结果选择相应的声学模型 其中。 通过来自要识别的语音数据的频谱减法噪声消除方法消除噪声,并对经过噪声消除的语音数据执行第二特征分析,以获得要在语音识别中使用的特征向量。
    • 36. 发明申请
    • Rear projection type multi-projection display
    • 背投式多投影显示
    • US20050146644A1
    • 2005-07-07
    • US10975099
    • 2004-10-28
    • Yasunaga MiyazawaHiroshi Hasegawa
    • Yasunaga MiyazawaHiroshi Hasegawa
    • H04N3/22H04N5/74H04N9/31
    • H04N5/74H04N9/3147
    • Exemplary embodiments of the invention provide a circuit that includes a plurality of projector units to modulate and project light from a light source based on image information, a transmissive screen to which projection images from the plurality of projector units are projected, an image-capturing device disposed in a housing to capture predetermined regions of the projection images projected onto the transmissive screen, a unit image information generating unit to generate image information to be inputted to each of the plurality of projector units, and a unit image information correcting unit to correct the unit image information based on a result captured by the image-capturing device. Therefore, it is possible to perform easily the adjustment process and to further reduce the adjustment time.
    • 本发明的示例性实施例提供一种电路,其包括多个投影仪单元,用于基于图像信息调制和投射来自光源的光,透射屏幕,来自多个投影仪单元的投影图像投影到该透射屏幕上;图像捕获装置 设置在壳体中以捕捉投影到透射屏幕上的投影图像的预定区域;单位图像信息生成单元,生成要输入到多个投影仪单元中的每一个的图像信息;以及单位图像信息校正单元, 基于由图像捕获装置捕获的结果的单位图像信息。 因此,可以容易地执行调整处理并进一步减少调整时间。
    • 37. 发明授权
    • Continuous speech recognition method and program medium with alternative choice selection to confirm individual words
    • 连续语音识别方法和程序介质,具有可选择选择,以确认单个单词
    • US06564185B1
    • 2003-05-13
    • US09370982
    • 1999-08-10
    • Yasunaga MiyazawaMitsuhiro InazumiHiroshi HasegawaMasahisa Ikejiri
    • Yasunaga MiyazawaMitsuhiro InazumiHiroshi HasegawaMasahisa Ikejiri
    • G10L1522
    • G10L15/22
    • The invention relates to a method and apparatus for recognition processing of continuous words of a group which is structured by a plurality of words such that a recognition result of all of the words which structures the continuous words is effectively and accurately confirmed. All of the continuous words which have been input are recognition processed, the recognition result of all of the continuous words is output, a response from a speaker showing an affirmative/negative recognition result is input and recognition processed. If affirmative is determined, the recognition result at that time is confirmed for all of the continuous words. If negative is determined, for each word from a first to an nth (third in this case) which structures continuous words, the content showing affirmative/negative from the speaker is recognized, affirmative or negative is determined, and the recognition result at that time is confirmed as a recognition processing target word.
    • 本发明涉及一种用于识别处理由多个单词构成的组的连续单词的方法和装置,使得能够有效和准确地确认构成连续单词的所有单词的识别结果。 所输入的所有连续词都是识别处理的,所有连续词的识别结果都被输出,表示肯定/否定识别结果的说话者的响应被输入并进行识别处理。 如果确定肯定,则确认所有连续词的识别结果。 如果确定为否定,对于构成连续词的第一至第n(在这种情况下为第三)的每个单词,确定从说话者显示肯定/否定的内容,确定肯定或否定,并且当时的识别结果 被确认为识别处理对象字。
    • 38. 发明授权
    • Speech recognition method, speech recognition device, and recording medium on which is recorded a speech recognition processing program
    • 语音识别方法,语音识别装置和其上记录有语音识别处理程序的记录介质
    • US06446039B1
    • 2002-09-03
    • US09378997
    • 1999-08-23
    • Yasunaga MiyazawaMitsuhiro InazumiHiroshi HasegawaMasahisa Ikejiri
    • Yasunaga MiyazawaMitsuhiro InazumiHiroshi HasegawaMasahisa Ikejiri
    • G10L1528
    • G10L15/285G10L15/06
    • This invention concerns obtaining high recognition capability while there is a large limitation on memory capacity and processing ability of a CPU. When several words are selected as registration words among a plurality of recognizable words, a recognition target speaker speaks the respective registration words, registration word data for the respective registration words from the sound data is created and saved in a RAM. When the recognition target speaker speaks a registration word, sound is recognized using the registration word data, and when recognizable words other than the registration words are recognized, sound is recognized using specific speaker group sound model data. Furthermore, speaker learning processing is performed using the registration word data and the specific speaker group sound model data, and when recognizable words other than the registration words are recognized, sound is recognized using post-speaker learning data for speaker adaptation.
    • 本发明涉及在CPU的存储容量和处理能力存在很大限制的情况下获得高识别能力。 当在多个可识别字中选择多个字作为注册字时,识别目标说话者说出各自的注册字,从声音数据中创建并保存用于各声部数据的登记字数据。 当识别目标扬声器使用注册字时,使用注册字数据识别声音,并且当识别出除注册字之外的可识别字时,使用特定扬声器组声音模型数据识别声音。 此外,使用注册字数据和特定扬声器组声音模型数据进行说话者学习处理,并且当识别出除注册字之外的可识别词时,使用用于说话者适配的后讲话者学习数据来识别声音。
    • 39. 发明授权
    • Voice model learning data creation method and its apparatus
    • 语音模型学习数据创建方法及其设备
    • US06349281B1
    • 2002-02-19
    • US09010799
    • 1998-01-22
    • Yasunaga MiyazawaHiroshi HasegawaMitsuhiro InazumiTadashi Aizawa
    • Yasunaga MiyazawaHiroshi HasegawaMitsuhiro InazumiTadashi Aizawa
    • G10L1506
    • G10L15/07
    • A voice model learning data creation method and apparatus makes possible the creation of an inexpensive voice model in a short period of time when creating a voice model for a new word not in a preexisting database. Verbal data from several persons is selected from among the verbal data held in the database. This selected verbal data is referred to as standard speaker data, and is stored in a standard speaker data storage component. The remaining verbal data in the preexisting database is designated as learning speaker data, as is stored in a learning speaker data storage component. A data conversion function from the standard speaker data space to the learning speaker data space is derived. Then, the learning data for the new word is created by the data conversion function. Thus, the data which is obtained from the standard speaker speaking the new word is converted to the learning speaker data space.
    • 语音模型学习数据创建方法和装置使得在为不在预先存在的数据库中的新单词创建语音模型的短时间内创建便宜的语音模型成为可能。 从数据库中保存的语言数据中选出来自多个人的语言数据。 该选择的语言数据被称为标准扬声器数据,并存储在标准扬声器数据存储部件中。 预先存在的数据库中的剩余语言数据被指定为学习扬声器数据,如存储在学习扬声器数据存储部件中那样。 导出从标准扬声器数据空间到学习扬声器数据空间的数据转换功能。 然后,通过数据转换功能创建新单词的学习数据。 因此,从标准说话者说出的新单词获得的数据被转换为学习扬声器数据空间。
    • 40. 发明授权
    • Cartridge-based, interactive speech recognition method with a response
creation capability
    • 基于墨盒的交互式语音识别方法,具有响应创造能力
    • US5946658A
    • 1999-08-31
    • US165512
    • 1998-10-02
    • Yasunaga MiyazawaMitsuhiro InazumiHiroshi HasegawaIsao EdatsuneOsamu Urano
    • Yasunaga MiyazawaMitsuhiro InazumiHiroshi HasegawaIsao EdatsuneOsamu Urano
    • G10L15/00G10L15/06G10L15/26G10L9/06G10L5/02
    • G10L15/26G10L2015/0638G10L2015/088
    • A technique for improving speech recognition in low-cost, speech interactive devices. This technique calls for selectively implementing a speaker-specific word enrollment and detection unit in parallel with a word detection unit to permit comprehension of spoken commands or messages when no recognizable words are found. Preferably, specific speaker detection will be based on the speaker's own personal list of words or expression. Other facets include complementing non-specific pre-registered word characteristic information with individual, speaker-specific verbal characteristics to improve recognition in cases where the speaker has unusual speech mannerisms or accent and response alteration in which speaker-specification registration functions are leveraged to provide access and permit changes to a predefined responses table according to user needs and tastes. Also disclosed is the externalization and modularization of non-specific speaker recognition, action and response information to enhance adaptability of the speech recognizer without sacrificing product cost competitiveness or overall device responsiveness.
    • 一种用于在低成本语音交互设备中改善语音识别的技术。 该技术要求与字检测单元并行地选择性地实现与扬声器特定的单词注册和检测单元,以便在找不到可识别的单词时允许理解口语命令或消息。 优选地,具体的说话者检测将基于说话者自己的单词或表达的个人列表。 其他方面包括补充非特定的预先登记的单词特征信息,具有单独的具有说话者的语言特征,以在讲话者具有不寻常的语音方式或重音和响应改变的情况下改善识别,其中利用说话者说明书注册功能来提供访问 并允许根据用户需求和口味对预定义的响应表进行更改。 还公开了非特定说话人识别,动作和响应信息的外部化和模块化,以增强语音识别器的适应性,而不牺牲产品成本竞争力或整体设备响应性。