专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US06823309B1 Speech synthesizing system and method for modifying prosody based on match to database 有权
标题翻译：语音合成系统和基于匹配数据库修改韵律的方法
公开(公告)号：US06823309B1
公开(公告)日：2004-11-23
申请号：US09701183
申请日：2000-11-27
申请人： Yumiko Kato , Kenji Matsui , Takahiro Kamai , Katsuyoshi Yamagami
发明人： Yumiko Kato , Kenji Matsui , Takahiro Kamai , Katsuyoshi Yamagami
IPC分类号： G10L1308
CPC分类号： G10L13/10 , G10L13/04 , G10L13/08
摘要： A speech synthesis system for storing in advance a degree of modification of prosodic data in a prosodic data modifying rule apparatus, the degree of modification corresponding to an approximate cost and being stored as a modifying rule, a prosodic data retrieving section for retrieving a prosodic data stored corresponding to a key data for use in retrieval, the prosodic data retrieved according to a degree of matching between the input data and the key data, the degree of matching represented by the approximate cost, a modifying section for modifying the retrieved prosodic data based on the degree of matching and the modifying rule stored in the prosodic data modifying rule means, and an output section for outputting synthesized speech based on the input data and the modified prosodic data.
摘要翻译：一种语音合成系统，用于预先存储韵律数据修改规则装置中的韵律数据的修改程度，对应于近似成本的修改程度，并被存储为修改规则;韵律数据检索部分，用于检索韵律数据存储对应于用于检索的密钥数据，根据输入数据和密钥数据之间的匹配程度检索的韵律数据，由近似成本表示的匹配程度，用于修改所检索的韵律数据的修改部分关于存储在韵律数据修改规则装置中的匹配度和修改规则，以及用于基于输入数据和修改的韵律数据输出合成语音的输出部分。

2. 发明授权

US06424937B1 Fundamental frequency pattern generator, method and program 有权
标题翻译：基本频率模式发生器，方法和程序
公开(公告)号：US06424937B1
公开(公告)日：2002-07-23
申请号：US09201298
申请日：1998-11-30
申请人： Yumiko Kato , Kenji Matsui , Takahiro Kamai , Noriyo Hara
发明人： Yumiko Kato , Kenji Matsui , Takahiro Kamai , Noriyo Hara
IPC分类号： G10L1104
CPC分类号： G10L13/10
摘要： According to this fundamental frequency generating method, a fundamental frequency pattern is set from a data base of a fundamental frequency pattern of each accent phrase standardized by the phoneme time length or the time length of the vowel and the vowel corresponding portion, and when the corresponding fundamental frequency pattern is not stored in the data base, the fundamental frequency pattern is generated by interpolating the interval between points serving as the references of the fundamental frequency pattern. With this method, a fundamental frequency pattern having higher naturalness than with conventional methods can be generated.
摘要翻译：根据该基频生成方法，从由音素时间长度或元音和元音对应部分的时间长度标准化的每个重音的基本频率图案的数据库设定基频模式，并且当相应的基频不存储基频，通过内插作为基频的基准点的间隔产生基频模式。利用这种方法，可以产生具有比常规方法更高的自然度的基本频率图案。

3. 发明授权

US07735105B2 Broadcast receiving method 有权
标题翻译：广播接收方式
公开(公告)号：US07735105B2
公开(公告)日：2010-06-08
申请号：US10415176
申请日：2002-08-22
申请人： Yumiko Kato , Takahiro Kamai , Kenji Mizutani , Hideyuki Yoshida
发明人： Yumiko Kato , Takahiro Kamai , Kenji Mizutani , Hideyuki Yoshida
IPC分类号： G06F3/00
CPC分类号： H04N21/42203 , G06Q30/02 , G06Q99/00 , G10L15/26 , H04H20/28 , H04H60/37 , H04H60/48 , H04H60/63 , H04H60/64 , H04H2201/30 , H04H2201/37 , H04N7/17318 , H04N21/4307 , H04N21/4722 , H04N21/84 , H04N21/858 , H04N21/8586
摘要： To purchase an item which is viewed on a broadcast. The items may include furniture, jewelry, clothing and automobiles. The viewer is able to utter a phrase in reference to a viewed item that they wish to view or purchase. The system then searches the broadcast information and displays all relevant items. The viewer is then able to select and purchase a particular item.
摘要翻译：购买在广播中观看的项目。这些物品可能包括家具，首饰，服装和汽车。观众能够提及他们希望查看或购买的被查看项目的短语。然后系统搜索广播信息并显示所有相关项目。然后，观众能够选择并购买特定物品。

4. 发明授权

US07698138B2 Broadcast receiving method, broadcast receiving system, recording medium, and program 有权
标题翻译：广播接收方式，广播接收系统，记录媒体和节目
公开(公告)号：US07698138B2
公开(公告)日：2010-04-13
申请号：US10542409
申请日：2003-12-26
申请人： Yumiko Kato , Takahiro Kamai , Hideyuki Yoshida , Yoshifumi Hirose
发明人： Yumiko Kato , Takahiro Kamai , Hideyuki Yoshida , Yoshifumi Hirose
IPC分类号： G10L15/00 , G10L15/06 , G10L11/00 , G10L21/00
CPC分类号： H04N5/445 , G06Q30/02 , G10L15/26 , H04H60/37 , H04H60/48 , H04N7/088 , H04N21/42203 , H04N21/432 , H04N21/4332 , H04N21/4348 , H04N21/435 , H04N21/4394 , H04N21/44222 , H04N21/454 , H04N21/47214 , H04N21/4722 , H04N21/47815 , H04N21/812 , H04N21/8405
摘要： A broadcast receiving system includes a broadcast receiving part for receiving a broadcast in which additional information that corresponds to an object appearing in broadcast contents and that contains keyword information for specifying the object is broadcasted simultaneously with the broadcast contents; a recognition vocabulary generating section for generating a recognition vocabulary set in a manner corresponding to the additional information by using a synonym dictionary; a speech recognition section for performing the speech recognition of a voice uttered by a viewing person, and for thereby specifying keyword information corresponding to a recognition vocabulary set when a word recognized as the speech recognition result is contained in the recognition vocabulary set; and a displaying section for displaying additional information corresponding to the specified keyword information.
摘要翻译：广播接收系统包括广播接收部分，用于接收广播，其中广播内容同时广播广播内容中广播与广播内容中出现的对象相对应的并且包含用于指定对象的关键字信息的附加信息; 识别词汇生成部，其通过使用同义词词典来以与所述附加信息对应的方式生成识别词汇集; 用于执行由观看者发出的语音的语音识别的语音识别部分，并且由此在识别词汇集中包含被识别为语音识别结果的单词时，指定与识别词汇集相对应的关键字信息; 以及显示部分，用于显示与指定的关键字信息相对应的附加信息。

5. 发明授权

US07526430B2 Speech synthesis apparatus 有权
标题翻译：语音合成装置
公开(公告)号：US07526430B2
公开(公告)日：2009-04-28
申请号：US11226331
申请日：2005-09-15
申请人： Yumiko Kato , Takahiro Kamai
发明人： Yumiko Kato , Takahiro Kamai
IPC分类号： G10L13/06 , G10L21/00
CPC分类号： G10L13/10
摘要： A speech synthesis apparatus, which can embed unchangeable additional information into synthesized speech without causing a deterioration of speech quality and restriction by bands, includes a language processing unit which generates synthesized speech generation information necessary for generating synthesized speech in accordance with a language string, a prosody generating unit which generates prosody information of speech based on the synthesized speech generation information, and a waveform generating unit which synthesizes speech based on the prosody information, in which the prosody generating unit embed code information as watermark information in the prosody information of a segment having a predetermined time duration within a phoneme length including a phoneme boundary.
摘要翻译：一种可将不可变附加信息嵌入合成语音而不导致语音质量恶化和频带限制的语音合成装置，包括：语言处理单元，其生成根据语言串产生合成语音所需的合成语音产生信息; 韵律产生单元，其基于合成的语音产生信息生成语音的韵律信息;以及波形生成单元，其基于韵律信息合成语音，其中，韵律生成单元将代码信息作为水印信息嵌入在片段的韵律信息中在包括音素边界的音素长度内具有预定的持续时间。

6. 发明申请

US20070203702A1 Speech synthesizer, speech synthesizing method, and program 有权
标题翻译：语音合成器，语音合成方法和程序
公开(公告)号：US20070203702A1
公开(公告)日：2007-08-30
申请号：US11783855
申请日：2007-04-12
申请人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato , Natsuki Saito
发明人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato , Natsuki Saito
IPC分类号： G10L15/14
CPC分类号： G10L13/06 , G10L13/04
摘要： A speech synthesizer that provides high-quality sound along with stable sound quality, including: a target parameter generation unit; a speech element DB; an element selection unit; a mixed parameter judgment unit which determines an optimum parameter combination of target parameters and speech elements; a parameter integration unit which integrates the parameters; and a waveform generation unit which generates synthetic speech. High-quality and stable synthetic speech is generated by combining, per parameter dimension, the parameters with stable sound quality generated by the target parameter generation unit with speech elements with high sound quality and a sense of true speech selected by the element selection unit.
摘要翻译：一种提供高质量声音以及稳定音质的语音合成器，包括：目标参数产生单元; 语音元件DB; 元件选择单元; 混合参数判断单元，其确定目标参数和语音元素的最佳参数组合; 参数集成单元，其集成参数; 以及生成合成语音的波形生成单元。通过将参数尺寸与由目标参数生成单元生成的稳定声音的参数与具有高音质的语音元素和由元素选择单元选择的真实语音感觉相结合，产生高质量和稳定的合成语音。

7. 发明申请

US20090281807A1 VOICE QUALITY CONVERSION DEVICE AND VOICE QUALITY CONVERSION METHOD 有权
标题翻译：语音质量转换设备和语音质量转换方法
公开(公告)号：US20090281807A1
公开(公告)日：2009-11-12
申请号：US12307021
申请日：2008-05-08
申请人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato
发明人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato
IPC分类号： G10L13/06 , G10L15/04 , G10L13/08
CPC分类号： G10L21/00 , G10L13/00 , G10L13/043 , G10L21/003 , G10L2015/025 , G10L2021/0135
摘要： A voice quality conversion device converts voice quality of an input speech using information of the speech. The device includes: a target vowel vocal tract information hold unit (101) holding target vowel vocal tract information of each vowel indicating target voice quality; a vowel conversion unit (103) receiving vocal tract information with phoneme boundary information of the speech including information of phonemes and phoneme durations, (ii) approximating a temporal change of vocal tract information of a vowel in the vocal tract information with phoneme boundary information applying a first function, (iii) approximating a temporal change of vocal tract information of the same vowel held in the target vowel vocal tract information hold unit (101) applying a second function, (iv) calculating a third function by combining the first function with the second function, and (v) converting the vocal tract information of the vowel applying the third function; and a synthesis unit (103) synthesizing a speech using the converted information (102).
摘要翻译：语音质量转换装置使用语音信息来转换输入语音的语音质量。该装置包括：目标元音声道信息保持单元，保持每个元音的目标元音声道信息，指示目标语音质量; 元音转换单元（103），其接收具有包括音素和音素持续时间的信息的语音的音素边界信息的声道信息，（ii）使用音素边界信息应用于声带信息中的元音的声道信息的时间变化近似第一功能，（iii）近似保持在应用第二功能的目标元音声道信息保持单元（101）中保持的同一母音的声道信息的时间变化，（iv）通过将第一功能与第二功能，（v）转换应用第三功能的元音的声道信息; 以及使用所转换的信息（102）合成语音的合成单元（103）。

8. 发明授权

US07454343B2 Speech synthesizer, speech synthesizing method, and program 有权
标题翻译：语音合成器，语音合成方法和程序
公开(公告)号：US07454343B2
公开(公告)日：2008-11-18
申请号：US11783855
申请日：2007-04-12
申请人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato , Natsuki Saito
发明人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato , Natsuki Saito
IPC分类号： G10L15/14
CPC分类号： G10L13/06 , G10L13/04
摘要： A speech synthesizer that provides high-quality sound along with stable sound quality, including: a target parameter generation unit; a speech element DB; an element selection unit; a mixed parameter judgment unit which determines an optimum parameter combination of target parameters and speech elements; a parameter integration unit which integrates the parameters; and a waveform generation unit which generates synthetic speech. High-quality and stable synthetic speech is generated by combining, per parameter dimension, the parameters with stable sound quality generated by the target parameter generation unit with speech elements with high sound quality and a sense of true speech selected by the element selection unit.
摘要翻译：一种提供高质量声音以及稳定音质的语音合成器，包括：目标参数产生单元; 语音元件DB; 元素选择单元; 混合参数判断单元，其确定目标参数和语音元素的最佳参数组合; 参数集成单元，其集成参数; 以及生成合成语音的波形生成单元。通过将参数尺寸与由目标参数生成单元生成的稳定声音的参数与具有高音质的语音元素和由元素选择单元选择的真实语音感觉相结合，产生高质量和稳定的合成语音。

9. 发明申请

US20060259299A1 Broadcast reception method, broadcast reception systm, recording medium and program (as amended) 有权
标题翻译：广播接收方式，广播接收系统，录音媒体和节目（经修改）
公开(公告)号：US20060259299A1
公开(公告)日：2006-11-16
申请号：US10542409
申请日：2003-12-26
申请人： Yumiko Kato , Takahiro Kamai , Hideyuki Yoshida , Yoshifumi Hirose
发明人： Yumiko Kato , Takahiro Kamai , Hideyuki Yoshida , Yoshifumi Hirose
IPC分类号： G10L15/00
CPC分类号： H04N5/445 , G06Q30/02 , G10L15/26 , H04H60/37 , H04H60/48 , H04N7/088 , H04N21/42203 , H04N21/432 , H04N21/4332 , H04N21/4348 , H04N21/435 , H04N21/4394 , H04N21/44222 , H04N21/454 , H04N21/47214 , H04N21/4722 , H04N21/47815 , H04N21/812 , H04N21/8405
摘要： A broadcast receiving system includes a broadcast receiving part for receiving a broadcast in which additional information that corresponds to an object appearing in broadcast contents and that contains keyword information for specifying the object is broadcasted simultaneously with the broadcast contents; a recognition vocabulary generating section for generating a recognition vocabulary set in a manner corresponding to the additional information by using a synonym dictionary; a speech recognition section for performing the speech recognition of a voice uttered by a viewing person, and for thereby specifying keyword information corresponding to a recognition vocabulary set when a word recognized as the speech recognition result is contained in the recognition vocabulary set; and a displaying section for displaying additional information corresponding to the specified keyword information.
摘要翻译：广播接收系统包括广播接收部分，用于接收广播，其中广播内容同时广播广播内容中广播与广播内容中出现的对象相对应的并且包含用于指定对象的关键字信息的附加信息; 识别词汇生成部，其通过使用同义词词典来以与所述附加信息对应的方式生成识别词汇集; 用于执行由观看者发出的语音的语音识别的语音识别部分，并且由此在识别词汇集中包含被识别为语音识别结果的单词时，指定与识别词汇集相对应的关键字信息; 以及显示部分，用于显示与指定的关键字信息相对应的附加信息。

10. 发明授权

US08311831B2 Voice emphasizing device and voice emphasizing method 有权
标题翻译：语音强调设备和语音强调方法
公开(公告)号：US08311831B2
公开(公告)日：2012-11-13
申请号：US12447775
申请日：2008-09-29
申请人： Yumiko Kato , Takahiro Kamai , Masakatsu Hoshimi
发明人： Yumiko Kato , Takahiro Kamai , Masakatsu Hoshimi
IPC分类号： G10L13/06
CPC分类号： G10L21/02 , G10L21/0232 , G10L25/87
摘要： A voice emphasizing device emphasizes in a speech a “strained rough voice” at a position where a speaker or user of the speech intends to generate emphasis or musical expression. Thereby, the voice emphasizing device can provide the position with emphasis of anger, excitement, tension, or an animated way of speaking, or musical expression of Enka (Japanese ballad), blues, rock, or the like. As a result, rich vocal expression can be achieved. The voice emphasizing device includes: an emphasis utterance section detection unit (12) detecting, from an input speech waveform, an emphasis section that is a time duration having a waveform intended by the speaker or user to be converted; and a voice emphasizing unit (13) increasing fluctuation of an amplitude envelope of the waveform in the detected emphasis section.
摘要翻译：语音强调装置在讲话中强调了一个紧张的粗糙声音，其中讲话者或言语用户意图产生强调或音乐表达。因此，声音强调装置可以以Enka（日本民谣），蓝调，摇滚等的强调愤怒，兴奋，紧张或动画的演绎方式或音乐表现为出发点。因此，可以实现丰富的声乐表达。语音强调装置包括：强调话音部分检测单元，从输入语音波形检测作为要被转换的扬声器或用户想要的波形的持续时间的强调部分; 以及语音强调单元（13），增加检测到的强调部分中波形的振幅包络的波动。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式