专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20090281807A1 VOICE QUALITY CONVERSION DEVICE AND VOICE QUALITY CONVERSION METHOD 有权
标题翻译：语音质量转换设备和语音质量转换方法
公开(公告)号：US20090281807A1
公开(公告)日：2009-11-12
申请号：US12307021
申请日：2008-05-08
申请人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato
发明人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato
IPC分类号： G10L13/06 , G10L15/04 , G10L13/08
CPC分类号： G10L21/00 , G10L13/00 , G10L13/043 , G10L21/003 , G10L2015/025 , G10L2021/0135
摘要： A voice quality conversion device converts voice quality of an input speech using information of the speech. The device includes: a target vowel vocal tract information hold unit (101) holding target vowel vocal tract information of each vowel indicating target voice quality; a vowel conversion unit (103) receiving vocal tract information with phoneme boundary information of the speech including information of phonemes and phoneme durations, (ii) approximating a temporal change of vocal tract information of a vowel in the vocal tract information with phoneme boundary information applying a first function, (iii) approximating a temporal change of vocal tract information of the same vowel held in the target vowel vocal tract information hold unit (101) applying a second function, (iv) calculating a third function by combining the first function with the second function, and (v) converting the vocal tract information of the vowel applying the third function; and a synthesis unit (103) synthesizing a speech using the converted information (102).
摘要翻译：语音质量转换装置使用语音信息来转换输入语音的语音质量。该装置包括：目标元音声道信息保持单元，保持每个元音的目标元音声道信息，指示目标语音质量; 元音转换单元（103），其接收具有包括音素和音素持续时间的信息的语音的音素边界信息的声道信息，（ii）使用音素边界信息应用于声带信息中的元音的声道信息的时间变化近似第一功能，（iii）近似保持在应用第二功能的目标元音声道信息保持单元（101）中保持的同一母音的声道信息的时间变化，（iv）通过将第一功能与第二功能，（v）转换应用第三功能的元音的声道信息; 以及使用所转换的信息（102）合成语音的合成单元（103）。

2. 发明授权

US07454343B2 Speech synthesizer, speech synthesizing method, and program 有权
标题翻译：语音合成器，语音合成方法和程序
公开(公告)号：US07454343B2
公开(公告)日：2008-11-18
申请号：US11783855
申请日：2007-04-12
申请人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato , Natsuki Saito
发明人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato , Natsuki Saito
IPC分类号： G10L15/14
CPC分类号： G10L13/06 , G10L13/04
摘要： A speech synthesizer that provides high-quality sound along with stable sound quality, including: a target parameter generation unit; a speech element DB; an element selection unit; a mixed parameter judgment unit which determines an optimum parameter combination of target parameters and speech elements; a parameter integration unit which integrates the parameters; and a waveform generation unit which generates synthetic speech. High-quality and stable synthetic speech is generated by combining, per parameter dimension, the parameters with stable sound quality generated by the target parameter generation unit with speech elements with high sound quality and a sense of true speech selected by the element selection unit.
摘要翻译：一种提供高质量声音以及稳定音质的语音合成器，包括：目标参数产生单元; 语音元件DB; 元素选择单元; 混合参数判断单元，其确定目标参数和语音元素的最佳参数组合; 参数集成单元，其集成参数; 以及生成合成语音的波形生成单元。通过将参数尺寸与由目标参数生成单元生成的稳定声音的参数与具有高音质的语音元素和由元素选择单元选择的真实语音感觉相结合，产生高质量和稳定的合成语音。

3. 发明申请

US20060259299A1 Broadcast reception method, broadcast reception systm, recording medium and program (as amended) 有权
标题翻译：广播接收方式，广播接收系统，录音媒体和节目（经修改）
公开(公告)号：US20060259299A1
公开(公告)日：2006-11-16
申请号：US10542409
申请日：2003-12-26
申请人： Yumiko Kato , Takahiro Kamai , Hideyuki Yoshida , Yoshifumi Hirose
发明人： Yumiko Kato , Takahiro Kamai , Hideyuki Yoshida , Yoshifumi Hirose
IPC分类号： G10L15/00
CPC分类号： H04N5/445 , G06Q30/02 , G10L15/26 , H04H60/37 , H04H60/48 , H04N7/088 , H04N21/42203 , H04N21/432 , H04N21/4332 , H04N21/4348 , H04N21/435 , H04N21/4394 , H04N21/44222 , H04N21/454 , H04N21/47214 , H04N21/4722 , H04N21/47815 , H04N21/812 , H04N21/8405
摘要： A broadcast receiving system includes a broadcast receiving part for receiving a broadcast in which additional information that corresponds to an object appearing in broadcast contents and that contains keyword information for specifying the object is broadcasted simultaneously with the broadcast contents; a recognition vocabulary generating section for generating a recognition vocabulary set in a manner corresponding to the additional information by using a synonym dictionary; a speech recognition section for performing the speech recognition of a voice uttered by a viewing person, and for thereby specifying keyword information corresponding to a recognition vocabulary set when a word recognized as the speech recognition result is contained in the recognition vocabulary set; and a displaying section for displaying additional information corresponding to the specified keyword information.
摘要翻译：广播接收系统包括广播接收部分，用于接收广播，其中广播内容同时广播广播内容中广播与广播内容中出现的对象相对应的并且包含用于指定对象的关键字信息的附加信息; 识别词汇生成部，其通过使用同义词词典来以与所述附加信息对应的方式生成识别词汇集; 用于执行由观看者发出的语音的语音识别的语音识别部分，并且由此在识别词汇集中包含被识别为语音识别结果的单词时，指定与识别词汇集相对应的关键字信息; 以及显示部分，用于显示与指定的关键字信息相对应的附加信息。

4. 发明授权

US08204747B2 Emotion recognition apparatus 有权
标题翻译：情感识别装置
公开(公告)号：US08204747B2
公开(公告)日：2012-06-19
申请号：US11997458
申请日：2007-05-21
申请人： Yumiko Kato , Takahiro Kamai , Yoshihisa Nakatoh , Yoshifumi Hirose
发明人： Yumiko Kato , Takahiro Kamai , Yoshihisa Nakatoh , Yoshifumi Hirose
IPC分类号： G10L15/04
CPC分类号： G10L17/26 , G10L2015/025
摘要： An emotion recognition apparatus performs accurate and stable speech-based emotion recognition, irrespective of individual, regional, and language differences of prosodic information. The emotion recognition apparatus includes: a speech recognition unit which recognizes types of phonemes included in the input speech; a characteristic tone detection unit which detects a characteristic tone that relates to a specific emotion, in the input speech; a characteristic tone occurrence indicator computation unit which computes a characteristic tone occurrence indicator for each of the phonemes, based on the types of the phonemes recognized by the speech recognition unit, the characteristic tone occurrence indicator relating to an occurrence frequency of the characteristic tone; and an emotion judgment unit which judges an emotion of the speaker in a phoneme at which the characteristic tone occurs in the input speech, based on the characteristic tone occurrence indicator computed by the characteristic tone occurrence indicator computing unit.
摘要翻译：情感识别装置执行准确和稳定的基于语音的情感识别，而不管韵律信息的个体，区域和语言差异。情感识别装置包括：语音识别单元，其识别输入语音中包括的音素的类型; 在输入语音中检测与特定情感相关的特征音的特征音检测单元; 特征音发生指示符计算单元，其基于由语音识别单元识别的音素的类型来计算每个音素的特征音发生指示符，与特征音的发生频率相关的特征音发生指示符; 以及情绪判断单元，其基于由特征音发生指示符计算单元计算的特征音发生指标，判断在输入语音中出现特征音的音素中的说话者的情感。

5. 发明申请

US20090254349A1 SPEECH SYNTHESIZER 审中-公开
标题翻译：语音合成器
公开(公告)号：US20090254349A1
公开(公告)日：2009-10-08
申请号：US12303455
申请日：2007-05-11
申请人： Yoshifumi Hirose , Yumiko Kato , Takahiro Kamai
发明人： Yoshifumi Hirose , Yumiko Kato , Takahiro Kamai
IPC分类号： G10L13/06 , G10L13/08 , G06F17/30
CPC分类号： G10L13/033 , G10L13/04
摘要： A speech synthesizer can execute speech content editing at high speed and generate speech content easily. The speech synthesizer includes a small speech element DB (101), a small speech element selection unit (102), a small speech element concatenation unit (103), a prosody modification unit (104), a large speech element DB (105), a correspondence DB (106) that associates the small speech element DB (101) with the large speech element DB (105), a speech element candidate obtainment unit (107), a large speech element selection unit (108), and a large speech element concatenation unit (109). By editing synthetic speech using the small speech element DB (101) and performing quality enhancement on an editing result using the large speech element DB (105), speech content can be generated easily on a mobile terminal.
摘要翻译：语音合成器可以高速执行语音内容编辑并且容易地产生语音内容。语音合成器包括小语音元素DB（101），小语音元素选择单元（102），小语音元素连接单元（103），韵律修改单元（104），大语音元素DB（105）将小语音元件DB（101）与大语音元件DB（105）相关联的对应DB（106），语音元素候补获取单元（107），大语音元素选择单元（108）和大语音元素级联单元（109）。通过使用小语音元素DB（101）编辑合成语音并且使用大语音元素DB（105）对编辑结果进行质量增强，可以在移动终端上容易地生成语音内容。

6. 发明申请

US20070094029A1 Speech synthesis method and information providing apparatus 审中-公开
标题翻译：语音合成方法和信息提供装置
公开(公告)号：US20070094029A1
公开(公告)日：2007-04-26
申请号：US11434153
申请日：2006-05-16
申请人： Natsuki Saito , Takahiro Kamai , Yumiko Kato , Yoshifumi Hirose
发明人： Natsuki Saito , Takahiro Kamai , Yumiko Kato , Yoshifumi Hirose
IPC分类号： G10L13/08
CPC分类号： G10L13/033
摘要： To provide a speech synthesis method of reading out units of synthesized speech without fail and in an easy to understand manner, even when playback of the units of synthesized speech are simultaneously requested. The duration prediction unit predicts the playback duration of synthesized speech to be synthesized based on text. The time constraint satisfaction judgment unit judges whether a constraint condition concerning the playback timing of the synthesized speech is satisfied or not, based on the predicted playback duration. If it judged that the constraint condition is not satisfied, the content modification unit shifts the playback starting timing of the synthesized speech of the text forward or backward, and modifies the contents of the text indicating time and distance in accordance with the shifted time. The synthesized speech generation unit generates synthesized speech based on the text having the modified contents and plays it back.
摘要翻译：即使在同时请求合成语音单元的回放的同时，提供一种无故障地以容易理解的方式读出合成语音单元的语音合成方法。持续时间预测单元基于文本预测要合成的合成语音的播放持续时间。时间约束满足判断单元基于预测的播放持续时间判断与合成语音的重放定时有关的约束条件是否满足。如果判定不满足约束条件，则内容修改单元向前或向后移动文本的合成语音的重放开始定时，并根据移动的时间修改指示时间和距离的文本的内容。合成语音生成单元基于具有修改内容的文本生成合成语音，并且回放。

7. 发明授权

US07698138B2 Broadcast receiving method, broadcast receiving system, recording medium, and program 有权
标题翻译：广播接收方式，广播接收系统，记录媒体和节目
公开(公告)号：US07698138B2
公开(公告)日：2010-04-13
申请号：US10542409
申请日：2003-12-26
申请人： Yumiko Kato , Takahiro Kamai , Hideyuki Yoshida , Yoshifumi Hirose
发明人： Yumiko Kato , Takahiro Kamai , Hideyuki Yoshida , Yoshifumi Hirose
IPC分类号： G10L15/00 , G10L15/06 , G10L11/00 , G10L21/00
CPC分类号： H04N5/445 , G06Q30/02 , G10L15/26 , H04H60/37 , H04H60/48 , H04N7/088 , H04N21/42203 , H04N21/432 , H04N21/4332 , H04N21/4348 , H04N21/435 , H04N21/4394 , H04N21/44222 , H04N21/454 , H04N21/47214 , H04N21/4722 , H04N21/47815 , H04N21/812 , H04N21/8405
摘要： A broadcast receiving system includes a broadcast receiving part for receiving a broadcast in which additional information that corresponds to an object appearing in broadcast contents and that contains keyword information for specifying the object is broadcasted simultaneously with the broadcast contents; a recognition vocabulary generating section for generating a recognition vocabulary set in a manner corresponding to the additional information by using a synonym dictionary; a speech recognition section for performing the speech recognition of a voice uttered by a viewing person, and for thereby specifying keyword information corresponding to a recognition vocabulary set when a word recognized as the speech recognition result is contained in the recognition vocabulary set; and a displaying section for displaying additional information corresponding to the specified keyword information.
摘要翻译：广播接收系统包括广播接收部分，用于接收广播，其中广播内容同时广播广播内容中广播与广播内容中出现的对象相对应的并且包含用于指定对象的关键字信息的附加信息; 识别词汇生成部，其通过使用同义词词典来以与所述附加信息对应的方式生成识别词汇集; 用于执行由观看者发出的语音的语音识别的语音识别部分，并且由此在识别词汇集中包含被识别为语音识别结果的单词时，指定与识别词汇集相对应的关键字信息; 以及显示部分，用于显示与指定的关键字信息相对应的附加信息。

8. 发明申请

US20070203702A1 Speech synthesizer, speech synthesizing method, and program 有权
标题翻译：语音合成器，语音合成方法和程序
公开(公告)号：US20070203702A1
公开(公告)日：2007-08-30
申请号：US11783855
申请日：2007-04-12
申请人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato , Natsuki Saito
发明人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato , Natsuki Saito
IPC分类号： G10L15/14
CPC分类号： G10L13/06 , G10L13/04
摘要： A speech synthesizer that provides high-quality sound along with stable sound quality, including: a target parameter generation unit; a speech element DB; an element selection unit; a mixed parameter judgment unit which determines an optimum parameter combination of target parameters and speech elements; a parameter integration unit which integrates the parameters; and a waveform generation unit which generates synthetic speech. High-quality and stable synthetic speech is generated by combining, per parameter dimension, the parameters with stable sound quality generated by the target parameter generation unit with speech elements with high sound quality and a sense of true speech selected by the element selection unit.
摘要翻译：一种提供高质量声音以及稳定音质的语音合成器，包括：目标参数产生单元; 语音元件DB; 元件选择单元; 混合参数判断单元，其确定目标参数和语音元素的最佳参数组合; 参数集成单元，其集成参数; 以及生成合成语音的波形生成单元。通过将参数尺寸与由目标参数生成单元生成的稳定声音的参数与具有高音质的语音元素和由元素选择单元选择的真实语音感觉相结合，产生高质量和稳定的合成语音。

9. 发明授权

US08898055B2 Voice quality conversion device and voice quality conversion method for converting voice quality of an input speech using target vocal tract information and received vocal tract information corresponding to the input speech 有权
标题翻译：语音质量转换装置和语音质量转换方法，用于使用对应于输入语音的目标声道信息和接收到的声道信息来转换输入语音的语音质量
公开(公告)号：US08898055B2
公开(公告)日：2014-11-25
申请号：US12307021
申请日：2008-05-08
申请人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato
发明人： Yoshifumi Hirose , Takahiro Kamai , Yumiko Kato
IPC分类号： G10L19/00 , G10L19/02 , G10L21/00 , G10L21/02 , G10L13/04 , G10L13/00 , G10L21/013
CPC分类号： G10L21/00 , G10L13/00 , G10L13/043 , G10L21/003 , G10L2015/025 , G10L2021/0135
摘要： A voice quality conversion device including: a target vowel vocal tract information hold unit holding target vowel vocal tract information of each vowel indicating target voice quality; a vowel conversion unit (i) receiving vocal tract information with phoneme boundary information of the speech including information of phonemes and phoneme durations, (ii) approximating a temporal change of vocal tract information of a vowel in the vocal tract information with phoneme boundary information applying a first function, (iii) approximating a temporal change of vocal tract information of the same vowel held in the target vowel vocal tract information hold unit applying a second function, (iv) calculating a third function by combining the first function with the second function, and (v) converting the vocal tract information of the vowel applying the third function; and a synthesis unit synthesizing a speech using the converted information.
摘要翻译：一种语音质量转换装置，包括：目标元音声道信息保持单元，保持表示目标语音质量的每个元音的目标元音声道信息; 元音转换单元（i）接收具有包括音素和音素持续时间信息的语音的音素边界信息的声道信息，（ii）使用音素边界信息应用于声带信息中的元音的声道信息的时间变化近似第一功能，（iii）近似在应用第二功能的目标元音声道信息保持单元中保持的同一元音的声道信息的时间变化，（iv）通过将第一功能与第二功能组合来计算第三功能（v）转换使用第三功能的元音的声道信息; 以及使用所转换的信息合成语音的合成单元。

10. 发明申请

US20090313019A1 EMOTION RECOGNITION APPARATUS 有权
标题翻译：感应识别装置
公开(公告)号：US20090313019A1
公开(公告)日：2009-12-17
申请号：US11997458
申请日：2007-05-21
申请人： Yumiko Kato , Takahiro Kamai , Yoshihisa Nakatoh , Yoshifumi Hirose
发明人： Yumiko Kato , Takahiro Kamai , Yoshihisa Nakatoh , Yoshifumi Hirose
IPC分类号： G10L15/04
CPC分类号： G10L17/26 , G10L2015/025
摘要： An emotion recognition apparatus is capable of performing accurate and stable speech-based emotion recognition, irrespective of individual, regional, and language differences of prosodic information. The emotion recognition apparatus is an apparatus for recognizing an emotion of a speaker from an input speech, and includes: a speech recognition unit (106) which recognizes types of phonemes included in the input speech; a characteristic tone detection unit (104) which detects a characteristic tone that relates to a specific emotion, in the input speech; a characteristic tone occurrence indicator computation unit (111) which computes a characteristic tone occurrence indicator for each of the phonemes, based on the types of the phonemes recognized by the speech recognition unit (106), the characteristic tone occurrence indicator relating to an occurrence frequency of the characteristic tone; and an emotion judgment unit (113) which judges an emotion of the speaker in a phoneme at which the characteristic tone occurs in the input speech, based on the characteristic tone occurrence indicator computed by the characteristic tone occurrence indicator computing unit (111).
摘要翻译：情感识别装置能够执行准确和稳定的基于语音的情感识别，而不管韵律信息的个体，区域和语言差异。情感识别装置是用于从输入语音识别扬声器的情感的装置，包括：识别输入语音中包含的音素类型的语音识别单元（106）在输入语音中检测与特定情绪相关的特征音的特征音检测单元（104）; 基于由语音识别单元（106）识别的音素的类型来计算每个音素的特征音发生指示符的特征音发生指示计算单元（111），与发生频率相关的特征音发生指示符的特征色调; 以及感觉判断单元，其基于由特征音发生指示计算单元计算出的特征音发生指标，判断在输入语音中出现特征音的音素中的说话者的情绪。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式