会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Speech synthesizing system and method for modifying prosody based on match to database
    • 语音合成系统和基于匹配数据库修改韵律的方法
    • US06823309B1
    • 2004-11-23
    • US09701183
    • 2000-11-27
    • Yumiko KatoKenji MatsuiTakahiro KamaiKatsuyoshi Yamagami
    • Yumiko KatoKenji MatsuiTakahiro KamaiKatsuyoshi Yamagami
    • G10L1308
    • G10L13/10G10L13/04G10L13/08
    • A speech synthesis system for storing in advance a degree of modification of prosodic data in a prosodic data modifying rule apparatus, the degree of modification corresponding to an approximate cost and being stored as a modifying rule, a prosodic data retrieving section for retrieving a prosodic data stored corresponding to a key data for use in retrieval, the prosodic data retrieved according to a degree of matching between the input data and the key data, the degree of matching represented by the approximate cost, a modifying section for modifying the retrieved prosodic data based on the degree of matching and the modifying rule stored in the prosodic data modifying rule means, and an output section for outputting synthesized speech based on the input data and the modified prosodic data.
    • 一种语音合成系统,用于预先存储韵律数据修改规则装置中的韵律数据的修改程度,对应于近似成本的修改程度,并被存储为修改规则;韵律数据检索部分,用于检索韵律数据 存储对应于用于检索的密钥数据,根据输入数据和密钥数据之间的匹配程度检索的韵律数据,由近似成本表示的匹配程度,用于修改所检索的韵律数据的修改部分 关于存储在韵律数据修改规则装置中的匹配度和修改规则,以及用于基于输入数据和修改的韵律数据输出合成语音的输出部分。
    • 5. 发明授权
    • Speech synthesis apparatus
    • 语音合成装置
    • US07526430B2
    • 2009-04-28
    • US11226331
    • 2005-09-15
    • Yumiko KatoTakahiro Kamai
    • Yumiko KatoTakahiro Kamai
    • G10L13/06G10L21/00
    • G10L13/10
    • A speech synthesis apparatus, which can embed unchangeable additional information into synthesized speech without causing a deterioration of speech quality and restriction by bands, includes a language processing unit which generates synthesized speech generation information necessary for generating synthesized speech in accordance with a language string, a prosody generating unit which generates prosody information of speech based on the synthesized speech generation information, and a waveform generating unit which synthesizes speech based on the prosody information, in which the prosody generating unit embed code information as watermark information in the prosody information of a segment having a predetermined time duration within a phoneme length including a phoneme boundary.
    • 一种可将不可变附加信息嵌入合成语音而不导致语音质量恶化和频带限制的语音合成装置,包括:语言处理单元,其生成根据语言串产生合成语音所需的合成语音产生信息; 韵律产生单元,其基于合成的语音产生信息生成语音的韵律信息;以及波形生成单元,其基于韵律信息合成语音,其中,韵律生成单元将代码信息作为水印信息嵌入在片段的韵律信息中 在包括音素边界的音素长度内具有预定的持续时间。
    • 7. 发明申请
    • VOICE QUALITY CONVERSION DEVICE AND VOICE QUALITY CONVERSION METHOD
    • 语音质量转换设备和语音质量转换方法
    • US20090281807A1
    • 2009-11-12
    • US12307021
    • 2008-05-08
    • Yoshifumi HiroseTakahiro KamaiYumiko Kato
    • Yoshifumi HiroseTakahiro KamaiYumiko Kato
    • G10L13/06G10L15/04G10L13/08
    • G10L21/00G10L13/00G10L13/043G10L21/003G10L2015/025G10L2021/0135
    • A voice quality conversion device converts voice quality of an input speech using information of the speech. The device includes: a target vowel vocal tract information hold unit (101) holding target vowel vocal tract information of each vowel indicating target voice quality; a vowel conversion unit (103) receiving vocal tract information with phoneme boundary information of the speech including information of phonemes and phoneme durations, (ii) approximating a temporal change of vocal tract information of a vowel in the vocal tract information with phoneme boundary information applying a first function, (iii) approximating a temporal change of vocal tract information of the same vowel held in the target vowel vocal tract information hold unit (101) applying a second function, (iv) calculating a third function by combining the first function with the second function, and (v) converting the vocal tract information of the vowel applying the third function; and a synthesis unit (103) synthesizing a speech using the converted information (102).
    • 语音质量转换装置使用语音信息来转换输入语音的语音质量。 该装置包括:目标元音声道信息保持单元,保持每个元音的目标元音声道信息,指示目标语音质量; 元音转换单元(103),其接收具有包括音素和音素持续时间的信息的语音的音素边界信息的声道信息,(ii)使用音素边界信息应用于声带信息中的元音的声道信息的时间变化近似 第一功能,(iii)近似保持在应用第二功能的目标元音声道信息保持单元(101)中保持的同一母音的声道信息的时间变化,(iv)通过将第一功能与 第二功能,(v)转换应用第三功能的元音的声道信息; 以及使用所转换的信息(102)合成语音的合成单元(103)。
    • 8. 发明授权
    • Speech synthesizer, speech synthesizing method, and program
    • 语音合成器,语音合成方法和程序
    • US07454343B2
    • 2008-11-18
    • US11783855
    • 2007-04-12
    • Yoshifumi HiroseTakahiro KamaiYumiko KatoNatsuki Saito
    • Yoshifumi HiroseTakahiro KamaiYumiko KatoNatsuki Saito
    • G10L15/14
    • G10L13/06G10L13/04
    • A speech synthesizer that provides high-quality sound along with stable sound quality, including: a target parameter generation unit; a speech element DB; an element selection unit; a mixed parameter judgment unit which determines an optimum parameter combination of target parameters and speech elements; a parameter integration unit which integrates the parameters; and a waveform generation unit which generates synthetic speech. High-quality and stable synthetic speech is generated by combining, per parameter dimension, the parameters with stable sound quality generated by the target parameter generation unit with speech elements with high sound quality and a sense of true speech selected by the element selection unit.
    • 一种提供高质量声音以及稳定音质的语音合成器,包括:目标参数产生单元; 语音元件DB; 元素选择单元; 混合参数判断单元,其确定目标参数和语音元素的最佳参数组合; 参数集成单元,其集成参数; 以及生成合成语音的波形生成单元。 通过将参数尺寸与由目标参数生成单元生成的稳定声音的参数与具有高音质的语音元素和由元素选择单元选择的真实语音感觉相结合,产生高质量和稳定的合成语音。
    • 10. 发明授权
    • Voice emphasizing device and voice emphasizing method
    • 语音强调设备和语音强调方法
    • US08311831B2
    • 2012-11-13
    • US12447775
    • 2008-09-29
    • Yumiko KatoTakahiro KamaiMasakatsu Hoshimi
    • Yumiko KatoTakahiro KamaiMasakatsu Hoshimi
    • G10L13/06
    • G10L21/02G10L21/0232G10L25/87
    • A voice emphasizing device emphasizes in a speech a “strained rough voice” at a position where a speaker or user of the speech intends to generate emphasis or musical expression. Thereby, the voice emphasizing device can provide the position with emphasis of anger, excitement, tension, or an animated way of speaking, or musical expression of Enka (Japanese ballad), blues, rock, or the like. As a result, rich vocal expression can be achieved. The voice emphasizing device includes: an emphasis utterance section detection unit (12) detecting, from an input speech waveform, an emphasis section that is a time duration having a waveform intended by the speaker or user to be converted; and a voice emphasizing unit (13) increasing fluctuation of an amplitude envelope of the waveform in the detected emphasis section.
    • 语音强调装置在讲话中强调了一个紧张的粗糙声音,其中讲话者或言语用户意图产生强调或音乐表达。 因此,声音强调装置可以以Enka(日本民谣),蓝调,摇滚等的强调愤怒,兴奋,紧张或动画的演绎方式或音乐表现为出发点。 因此,可以实现丰富的声乐表达。 语音强调装置包括:强调话音部分检测单元,从输入语音波形检测作为要被转换的扬声器或用户想要的波形的持续时间的强调部分; 以及语音强调单元(13),增加检测到的强调部分中波形的振幅包络的波动。