会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明申请
    • VOICE QUALITY CONVERSION DEVICE AND VOICE QUALITY CONVERSION METHOD
    • 语音质量转换设备和语音质量转换方法
    • US20090281807A1
    • 2009-11-12
    • US12307021
    • 2008-05-08
    • Yoshifumi HiroseTakahiro KamaiYumiko Kato
    • Yoshifumi HiroseTakahiro KamaiYumiko Kato
    • G10L13/06G10L15/04G10L13/08
    • G10L21/00G10L13/00G10L13/043G10L21/003G10L2015/025G10L2021/0135
    • A voice quality conversion device converts voice quality of an input speech using information of the speech. The device includes: a target vowel vocal tract information hold unit (101) holding target vowel vocal tract information of each vowel indicating target voice quality; a vowel conversion unit (103) receiving vocal tract information with phoneme boundary information of the speech including information of phonemes and phoneme durations, (ii) approximating a temporal change of vocal tract information of a vowel in the vocal tract information with phoneme boundary information applying a first function, (iii) approximating a temporal change of vocal tract information of the same vowel held in the target vowel vocal tract information hold unit (101) applying a second function, (iv) calculating a third function by combining the first function with the second function, and (v) converting the vocal tract information of the vowel applying the third function; and a synthesis unit (103) synthesizing a speech using the converted information (102).
    • 语音质量转换装置使用语音信息来转换输入语音的语音质量。 该装置包括:目标元音声道信息保持单元,保持每个元音的目标元音声道信息,指示目标语音质量; 元音转换单元(103),其接收具有包括音素和音素持续时间的信息的语音的音素边界信息的声道信息,(ii)使用音素边界信息应用于声带信息中的元音的声道信息的时间变化近似 第一功能,(iii)近似保持在应用第二功能的目标元音声道信息保持单元(101)中保持的同一母音的声道信息的时间变化,(iv)通过将第一功能与 第二功能,(v)转换应用第三功能的元音的声道信息; 以及使用所转换的信息(102)合成语音的合成单元(103)。
    • 5. 发明授权
    • Speech synthesizer, speech synthesizing method, and program
    • 语音合成器,语音合成方法和程序
    • US07454343B2
    • 2008-11-18
    • US11783855
    • 2007-04-12
    • Yoshifumi HiroseTakahiro KamaiYumiko KatoNatsuki Saito
    • Yoshifumi HiroseTakahiro KamaiYumiko KatoNatsuki Saito
    • G10L15/14
    • G10L13/06G10L13/04
    • A speech synthesizer that provides high-quality sound along with stable sound quality, including: a target parameter generation unit; a speech element DB; an element selection unit; a mixed parameter judgment unit which determines an optimum parameter combination of target parameters and speech elements; a parameter integration unit which integrates the parameters; and a waveform generation unit which generates synthetic speech. High-quality and stable synthetic speech is generated by combining, per parameter dimension, the parameters with stable sound quality generated by the target parameter generation unit with speech elements with high sound quality and a sense of true speech selected by the element selection unit.
    • 一种提供高质量声音以及稳定音质的语音合成器,包括:目标参数产生单元; 语音元件DB; 元素选择单元; 混合参数判断单元,其确定目标参数和语音元素的最佳参数组合; 参数集成单元,其集成参数; 以及生成合成语音的波形生成单元。 通过将参数尺寸与由目标参数生成单元生成的稳定声音的参数与具有高音质的语音元素和由元素选择单元选择的真实语音感觉相结合,产生高质量和稳定的合成语音。
    • 6. 发明授权
    • Emotion recognition apparatus
    • 情感识别装置
    • US08204747B2
    • 2012-06-19
    • US11997458
    • 2007-05-21
    • Yumiko KatoTakahiro KamaiYoshihisa NakatohYoshifumi Hirose
    • Yumiko KatoTakahiro KamaiYoshihisa NakatohYoshifumi Hirose
    • G10L15/04
    • G10L17/26G10L2015/025
    • An emotion recognition apparatus performs accurate and stable speech-based emotion recognition, irrespective of individual, regional, and language differences of prosodic information. The emotion recognition apparatus includes: a speech recognition unit which recognizes types of phonemes included in the input speech; a characteristic tone detection unit which detects a characteristic tone that relates to a specific emotion, in the input speech; a characteristic tone occurrence indicator computation unit which computes a characteristic tone occurrence indicator for each of the phonemes, based on the types of the phonemes recognized by the speech recognition unit, the characteristic tone occurrence indicator relating to an occurrence frequency of the characteristic tone; and an emotion judgment unit which judges an emotion of the speaker in a phoneme at which the characteristic tone occurs in the input speech, based on the characteristic tone occurrence indicator computed by the characteristic tone occurrence indicator computing unit.
    • 情感识别装置执行准确和稳定的基于语音的情感识别,而不管韵律信息的个体,区域和语言差异。 情感识别装置包括:语音识别单元,其识别输入语音中包括的音素的类型; 在输入语音中检测与特定情感相关的特征音的特征音检测单元; 特征音发生指示符计算单元,其基于由语音识别单元识别的音素的类型来计算每个音素的特征音发生指示符,与特征音的发生频率相关的特征音发生指示符; 以及情绪判断单元,其基于由特征音发生指示符计算单元计算的特征音发生指标,判断在输入语音中出现特征音的音素中的说话者的情感。
    • 7. 发明申请
    • SPEECH SYNTHESIZER
    • 语音合成器
    • US20090254349A1
    • 2009-10-08
    • US12303455
    • 2007-05-11
    • Yoshifumi HiroseYumiko KatoTakahiro Kamai
    • Yoshifumi HiroseYumiko KatoTakahiro Kamai
    • G10L13/06G10L13/08G06F17/30
    • G10L13/033G10L13/04
    • A speech synthesizer can execute speech content editing at high speed and generate speech content easily. The speech synthesizer includes a small speech element DB (101), a small speech element selection unit (102), a small speech element concatenation unit (103), a prosody modification unit (104), a large speech element DB (105), a correspondence DB (106) that associates the small speech element DB (101) with the large speech element DB (105), a speech element candidate obtainment unit (107), a large speech element selection unit (108), and a large speech element concatenation unit (109). By editing synthetic speech using the small speech element DB (101) and performing quality enhancement on an editing result using the large speech element DB (105), speech content can be generated easily on a mobile terminal.
    • 语音合成器可以高速执行语音内容编辑并且容易地产生语音内容。 语音合成器包括小语音元素DB(101),小语音元素选择单元(102),小语音元素连接单元(103),韵律修改单元(104),大语音元素DB(105) 将小语音元件DB(101)与大语音元件DB(105)相关联的对应DB(106),语音元素候补获取单元(107),大语音元素选择单元(108)和大语音 元素级联单元(109)。 通过使用小语音元素DB(101)编辑合成语音并且使用大语音元素DB(105)对编辑结果进行质量增强,可以在移动终端上容易地生成语音内容。
    • 8. 发明申请
    • Speech synthesis method and information providing apparatus
    • 语音合成方法和信息提供装置
    • US20070094029A1
    • 2007-04-26
    • US11434153
    • 2006-05-16
    • Natsuki SaitoTakahiro KamaiYumiko KatoYoshifumi Hirose
    • Natsuki SaitoTakahiro KamaiYumiko KatoYoshifumi Hirose
    • G10L13/08
    • G10L13/033
    • To provide a speech synthesis method of reading out units of synthesized speech without fail and in an easy to understand manner, even when playback of the units of synthesized speech are simultaneously requested. The duration prediction unit predicts the playback duration of synthesized speech to be synthesized based on text. The time constraint satisfaction judgment unit judges whether a constraint condition concerning the playback timing of the synthesized speech is satisfied or not, based on the predicted playback duration. If it judged that the constraint condition is not satisfied, the content modification unit shifts the playback starting timing of the synthesized speech of the text forward or backward, and modifies the contents of the text indicating time and distance in accordance with the shifted time. The synthesized speech generation unit generates synthesized speech based on the text having the modified contents and plays it back.
    • 即使在同时请求合成语音单元的回放的同时,提供一种无故障地以容易理解的方式读出合成语音单元的语音合成方法。 持续时间预测单元基于文本预测要合成的合成语音的播放持续时间。 时间约束满足判断单元基于预测的播放持续时间判断与合成语音的重放定时有关的约束条件是否满足。 如果判定不满足约束条件,则内容修改单元向前或向后移动文本的合成语音的重放开始定时,并根据移动的时间修改指示时间和距离的文本的内容。 合成语音生成单元基于具有修改内容的文本生成合成语音,并且回放。
    • 10. 发明申请
    • EMOTION RECOGNITION APPARATUS
    • 感应识别装置
    • US20090313019A1
    • 2009-12-17
    • US11997458
    • 2007-05-21
    • Yumiko KatoTakahiro KamaiYoshihisa NakatohYoshifumi Hirose
    • Yumiko KatoTakahiro KamaiYoshihisa NakatohYoshifumi Hirose
    • G10L15/04
    • G10L17/26G10L2015/025
    • An emotion recognition apparatus is capable of performing accurate and stable speech-based emotion recognition, irrespective of individual, regional, and language differences of prosodic information. The emotion recognition apparatus is an apparatus for recognizing an emotion of a speaker from an input speech, and includes: a speech recognition unit (106) which recognizes types of phonemes included in the input speech; a characteristic tone detection unit (104) which detects a characteristic tone that relates to a specific emotion, in the input speech; a characteristic tone occurrence indicator computation unit (111) which computes a characteristic tone occurrence indicator for each of the phonemes, based on the types of the phonemes recognized by the speech recognition unit (106), the characteristic tone occurrence indicator relating to an occurrence frequency of the characteristic tone; and an emotion judgment unit (113) which judges an emotion of the speaker in a phoneme at which the characteristic tone occurs in the input speech, based on the characteristic tone occurrence indicator computed by the characteristic tone occurrence indicator computing unit (111).
    • 情感识别装置能够执行准确和稳定的基于语音的情感识别,而不管韵律信息的个体,区域和语言差异。 情感识别装置是用于从输入语音识别扬声器的情感的装置,包括:识别输入语音中包含的音素类型的语音识别单元(106) 在输入语音中检测与特定情绪相关的特征音的特征音检测单元(104); 基于由语音识别单元(106)识别的音素的类型来计算每个音素的特征音发生指示符的特征音发生指示计算单元(111),与发生频率相关的特征音发生指示符 的特征色调; 以及感觉判断单元,其基于由特征音发生指示计算单元计算出的特征音发生指标,判断在输入语音中出现特征音的音素中的说话者的情绪。