会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Speech synthesis apparatus and method for causing a computer to perform
speech synthesis by calculating product of parameters for a speech
waveform and a read waveform generation matrix
    • 语音合成装置和方法,用于使计算机通过计算语音波形和读取波形生成矩阵的参数的乘积来执行语音合成
    • US5745651A
    • 1998-04-28
    • US452545
    • 1995-05-30
    • Mitsuru OtsukaYasunori OhoraTakashi AsoToshiaki Fukada
    • Mitsuru OtsukaYasunori OhoraTakashi AsoToshiaki Fukada
    • G10L13/00G10L13/02G10L13/04G10L13/06G10L13/08G10L3/02
    • G10L13/033G10L13/08
    • A speech synthesis method and a speech synthesis apparatus includes a system for synthesis by rule that prevents the quality of synthesized speech from deteriorating and for reducing the number of calculations that are required for the generation of a speech waveform. The speech synthesis apparatus includes a character series input section, for inputting a character series as phonetic text, a pitch waveform generator, for generating a pitch waveform by calculating a product of a matrix, which has been acquired for each pitch, and the character series, which is input by the character series input section, and a device for connecting pitch waveforms that are generated by the pitch waveform generator and for providing a speech waveform. The calculation method for the generation of such a pitch waveform provides a great reduction in the number of calculations that are required. In addition, in the calculation for the generation of a pitch waveform, a function that determines a frequency response is employed to convert a spectral envelope, which is obtained from a parameter, so that the timbres of synthesized speech can be changed without parameter operations.
    • 语音合成方法和语音合成装置包括用于合成规则的系统,该系统防止合成语音的质量恶化,并减少产生语音波形所需的计算次数。 语音合成装置包括:字符串输入部,用于输入作为语音文本的字符串;音调波形发生器,用于通过计算已经针对每个音调获取的矩阵的乘积和字符串来产生音调波形 ,由字符串输入部输入,以及用于连接由音调波形发生器产生的音调波形并用于提供语音波形的装置。 用于产生这种音调波形的计算方法大大减少了所需的计算次数。 此外,在产生音调波形的计算中,采用确定频率响应的函数来转换从参数获得的频谱包络,使得可以在没有参数操作的情况下改变合成语音的音色。
    • 8. 发明授权
    • Synthesizing phoneme string of predetermined duration by adjusting initial phoneme duration on values from multiple regression by adding values based on their standard deviations
    • 通过根据其标准偏差添加值来调整来自多元回归的初始音素持续时间值来合成预定持续时间的音素串
    • US06546367B2
    • 2003-04-08
    • US09264866
    • 1999-03-09
    • Mitsuru Otsuka
    • Mitsuru Otsuka
    • G10L1308
    • G10L13/10G10L13/08
    • Statistical data including an average value, a standard deviation, and a minimum value of a phoneme duration of each phoneme is stored in a memory. When speech production time is determined for a phoneme string in a predetermined expiratory paragraph, the total phoneme duration of the phoneme string is set so as to become equal to the speech production time. Based on the set phoneme duration, phonemes are connected and a speech waveform is generated. To set a phoneme duration for each phoneme, a phoneme duration initial value is first set based on an average value, obtained by equally dividing the speech production time by phonemes of the phoneme string, and a phoneme duration range, phoneme. Then, set based on statistical data of each the phoneme duration initial value is adjusted based on the statistical data and the speech production time.
    • 包括每个音素的音素持续时间的平均值,标准偏差和最小值的统计数据被存储在存储器中。 当对于预定呼气段落中的音素串确定语音制作时间时,音素串的总音素持续时间被设置成等于语音产生时间。 基于设定的音素持续时间,连接音素并生成语音波形。 为了设置每个音素的音素持续时间,首先根据平均值设置音素持续时间初始值,该平均值是通过将语音产生时间除以音素串的音素和音素持续时间范围音素而得到的。 然后,根据统计数据和语音产生时间,对每个音素持续时间初始值的统计数据进行设定。
    • 10. 发明授权
    • Speech synthesizing method and apparatus
    • 语音合成方法及装置
    • US06993484B1
    • 2006-01-31
    • US09386049
    • 1999-08-30
    • Masayuki YamadaYasuhiro KomoriMitsuru Otsuka
    • Masayuki YamadaYasuhiro KomoriMitsuru Otsuka
    • G10L13/00G10L13/06
    • G10L13/07G10L25/21
    • An amplitude altering magnification (r) applied to sub-phoneme units of a voiced portion and an amplitude altering magnification s to be applied to sub-phoneme units of an unvoiced portion are determined based upon a target phoneme average power (p0) of synthesized speech and power (p) of a selected phoneme unit. Sub-phoneme units are extracted from a phoneme to be synthesized. From among the extracted sub-phoneme units, a sub-phoneme unit of the voiced portion is multiplied by the amplitude altering magnification (r), and a sub-phoneme unit of the unvoiced portion is multiplied by the amplitude altering magnification (s). Synthesized speech is obtained using the sub-phoneme units thus obtained. This makes it possible to realize power control in which any decline in the quality of synthesized speech is reduced.
    • 基于目标音素平均功率(p 0),确定应用于有声部分的子音素单位的振幅变化倍率(r)和施加到清音部分的子音素单位的振幅变化倍率s 所选择的音素单元的合成语音和功率(p)。 从要合成的音素中提取子音素单元。 从所提取的子音素单元中,将有声部分的子音素单位乘以振幅变化倍率(r),并将无声部分的子音素单位乘以振幅变化倍数(s)。 使用由此获得的子音素单元获得合成语音。 这使得可以实现其中合成语音质量的任何下降降低的功率控制。