会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 51. 发明授权
    • Speech processing and speech synthesis using a linear combination of bases at peak frequencies for spectral envelope information
    • 在频谱包络信息的峰值频率处使用基线的线性组合的语音处理和语音合成
    • US08321208B2
    • 2012-11-27
    • US12327399
    • 2008-12-03
    • Masatsune TamuraKatsumi TsuchiyaTakehiko Kagoshima
    • Masatsune TamuraKatsumi TsuchiyaTakehiko Kagoshima
    • G10L13/06G10L19/02
    • G10L13/06
    • An information extraction unit extracts spectral envelope information of L-dimension from each frame of speech data by discrete Fourier transform. The spectral envelope information is represented by L points. A basis storage unit stores N bases (L>N>1). Each basis is differently a frequency band having a maximum as a peak frequency in a spectral domain having L-dimension. A value corresponding to a frequency outside the frequency band along a frequency axis of the spectral domain is zero. Two frequency bands of which two peak frequencies are adjacent along the frequency axis partially overlap. A parameter calculation unit minimizes a distortion between the spectral envelope information and a linear combination of each basis with a coefficient for each of L points of the spectral envelope information by changing the coefficient, and sets the coefficient of each basis from which the distortion is minimized to a spectral envelope parameter of the spectral envelope information.
    • 信息提取单元通过离散傅里叶变换从每个语音数据帧提取L维的频谱包络信息。 频谱包络信息由L点表示。 基准存储单元存储N个碱基(L> N> 1)。 每个基准在具有L维的谱域中具有作为峰值频率的最大值的频带不同。 对应于沿着频域的频率轴的频带外的频率的值为零。 两个峰值频率沿频率轴相邻的两个频带部分重叠。 参数计算单元通过改变系数,将频谱包络信息和每个基线的线性组合之间的失真与频谱包络信息中的每个L点的系数最小化,并且设置失真最小化的每个基准的系数 到频谱包络信息的频谱包络参数。
    • 54. 发明申请
    • SPEECH SYNTHESIS APPARATUS AND METHOD
    • 语音合成设备和方法
    • US20110087488A1
    • 2011-04-14
    • US12970162
    • 2010-12-16
    • Ryo MorinakaTakehiko Kagoshima
    • Ryo MorinakaTakehiko Kagoshima
    • G10L11/04G10L13/06
    • G10L13/06G10L13/033G10L19/097G10L25/15G10L2021/0135
    • According to an embodiment, a speech synthesis apparatus includes a selecting unit configured to select speaker's parameters one by one for respective speakers and obtain a plurality of speakers' parameters, the speaker's parameters being prepared for respective pitch waveforms corresponding to speaker's speech sounds, the speaker's parameters including formant frequencies, formant phases, formant powers, and window functions concerning respective formants that are contained in the respective pitch waveforms. The apparatus includes a mapping unit configured to make formants correspond to each other between the plurality of speakers' parameters using a cost function based on the formant frequencies and the formant powers. The apparatus includes a generating unit configured to generate an interpolated speaker's parameter by interpolating, at desired interpolation ratios, the formant frequencies, formant phases, formant powers, and window functions of formants which are made to correspond to each other.
    • 根据实施例,语音合成装置包括:选择单元,被配置为逐个选择说话者的参数,并且获得多个扬声器的参数;所述说话者的参数是针对对应于说话者的语音的各个音调波形而准备的, 参数包括共振峰频率,共振峰相位,共振峰功率,以及相关螺旋波形中包含的各共振峰的窗函数。 该装置包括:映射单元,其被配置为使用基于共振峰频率和共振峰功率的成本函数在多个扬声器的参数之间使得共振峰彼此对应。 该装置包括:生成单元,被配置为通过以期望的内插比率内插使彼此对应的共振峰的共振峰频率,共振峰相位,共振峰功率和窗函数来生成内插说话者的参数。
    • 55. 发明授权
    • Speech synthesis system and method
    • 语音合成系统及方法
    • US07630896B2
    • 2009-12-08
    • US11233092
    • 2005-09-23
    • Masatsune TamuraGou HirabayashiTakehiko Kagoshima
    • Masatsune TamuraGou HirabayashiTakehiko Kagoshima
    • G10L13/06
    • G10L13/07
    • A speech synthesis system in a preferred embodiment includes a speech unit storage section, a phonetic environment storage section, a phonetic sequence/prosodic information input section, a plural-speech-unit selection section, a fused-speech-unit sequence generation section, and a fused-speech-unit modification/concatenation section. By fusing a plurality of selected speech units in the fused speech unit sequence generation section, a fused speech unit is generated. In the fused speech unit sequence generation section, the average power information is calculated for a plurality of selected M speech units, N speech units are fused together, and the power information of the fused speech unit is so corrected as to be equalized with the average power information of the M speech units.
    • 优选实施例中的语音合成系统包括语音单元存储部分,语音环境存储部分,语音序列/韵律信息输入部分,多语音单元选择部分,融合语音单元序列生成部分和 融合语音单元修改/级联部分。 通过在融合语音单元序列生成部中融合多个选择的语音单元,生成融合语音单元。 在融合语音单元序列产生部分中,针对多个所选择的M个语音单元计算平均功率信息,将N个语音单元融合在一起,并将融合语音单元的功率信息校正为与平均值相等 M个语音单元的功率信息。
    • 57. 发明授权
    • Speech synthesis method
    • 语音合成方法
    • US07184958B2
    • 2007-02-27
    • US10792888
    • 2004-03-05
    • Takehiko KagoshimaMasami Akamine
    • Takehiko KagoshimaMasami Akamine
    • G10L13/00G10L19/04
    • G10L13/07G10L25/90
    • A speech synthesis method subjects a reference speech signal to windowing to extract a speech pitch wave having a window function of a window length double a pitch period of the reference speech signal from the reference speech signal. A linear prediction coefficient is generated by subjecting the reference speech signal to a linear prediction analysis. The speech pitch wave is subjected to inverse-filtering based on the linear prediction coefficient to produce a residual pitch wave, which is then stored as information of a speech synthesis unit in a voiced period in a storage. Speech using the information of the speech synthesis unit is then synthesized.
    • 语音合成方法使参考语音信号进行加窗以从参考语音信号中提取具有参考语音信号的音高周期的窗口长度双倍的窗函数的语音音调波。 通过对参考语音信号进行线性预测分析来生成线性预测系数。 语音音调波基于线性预测系数进行逆滤波以产生残余音调波,然后作为语音合成单元的信息存储在存储器中的有声周期中。 然后合成使用语音合成单元的信息的语音。
    • 58. 发明授权
    • Phonemic unit dictionary based on shifted portions of source codebook vectors, for text-to-speech synthesis
    • 基于源码本向量的偏移部分的音素单位词典,用于文本到语音合成
    • US06202048B1
    • 2001-03-13
    • US09239966
    • 1999-01-29
    • Katsumi TsuchiyaTakehiko KagoshimaMasami Akamine
    • Katsumi TsuchiyaTakehiko KagoshimaMasami Akamine
    • G10L1306
    • G10L19/12G10L13/06
    • A speech synthesis apparatus synthesize a speech signal by filtering a speech source signal through a synthesis filter. A speech source signal codebook stores a plurality of speech source signals as a code vector. A unit dictionary memory stores a plurality of synthesis units corresponding to phonemic symbols, each synthesis unit comprising an index of the code vector in the speech source codebook and a shift number for the code vector to decode the speech source signal. A unit selection section selects a synthesis unit corresponding to phonemic symbols to be synthesized from the unit dictionary memory. A synthesis unit decoder selects the code vector corresponding to the index in the synthesis unit from the speech source signal codebook, and shifts the code vector according to the shift number in the synthesis unit.
    • 语音合成装置通过合成滤波器对语音源信号进行滤波来合成语音信号。 语音源信号码本存储多个语音源信号作为码矢量。 单元字典存储器存储对应于音素符号的多个合成单元,每个合成单元包括语音源码本中的码矢量的索引和用于解码语音源信号的码矢量的移位号。 单元选择部从单位字典存储器中选择与要合成的音素符号对应的合成单位。 合成单元解码器从语音源信号码本中选择与合成单元中的索引相对应的码矢量,并根据合成单元中的移位号移位码矢量。