专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

51. 发明授权

US08321208B2 Speech processing and speech synthesis using a linear combination of bases at peak frequencies for spectral envelope information 有权
标题翻译：在频谱包络信息的峰值频率处使用基线的线性组合的语音处理和语音合成
公开(公告)号：US08321208B2
公开(公告)日：2012-11-27
申请号：US12327399
申请日：2008-12-03
申请人： Masatsune Tamura , Katsumi Tsuchiya , Takehiko Kagoshima
发明人： Masatsune Tamura , Katsumi Tsuchiya , Takehiko Kagoshima
IPC分类号： G10L13/06 , G10L19/02
CPC分类号： G10L13/06
摘要： An information extraction unit extracts spectral envelope information of L-dimension from each frame of speech data by discrete Fourier transform. The spectral envelope information is represented by L points. A basis storage unit stores N bases (L>N>1). Each basis is differently a frequency band having a maximum as a peak frequency in a spectral domain having L-dimension. A value corresponding to a frequency outside the frequency band along a frequency axis of the spectral domain is zero. Two frequency bands of which two peak frequencies are adjacent along the frequency axis partially overlap. A parameter calculation unit minimizes a distortion between the spectral envelope information and a linear combination of each basis with a coefficient for each of L points of the spectral envelope information by changing the coefficient, and sets the coefficient of each basis from which the distortion is minimized to a spectral envelope parameter of the spectral envelope information.
摘要翻译：信息提取单元通过离散傅里叶变换从每个语音数据帧提取L维的频谱包络信息。频谱包络信息由L点表示。基准存储单元存储N个碱基（L> N> 1）。每个基准在具有L维的谱域中具有作为峰值频率的最大值的频带不同。对应于沿着频域的频率轴的频带外的频率的值为零。两个峰值频率沿频率轴相邻的两个频带部分重叠。参数计算单元通过改变系数，将频谱包络信息和每个基线的线性组合之间的失真与频谱包络信息中的每个L点的系数最小化，并且设置失真最小化的每个基准的系数到频谱包络信息的频谱包络参数。

52. 发明授权

US08224646B2 Speech synthesizing device, method and computer program product 失效
标题翻译：语音合成装置，方法和计算机程序产品
公开(公告)号：US08224646B2
公开(公告)日：2012-07-17
申请号：US12563551
申请日：2009-09-21
申请人： Ryutaro Tokuda , Takehiko Kagoshima
发明人： Ryutaro Tokuda , Takehiko Kagoshima
IPC分类号： G10L13/00
CPC分类号： G10L13/00
摘要： The speech synthesizing device acquires numerical data at regular time intervals, each piece of the numerical data representing a value having a plurality of digits, detects a change between two values represented by the numerical data that is acquired at two consecutive times, determines which digit of the value represented by the numerical data is used to generate speech data depending on the detected change, generates numerical information that indicates the determined digit of the value represented by the numerical data, and generates speech data from the digit indicated by the numerical information.
摘要翻译：语音合成装置以规则的时间间隔获取数字数据，表示具有多个数字的值的每一数字数据检测由连续两次获取的数字数据表示的两个值之间的变化，确定由数值数据表示的值用于根据检测到的变化产生语音数据，生成指示由数字数据表示的值的确定的数字的数字信息，并从由数字信息指示的数字生成语音数据。

53. 发明申请

US20110246199A1 SPEECH SYNTHESIZER 失效
标题翻译：语音合成器
公开(公告)号：US20110246199A1
公开(公告)日：2011-10-06
申请号：US12881397
申请日：2010-09-14
申请人： Osamu NISHIYAMA , Takehiko Kagoshima
发明人： Osamu NISHIYAMA , Takehiko Kagoshima
IPC分类号： G10L13/00
CPC分类号： G10L13/06 , G10L25/69
摘要： According to one embodiment, a speech synthesizer generates a speech segment sequence and synthesizes speech by connecting speech segments of the generated speech segment sequence. If a speech segment of a synthesized first speech segment sequence is different from the speech segment of a synthesized second speech segment sequence having the same synthesis unit as the first speech segment sequence, the speech synthesizer disables the speech segment of the first speech segment sequence that is different from the speech segment of the second speech segment sequence.
摘要翻译：根据一个实施例，语音合成器生成语音片段序列并通过连接所产生的语音片段序列的语音片段来合成语音。如果合成的第一语音段序列的语音片段与具有与第一语音片段序列具有相同合成单位的合成的第二语音片段序列的语音片段不同，则语音合成器禁止第一语音片段序列的语音片段，与第二语音段序列的语音段不同。

54. 发明申请

US20110087488A1 SPEECH SYNTHESIS APPARATUS AND METHOD 有权
标题翻译：语音合成设备和方法
公开(公告)号：US20110087488A1
公开(公告)日：2011-04-14
申请号：US12970162
申请日：2010-12-16
申请人： Ryo Morinaka , Takehiko Kagoshima
发明人： Ryo Morinaka , Takehiko Kagoshima
IPC分类号： G10L11/04 , G10L13/06
CPC分类号： G10L13/06 , G10L13/033 , G10L19/097 , G10L25/15 , G10L2021/0135
摘要： According to an embodiment, a speech synthesis apparatus includes a selecting unit configured to select speaker's parameters one by one for respective speakers and obtain a plurality of speakers' parameters, the speaker's parameters being prepared for respective pitch waveforms corresponding to speaker's speech sounds, the speaker's parameters including formant frequencies, formant phases, formant powers, and window functions concerning respective formants that are contained in the respective pitch waveforms. The apparatus includes a mapping unit configured to make formants correspond to each other between the plurality of speakers' parameters using a cost function based on the formant frequencies and the formant powers. The apparatus includes a generating unit configured to generate an interpolated speaker's parameter by interpolating, at desired interpolation ratios, the formant frequencies, formant phases, formant powers, and window functions of formants which are made to correspond to each other.
摘要翻译：根据实施例，语音合成装置包括：选择单元，被配置为逐个选择说话者的参数，并且获得多个扬声器的参数;所述说话者的参数是针对对应于说话者的语音的各个音调波形而准备的，参数包括共振峰频率，共振峰相位，共振峰功率，以及相关螺旋波形中包含的各共振峰的窗函数。该装置包括：映射单元，其被配置为使用基于共振峰频率和共振峰功率的成本函数在多个扬声器的参数之间使得共振峰彼此对应。该装置包括：生成单元，被配置为通过以期望的内插比率内插使彼此对应的共振峰的共振峰频率，共振峰相位，共振峰功率和窗函数来生成内插说话者的参数。

55. 发明授权

US07630896B2 Speech synthesis system and method 失效
标题翻译：语音合成系统及方法
公开(公告)号：US07630896B2
公开(公告)日：2009-12-08
申请号：US11233092
申请日：2005-09-23
申请人： Masatsune Tamura , Gou Hirabayashi , Takehiko Kagoshima
发明人： Masatsune Tamura , Gou Hirabayashi , Takehiko Kagoshima
IPC分类号： G10L13/06
CPC分类号： G10L13/07
摘要： A speech synthesis system in a preferred embodiment includes a speech unit storage section, a phonetic environment storage section, a phonetic sequence/prosodic information input section, a plural-speech-unit selection section, a fused-speech-unit sequence generation section, and a fused-speech-unit modification/concatenation section. By fusing a plurality of selected speech units in the fused speech unit sequence generation section, a fused speech unit is generated. In the fused speech unit sequence generation section, the average power information is calculated for a plurality of selected M speech units, N speech units are fused together, and the power information of the fused speech unit is so corrected as to be equalized with the average power information of the M speech units.
摘要翻译：优选实施例中的语音合成系统包括语音单元存储部分，语音环境存储部分，语音序列/韵律信息输入部分，多语音单元选择部分，融合语音单元序列生成部分和融合语音单元修改/级联部分。通过在融合语音单元序列生成部中融合多个选择的语音单元，生成融合语音单元。在融合语音单元序列产生部分中，针对多个所选择的M个语音单元计算平均功率信息，将N个语音单元融合在一起，并将融合语音单元的功率信息校正为与平均值相等 M个语音单元的功率信息。

56. 发明申请

US20080312931A1 SPEECH SYNTHESIS METHOD, SPEECH SYNTHESIS SYSTEM, AND SPEECH SYNTHESIS PROGRAM 有权
标题翻译：语音合成方法，语音合成系统和语音合成程序
公开(公告)号：US20080312931A1
公开(公告)日：2008-12-18
申请号：US12193530
申请日：2008-08-18
申请人： Tatsuya MIZUTANI , Takehiko Kagoshima
发明人： Tatsuya MIZUTANI , Takehiko Kagoshima
IPC分类号： G10L13/08 , G10L13/00
CPC分类号： G10L13/06 , G10L13/04
摘要： A speech synthesis system stores a group of speech units in a memory, selects a plurality of speech units from the group based on prosodic information of target speech, the speech units selected corresponding to each of segments which are obtained by segmenting a phoneme string of the target speech and minimizing distortion of synthetic speech generated from the speech units selected to the target speech, generates a new speech unit corresponding to the each of the segments, by fusing the speech units selected, to obtain a plurality of new speech units corresponding to the segments respectively, and generates synthetic speech by concatenating the new speech units.
摘要翻译：语音合成系统将一组语音单元存储在存储器中，基于目标语音的韵律信息从组中选择多个语音单元，对应于每个段选择的语音单元，该段是通过分割目标语音和最小化从选择到目标语音的语音单元产生的合成语音的失真，通过融合所选择的语音单元来生成对应于每个段的新语音单元，以获得对应于该语音单元的多个新语音单元并且通过连接新的语音单元来产生合成语音。

57. 发明授权

US07184958B2 Speech synthesis method 有权
标题翻译：语音合成方法
公开(公告)号：US07184958B2
公开(公告)日：2007-02-27
申请号：US10792888
申请日：2004-03-05
申请人： Takehiko Kagoshima , Masami Akamine
发明人： Takehiko Kagoshima , Masami Akamine
IPC分类号： G10L13/00 , G10L19/04
CPC分类号： G10L13/07 , G10L25/90
摘要： A speech synthesis method subjects a reference speech signal to windowing to extract a speech pitch wave having a window function of a window length double a pitch period of the reference speech signal from the reference speech signal. A linear prediction coefficient is generated by subjecting the reference speech signal to a linear prediction analysis. The speech pitch wave is subjected to inverse-filtering based on the linear prediction coefficient to produce a residual pitch wave, which is then stored as information of a speech synthesis unit in a voiced period in a storage. Speech using the information of the speech synthesis unit is then synthesized.
摘要翻译：语音合成方法使参考语音信号进行加窗以从参考语音信号中提取具有参考语音信号的音高周期的窗口长度双倍的窗函数的语音音调波。通过对参考语音信号进行线性预测分析来生成线性预测系数。语音音调波基于线性预测系数进行逆滤波以产生残余音调波，然后作为语音合成单元的信息存储在存储器中的有声周期中。然后合成使用语音合成单元的信息的语音。

58. 发明授权

US06202048B1 Phonemic unit dictionary based on shifted portions of source codebook vectors, for text-to-speech synthesis 有权
标题翻译：基于源码本向量的偏移部分的音素单位词典，用于文本到语音合成
公开(公告)号：US06202048B1
公开(公告)日：2001-03-13
申请号：US09239966
申请日：1999-01-29
申请人： Katsumi Tsuchiya , Takehiko Kagoshima , Masami Akamine
发明人： Katsumi Tsuchiya , Takehiko Kagoshima , Masami Akamine
IPC分类号： G10L1306
CPC分类号： G10L19/12 , G10L13/06
摘要： A speech synthesis apparatus synthesize a speech signal by filtering a speech source signal through a synthesis filter. A speech source signal codebook stores a plurality of speech source signals as a code vector. A unit dictionary memory stores a plurality of synthesis units corresponding to phonemic symbols, each synthesis unit comprising an index of the code vector in the speech source codebook and a shift number for the code vector to decode the speech source signal. A unit selection section selects a synthesis unit corresponding to phonemic symbols to be synthesized from the unit dictionary memory. A synthesis unit decoder selects the code vector corresponding to the index in the synthesis unit from the speech source signal codebook, and shifts the code vector according to the shift number in the synthesis unit.
摘要翻译：语音合成装置通过合成滤波器对语音源信号进行滤波来合成语音信号。语音源信号码本存储多个语音源信号作为码矢量。单元字典存储器存储对应于音素符号的多个合成单元，每个合成单元包括语音源码本中的码矢量的索引和用于解码语音源信号的码矢量的移位号。单元选择部从单位字典存储器中选择与要合成的音素符号对应的合成单位。合成单元解码器从语音源信号码本中选择与合成单元中的索引相对应的码矢量，并根据合成单元中的移位号移位码矢量。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式