专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20140136192A1 WAVEFORM PROCESSING DEVICE, WAVEFORM PROCESSING METHOD, AND WAVEFORM PROCESSING PROGRAM 有权
标题翻译：波形处理设备，波形处理方法和波形处理程序
公开(公告)号：US20140136192A1
公开(公告)日：2014-05-15
申请号：US14131460
申请日：2012-06-26
申请人： Masanori Kato , Reishi Kondo , Yasuyuki Mitsui
发明人： Masanori Kato , Reishi Kondo , Yasuyuki Mitsui
IPC分类号： G10L25/90
CPC分类号： G10L25/90 , G10L13/033 , G10L13/06 , G10L13/07
摘要： There is provided a waveform processing device for changing power of each pitch waveform of a segment in order to acquire a natural synthesis speech. A power calculation means 71 selects pitch waveforms one by one from a group of pitch waveforms corresponding to a segment, and calculates a scalar indicating power of a selected pitch waveform. A normalization degree calculation means 72 calculates a degree of normalization which is an index indicating a degree of normalization of a pitch waveform selected by the power calculation means 71, as a function value of an increasing function using the scalar as a variable. A change coefficient calculation means 73 calculates a change coefficient for changing an amplitude value of a pitch waveform selected by the power calculation means 71 based on the scalar and the degree of normalization. An amplitude change means 74 multiplies an amplitude value at each sampling point of a pitch waveform selected by the power calculation means 71 by the change coefficient.
摘要翻译：提供了一种用于改变段的每个音调波形的功率的波形处理装置，以获得自然合成语音。功率计算装置71从对应于段的一组音调波形中逐个选择音调波形，并计算所选音调波形的标量指示功率。归一化度计算装置72计算作为指示由功率计算装置71选择的音调波形的归一化程度的指标的归一化程度，作为使用标量作为变量的增加函数的函数值。变化系数计算单元73根据标量和标准化程度，计算用于改变由功率计算单元71选择的音调波形的振幅值的变化系数。振幅改变装置74将由功率计算装置71选择的音调波形的每个采样点处的振幅值乘以变化系数。

2. 发明申请

US20100076768A1 SPEECH SYNTHESIZING APPARATUS, METHOD, AND PROGRAM 有权
标题翻译：语音合成设备，方法和程序
公开(公告)号：US20100076768A1
公开(公告)日：2010-03-25
申请号：US12527802
申请日：2008-02-15
申请人： Masanori Kato , Reishi Kondo , Yasuyuki Mitsui
发明人： Masanori Kato , Reishi Kondo , Yasuyuki Mitsui
IPC分类号： G10L13/00
CPC分类号： G10L13/06 , G10L13/04
摘要： Disclosed is a speech synthesizing apparatus including a segment selection unit that selects a segment suited to a target segment environment from candidate segments, includes a prosody change amount calculation unit that calculates prosody change amount of each candidate segment based on prosody information of candidate segments and the target segment environment, a selection criterion calculation unit that calculates a selection criterion based on the prosody change amount, a candidate selection unit that narrows down selection candidates based on the prosody change amount and the selection criterion, and an optimum segment search unit than searches for an optimum segment from among the narrowed-down candidate segments.
摘要翻译：公开了一种语音合成装置，包括从候选片段选择适合于目标片段环境的片段的片段选择部，包括：韵律变化量计算部，其基于候选片段的韵律信息计算每个候选片段的韵律变化量，目标区段环境，基于韵律变化量计算选择标准的选择标准计算单元，基于韵律变化量和选择标准来缩小选择候选的候选选择单元，以及搜索最优区段搜索单元来自缩小的候选段之间的最佳段。

3. 发明授权

US09324316B2 Prosody generator, speech synthesizer, prosody generating method and prosody generating program 有权
标题翻译：韵律发生器，语音合成器，韵律生成方法和韵律生成程序
公开(公告)号：US09324316B2
公开(公告)日：2016-04-26
申请号：US14004148
申请日：2012-05-10
申请人： Yasuyuki Mitsui , Reishi Kondo , Masanori Kato
发明人： Yasuyuki Mitsui , Reishi Kondo , Masanori Kato
IPC分类号： G10L13/00 , G10L13/027 , G10L13/10
CPC分类号： G10L13/027 , G10L13/10
摘要： There is provided a prosody generator that generates prosody information for implementing highly natural speech synthesis without unnecessarily collecting large quantities of learning data. A data dividing means 81 divides into subspaces the data space of a learning database as an assembly of learning data indicative of the feature quantities of speech waveforms. A density information extracting means 82 extracts density information indicative of the density state in terms of information quantity of the learning data in each of the subspaces divided by the data dividing means 81. A prosody information generating method selecting means 83 selects either a first method or a second method as a prosody information generating method based on the density information, the first method involving generating the prosody information using a statistical technique, the second method involving generating the prosody information using rules based on heuristics.
摘要翻译：提供了一种韵律发生器，其产生用于实现高度自然的语音合成的韵律信息，而不必不必要地收集大量的学习数据。数据分割装置81将学习数据库的数据空间划分为子空间，作为指示语音波形的特征量的学习数据的组合。密度信息提取装置82从由数据划分装置81划分的每个子空间中提取表示密度状态的密度信息。每个子空间中的学习数据的信息量表示密度信息。韵律信息生成方法选择装置83选择第一种方法或作为基于密度信息的韵律信息生成方法的第二方法，涉及使用统计技术生成韵律信息的第一方法，涉及使用基于启发式的规则生成韵律信息的第二方法。

4. 发明授权

US08407054B2 Speech synthesis device, speech synthesis method, and speech synthesis program 有权
标题翻译：语音合成装置，语音合成方法和语音合成程序
公开(公告)号：US08407054B2
公开(公告)日：2013-03-26
申请号：US12599317
申请日：2008-04-28
申请人： Masanori Kato , Yasuyuki Mitsui , Reishi Kondo
发明人： Masanori Kato , Yasuyuki Mitsui , Reishi Kondo
IPC分类号： G10L13/06
CPC分类号： G10L13/10
摘要： A speech synthesis device is provided with: a central segment selection unit for selecting a central segment from among a plurality of speech segments; a prosody generation unit for generating prosody information based on the central segment; a non-central segment selection unit for selecting a non-central segment, which is a segment outside of a central segment section, based on the central segment and the prosody information; and a waveform generation unit for generating a synthesized speech waveform based on the prosody information, the central segment, and the non-central segment. The speech synthesis device first selects a central segment that forms a basis for prosody generation and generates prosody information based on the central segment so that it is possible to sufficiently reduce both concatenation distortion and sound quality degradation accompanying prosody control in the section of the central segment.
摘要翻译：语音合成装置具有：中央段选择单元，用于从多个语音段中选择中心段; 用于产生基于中心段的韵律信息的韵律生成单元; 非中心段选择单元，用于基于所述中心段和所述韵律信息来选择作为中心段区段外的段的非中心段; 以及波形生成单元，用于基于所述韵律信息，所述中心段和所述非中心区段来生成合成语音波形。语音合成装置首先选择形成韵律产生基础的中心片段，并且基于中心片段产生韵律信息，从而可以充分地减少伴随中心片段的韵律控制的连接失真和声音质量下降。

5. 发明申请

US20100211393A1 SPEECH SYNTHESIS DEVICE, SPEECH SYNTHESIS METHOD, AND SPEECH SYNTHESIS PROGRAM 有权
标题翻译：语音合成设备，语音合成方法和语音合成程序
公开(公告)号：US20100211393A1
公开(公告)日：2010-08-19
申请号：US12599317
申请日：2008-04-28
申请人： Masanori Kato , Yasuyuki Mitsui , Reishi Kondo
发明人： Masanori Kato , Yasuyuki Mitsui , Reishi Kondo
IPC分类号： G10L13/06 , G10L13/08
CPC分类号： G10L13/10
摘要： A speech synthesis device is provided with: a central segment selection unit for selecting a central segment from among a plurality of speech segments; a prosody generation unit for generating prosody information based on the central segment; a non-central segment selection unit for selecting a non-central segment, which is a segment outside of a central segment section, based on the central segment and the prosody information; and a waveform generation unit for generating a synthesized speech waveform based on the prosody information, the central segment, and the non-central segment. The speech synthesis device first selects a central segment that forms a basis for prosody generation and generates prosody information based on the central segment so that it is possible to sufficiently reduce both concatenation distortion and sound quality degradation accompanying prosody control in the section of the central segment.
摘要翻译：语音合成装置具有：中央段选择单元，用于从多个语音段中选择中心段; 用于产生基于中心段的韵律信息的韵律生成单元; 非中心段选择单元，用于基于所述中心段和所述韵律信息来选择作为中心段区段外的段的非中心段; 以及波形生成单元，用于基于所述韵律信息，所述中心段和所述非中心区段来生成合成语音波形。语音合成装置首先选择形成韵律产生基础的中心片段，并且基于中心片段产生韵律信息，从而可以充分地减少伴随中心片段的韵律控制的连接失真和声音质量下降。

6. 发明授权

US09520125B2 Speech synthesis device, speech synthesis method, and speech synthesis program 有权
标题翻译：语音合成装置，语音合成方法和语音合成程序
公开(公告)号：US09520125B2
公开(公告)日：2016-12-13
申请号：US14131409
申请日：2012-06-08
申请人： Yasuyuki Mitsui , Masanori Kato , Reishi Kondo
发明人： Yasuyuki Mitsui , Masanori Kato , Reishi Kondo
IPC分类号： G10L13/00 , G10L13/06 , G10L13/08 , G10L21/00 , G10L13/04 , G10L15/08 , G10L13/10
CPC分类号： G10L15/08 , G10L13/08 , G10L2013/105
摘要： There are provided a speech synthesis device, a speech synthesis method and a speech synthesis program which can represent a phoneme as a duration shorter than a duration upon modeling according to a statistical method. A speech synthesis device 80 according to the present invention includes a phoneme boundary updating means 81 which, by using a voiced utterance likelihood index which is an index indicating a degree of voiced utterance likelihood of each state which represents a phoneme modeled by a statistical method, updates a phoneme boundary position which is a boundary with other phonemes neighboring to the phoneme.
摘要翻译：提供了一种语音合成装置，语音合成方法和语音合成程序，其可以根据统计方法将音素表示为短于建模持续时间的持续时间。根据本发明的语音合成装置80包括：音素边界更新装置81，其通过使用作为表示通过统计方法建模的音素的每个状态的有声话音似然度的指标的浊音发声似然指标，更新作为与音素相邻的其他音素的边界的音素边界位置。

7. 发明申请

US20140149116A1 SPEECH SYNTHESIS DEVICE, SPEECH SYNTHESIS METHOD, AND SPEECH SYNTHESIS PROGRAM 有权
标题翻译：语音合成设备，语音合成方法和语音合成程序
公开(公告)号：US20140149116A1
公开(公告)日：2014-05-29
申请号：US14131409
申请日：2012-06-08
申请人： Yasuyuki Mitsui , Masanori Kato , Reishi Kondo
发明人： Yasuyuki Mitsui , Masanori Kato , Reishi Kondo
IPC分类号： G10L15/08
CPC分类号： G10L15/08 , G10L13/08 , G10L2013/105
摘要： There are provided a speech synthesis device, a speech synthesis method and a speech synthesis program which can represent a phoneme as a duration shorter than a duration upon modeling according to a statistical method. A speech synthesis device 80 according to the present invention includes a phoneme boundary updating means 81 which, by using a voiced utterance likelihood index which is an index indicating a degree of voiced utterance likelihood of each state which represents a phoneme modeled by a statistical method, updates a phoneme boundary position which is a boundary with other phonemes neighboring to the phoneme.
摘要翻译：提供了一种语音合成装置，语音合成方法和语音合成程序，其可以根据统计方法将音素表示为短于建模持续时间的持续时间。根据本发明的语音合成装置80包括：音素边界更新装置81，其通过使用作为表示通过统计方法建模的音素的每个状态的有声话音似然度的指标的浊音发声似然指标，更新作为与音素相邻的其他音素的边界的音素边界位置。

8. 发明授权

US08620663B2 Speech synthesis system for generating speech information obtained by converting text into speech 有权
标题翻译：用于生成通过将文本转换为语音而获得的语音信息的语音合成系统
公开(公告)号：US08620663B2
公开(公告)日：2013-12-31
申请号：US13000340
申请日：2009-06-22
申请人： Reishi Kondo , Masanori Kato , Yasuyuki Mitsui
发明人： Reishi Kondo , Masanori Kato , Yasuyuki Mitsui
IPC分类号： G10L13/08
CPC分类号： G10L13/08 , G10L15/30 , G10L2015/025
摘要： A speech synthesis system includes a server device and a client device. The server device stores speech element information and speech element identification information in association with each other so that, in a case that speech element information representing respective speech elements included in speech uttered by a speech registering user are arranged in the order of arrangement of the speech elements in the speech, at least one of speech element identification information identifying the respective speech element information has different information from information arranged in accordance with a predetermined rule. The client device transmits speech element identification information to the server device based on accepted text information. The client device executes a speech synthesis process based on the speech element information received from the server device.
摘要翻译：语音合成系统包括服务器设备和客户机设备。服务器装置相互关联地存储语音元素信息和语音元素识别信息，使得在语音注册用户发出的语音中包含的表示各个语音元素的语音元素信息按照语音排列的顺序排列的情况下语音中的元素中，识别各个语音元素信息的语音元素识别信息中的至少一个具有与根据预定规则排列的信息不同的信息。客户端设备基于接受的文本信息将语音元素识别信息发送到服务器设备。客户端设备基于从服务器设备接收到的语音元素信息执行语音合成处理。

9. 发明申请

US20110137655A1 SPEECH SYNTHESIS SYSTEM 有权
标题翻译：语音合成系统
公开(公告)号：US20110137655A1
公开(公告)日：2011-06-09
申请号：US13000340
申请日：2009-06-22
申请人： Reishi Kondo , Masanori Kato , Yasuyuki Mitsui
发明人： Reishi Kondo , Masanori Kato , Yasuyuki Mitsui
IPC分类号： G10L13/00
CPC分类号： G10L13/08 , G10L15/30 , G10L2015/025
摘要： A speech synthesis system includes a server device and a client device. The server device stores speech element information and speech element identification information in association with each other so that, in a case that speech element information representing respective speech elements included in speech uttered by a speech registering user are arranged in the order of arrangement of the speech elements in the speech, at least one of speech element identification information identifying the respective speech element information has different information from information arranged in accordance with a predetermined rule. The client device transmits speech element identification information to the server device based on accepted text information. The client device executes a speech synthesis process based on the speech element information received from the server device.
摘要翻译：语音合成系统包括服务器设备和客户机设备。服务器装置相互关联地存储语音元素信息和语音元素识别信息，使得在语音注册用户发出的语音中包含的表示各个语音元素的语音元素信息按照语音排列的顺序排列的情况下语音中的元素中，识别各个语音元素信息的语音元素识别信息中的至少一个具有与根据预定规则排列的信息不同的信息。客户端设备基于接受的文本信息将语音元素识别信息发送到服务器设备。客户端设备基于从服务器设备接收到的语音元素信息执行语音合成处理。

10. 发明申请

US20100305949A1 SPEECH SYNTHESIS DEVICE, SPEECH SYNTHESIS METHOD, AND SPEECH SYNTHESIS PROGRAM 审中-公开
标题翻译：语音合成设备，语音合成方法和语音合成程序
公开(公告)号：US20100305949A1
公开(公告)日：2010-12-02
申请号：US12744807
申请日：2008-11-25
申请人： Masanori Kato , Yasuyuki Mitsui , Reishi Kondo
发明人： Masanori Kato , Yasuyuki Mitsui , Reishi Kondo
IPC分类号： G10L13/06
CPC分类号： G10L13/04 , G10L13/06
摘要： It is possible to provide a speech synthesis device, speech synthesis method, and speech synthesis program which can improve a speech quality and reduce a calculation amount with a preferable balance between them. The speech synthesis device includes: a sub-score calculation unit (60/65) which calculates a segment selection sub-score for selecting an optimal segment; and a candidate narrowing unit (70/73) for narrowing the candidates according to the number of the candidate segments and the segment selection sub score. The speech synthesis device performs candidate narrowing by the sub score calculation unit (60/65) and the candidate narrowing unit (70/73) in the candidate selection process when generating a synthesized speech from an input text.
摘要翻译：可以提供一种语音合成装置，语音合成方法和语音合成程序，其能够提高语音质量并且以其优选的平衡来减少计算量。语音合成装置包括：计算用于选择最佳片段的片段选择子得分的子得分计算部（60/65）以及用于根据候选片段的数量和片段选择子得分来缩小候选的候选缩小单元（70/73）。当从输入文本生成合成语音时，语音合成装置在候选选择处理中由子分计算单元（60/65）和候选缩小单元（70/73）执行候选缩小。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式