专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20130080155A1 APPARATUS AND METHOD FOR CREATING DICTIONARY FOR SPEECH SYNTHESIS 有权
标题翻译：用于创建语音合成词典的装置和方法
公开(公告)号：US20130080155A1
公开(公告)日：2013-03-28
申请号：US13535782
申请日：2012-06-28
申请人： Kentaro Tachibana , Masahiro Morita , Takehiko Kagoshima
发明人： Kentaro Tachibana , Masahiro Morita , Takehiko Kagoshima
IPC分类号： G06F17/21
CPC分类号： G10L13/02 , G10L13/06 , G10L25/60
摘要： Apparatus for creating a dictionary for speech synthesis includes a sentence storage unit configured to store N sentences, a sentence display unit configured to selectively display a first sentence which is one of the N sentences, a recording unit configured to record each user speech, a necessity determination unit configured to make a determination of whether to create the dictionary, a dictionary creation unit configured to create the dictionary by utilizing the user speech, and a speech synthesis unit configured to convert a second sentence to a synthesized speech with the dictionary. The determination unit makes the determination under a condition that the recording unit records the user speech of M first sentences (M is less than N) and the determination is based on at least one of an instruction from the user, M and an amount of the recorded user speech.
摘要翻译：用于创建用于语音合成的字典的装置包括被配置为存储N个句子的句子存储单元，被配置为选择性地显示作为N个句子之一的第一句子的句子显示单元，被配置为记录每个用户语音的记录单元确定单元，被配置为确定是否创建字典，字典创建单元，被配置为通过利用用户语音来创建字典;以及语音合成单元，被配置为将第二句子转换成具有字典的合成语音。确定单元在记录单元记录M个第一句子（M小于N）的用户语音的条件下进行确定，并且该确定基于来自用户的指令M和记录用户言语。

2. 发明授权

US09129596B2 Apparatus and method for creating dictionary for speech synthesis utilizing a display to aid in assessing synthesis quality 有权
标题翻译：用于使用显示器创建用于语音合成的词典的装置和方法，以帮助评估合成质量
公开(公告)号：US09129596B2
公开(公告)日：2015-09-08
申请号：US13535782
申请日：2012-06-28
申请人： Kentaro Tachibana , Masahiro Morita , Takehiko Kagoshima
发明人： Kentaro Tachibana , Masahiro Morita , Takehiko Kagoshima
IPC分类号： G06F17/21 , G10L13/00 , G10L13/02 , G10L13/06 , G10L25/60
CPC分类号： G10L13/02 , G10L13/06 , G10L25/60
摘要： Apparatus for creating a dictionary for speech synthesis includes a sentence storage unit configured to store N sentences, a sentence display unit configured to selectively display a first sentence which is one of the N sentences, a recording unit configured to record each user speech, a necessity determination unit configured to make a determination of whether to create the dictionary, a dictionary creation unit configured to create the dictionary by utilizing the user speech, and a speech synthesis unit configured to convert a second sentence to a synthesized speech with the dictionary. The display unit is configured to stop displaying the currently displayed sentence according to an evaluation of a quality of its synthesis. The determination unit makes the determination under a condition that the recording unit records the user speech of M first sentences (M is less than N) and the determination is based on at least one of an instruction from the user, M and an amount of the recorded user speech.
摘要翻译：用于创建用于语音合成的字典的装置包括被配置为存储N个句子的句子存储单元，被配置为选择性地显示作为N个句子之一的第一句子的句子显示单元，被配置为记录每个用户语音的记录单元确定单元，被配置为确定是否创建字典，字典创建单元，被配置为通过利用用户语音来创建字典;以及语音合成单元，被配置为将第二句子转换成具有字典的合成语音。显示单元被配置为根据其合成的质量的评估来停止显示当前显示的句子。确定单元在记录单元记录M个第一句子（M小于N）的用户语音的条件下进行确定，并且该确定基于来自用户的指令M和记录用户言语。

3. 发明申请

US20120239390A1 APPARATUS AND METHOD FOR SUPPORTING READING OF DOCUMENT, AND COMPUTER READABLE MEDIUM 有权
标题翻译：用于支持文件读取的装置和方法以及计算机可读介质
公开(公告)号：US20120239390A1
公开(公告)日：2012-09-20
申请号：US13232478
申请日：2011-09-14
申请人： Kosei Fume , Masaru Suzuki , Masahiro Morita , Kentaro Tachibana , Kouichirou Mori , Yuji Shimizu , Takehiko Kagoshima , Masatsune Tamura , Tomohiro Yamasaki
发明人： Kosei Fume , Masaru Suzuki , Masahiro Morita , Kentaro Tachibana , Kouichirou Mori , Yuji Shimizu , Takehiko Kagoshima , Masatsune Tamura , Tomohiro Yamasaki
IPC分类号： G10L19/00
CPC分类号： G10L13/10 , G10L13/08 , G10L25/63
摘要： According to one embodiment, an apparatus for supporting reading of a document includes a model storage unit, a document acquisition unit, a feature information extraction, and an utterance style estimation unit. The model storage unit is configured to store a model which has trained a correspondence relationship between first feature information and an utterance style. The first feature information is extracted from a plurality of sentences in a training document. The document acquisition unit is configured to acquire a document to be read. The feature information extraction unit is configured to extract second feature information from each sentence in the document to be read. The utterance style estimation unit is configured to compare the second feature information of a plurality of sentences in the document to be read with the model, and to estimate an utterance style of the each sentence of the document to be read.
摘要翻译：根据一个实施例，用于支持文档读取的装置包括模型存储单元，文档获取单元，特征信息提取和话语风格估计单元。模型存储单元被配置为存储已经训练了第一特征信息和话语风格之间的对应关系的模型。从训练文档中的多个句子中提取第一特征信息。文档获取单元被配置为获取要读取的文档。特征信息提取单元被配置为从要读取的文档中的每个句子中提取第二特征信息。发音风格估计单元被配置为将要读取的文档中的多个句子的第二特征信息与模型进行比较，并且估计待读取的文档的每个句子的发音风格。

4. 发明授权

US09280967B2 Apparatus and method for estimating utterance style of each sentence in documents, and non-transitory computer readable medium thereof 有权
标题翻译：用于估计文档中每个句子的话语风格的装置和方法及其非暂时性计算机可读介质
公开(公告)号：US09280967B2
公开(公告)日：2016-03-08
申请号：US13232478
申请日：2011-09-14
申请人： Kosei Fume , Masaru Suzuki , Masahiro Morita , Kentaro Tachibana , Kouichirou Mori , Yuji Shimizu , Takehiko Kagoshima , Masatsune Tamura , Tomohiro Yamasaki
发明人： Kosei Fume , Masaru Suzuki , Masahiro Morita , Kentaro Tachibana , Kouichirou Mori , Yuji Shimizu , Takehiko Kagoshima , Masatsune Tamura , Tomohiro Yamasaki
IPC分类号： G10L13/08 , G10L13/10 , G10L25/63
CPC分类号： G10L13/10 , G10L13/08 , G10L25/63
摘要： According to one embodiment, an apparatus for supporting reading of a document includes a model storage unit, a document acquisition unit, a feature information extraction, and an utterance style estimation unit. The model storage unit is configured to store a model which has trained a correspondence relationship between first feature information and an utterance style. The first feature information is extracted from a plurality of sentences in a training document. The document acquisition unit is configured to acquire a document to be read. The feature information extraction unit is configured to extract second feature information from each sentence in the document to be read. The utterance style estimation unit is configured to compare the second feature information of a plurality of sentences in the document to be read with the model, and to estimate an utterance style of the each sentence of the document to be read.
摘要翻译：根据一个实施例，用于支持文档读取的装置包括模型存储单元，文档获取单元，特征信息提取和话语风格估计单元。模型存储单元被配置为存储已经训练了第一特征信息和话语风格之间的对应关系的模型。从训练文档中的多个句子中提取第一特征信息。文档获取单元被配置为获取要读取的文档。特征信息提取单元被配置为从要读取的文档中的每个句子中提取第二特征信息。发音风格估计单元被配置为将要读取的文档中的多个句子的第二特征信息与模型进行比较，并且估计待读取的文档的每个句子的发音风格。

5. 发明授权

US08655664B2 Text presentation apparatus, text presentation method, and computer program product 有权
标题翻译：文本呈现装置，文本呈现方法和计算机程序产品
公开(公告)号：US08655664B2
公开(公告)日：2014-02-18
申请号：US13207575
申请日：2011-08-11
申请人： Kentaro Tachibana , Gou Hirabayashi , Takehiko Kagoshima
发明人： Kentaro Tachibana , Gou Hirabayashi , Takehiko Kagoshima
IPC分类号： G10L13/00 , G10L15/26 , G10L15/00 , G10L15/06 , G10L15/16 , G06F17/20 , G06F17/27 , G06F17/21 , G10L13/08 , G10L21/00 , G10L25/00
CPC分类号： G10L13/08
摘要： According to an embodiment, a text presentation apparatus presenting text for a speaker to read aloud for voice recording includes: a text storing unit for storing first text; a presenting unit for presenting the first text; a determination unit for determining whether or not the first text needs to be replaced, on the basis of a speaker's input for the first text presented; a preliminary text storing unit for storing preliminary text; a select unit configured to select, if it is determined that the first text needs to be replaced, second text to replace the first text from among the preliminary text, the selecting being performed on the basis of attribute information describing an attribute of the first text and on the basis of at least one of attribute information describing pronunciation of the first text and attribute information describing a stress type of the first text; and a control unit configured to control the presenting unit so that the presenting unit presents the second text.
摘要翻译：根据一个实施例，呈现用于语音录音的扬声器的文本的文本呈现装置包括：文本存储单元，用于存储第一文本; 用于呈现第一文本的呈现单元; 确定单元，用于基于所呈现的第一文本的说话者的输入来确定是否需要替换第一文本; 用于存储初步文本的初步文本存储单元; 选择单元，其被配置为：如果确定需要替换所述第一文本，则从所述初步文本中选择替换所述第一文本的第二文本，所述选择是基于描述所述第一文本的属性的属性信息执行的并且基于描述第一文本的发音的属性信息和描述第一文本的应力类型的属性信息中的至少一个; 以及控制单元，被配置为控制所述呈现单元，使得所述呈现单元呈现所述第二文本。

6. 发明申请

US20120065981A1 TEXT PRESENTATION APPARATUS, TEXT PRESENTATION METHOD, AND COMPUTER PROGRAM PRODUCT 有权
标题翻译：文本陈述装置，文本介绍方法和计算机程序产品
公开(公告)号：US20120065981A1
公开(公告)日：2012-03-15
申请号：US13207575
申请日：2011-08-11
申请人： Kentaro Tachibana , Gou Hirabayashi , Takehiko Kagoshima
发明人： Kentaro Tachibana , Gou Hirabayashi , Takehiko Kagoshima
IPC分类号： G10L11/00
CPC分类号： G10L13/08
摘要： According to an embodiment, a text presentation apparatus presenting text for a speaker to read aloud for voice recording includes: a text storing unit for storing first text; a presenting unit for presenting the first text; a determination unit for determining whether or not the first text needs to be replaced, on the basis of a speaker's input for the first text presented; a preliminary text storing unit for storing preliminary text; a select unit configured to select, if it is determined that the first text needs to be replaced, second text to replace the first text from among the preliminary text, the selecting being performed on the basis of attribute information describing an attribute of the first text and on the basis of at least one of attribute information describing pronunciation of the first text and attribute information describing a stress type of the first text; and a control unit configured to control the presenting unit so that the presenting unit presents the second text.
摘要翻译：根据一个实施例，呈现用于语音录音的扬声器的文本的文本呈现装置包括：文本存储单元，用于存储第一文本; 用于呈现第一文本的呈现单元; 确定单元，用于基于所呈现的第一文本的说话者的输入来确定是否需要替换第一文本; 用于存储初步文本的初步文本存储单元; 选择单元，其被配置为：如果确定需要替换所述第一文本，则从所述初步文本中选择替换所述第一文本的第二文本，所述选择是基于描述所述第一文本的属性的属性信息执行的并且基于描述第一文本的发音的属性信息和描述第一文本的应力类型的属性信息中的至少一个; 以及控制单元，被配置为控制所述呈现单元，使得所述呈现单元呈现所述第二文本。

7. 发明授权

US08438014B2 Separating speech waveforms into periodic and aperiodic components, using artificial waveform generated from pitch marks 有权
标题翻译：将语音波形分为周期性和非周期性分量，使用由间距标记产生的人造波形
公开(公告)号：US08438014B2
公开(公告)日：2013-05-07
申请号：US13358702
申请日：2012-01-26
申请人： Masahiro Morita , Javier Latorre , Takehiko Kagoshima
发明人： Masahiro Morita , Javier Latorre , Takehiko Kagoshima
IPC分类号： G10L11/06 , G10L11/04
CPC分类号： G10L25/93 , G10L25/90
摘要： According to one embodiment, in a speech processing device, an extractor windows a part of the speech signal and extracts a partial waveform. A calculator performs frequency analysis of the partial waveform to calculate a frequency spectrum. An estimator generates an artificial waveform that is a waveform according to an interval between the pitch marks for each harmonic component having a frequency that is a predetermined multiple of a fundamental frequency of the speech signal and estimates harmonic spectral features representing characteristics of the frequency spectrum of the harmonic component from each of the artificial waveforms. A separator separates the partial waveform into a periodic component produced from periodic vocal-fold vibration as an acoustic source and an aperiodic component produced from aperiodic acoustic sources other than the vocal-fold vibration by using the respective harmonic spectral features and the frequency spectrum of the partial waveform.
摘要翻译：根据一个实施例，在语音处理设备中，提取器对一部分语音信号进行窗口并提取部分波形。计算器执行部分波形的频率分析以计算频谱。估计器产生人造波形，其是根据具有作为语音信号的基频的预定倍数的频率的每个谐波分量的音调标记之间的间隔的波形，并且估计表示频率的频谱特性的谐波谱特征来自每个人造波形的谐波分量。分离器将部分波形分离为由周期性声带振动产生的周期分量，作为声源，并且通过使用相应的谐波频谱特征和频谱的频谱，从除声带之外的非周期声源产生的非周期分量部分波形。

8. 发明授权

US08195464B2 Speech processing apparatus and program 失效
标题翻译：语音处理装置和程序
公开(公告)号：US08195464B2
公开(公告)日：2012-06-05
申请号：US12212759
申请日：2008-09-18
申请人： Masahiro Morita , Takehiko Kagoshima
发明人： Masahiro Morita , Takehiko Kagoshima
IPC分类号： G01L13/00
CPC分类号： G10L13/07
摘要： A speech synthesizer includes a periodic component fusing unit and an aperiodic component fusing unit, and fuses periodic components and aperiodic components of a plurality of speech units for each segment, which are selected by a unit selector, by a periodic component fusing unit and an aperiodic component fusing unit, respectively. The speech synthesizer is further provided with an adder, so that the adder adds, edits, and concatenates the periodic components and the aperiodic components of the fused speech units to generate a speech waveform.
摘要翻译：语音合成器包括周期性分量定影单元和非周期性分量定影单元，并且通过周期性分量定影单元和非周期性分量定影单元，对由单元选择器选择的每个分段的多个语音单元的周期性分量和非周期分量进行融合分量定影单元。语音合成器还具有加法器，使得加法器对融合语音单元的周期分量和非周期分量进行相加，编辑和级联，以产生语音波形。

9. 发明申请

US20090216537A1 SPEECH SYNTHESIS APPARATUS AND METHOD THEREOF 审中-公开
标题翻译：语音合成装置及其方法
公开(公告)号：US20090216537A1
公开(公告)日：2009-08-27
申请号：US11570208
申请日：2006-10-19
申请人： Osamu Nishiyama , Masahiro Morita , Takehiko Kagoshima
发明人： Osamu Nishiyama , Masahiro Morita , Takehiko Kagoshima
IPC分类号： G10L13/06
CPC分类号： G10L13/04
摘要： A speech synthesis apparatus includes a text obtaining device that obtains text data for speech synthesis from the outside, a language processor that carries out morphological analysis/parsing to the text data, a prosodic processor that outputs, to a speech synthesizer, a synthesis unit string based on the prosodic and language related attributes of the text data such as accents and word classes, the speech synthesizer that generates synthesized speech from the synthesis unit string, and a speech waveform output device that reproduces a prescribed amount of output synthesized speech after it is accumulated or sequentially as it is output.
摘要翻译：语音合成装置包括从外部获取用于语音合成的文本数据的文本获取装置，对文本数据进行形态分析/解析的语言处理器，向语音合成器输出合成单元字符串的韵律处理器基于诸如重音和字类的文本数据的韵律和语言相关属性，从合成单元串产生合成语音的语音合成器，以及在其合成之后再现规定量的输出合成语音的语音波形输出装置在输出时累积或顺序。

10. 发明授权

US08108216B2 Speech synthesis system and speech synthesis method 有权
标题翻译：语音合成系统和语音合成方法
公开(公告)号：US08108216B2
公开(公告)日：2012-01-31
申请号：US12051104
申请日：2008-03-19
申请人： Masahiro Morita , Takehiko Kagoshima
发明人： Masahiro Morita , Takehiko Kagoshima
IPC分类号： G10L13/06 , G10L13/00
CPC分类号： G10L13/07
摘要： In a speech synthesis, a selecting unit selects one string from first speech unit strings corresponding to a first segment sequence obtained by dividing a phoneme string corresponding to target speech into segments. The selecting unit performs repeatedly generating, based on maximum W second speech unit strings corresponding to a second segment sequence as a partial sequence of the first sequence, third speech unit strings corresponding to a third segment sequence obtained by adding a segment to the second sequence, and selecting maximum W strings from the third strings based on a evaluation value of each of the third strings. The value is obtained by correcting a total cost of each of the third string candidate with a penalty coefficient for each of the third strings. The coefficient is based on a restriction concerning quickness of speech unit data acquisition, and depends on extent in which the restriction is approached.
摘要翻译：在语音合成中，选择单元从与通过将对应于目标语音的音素串分割成段而获得的第一段序列相对应的第一语音单元串中选择一个字符串。选择单元基于与作为第一序列的部分序列的第二片段序列相对应的最大W第二语音单元串重复地生成与通过将第二序列相加而获得的第三片段序列相对应的第三语音单元串，以及基于每个第三串的评估值从第三串中选择最大W字符串。该值通过用第三串中的每一个的罚分系数校正第三串候选者的总成本来获得。该系数基于对语音单元数据采集的快速性的限制，并且取决于接近限制的程度。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式