会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • APPARATUS AND METHOD FOR CREATING DICTIONARY FOR SPEECH SYNTHESIS
    • 用于创建语音合成词典的装置和方法
    • US20130080155A1
    • 2013-03-28
    • US13535782
    • 2012-06-28
    • Kentaro TachibanaMasahiro MoritaTakehiko Kagoshima
    • Kentaro TachibanaMasahiro MoritaTakehiko Kagoshima
    • G06F17/21
    • G10L13/02G10L13/06G10L25/60
    • Apparatus for creating a dictionary for speech synthesis includes a sentence storage unit configured to store N sentences, a sentence display unit configured to selectively display a first sentence which is one of the N sentences, a recording unit configured to record each user speech, a necessity determination unit configured to make a determination of whether to create the dictionary, a dictionary creation unit configured to create the dictionary by utilizing the user speech, and a speech synthesis unit configured to convert a second sentence to a synthesized speech with the dictionary. The determination unit makes the determination under a condition that the recording unit records the user speech of M first sentences (M is less than N) and the determination is based on at least one of an instruction from the user, M and an amount of the recorded user speech.
    • 用于创建用于语音合成的字典的装置包括被配置为存储N个句子的句子存储单元,被配置为选择性地显示作为N个句子之一的第一句子的句子显示单元,被配置为记录每个用户语音的记录单元 确定单元,被配置为确定是否创建字典,字典创建单元,被配置为通过利用用户语音来创建字典;以及语音合成单元,被配置为将第二句子转换成具有字典的合成语音。 确定单元在记录单元记录M个第一句子(M小于N)的用户语音的条件下进行确定,并且该确定基于来自用户的指令M和 记录用户言语。
    • 2. 发明授权
    • Apparatus and method for creating dictionary for speech synthesis utilizing a display to aid in assessing synthesis quality
    • 用于使用显示器创建用于语音合成的词典的装置和方法,以帮助评估合成质量
    • US09129596B2
    • 2015-09-08
    • US13535782
    • 2012-06-28
    • Kentaro TachibanaMasahiro MoritaTakehiko Kagoshima
    • Kentaro TachibanaMasahiro MoritaTakehiko Kagoshima
    • G06F17/21G10L13/00G10L13/02G10L13/06G10L25/60
    • G10L13/02G10L13/06G10L25/60
    • Apparatus for creating a dictionary for speech synthesis includes a sentence storage unit configured to store N sentences, a sentence display unit configured to selectively display a first sentence which is one of the N sentences, a recording unit configured to record each user speech, a necessity determination unit configured to make a determination of whether to create the dictionary, a dictionary creation unit configured to create the dictionary by utilizing the user speech, and a speech synthesis unit configured to convert a second sentence to a synthesized speech with the dictionary. The display unit is configured to stop displaying the currently displayed sentence according to an evaluation of a quality of its synthesis. The determination unit makes the determination under a condition that the recording unit records the user speech of M first sentences (M is less than N) and the determination is based on at least one of an instruction from the user, M and an amount of the recorded user speech.
    • 用于创建用于语音合成的字典的装置包括被配置为存储N个句子的句子存储单元,被配置为选择性地显示作为N个句子之一的第一句子的句子显示单元,被配置为记录每个用户语音的记录单元 确定单元,被配置为确定是否创建字典,字典创建单元,被配置为通过利用用户语音来创建字典;以及语音合成单元,被配置为将第二句子转换成具有字典的合成语音。 显示单元被配置为根据其合成的质量的评估来停止显示当前显示的句子。 确定单元在记录单元记录M个第一句子(M小于N)的用户语音的条件下进行确定,并且该确定基于来自用户的指令M和 记录用户言语。
    • 5. 发明授权
    • Text presentation apparatus, text presentation method, and computer program product
    • 文本呈现装置,文本呈现方法和计算机程序产品
    • US08655664B2
    • 2014-02-18
    • US13207575
    • 2011-08-11
    • Kentaro TachibanaGou HirabayashiTakehiko Kagoshima
    • Kentaro TachibanaGou HirabayashiTakehiko Kagoshima
    • G10L13/00G10L15/26G10L15/00G10L15/06G10L15/16G06F17/20G06F17/27G06F17/21G10L13/08G10L21/00G10L25/00
    • G10L13/08
    • According to an embodiment, a text presentation apparatus presenting text for a speaker to read aloud for voice recording includes: a text storing unit for storing first text; a presenting unit for presenting the first text; a determination unit for determining whether or not the first text needs to be replaced, on the basis of a speaker's input for the first text presented; a preliminary text storing unit for storing preliminary text; a select unit configured to select, if it is determined that the first text needs to be replaced, second text to replace the first text from among the preliminary text, the selecting being performed on the basis of attribute information describing an attribute of the first text and on the basis of at least one of attribute information describing pronunciation of the first text and attribute information describing a stress type of the first text; and a control unit configured to control the presenting unit so that the presenting unit presents the second text.
    • 根据一个实施例,呈现用于语音录音的扬声器的文本的文本呈现装置包括:文本存储单元,用于存储第一文本; 用于呈现第一文本的呈现单元; 确定单元,用于基于所呈现的第一文本的说话者的输入来确定是否需要替换第一文本; 用于存储初步文本的初步文本存储单元; 选择单元,其被配置为:如果确定需要替换所述第一文本,则从所述初步文本中选择替换所述第一文本的第二文本,所述选择是基于描述所述第一文本的属性的属性信息执行的 并且基于描述第一文本的发音的属性信息和描述第一文本的应力类型的属性信息中的至少一个; 以及控制单元,被配置为控制所述呈现单元,使得所述呈现单元呈现所述第二文本。
    • 6. 发明申请
    • TEXT PRESENTATION APPARATUS, TEXT PRESENTATION METHOD, AND COMPUTER PROGRAM PRODUCT
    • 文本陈述装置,文本介绍方法和计算机程序产品
    • US20120065981A1
    • 2012-03-15
    • US13207575
    • 2011-08-11
    • Kentaro TachibanaGou HirabayashiTakehiko Kagoshima
    • Kentaro TachibanaGou HirabayashiTakehiko Kagoshima
    • G10L11/00
    • G10L13/08
    • According to an embodiment, a text presentation apparatus presenting text for a speaker to read aloud for voice recording includes: a text storing unit for storing first text; a presenting unit for presenting the first text; a determination unit for determining whether or not the first text needs to be replaced, on the basis of a speaker's input for the first text presented; a preliminary text storing unit for storing preliminary text; a select unit configured to select, if it is determined that the first text needs to be replaced, second text to replace the first text from among the preliminary text, the selecting being performed on the basis of attribute information describing an attribute of the first text and on the basis of at least one of attribute information describing pronunciation of the first text and attribute information describing a stress type of the first text; and a control unit configured to control the presenting unit so that the presenting unit presents the second text.
    • 根据一个实施例,呈现用于语音录音的扬声器的文本的文本呈现装置包括:文本存储单元,用于存储第一文本; 用于呈现第一文本的呈现单元; 确定单元,用于基于所呈现的第一文本的说话者的输入来确定是否需要替换第一文本; 用于存储初步文本的初步文本存储单元; 选择单元,其被配置为:如果确定需要替换所述第一文本,则从所述初步文本中选择替换所述第一文本的第二文本,所述选择是基于描述所述第一文本的属性的属性信息执行的 并且基于描述第一文本的发音的属性信息和描述第一文本的应力类型的属性信息中的至少一个; 以及控制单元,被配置为控制所述呈现单元,使得所述呈现单元呈现所述第二文本。
    • 7. 发明授权
    • Separating speech waveforms into periodic and aperiodic components, using artificial waveform generated from pitch marks
    • 将语音波形分为周期性和非周期性分量,使用由间距标记产生的人造波形
    • US08438014B2
    • 2013-05-07
    • US13358702
    • 2012-01-26
    • Masahiro MoritaJavier LatorreTakehiko Kagoshima
    • Masahiro MoritaJavier LatorreTakehiko Kagoshima
    • G10L11/06G10L11/04
    • G10L25/93G10L25/90
    • According to one embodiment, in a speech processing device, an extractor windows a part of the speech signal and extracts a partial waveform. A calculator performs frequency analysis of the partial waveform to calculate a frequency spectrum. An estimator generates an artificial waveform that is a waveform according to an interval between the pitch marks for each harmonic component having a frequency that is a predetermined multiple of a fundamental frequency of the speech signal and estimates harmonic spectral features representing characteristics of the frequency spectrum of the harmonic component from each of the artificial waveforms. A separator separates the partial waveform into a periodic component produced from periodic vocal-fold vibration as an acoustic source and an aperiodic component produced from aperiodic acoustic sources other than the vocal-fold vibration by using the respective harmonic spectral features and the frequency spectrum of the partial waveform.
    • 根据一个实施例,在语音处理设备中,提取器对一部分语音信号进行窗口并提取部分波形。 计算器执行部分波形的频率分析以计算频谱。 估计器产生人造波形,其是根据具有作为语音信号的基频的预定倍数的频率的每个谐波分量的音调标记之间的间隔的波形,并且估计表示频率的频谱特性的谐波谱特征 来自每个人造波形的谐波分量。 分离器将部分波形分离为由周期性声带振动产生的周期分量,作为声源,并且通过使用相应的谐波频谱特征和频谱的频谱,从除声带之外的非周期声源产生的非周期分量 部分波形。
    • 10. 发明授权
    • Speech synthesis system and speech synthesis method
    • 语音合成系统和语音合成方法
    • US08108216B2
    • 2012-01-31
    • US12051104
    • 2008-03-19
    • Masahiro MoritaTakehiko Kagoshima
    • Masahiro MoritaTakehiko Kagoshima
    • G10L13/06G10L13/00
    • G10L13/07
    • In a speech synthesis, a selecting unit selects one string from first speech unit strings corresponding to a first segment sequence obtained by dividing a phoneme string corresponding to target speech into segments. The selecting unit performs repeatedly generating, based on maximum W second speech unit strings corresponding to a second segment sequence as a partial sequence of the first sequence, third speech unit strings corresponding to a third segment sequence obtained by adding a segment to the second sequence, and selecting maximum W strings from the third strings based on a evaluation value of each of the third strings. The value is obtained by correcting a total cost of each of the third string candidate with a penalty coefficient for each of the third strings. The coefficient is based on a restriction concerning quickness of speech unit data acquisition, and depends on extent in which the restriction is approached.
    • 在语音合成中,选择单元从与通过将对应于目标语音的音素串分割成段而获得的第一段序列相对应的第一语音单元串中选择一个字符串。 选择单元基于与作为第一序列的部分序列的第二片段序列相对应的最大W第二语音单元串重复地生成与通过将第二序列相加而获得的第三片段序列相对应的第三语音单元串, 以及基于每个第三串的评估值从第三串中选择最大W字符串。 该值通过用第三串中的每一个的罚分系数校正第三串候选者的总成本来获得。 该系数基于对语音单元数据采集的快速性的限制,并且取决于接近限制的程度。