会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 31. 发明申请
    • SPEECH SYNTHESIS SYSTEM, SPEECH SYNTHESIS PROGRAM PRODUCT, AND SPEECH SYNTHESIS METHOD
    • 语音合成系统,语音合成程序产品和语音合成方法
    • US20090070115A1
    • 2009-03-12
    • US12192510
    • 2008-08-15
    • Ryuki TachibanaMasafumi Nishimura
    • Ryuki TachibanaMasafumi Nishimura
    • G10L13/08
    • G10L13/00G10L13/07G10L13/10
    • It is an objective of the present invention to provide waveform concatenation speech synthesis with high sound quality utilizing its advantages in the case where there is a large quantity of speech segments while providing waveform concatenation speech synthesis with accurate accents in other cases. Prosody with both high accuracy and high sound quality is achieved by performing a two-path search including a speech segment search and a prosody modification value search. In the preferred embodiment of the present invention, an accurate accent is secured by evaluating the consistency of the prosody by using a statistical model of prosody variations (the slope of fundamental frequency) for both of two paths of the speech segment selection and the modification value search. In the prosody modification value search, a prosody modification value sequence that minimizes a modified prosody cost is searched for. This allows a search for a modification value sequence that can increase the likelihood of absolute values or variations of the prosody to the statistical model as high as possible with minimum modification values.
    • 本发明的目的是提供具有高音质的波形级联语音合成,利用其在存在大量语音段的情况下的优点,同时在其它情况下提供具有精确重音的波形级联语音合成。 通过执行包括语音片段搜索和韵律修改值搜索的双向搜索来实现高精度和高音质的韵律。 在本发明的优选实施例中,通过使用语音段选择的两个路径中的韵律变化(基频的斜率)和修改值的统计模型来评估韵律的一致性来确保准确的重音 搜索。 在韵律修改值搜索中,搜索最小化修改的韵律成本的韵律修改值序列。 这允许搜索修改值序列,其可以使用最小修改值尽可能高地增加对统计模型的韵律的绝对值或变化的可能性。
    • 33. 发明申请
    • METHODS AND APPARATUS FOR NATURAL SPOKEN LANGUAGE SPEECH RECOGNITION
    • 自然语言语音识别的方法和装置
    • US20080221872A1
    • 2008-09-11
    • US12045198
    • 2008-03-10
    • Shinsuke MoriMasafumi NishimuraNobuyasu Itoh
    • Shinsuke MoriMasafumi NishimuraNobuyasu Itoh
    • G06F17/27
    • G10L15/19
    • A word prediction method and apparatus improves precision and accuracy. For the prediction of a sixth word “?”, a partial analysis tree having a modification relationship with the sixth word is predicted. “sara-ni sho-senkyoku no” has two partial analysis trees, “sara-ni” and “sho-senkyoku no”. It is predicted that “sara-ni” does not have a modification relationship with the sixth word, and that “sho-senkyoku no” does. Then, “donyu”, which is the sixth word from “sho-senkyoku no”, is predicted. In this example, since “sara-ni” is not useful information for the prediction of “donyu”, it is preferable that “donyu” be predicted only by “sho-senkyoku no”.
    • 词预测方法和装置提高了精度和精度。 为了预测第六个字“?”,预测了与第六个字有修正关系的部分分析树。 “sara-ni sho-senkyoku no”有两个部分分析树,“sara-ni”和“sho-senkyoku no”。 据预测,“sara-ni”与第六个字没有修改关系,“sho-senkyoku no”也没有。 那么,这是“sho-senkyoku no”的第六个单词“donyu”。 在这个例子中,由于“sara-ni”对于“donyu”的预测没有用的信息,因此优选仅通过“sho-senkyoku no”来预测“donyu”。
    • 34. 发明申请
    • System And Method For Supporting Text-To-Speech
    • 支持文字转语音的系统和方法
    • US20080046247A1
    • 2008-02-21
    • US11774798
    • 2007-07-09
    • Gakuto KurataToru NaganoMasafumi NishimuraRyuki Tachibana
    • Gakuto KurataToru NaganoMasafumi NishimuraRyuki Tachibana
    • G10L13/00
    • G10L13/04G10L15/26
    • A system for generating high-quality synthesized text-to-speech includes a learning data generating unit, a frequency data generating unit, and a setting unit. The learning data generating unit recognizes inputted speech, and then generates first learning data in which wordings of phrases are associated with readings thereof. The frequency data generating unit generates, based on the first learning data, frequency data indicating appearance frequencies of both wordings and readings of phrases. The setting unit sets the thus generated frequency data for a language processing unit in order to approximate outputted speech of text-to-speech to the inputted speech. Furthermore, the language processing unit generates, from a wording of text, a reading corresponding to the wording, on the basis of the appearance frequencies.
    • 用于产生高质量合成文本到语音的系统包括学习数据生成单元,频率数据生成单元和设置单元。 学习数据生成单元识别输入的语音,然后生成其中短语的词语与其读数相关联的第一学习数据。 频率数据生成单元基于第一学习数据生成指示短语的两个措辞和读数的出现频率的频率数据。 设置单元设置由此产生的语言处理单元的频率数据,以将文本到语音的输出语音与输入的语音近似。 此外,语言处理单元根据出现频率,从文字的文字生成与该文字对应的阅读。
    • 37. 发明授权
    • Recognizing speech, and processing data
    • 识别语音和处理数据
    • US08150687B2
    • 2012-04-03
    • US11000165
    • 2004-11-30
    • Shinsuke MoriNobuyasu ItohMasafumi Nishimura
    • Shinsuke MoriNobuyasu ItohMasafumi Nishimura
    • G06F17/27G10L15/00G10L15/16
    • G10L15/26
    • An example embodiment of the invention includes a speech recognition processing unit for specifying speech segments for speech data, recognizing a speech in each of the speech segments, and associating a character string of obtained recognition data with the speech data for each speech segment, based on information on a time of the speech, and an output control unit for displaying/outputting the text prepared by sorting the recognition data in each speech segment. Sometimes, the system further includes a text editing unit for editing the prepared text, and a speech correspondence estimation unit for associating a character string in the edited text with the speech data by using a technique of dynamic programming.
    • 本发明的示例性实施例包括:语音识别处理单元,用于指定用于语音数据的语音片段,识别每个语音片段中的语音,并且将所获得的识别数据的字符串与每个语音段的语音数据相关联,基于 关于语音时间的信息,以及输出控制单元,用于显示/输出通过对每个语音段中的识别数据进行排序而准备的文本。 有时,该系统还包括用于编辑准备的文本的文本编辑单元和用于通过使用动态规划技术将编辑文本中的字符串与语音数据相关联的语音对应估计单元。
    • 39. 发明授权
    • System and method for supporting text-to-speech
    • 支持文字转语音的系统和方法
    • US07921014B2
    • 2011-04-05
    • US11774798
    • 2007-07-09
    • Gakuto KurataToru NaganoMasafumi NishimuraRyuki Tachibana
    • Gakuto KurataToru NaganoMasafumi NishimuraRyuki Tachibana
    • G10L13/00
    • G10L13/04G10L15/26
    • A system for generating high-quality synthesized text-to-speech includes a learning data generating unit, a frequency data generating unit, and a setting unit. The learning data generating unit recognizes inputted speech, and then generates first learning data in which wordings of phrases are associated with readings thereof. The frequency data generating unit generates, based on the first learning data, frequency data indicating appearance frequencies of both wordings and readings of phrases. The setting unit sets the thus generated frequency data for a language processing unit in order to approximate outputted speech of text-to-speech to the inputted speech. Furthermore, the language processing unit generates, from a wording of text, a reading corresponding to the wording, on the basis of the appearance frequencies.
    • 用于产生高质量合成文本到语音的系统包括学习数据生成单元,频率数据生成单元和设置单元。 学习数据生成单元识别输入的语音,然后生成其中短语的词语与其读数相关联的第一学习数据。 频率数据生成单元基于第一学习数据生成指示短语的两个措辞和读数的出现频率的频率数据。 设置单元设置由此产生的语言处理单元的频率数据,以将文本到语音的输出语音与输入的语音近似。 此外,语言处理单元根据出现频率,从文字的文字生成与该文字对应的阅读。