会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明授权
    • Voice personalization of speech synthesizer
    • 语音合成器的语音个性化
    • US06970820B2
    • 2005-11-29
    • US09792928
    • 2001-02-26
    • Jean-Claude JunquaFlorent PerronninRoland KuhnPatrick Nguyen
    • Jean-Claude JunquaFlorent PerronninRoland KuhnPatrick Nguyen
    • G10L13/08G10L13/02G10L13/04G10L13/06G10L21/00G10L13/00
    • G10L13/04G10L2021/0135
    • The speech synthesizer is personalized to sound like or mimic the speech characteristics of an individual speaker. The individual speaker provides a quantity of enrollment data, which can be extracted from a short quantity of speech, and the system modifies the base synthesis parameters to more closely resemble those of the new speaker. More specifically, the synthesis parameters may be decomposed into speaker dependent parameters, such as context-independent parameters, and speaker independent parameters, such as context dependent parameters. The speaker dependent parameters are adapted using enrollment data from the new speaker. After adaptation, the speaker dependent parameters are combined with the speaker independent parameters to provide a set of personalized synthesis parameters. To adapt the parameters with a small amount of enrollment data, an eigenspace is constructed and used to constrain the position of the new speaker so that context independent parameters not provided by the new speaker may be estimated.
    • 语音合成器被个性化以发音或模仿单个扬声器的语音特征。 单个扬声器提供一定数量的登记数据,其可以从短语言中提取,并且系统将基本合成参数修改为更接近于新说话者的参考数据。 更具体地,合成参数可以被分解为与扬声器相关的参数,诸如与上下文无关的参数,以及与扬声器无关的参数,诸如与上下文相关的参数。 使用来自新扬声器的注册数据来调整与扬声器相关的参数。 在适应之后,将扬声器依赖参数与扬声器独立参数组合以提供一组个性化合成参数。 为了使参数具有少量的注册数据,构造本征空间并用于约束新的说话者的位置,以便可以估计不能由新发言者提供的上下文独立参数。
    • 7. 发明授权
    • Universal remote control allowing natural language modality for television and multimedia searches and requests
    • 通用遥控器允许电视和多媒体搜索和请求的自然语言模式
    • US06553345B1
    • 2003-04-22
    • US09383762
    • 1999-08-26
    • Roland KuhnTony DavisJean-Claude JunquaYi ZhaoWeiying Li
    • Roland KuhnTony DavisJean-Claude JunquaYi ZhaoWeiying Li
    • G10L1522
    • H04N5/4403G08C2201/31G10L15/26H04M1/72533H04N5/44543H04N5/44582H04N21/42207H04N21/42209H04N21/42222H04N21/42224H04N21/482H04N2005/4407H04N2005/441H04N2005/4428H04N2005/443H04N2005/4432H04N2005/4435
    • The remote control unit supports multi-modal dialog with the user, through which the user can easily select programs for viewing or recording. The remote control houses a microphone into which the user can input natural language speech. The input speech is recognized and interpreted by a natural language parser that extracts the semantic content of the user's speech. The parser works in conjunction with an electronic program guide, through which the remote control system is able to ascertain what programs are available for viewing or recording and supply appropriate prompts to the user. In one embodiment, the remote control includes a touch screen display upon which the user may view prompts or make selections by pen input or tapping. Selections made on the touch screen automatically limit the context of the ongoing dialog between user and remote control, allowing the user to interact naturally with the unit. The remote control unit can control virtually any audio-video component, including those designed before the current technology. The remote control system can be packaged entirely within the remote control handheld unit, or components may be distributed in other systems attached to the user's multimedia equipment.
    • 遥控器支持与用户的多模态对话,用户可以轻松地选择节目进行观看或录制。 遥控器装有麦克风,用户可以在其中输入自然语言语音。 输入语音由提取用户语音的语义内容的自然语言解析器识别和解释。 解析器与电子节目指南一起工作,通过该电子节目指南,遥控系统能够确定哪些节目可用于观看或录制,并向用户提供适当的提示。 在一个实施例中,遥控器包括触摸屏显示器,用户可以通过触摸屏显示器通过笔输入或点击来查看提示或进行选择。 在触摸屏上进行的选择自动限制用户和遥控器之间正在进行的对话框的上下文,从而允许用户自然地与本机进行交互。 遥控器可以实际控制任何音频 - 视频组件,包括在当前技术之前设计的。 远程控制系统可以完全包装在遥控手持单元内,或者组件可以分布在附接到用户的多媒体设备的其它系统中。
    • 10. 发明授权
    • Method for letter-to-sound in text-to-speech synthesis
    • 文字到语音合成中的字母对声音的方法
    • US06029132A
    • 2000-02-22
    • US70300
    • 1998-04-30
    • Roland KuhnJean-Claude Junqua
    • Roland KuhnJean-Claude Junqua
    • G10L13/08G10L5/00G10L9/00
    • G10L13/08
    • A two-stage pronunciation generator utilizes mixed decision trees that includes a network of yes-no questions about letter, syntax, context, and dialect in a spelled word sequence. A second stage utilizes decision trees that includes a network of yes-no questions about adjacent phonemes in the phoneme sequence corresponding to the spelled word sequence. Leaf nodes of the mixed decision trees provide information about which phonetic transcriptions are most probable. Using the mixed trees, scores are developed for each of a plurality of possible pronunciations, and these scores can be used to select the best pronunciation as well as to rank pronunciations in order of probability. The pronunciations generated by the system can be used in speech synthesis and speech recognition applications as well as lexicography applications.
    • 两阶段发音生成器利用混合决策树,其中包含有拼写单词序列中关于字母,语法,上下文和方言的是 - 否问题的网络。 第二阶段利用对应于拼写单词序列的音素序列中包含关于相邻音素的是 - 否问题的网络的决策树。 混合决策树的叶节点提供了哪些语音转录最有可能的信息。 使用混合树,为多个可能的发音中的每一个开发分数,并且这些分数可以用于选择最佳发音以及按概率的排序排列发音。 系统生成的发音可用于语音合成和语音识别应用以及词典应用。