会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • METHOD AND SYSTEM FOR GENERATING SOUND EFFECTS INTERACTIVELY
    • 产生声音效果的方法和系统互动
    • US20070233494A1
    • 2007-10-04
    • US11691511
    • 2007-03-27
    • Liqin ShenHai Ping LiQin ShiZhiwei Shuang
    • Liqin ShenHai Ping LiQin ShiZhiwei Shuang
    • G10L13/08
    • G10H1/0091G10H2240/145G10H2250/315G10L13/033
    • The invention provides a method and system for generating sound effects interactively. The method provides a plurality of sound effect tags to a user, wherein each of the plurality of sound effects corresponds to a specific sound effect object. The sound effect object includes a seed sound representing a predefined audio file and a sound effect action representing an operation on sound. Then the user selects at least one of the sound effect tags for a whole source sound or at least a piece of the source sound. The method edits the source sound by using the selected sound effect tags to form a sound effect expression, interprets the sound effect expression to determine the operations corresponding to respective sound effect tags in the sound effect expression and the execution order of the operations, and executes the operations in said order to output a sound with the sound effects. The method of the invention enables a user to perform sound effect editing on sound in real time and dynamically, thus providing more customized sound effects.
    • 本发明提供了一种用于交互地产生声音效果的方法和系统。 该方法向用户提供多个声音效果标签,其中多个声音效果中的每一个对应于特定的声音效果对象。 声音对象包括表示预定音频文件的种子声音和表示声音操作的声音效果动作。 然后,用户为整个源声音或至少一段源声音选择至少一个声音效果标签。 该方法通过使用所选择的声音效果标签来形成声音效果表达式,解释声音效果表达式以确定与声音效果表达式中的各个声音效果标签相对应的操作和操作的执行顺序,并执行 按照所述顺序的操作输出具有声音效果的声音。 本发明的方法使得用户能够实时地和动态地对声音执行声音效果编辑,从而提供更多定制的声音效果。
    • 4. 发明授权
    • Generating a frequency warping function based on phoneme and context
    • 基于音素和语境生成频率扭曲函数
    • US08401861B2
    • 2013-03-19
    • US11654447
    • 2007-01-17
    • Shuang Zhi WeiRaimo BakisEllen Marie EideLiqin Shen
    • Shuang Zhi WeiRaimo BakisEllen Marie EideLiqin Shen
    • G10L21/00G10L13/06
    • G10L15/07G10L2021/0135
    • A method for generating a frequency warping function comprising preparing the training speech of a source and a target speaker; performing frame alignment on the training speech of the speakers; selecting aligned frames from the frame-aligned training speech of the speakers; extracting corresponding sets of formant parameters from the selected aligned frames; and generating a frequency warping function based on the corresponding sets of formant parameters. The step of selecting aligned frames preferably selects a pair of aligned frames in the middle of the same or similar frame-aligned phonemes with the same or similar contexts in the speech of the source speaker and target speaker. The step of generating a frequency warping function preferably uses the various pairs of corresponding formant parameters in the corresponding sets of formant parameters as key positions in a piecewise linear frequency warping function to generate the frequency warping function.
    • 一种用于产生频率扭曲函数的方法,包括准备源和目标说话者的训练语音; 对演讲者的训练语音进行框架对齐; 从扬声器的帧对齐训练语音中选择对准的帧; 从所选择的对齐的帧中提取相应的共振峰参数集合; 以及基于相应的共振峰参数集合生成频率扭曲函数。 选择对准的帧的步骤优选地在源扬声器和目标扬声器的语音中使用相同或相似的上下文在相同或相似的帧对准音素的中间选择一对对齐的帧。 产生频率扭曲函数的步骤优选地使用相应的共振峰参数集合中的各种相应的共振峰参数作为分段线性频率扭曲函数中的关键位置来产生频率扭曲函数。
    • 5. 发明授权
    • Method and system for generating synthesized speech based on human recording
    • 基于人类记录生成合成语音的方法和系统
    • US07899672B2
    • 2011-03-01
    • US11475820
    • 2006-06-27
    • Yong QinLiqin ShenWei ZhangWeibin Zhu
    • Yong QinLiqin ShenWei ZhangWeibin Zhu
    • G10L13/08G10L13/00
    • G10L13/04
    • A method and system that incorporates human recording with a TTS system to generate synthesized speech with high quality by searching over a database of pre-recorded utterances to select an utterance best matching text content to be synthesized into speech; dividing the best-matched utterance into a plurality of segments to generate remaining segments that are the same as corresponding parts of the text content and difference segments that are different from corresponding parts of the text content; synthesizing speech for the parts of the text content corresponding to the difference segments; and splicing the synthesized speech segments with the remaining segments of the best-matched utterance.
    • 一种将人类记录与TTS系统相结合的方法和系统,通过在数据库上搜索预先录制的话语来选择要合成语音的最佳匹配文本内容,从而产生高质量的合成语音; 将最佳匹配的话语划分成多个段以产生与文本内容的对应部分和与文本内容的对应部分不同的差异段的剩余段; 对与差分片段相对应的文本内容的部分合成语音; 以及将合成的语音片段与最佳匹配的话语的剩余片段拼接。
    • 6. 发明申请
    • Method and apparatus for generating a frequency warping function and for frequency warping
    • 用于产生频率翘曲功能和频率翘曲的方法和装置
    • US20070185715A1
    • 2007-08-09
    • US11654447
    • 2007-01-17
    • Shuang WeiRaimo BakisEllen EideLiqin Shen
    • Shuang WeiRaimo BakisEllen EideLiqin Shen
    • G10L15/04
    • G10L15/07G10L2021/0135
    • A method for generating a frequency warping function comprising preparing the training speech of a source and a target speaker; performing frame alignment on the training speech of the speakers; selecting aligned frames from the frame-aligned training speech of the speakers; extracting corresponding sets of formant parameters from the selected aligned frames; and generating a frequency warping function based on the corresponding sets of formant parameters. The step of selecting aligned frames preferably selects a pair of aligned frames in the middle of the same or similar frame-aligned phonemes with the same or similar contexts in the speech of the source speaker and target speaker. The step of generating a frequency warping function preferably uses the various pairs of corresponding formant parameters in the corresponding sets of formant parameters as key positions in a piecewise linear frequency warping function to generate the frequency warping function.
    • 一种用于产生频率扭曲函数的方法,包括准备源和目标说话者的训练语音; 对演讲者的训练语音进行框架对齐; 从扬声器的帧对齐训练语音中选择对准的帧; 从所选择的对齐的帧中提取相应的共振峰参数集合; 以及基于相应的共振峰参数集合生成频率扭曲函数。 选择对准的帧的步骤优选地在源扬声器和目标扬声器的语音中使用相同或相似的上下文在相同或相似的帧对准音素的中间选择一对对齐的帧。 产生频率扭曲函数的步骤优选地使用相应的共振峰参数集合中的各种相应的共振峰参数作为分段线性频率扭曲函数中的关键位置来产生频率扭曲函数。