会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 21. 发明申请
    • DYNAMIC QUANTIZER STRUCTURES FOR EFFICIENT COMPRESSION
    • 用于有效压缩的动态量子结构
    • US20080107348A1
    • 2008-05-08
    • US11855778
    • 2007-09-14
    • Jani NurminenSakari Himanen
    • Jani NurminenSakari Himanen
    • G06K9/46
    • H04N19/126G10L19/032H04N19/46
    • A method and system are introduced that provide dynamic quantizer structures which are configurable during run time. A quantizer configuration and data are stored in a binary format. The dynamic quantizer data is represented as a bitstream, and the bitstream in turn is used as additional input during initialization (or re-initialization/re-configuration) of a speech coder. A configuration header fully specifies the structure and configuration of the dynamic quantizer for each quantized parameter, and the dynamic quantizer data and configurations are fully and dynamically allocated into the speech coder memory. This enables easy re-configuration of a codec associated with the quantizer structures for different scenarios. The use of dynamic quantizer structures in turn enhances compression efficiency of an input signal. The dynamic quantizer structures can also be applied to other compression applications that allow lossy compression.
    • 引入了一种提供在运行时可配置的动态量化器结构的方法和系统。 量化器配置和数据以二进制格式存储。 动态量化器数据被表示为比特流,并且在语音编码器的初始化(或重新初始化/重新配置)期间,比特流又被用作附加输入。 配置头完全指定每个量化参数的动态量化器的结构和配置,动态量化器数据和配置被完全和动态地分配到语音编码器存储器中。 这使得能够容易地重新配置与用于不同场景的量化器结构相关联的编解码器。 动态量化器结构的使用又提高了输入信号的压缩效率。 动态量化器结构也可以应用于允许有损压缩的其他压缩应用。
    • 22. 发明申请
    • Supporting a concatenative text-to-speech synthesis
    • 支持连贯的文本到语音合成
    • US20070011009A1
    • 2007-01-11
    • US11177250
    • 2005-07-08
    • Jani NurminenSakari HimanenAnssi RamoJanne Vainio
    • Jani NurminenSakari HimanenAnssi RamoJanne Vainio
    • G10L13/08
    • G10L13/06
    • The invention relates to a support of a concatenative TTS synthesis. In order to generate a speech database as a basis for the TTS synthesis, first, a speech processing including a segmental parametric speech encoding of speech data based on a parametric modeling of speech is performed, which results in compressed parameterized speech segments. Then, the compressed parameterized speech segments are assembled in a speech database. In order to synthesize output speech, compressed parameterized speech segments are selected from the speech database based on an available text and decompressed to regain parameterized speech segments. The parameterized speech segments are then concatenated in a parameter domain. The output speech is synthesized based on these concatenated parametric speech segments.
    • 本发明涉及一种级联TTS合成的支持。 为了生成语音数据库作为TTS综合的基础,首先,执行包括基于语音的参数建模的语音数据的分段参数语音编码的语音处理,这导致压缩的参数化语音段。 然后,压缩的参数化语音段被组合在语音数据库中。 为了合成输出语音,基于可用文本从语音数据库中选择压缩的参数化语音段,并且解压缩以重新获得参数化语音段。 参数化语音段然后在参数域中连接。 基于这些连接的参数语音段来合成输出语音。
    • 24. 发明申请
    • Method and system for speech coding
    • 语音编码方法和系统
    • US20050091041A1
    • 2005-04-28
    • US10692290
    • 2003-10-23
    • Anssi RamoJani NurminenSakari HimanenAri Heikkinen
    • Anssi RamoJani NurminenSakari HimanenAri Heikkinen
    • G10L20060101G10L11/06G10L19/02G10L19/04G10L19/14G10L21/04H04B1/06H04M11/00
    • G10L19/24
    • A method and device for use in conjunction with an encoder for encoding an audio signal into a plurality of parameters. Based on the behavior of the parameters, such as pitch, voicing, energy and spectral amplitude information of the audio signal, the audio signal can be segmented, so that the parameter update rate can be optimized. The parameters of the segmented audio signal are recorded in a storage medium or transmitted to a decoder so as to allow the decoder to reconstruct the audio signal based on the parameters indicative of the segment audio signals. For example, based on the pitch characteristic, the pitch contour can be approximated by a plurality of contour segments. An adaptive downsampling method is used to update the parameters based on the contour segments so as to reduce the update rate. At the decoder, the parameters are updated at the original rate.
    • 一种与用于将音频信号编码为多个参数的编码器结合使用的方法和装置。 基于音频信号的音调,发音,能量和频谱幅度信息等参数的行为,可以对音频信号进行分段,从而可以优化参数更新速率。 分段音频信号的参数被记录在存储介质中或被发送到解码器,以便允许解码器基于指示段音频信号的参数重建音频信号。 例如,基于俯仰特性,俯仰轮廓可以由多个轮廓段近似。 使用自适应下采样方法根据轮廓段更新参数,以便降低更新速率。 在解码器处,参数以原始速率更新。