会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • Complexity Adjustment for a Signal Encoder
    • 信号编码器的复杂性调整
    • US20080120098A1
    • 2008-05-22
    • US11562067
    • 2006-11-21
    • Jari M. MakinenJuha MarilaHannu J. MikkolaJanne VainioTuomas VaittinenSakari HimanenKai K. Samposalo
    • Jari M. MakinenJuha MarilaHannu J. MikkolaJanne VainioTuomas VaittinenSakari HimanenKai K. Samposalo
    • G10L19/00
    • G10L19/22
    • The present invention provides, methods, computer-readable media, and apparatuses for tuning and adjusting the computational complexity of algorithm that is executed by a signal encoder. The signal encoder may comprise a speech encoder. When a resource shortage on a computer platform is detected, a degree of the resource shortage and a corresponding complexity adjustment for a speech encoder are determined. The speech encoder is then tuned to adjust the computational complexity of an executed speech processing algorithm. The resource shortage may correspond to a computational capability, audio buffer memory, or battery of a mobile device. A speech process being executed by the mobile device is tuned to adjust the computational demands in accordance with a complexity adjustment. A number of iteration rounds may be adjusted while the speech encoder is executing a speech processing algorithm. The iterations may correspond to an algebraic codebook search.
    • 本发明提供了用于调整和调整由信号编码器执行的算法的计算复杂度的方法,计算机可读介质和装置。 信号编码器可以包括语音编码器。 当检测到计算机平台上的资源短缺时,确定了语音编码器的资源短缺程度和对应的复杂度调整。 然后调谐语音编码器以调整执行的语音处理算法的计算复杂度。 资源短缺可能对应于移动设备的计算能力,音频缓冲存储器或电池。 调整由移动设备执行的语音过程以根据复杂性调整来调整计算需求。 当语音编码器执行语音处理算法时,可以调整多个迭代轮。 迭代可以对应于代数码本搜索。
    • 4. 发明申请
    • Supporting a concatenative text-to-speech synthesis
    • 支持连贯的文本到语音合成
    • US20070011009A1
    • 2007-01-11
    • US11177250
    • 2005-07-08
    • Jani NurminenSakari HimanenAnssi RamoJanne Vainio
    • Jani NurminenSakari HimanenAnssi RamoJanne Vainio
    • G10L13/08
    • G10L13/06
    • The invention relates to a support of a concatenative TTS synthesis. In order to generate a speech database as a basis for the TTS synthesis, first, a speech processing including a segmental parametric speech encoding of speech data based on a parametric modeling of speech is performed, which results in compressed parameterized speech segments. Then, the compressed parameterized speech segments are assembled in a speech database. In order to synthesize output speech, compressed parameterized speech segments are selected from the speech database based on an available text and decompressed to regain parameterized speech segments. The parameterized speech segments are then concatenated in a parameter domain. The output speech is synthesized based on these concatenated parametric speech segments.
    • 本发明涉及一种级联TTS合成的支持。 为了生成语音数据库作为TTS综合的基础,首先,执行包括基于语音的参数建模的语音数据的分段参数语音编码的语音处理,这导致压缩的参数化语音段。 然后,压缩的参数化语音段被组合在语音数据库中。 为了合成输出语音,基于可用文本从语音数据库中选择压缩的参数化语音段,并且解压缩以重新获得参数化语音段。 参数化语音段然后在参数域中连接。 基于这些连接的参数语音段来合成输出语音。
    • 6. 发明授权
    • Method and system for pitch contour quantization in audio coding
    • 音频编码中音调轮廓量化的方法和系统
    • US08380496B2
    • 2013-02-19
    • US12150307
    • 2008-04-25
    • Anssi RämöJani NurminenSakari HimanenAri Heikkinen
    • Anssi RämöJani NurminenSakari HimanenAri Heikkinen
    • G10L11/04G10L19/00
    • G10L19/032G10L19/09
    • A method and device for improving coding efficiency in audio coding. From the pitch values of a pitch contour of an audio signal, a plurality of simplified pitch contour segments are generated to approximate the pitch contour, based on one or more pre-selected criteria. The contour segments can be linear or non-linear with each contour segment represented by a first end point and a second end point. If the contour segments are linear, then only the information regarding the end points, instead of the pitch values, are provided to a decoder for reconstructing the audio signal. The contour segment can have a fixed maximum length or a variable length, but the deviation between a contour segment and the pitch values in that segment is limited by a maximum value.
    • 一种提高音频编码效率的方法和装置。 根据音频信号的音调轮廓的音调值,基于一个或多个预先选择的标准,生成多个简化俯仰轮廓线段以近似俯仰轮廓。 轮廓段可以是由第一终点和第二终点表示的每个轮廓段线性或非线性的。 如果轮廓段是线性的,则仅将关于终点而不是音调值的信息提供给用于重建音频信号的解码器。 轮廓段可以具有固定的最大长度或可变长度,但轮廓段与该段中的俯仰值之间的偏差受到最大值的限制。
    • 7. 发明申请
    • Reusing codebooks in parameter quantization
    • 在参数量化中重用码本
    • US20060080090A1
    • 2006-04-13
    • US10961471
    • 2004-10-07
    • Anssi RamoSakari HimanenJani Nurminen
    • Anssi RamoSakari HimanenJani Nurminen
    • G10L19/12
    • G10L19/07
    • The present invention provides a new methodology for reusing codebooks for a multistage vector quantization of parameter quantizers of signals. Prior art multistage vector quantization is done in such a way that each stage has different optimized codebooks. The prior art codebooks, thus, use quite a lot of a memory storage space. Using the same codebook stages several times, according to the present invention, reduces the memory usage and a codebook structure maintains good quality by using optimized codebooks for the most important (first) stages in the quantization. The number of codebooks is reduced by reusing the same codebooks in the refining stages. Additionally, according to the present invention, using many predictors is space-wise efficient since they need only a few of coefficients instead of larger codebooks.
    • 本发明提供了一种用于重新使用信号参数量化器的多级矢量量化码本的新方法。 现有技术的多级矢量量化是以每一级具有不同优化码本的方式完成的。 因此,现有技术的码本使用相当多的存储器存储空间。 使用相同的码本阶段,根据本发明,通过使用量化中最重要的(第一)级的优化码本来减少存储器使用并且码本结构保持良好的质量。 通过在精炼阶段重复使用相同的码本来减少码本的数量。 此外,根据本发明,使用许多预测器是空间有效的,因为它们仅需要少数系数而不是较大的码本。
    • 8. 发明授权
    • Dynamic quantizer structures for efficient compression
    • 用于高效压缩的动态量化器结构
    • US08086057B2
    • 2011-12-27
    • US11855778
    • 2007-09-14
    • Jani NurminenSakari Himanen
    • Jani NurminenSakari Himanen
    • G06K9/00
    • H04N19/126G10L19/032H04N19/46
    • A method and system are introduced that provide dynamic quantizer structures which are configurable during run time. A quantizer configuration and data are stored in a binary format. The dynamic quantizer data is represented as a bitstream, and the bitstream in turn is used as additional input during initialization (or re-initialization/re-configuration) of a speech coder. A configuration header fully specifies the structure and configuration of the dynamic quantizer for each quantized parameter, and the dynamic quantizer data and configurations are fully and dynamically allocated into the speech coder memory. This enables easy re-configuration of a codec associated with the quantizer structures for different scenarios. The use of dynamic quantizer structures in turn enhances compression efficiency of an input signal. The dynamic quantizer structures can also be applied to other compression applications that allow lossy compression.
    • 引入了一种提供在运行时可配置的动态量化器结构的方法和系统。 量化器配置和数据以二进制格式存储。 动态量化器数据被表示为比特流,并且在语音编码器的初始化(或重新初始化/重新配置)期间,比特流又被用作附加输入。 配置头完全指定每个量化参数的动态量化器的结构和配置,动态量化器数据和配置被完全和动态地分配到语音编码器存储器中。 这使得能够容易地重新配置与用于不同场景的量化器结构相关联的编解码器。 动态量化器结构的使用又提高了输入信号的压缩效率。 动态量化器结构也可以应用于允许有损压缩的其他压缩应用。
    • 10. 发明申请
    • Method and system for pitch contour quantization in audio coding
    • 音频编码中音调轮廓量化的方法和系统
    • US20080275695A1
    • 2008-11-06
    • US12150307
    • 2008-04-25
    • Anssi RamoJani NurminenSakari HimanenAri Heikkinen
    • Anssi RamoJani NurminenSakari HimanenAri Heikkinen
    • G10L11/04
    • G10L19/032G10L19/09
    • A method and device for improving coding efficiency in audio coding. From the pitch values of a pitch contour of an audio signal, a plurality of simplified pitch contour segments are generated to approximate the pitch contour, based on one or more pre-selected criteria. The contour segments can be linear or non-linear with each contour segment represented by a first end point and a second end point. If the contour segments are linear, then only the information regarding the end points, instead of the pitch values, are provided to a decoder for reconstructing the audio signal. The contour segment can have a fixed maximum length or a variable length, but the deviation between a contour segment and the pitch values in that segment is limited by a maximum value.
    • 一种提高音频编码效率的方法和装置。 根据音频信号的音调轮廓的音调值,基于一个或多个预先选择的标准,生成多个简化俯仰轮廓线段以近似俯仰轮廓。 轮廓段可以是由第一终点和第二终点表示的每个轮廓段线性或非线性的。 如果轮廓段是线性的,则仅将关于终点而不是音调值的信息提供给用于重建音频信号的解码器。 轮廓段可以具有固定的最大长度或可变长度,但轮廓段与该段中的俯仰值之间的偏差受到最大值的限制。