会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 62. 发明申请
    • AUDIO DECODING DEVICE AND AUDIO DECODING METHOD
    • 音频解码设备和音频解码方法
    • US20140029752A1
    • 2014-01-30
    • US13904165
    • 2013-05-29
    • Fujitsu Limited
    • Yohei KISHIAkira KamanoShunsuke TakeuchiMiyuki ShirakawaMasanao Suzuki
    • G10L19/008G10L19/04
    • G10L19/008G10L19/0204G10L19/04G10L25/12
    • An audio decoding device includes a processor; and a memory which stores a plurality of instructions, which when executed by the processor, cause the processor to execute, decoding, using a first channel signal and a second channel signal included in a plurality of channels of an audio signal having a first frequency range and a second frequency range, a first prediction coefficient of the first frequency range and a second prediction coefficient of the second frequency range, both selected from a code book when prediction-encoding a third channel signal that is not subjected to prediction encoding and that is included in the plurality of channels; decoding a residual signal included in the first frequency range, the residual signal representing an error occurring in prediction encoding; and prediction-decoding the third channel signal subjected to prediction-encoding in the second frequency range from the first channel signal, the second channel signal.
    • 音频解码装置包括处理器; 以及存储器,其存储多个指令,所述指令在由所述处理器执行时使所述处理器执行,解码使用包括在具有第一频率范围的音频信号的多个通道中的第一通道信号和第二通道信号 以及第二频率范围,第一频率范围的第一预测系数和第二频率范围的第二预测系数,两者都是从对未进行预测编码的第三频道信号进行预测编码时的码本中选择的, 包括在多个通道中; 对包含在第一频率范围内的残差信号进行解码,残差信号表示在预测编码中发生的误差; 并从第一信道信号,第二信道信号预测解码在第二频率范围内进行预测编码的第三信道信号。
    • 64. 发明申请
    • Methods and Systems for Generating Filter Coefficients and Configuring Filters
    • 用于生成滤波器系数和配置滤波器的方法和系统
    • US20130317833A1
    • 2013-11-28
    • US13983892
    • 2012-02-08
    • Mark F. Davis
    • Mark F. Davis
    • G10L19/04
    • G10L19/04G10L19/0017
    • Methods for generating a palette of feedback (IIR) filter coefficient sets and using the palette to configure (e.g., adaptively update) a prediction filter which includes a feedback filter, and a system for performing any of the methods. Examples of the system include an encoder, including a prediction filter and configured to encode data indicative of a waveform signal (e.g., samples of an audio signal), and a decoder. In some embodiments, the prediction filter is included in an encoder operable to generate (and assert to a decoder) encoded data including filter coefficient data indicative of the selected IIR coefficient set with which the prediction filter was configured during generation of the encoded data. In some embodiments, the timing with which adaptive updating of prediction filter configuration occurs or is allowed to occur is constrained (e.g., to optimize efficiency of prediction encoding).
    • 用于生成反馈调色板(IIR)滤波器系数组并使用调色板来配置(例如,自适应地更新)包括反馈滤波器的预测滤波器的方法和用于执行任何方法的系统的方法。 系统的示例包括编码器,包括预测滤波器并且被配置为对表示波形信号(例如,音频信号的样本)的数据和解码器进行编码。 在一些实施例中,预测滤波器被包括在可操作以生成(并且向解码器断言)编码数据的编码数据中,所述编码数据包括指示在生成编码数据期间配置了预测滤波器的所选择的IIR系数组的滤波器系数数据。 在一些实施例中,发生或允许发生预测过滤器配置的自适应更新的定时被限制(例如,以优化预测编码的效率)。
    • 65. 发明申请
    • CODING DEVICE, CODING METHOD, DECODING DEVICE, DECODING METHOD, AND STORAGE MEDIUM
    • 编码设备,编码方法,解码设备,解码方法和存储介质
    • US20130246073A1
    • 2013-09-19
    • US13727370
    • 2012-12-26
    • CASIO COMPUTER CO., LTD.
    • Goro SAKATA
    • G10L19/04
    • G10L19/04G10H1/0041G10H7/002G10H2230/041G10H2250/601G10L19/08H03M7/30
    • For respective sampling data of waveform data of sounds to be coded, a prediction residual value is calculated as sampling residual data, and an effective bit length is calculated from this residual waveform data. Then, for the effective bit length data, a maximum effective bit length among processing targets is generated as common effective actual data, and coded data in which this common effective actual data and information indicating the common effective bit length are arranged in a predetermined configuration format are generated. The information included in the coded data is analyzed and each of the plurality of the common effective bit information is extracted. Then, waveform data of the sounds are decoded by performing inverse linear prediction processing from an analysis result on the residual waveform data decompressed by performing bit extension which adds a portion other than the common effective bit length.
    • 对于要编码的声音的波形数据的各个采样数据,计算预测残差值作为采样残差数据,并根据该残差波形数据计算有效位长度。 然后,对于有效比特长度数据,生成作为公共有效实际数据的处理对象之间的最大有效比特长度,以及其中表示公共有效比特长度的公共有效实际数据和信息以预定配置格式排列的编码数据 被生成。 分析包含在编码数据中的信息,并且提取多个公共有效位信息中的每一个。 然后,通过对通过添加除了公共有效位长度以外的部分的比特扩展进行解压缩的残差波形数据的分析结果进行逆线性预测处理来解码声音的波形数据。
    • 66. 发明授权
    • Automated distortion classification
    • 自动失真分类
    • US08438030B2
    • 2013-05-07
    • US12626101
    • 2009-11-25
    • Gaurav TalwarRathinavelu Chengalvarayan
    • Gaurav TalwarRathinavelu Chengalvarayan
    • G10L19/04
    • G10L17/26G10L15/20G10L21/0208
    • A method of and system for automated distortion classification. The method includes steps of (a) receiving audio including a user speech signal and at least some distortion associated with the signal; (b) pre-processing the received audio to generate acoustic feature vectors; (c) decoding the generated acoustic feature vectors to produce a plurality of hypotheses for the distortion; and (d) post-processing the plurality of hypotheses to identify at least one distortion hypothesis of the plurality of hypotheses as the received distortion. The system can include one or more distortion models including distortion-related acoustic features representative of various types of distortion and used by a decoder to compare the acoustic feature vectors with the distortion-related acoustic features to produce the plurality of hypotheses for the distortion.
    • 一种自动失真分类的方法和系统。 该方法包括以下步骤:(a)接收包括用户语音信号的音频和至少一些与该信号相关的失真; (b)预处理所接收的音频以产生声学特征向量; (c)对生成的声学特征向量进行解码以产生用于失真的多个假设; 以及(d)后处理所述多个假设以将所述多个假设中的至少一个失真假设识别为接收到的失真。 该系统可以包括一个或多个失真模型,包括代表各种类型的失真的失真相关的声学特征,并被解码器用于将声学特征向量与失真相关的声学特征进行比较,以产生用于失真的多个假设。
    • 67. 发明授权
    • Predictive speech signal coding
    • 预测语音信号编码
    • US08433563B2
    • 2013-04-30
    • US12455478
    • 2009-06-02
    • Koen Bernard VosSoren Skak Jensen
    • Koen Bernard VosSoren Skak Jensen
    • G10L19/04G10L19/08
    • G10L19/12G10L19/09
    • A method, system and computer program for encoding speech according to a source-filter model. The method comprises deriving a spectral envelope signal representative of a modelled filter and a first remaining signal representative of a modelled source signal, and deriving a second remaining signal from the first remaining signal by, at intervals during the encoding: exploiting a correlation between approximately periodic portions in the first remaining signal to generate a predicted version of a later portion from a stored version of an earlier portion, and using the predicted-version of the later portion to remove an effect of said periodicity from the first remaining signal. The method further comprises, once every number of intervals, transforming the stored version of the earlier portion of the first remaining signal prior to generating the predicted version of the respective later portion.
    • 一种根据源滤波器模型对语音进行编码的方法,系统和计算机程序。 该方法包括导出代表建模过滤器的频谱包络信号和表示建模源信号的第一剩余信号,以及在编码期间间隔从第一剩余信号导出第二剩余信号:利用近似周期性的相关性 第一剩余信号中的部分,以从早期部分的存储版本生成稍后部分的预测版本,并​​且使用后面部分的预测版本来从第一剩余信号中去除所述周期性的影响。 该方法还包括:每产生一次间隔之后,在生成相应较后部分的预测版本之前变换第一剩余信号的较早部分的存储版本。
    • 69. 发明申请
    • METHOD AND APPARATUS FOR PROCESSING AN AUDIO SIGNAL
    • 用于处理音频信号的方法和装置
    • US20130096928A1
    • 2013-04-18
    • US13636922
    • 2011-03-23
    • Gyuhyeok JeongDaehwan KimChangheon LeeLagyoung KimHyejeong JeonByungsuk LeeIngyu Kang
    • Gyuhyeok JeongDaehwan KimChangheon LeeLagyoung KimHyejeong JeonByungsuk LeeIngyu Kang
    • G10L19/04
    • G10L19/04G10L19/06G10L19/22G10L19/24
    • The present invention relates to a method for processing an audio signal, comprising: determining bandwidth information indicating to which of a plurality of bands the current frame corresponds; determining information on the order corresponding to the present frame on the basis of the bandwidth information; performing a linear predictive analysis of the present frame to generate a first set linear predictive transform coefficient of a first order; performing a vector quantization on the first set linear predictive coefficient to generate a first index; performing a linear predictive analysis of the current frame to generate a second set linear predictive transform coefficient of a second order in accordance with the information on the order; and performing a vector quantization on a second set difference by using the first set index and the second set linear predictive transform coefficient, when the second set linear predictive coefficient is generated.
    • 本发明涉及一种用于处理音频信号的方法,包括:确定指示当前帧对应于多个频带中的哪个频带的带宽信息; 基于所述带宽信息确定与所述当前帧相对应的顺序的信息; 对当前帧执行线性预测分析以产生第一阶的第一组线性预测变换系数; 对所述第一组线性预测系数执行矢量量化以产生第一指标; 执行当前帧的线性预测分析,以根据关于订单的信息生成二阶的第二组线性预测变换系数; 以及当产生所述第二组线性预测系数时,通过使用所述第一设定索引和所述第二组线性预测变换系数,对第二设定差执行向量量化。
    • 70. 发明授权
    • Dynamic pruning for automatic speech recognition
    • 动态修剪自动语音识别
    • US08392187B2
    • 2013-03-05
    • US12362668
    • 2009-01-30
    • Qifeng Zhu
    • Qifeng Zhu
    • G10L19/04
    • G10L15/083
    • Methods, speech recognition systems, and computer readable media are provided that recognize speech using dynamic pruning techniques. A search network is expanded based on a frame from a speech signal, a best hypothesis is determined in the search network, a default beam threshold is modified, and the search network is pruned using the modified beam threshold. The search network may be further pruned based on the search depth of the best hypothesis and/or the average number of frames per state for a search path.
    • 提供了使用动态修剪技术来识别语音的方法,语音识别系统和计算机可读介质。 基于来自语音信号的帧扩展搜索网络,在搜索网络中确定最佳假设,修改默认波束阈值,并且使用修改的波束阈值修剪搜索网络。 可以基于搜索路径的最佳假设的搜索深度和/或每个状态的平均帧数来进一步修剪搜索网络。