会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Audio transcoder using encoder-generated side information to transcode to target bit-rate
    • 音频代码转换器使用编码器生成的侧面信息转码为目标比特率
    • US08457958B2
    • 2013-06-04
    • US11938194
    • 2007-11-09
    • Kazuhito KoishidaSanjeev MehrotraWei-Ge Chen
    • Kazuhito KoishidaSanjeev MehrotraWei-Ge Chen
    • G10L19/02H04B1/66
    • G10L19/173
    • An audio encoder encodes side information into a compressed audio bitstream containing encoding parameters used by the encoder for one or more encoding techniques, such as a noise-mask-ratio curve used for rate control. A transcoder uses the encoder generated side information to transcode the audio from the original compressed bitstream having an initial bit-rate into a second bitstream having a new bit-rate. Because the side information is derived from the original audio, the transcoder is able to better maintain audio quality of the transcoding. The side information also allows the transcoder to re-encode from an intermediate decoding/encoding stage for faster and lower complexity transcoding.
    • 音频编码器将侧信息编码成包含由编码器使用的编码参数的压缩音频比特流,用于一种或多种编码技术,例如用于速率控制的噪声屏蔽比曲线。 代码转换器使用编码器产生的侧信息将来自具有初始比特率的原始压缩比特流的音频转码为具有新比特率的第二比特流。 因为侧信息是从原始音频导出的,所以代码转换器能够更好地保持转码的音频质量。 侧面信息还允许代码转换器从中间解码/编码级重新编码,以实现更快和更低复杂度的代码转换。
    • 5. 发明申请
    • Query and Matching for Content Recognition
    • 内容识别的查询和匹配
    • US20120296938A1
    • 2012-11-22
    • US13110185
    • 2011-05-18
    • Kazuhito KoishidaDavid NisterIan SimonTom Butcher
    • Kazuhito KoishidaDavid NisterIan SimonTom Butcher
    • G06F17/30
    • G06F17/30743
    • Various embodiments enable audio data, such as music data, to be captured, by a device, from a background environment and processed to formulate a query that can then be transmitted to a content recognition service. In one or more embodiments, multiple queries are transmitted to the content recognition service. In at least some embodiments, subsequent queries can progressively incorporate previous queries plus additional data that is captured. In one or more embodiments, responsive to receiving the query, the content recognition service can employ a multi-stage matching technique to identify content items responding to the query. This matching technique can be employed as queries are progressively received.
    • 各种实施例使得诸如音乐数据的音频数据能够被设备从背景环境中捕获并被处理以制定可以被发送到内容识别服务的查询。 在一个或多个实施例中,多个查询被发送到内容识别服务。 在至少一些实施例中,后续查询可以逐渐地并入先前查询加上所捕获的附加数据。 在一个或多个实施例中,响应于接收查询,内容识别服务可以采用多阶段匹配技术来识别响应于查询的内容项目。 可以采用这种匹配技术,因为逐渐接收到查询。
    • 8. 发明申请
    • TRANSCODER USING ENCODER GENERATED SIDE INFORMATION
    • 使用编码器生成侧信息的TRANSCODER
    • US20090125315A1
    • 2009-05-14
    • US11938194
    • 2007-11-09
    • Kazuhito KoishidaSanjeev MehrotraWei-Ge Chen
    • Kazuhito KoishidaSanjeev MehrotraWei-Ge Chen
    • G10L19/00
    • G10L19/173
    • An audio encoder encodes side information into a compressed audio bitstream containing encoding parameters used by the encoder for one or more encoding techniques, such as a noise-mask-ratio curve used for rate control. A transcoder uses the encoder generated side information to transcode the audio from the original compressed bitstream having an initial bit-rate into a second bitstream having a new bit-rate. Because the side information is derived from the original audio, the transcoder is able to better maintain audio quality of the transcoding. The side information also allows the transcoder to re-encode from an intermediate decoding/encoding stage for faster and lower complexity transcoding.
    • 音频编码器将侧信息编码成包含由编码器使用的编码参数的压缩音频比特流,用于一种或多种编码技术,例如用于速率控制的噪声屏蔽比曲线。 代码转换器使用编码器产生的侧信息将来自具有初始比特率的原始压缩比特流的音频转码为具有新比特率的第二比特流。 因为侧信息是从原始音频导出的,所以代码转换器能够更好地保持转码的音频质量。 侧面信息还允许代码转换器从中间解码/编码级重新编码,以实现更快和更低复杂度的代码转换。
    • 9. 发明申请
    • CODING OF SPARSE DIGITAL MEDIA SPECTRAL DATA
    • 编码数字媒体光谱数据
    • US20080312758A1
    • 2008-12-18
    • US11764108
    • 2007-06-15
    • Kazuhito KoishidaSanjeev MehrotraWei-Ge Chen
    • Kazuhito KoishidaSanjeev MehrotraWei-Ge Chen
    • G06F17/00
    • G10L19/02G10L19/0212G10L19/032G10L19/18
    • An audio encoder/decoder provides efficient compression of spectral transform coefficient data characterized by sparse spectral peaks. The audio encoder/decoder applies a temporal prediction of the frequency position of spectral peaks. The spectral peaks in the transform coefficients that are predicted from those in a preceding transform coding block are encoded as a shift in frequency position from the previous transform coding block and two non-zero coefficient levels. The prediction may avoid coding very large zero-level transform coefficient runs as compared to conventional run length coding. For spectral peaks not predicted from those in a preceding transform coding block, the spectral peaks are encoded as a value trio of a length of a run of zero-level spectral transform coefficients, and two non-zero coefficient levels.
    • 音频编码器/解码器提供以稀疏频谱峰值为特征的频谱变换系数数据的有效压缩。 音频编码器/解码器对频谱峰值的频率位置进行时间预测。 从前一变换编码块中预测的变换系数中的频谱峰值被编码为来自先前变换编码块和两个非零系数电平的频率位置的移位。 与常规游程长度编码相比,预测可以避免编码非常大的零电平变换系数运行。 对于未在前面的变换编码块中预测的频谱峰值,频谱峰值被编码为零电平频谱变换系数的行程的长度和两个非零系数电平的三值。
    • 10. 发明授权
    • Gain constrained noise suppression
    • 增加约束噪声抑制
    • US07454332B2
    • 2008-11-18
    • US10869467
    • 2004-06-15
    • Kazuhito KoishidaFeng ZhugeHosam A. KhalilTian WangWei-ge Chen
    • Kazuhito KoishidaFeng ZhugeHosam A. KhalilTian WangWei-ge Chen
    • G10L21/02G10L19/14
    • G10L21/0208G10L21/0232
    • A gain-constrained noise suppression for speech more precisely estimates noise, including during speech, to reduce musical noise artifacts introduced from noise suppression. The noise suppression operates by applying a spectral gain G(m, k) to each short-time spectrum value S(m, k) of a speech signal, where m is the frame number and k is the spectrum index. The spectrum values are grouped into frequency bins, and a noise characteristic estimated for each bin classified as a “noise bin.” An energy parameter is smoothed in both the time domain and the frequency domain to improve noise estimation per bin. The gain factors G(m, k) are calculated based on the current signal spectrum and the noise estimation, then smoothed before being applied to the signal spectral values S(m, k). First, a noisy factor is computed based on a ratio of the number of noise bins to the total number of bins for the current frame, where a zero-valued noisy factor means only using constant gain for all the spectrum values and noisy factor of one means no smoothing at all. Then, this noisy factor is used to alter the gain factors, such as by cutting off the high frequency components of the gain factors in the frequency domain.
    • 用于语音的增益约束噪声抑制更精确地估计包括在语音期间的噪声,以减少从噪声抑制引入的音乐噪声伪像。 通过对语音信号的每个短时间频谱值S(m,k)应用频谱增益G(m,k)来进行噪声抑制,其中m是帧号,k是频谱索引。 频谱值被分组成频率仓,并且对于被分类为“噪声仓”的每个仓估计的噪声特性。 能量参数在时域和频域均被平滑,以改善每个bin的噪声估计。 基于当前信号频谱和噪声估计来计算增益因子G(m,k),然后在施加到信号频谱值S(m,k)之前进行平滑处理。 首先,基于噪声箱数与当前帧的总数的比率来计算噪声因子,其中零值噪声因子意味着仅对所有频谱值使用恒定增益并且噪声因子为1 意味着没有平滑。 然后,这种噪声因子用于改变增益因子,例如通过切断频域中增益因子的高频分量。