会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Shape and scale parameters for extended-band frequency coding
    • 扩展频带编码的形状和缩放参数
    • US07953604B2
    • 2011-05-31
    • US11336618
    • 2006-01-20
    • Sanjeev MehrotraWei-Ge ChenKazuhito KoishidaChao He
    • Sanjeev MehrotraWei-Ge ChenKazuhito KoishidaChao He
    • G10L19/00
    • G10L21/038
    • An audio encoder performs frequency extension coding that comprises determining one or more shape parameters using a displacement vector that corresponds to a displacement of an even number (e.g., an even number of sub-bands between a sub-band in a baseband frequency range and a sub-band in an extended-band frequency range). The shape parameters can be determined on a per-audio-block basis. Restricting a displacement to an even number (in frequency extension coding or in other signal modulation schemes) can improve the quality of reconstructed audio. An audio encoder also can perform frequency extension coding that comprises determining one or more scale parameters at one or more audio blocks, and determining one or more anchor points for interpolating the one or more scale parameters.
    • 音频编码器执行频率扩展编码,其包括使用对应于偶数位移的位移矢量来确定一个或多个形状参数(例如,基带频率范围中的子带和偶数个子带之间的偶数个子带) 子带在扩展频带范围内)。 形状参数可以基于每个音频块来确定。 将位移限制为偶数(在频率扩展编码或其他信号调制方案中)可以提高重构音频的质量。 音频编码器还可以执行频率扩展编码,其包括确定一个或多个音频块处的一个或多个缩放参数,以及确定用于内插一个或多个缩放参数的一个或多个定位点。
    • 5. 发明申请
    • COMPLEX-TRANSFORM CHANNEL CODING WITH EXTENDED-BAND FREQUENCY CODING
    • 具有扩展频段编码的复杂变换通道编码
    • US20110035226A1
    • 2011-02-10
    • US12907889
    • 2010-10-19
    • Sanjeev MehrotraWei-Ge Chen
    • Sanjeev MehrotraWei-Ge Chen
    • G10L19/00
    • G10L21/038G10L19/008
    • An audio encoder receives multi-channel audio data comprising a group of plural source channels and performs channel extension coding, which comprises encoding a combined channel for the group and determining plural parameters for representing individual source channels of the group as modified versions of the encoded combined channel. The encoder also performs frequency extension coding. The frequency extension coding can comprise, for example, partitioning frequency bands in the multi-channel audio data into a baseband group and an extended band group, and coding audio coefficients in the extended band group based on audio coefficients in the baseband group. The encoder also can perform other kinds of transforms. An audio decoder performs corresponding decoding and/or additional processing tasks, such as a forward complex transform.
    • 音频编码器接收包括一组多个源信道的多声道音频数据,并执行信道扩展编码,其包括对该组的组合信道进行编码,并确定用于表示该组的各个源信道的多个参数,作为编码组合的修改版本 渠道。 编码器还执行频率扩展编码。 频率扩展编码可以包括例如将多声道音频数据中的频带划分为基带组和扩展频带组,并且基于基带组中的音频系数对扩展频带组中的音频系数进行编码。 编码器还可以执行其他类型的转换。 音频解码器执行相应的解码和/或附加处理任务,例如前向复合变换。
    • 7. 发明授权
    • Flexible frequency and time partitioning in perceptual transform coding of audio
    • 音频感知变换编码中灵活的频率和时间分割
    • US07761290B2
    • 2010-07-20
    • US11764134
    • 2007-06-15
    • Kazuhito KoishidaSanjeev MehrotraWei-Ge Chen
    • Kazuhito KoishidaSanjeev MehrotraWei-Ge Chen
    • G10L19/02
    • G10L19/0208G10L19/032
    • An audio encoder/decoder performs band partitioning for vector quantization encoding of spectral holes and missing high frequencies that result from quantization when encoding at low bit rates. The encoder/decoder determines a band structure for spectral holes based on two threshold parameters: a minimum hole size threshold and a maximum band size threshold. Spectral holes wider than the minimum hole size threshold are partitioned evenly into bands not exceeding the maximum band size threshold in size. Such hole filling bands are configured up to a preset number of hole filling bands. The bands for missing high frequencies are then configured by dividing the high frequency region into bands having binary-increasing, linearly-increasing or arbitrarily-configured band sizes up to a maximum overall number of bands.
    • 音频编码器/解码器对以低比特率进行编码的频谱空间矢量量化编码和由量化产生的缺失高频进行频带划分。 编码器/解码器基于两个阈值参数确定频谱孔的频带结构:最小孔尺寸阈值和最大频带尺寸阈值。 比最小孔尺寸阈值更宽的光谱孔被均匀地分割成不超过最大带尺寸阈值的带。 这样的孔填充带被配置成预定数量的填充孔。 然后通过将高频区域划分成具有二进制增加,线性增加或任意配置的频带大小直到最大总带数的频带来配置用于缺失高频的频带。
    • 9. 发明授权
    • Receiver-driven layered error correction multicast over heterogeneous packet networks
    • 接收器驱动的分层纠错多播在异构分组网络上
    • US07697514B2
    • 2010-04-13
    • US11109250
    • 2005-04-18
    • Philip ChouAlbert WangSanjeev MehrotraAlexander Mohr
    • Philip ChouAlbert WangSanjeev MehrotraAlexander Mohr
    • H04L12/56
    • H04L1/007H04L1/0009H04L1/0017H04L1/06H04L1/1812H04L2001/0093
    • A system and method for correcting errors and losses occurring during a receiver-driven layered multicast (RLM) of real-time media over a heterogeneous packet network such as the Internet. This is accomplished by augmenting RLM with one or more layers of error correction information. This allows each receiver to separately optimize the quality of received audio and video information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM would have one or more multicasted error correction data streams (i.e., layers) associated therewith. Each of the error correction layers would contain information that can be used to replace lost packets from the associated source layer. More than one error correction layer is proposed as some of the error correction packets contained in the data stream needed to replace the packets lost in the associated source stream may themselves be lost in transmission. A preferred process for generating the error correction streams involves the use of a unique adaptation of the Forward Error Correction (FEC) techniques. This process encodes the transmission data using a linear transform which adds redundant elements. The redundancy permits losses to be corrected because any of the original data elements can be derived from any of the encoded elements. Thus, as long as enough of the encoded data elements are received so as to equal the number of the original data elements, it is possible to derive all the original elements.
    • 一种用于在异构分组网络(例如因特网)下校正在实时媒体的接收机驱动分层多播(RLM)期间发生的错误和损失的系统和方法。 这是通过用一层或多层纠错信息增强RLM来实现的。 这允许每个接收机通过订阅至少一个纠错层来分别优化所接收的音频和视频信息的质量。 理想地,RLM中的每个源层将具有与其相关联的一个或多个多播的纠错数据流(即,层)。 每个纠错层将包含可用于替换相关源层丢失的分组的信息。 提出了多于一个纠错层,因为包含在替换相关源流中丢失的分组所需的数据流中的一些纠错分组本身可能在传输中丢失。 用于产生纠错流的优选过程涉及使用前向纠错(FEC)技术的唯一适配。 该过程使用添加冗余元素的线性变换对传输数据进行编码。 冗余允许修正损失,因为任何原始数据元素可以从任何编码元素导出。 因此,只要接收到足够的编码数据元素以便等于原始数据元素的数量,就有可能导出所有的原始元素。
    • 10. 发明申请
    • FINE-GRAINED CLIENT-SIDE CONTROL OF SCALABLE MEDIA DELIVERY
    • 精细的客户端控制可扩展的媒体交付
    • US20100080290A1
    • 2010-04-01
    • US12242524
    • 2008-09-30
    • Sanjeev Mehrotra
    • Sanjeev Mehrotra
    • H04N11/02
    • H04N19/34H04N19/115H04N19/124H04N19/147H04N19/164H04N19/187H04N19/19H04N19/30H04N19/44H04N19/46H04N19/61
    • Techniques and tools for adjusting quality and bit rate of multiple chunks of media delivered over a network are described. For example, each of the multiple chunks is encoded as multiple layers (e.g., a base layer and multiple embedded residual layers) for fine-grained scalability at different rate/quality points. A server stores the encoded data for the layers of chunks as well as curve information that parameterizes rate-distortion curves for the chunks. The server sends the curve information to a client. For the multiple chunks, the client uses the curve information to determine rate-distortion preferences for the respective chunks, then sends feedback indicating the rate-distortion preferences to the server. For each of the multiple chunks, the server, based at least in part upon the feedback, selects one or more scalable layers of the chunk to deliver to the client.
    • 描述了通过网络传送多个媒体块的质量和比特率的技术和工具。 例如,多个块中的每一个被编码为多个层(例如,基本层和多个嵌入的残余层),用于在不同速率/质量点处进行细粒度可扩展性。 服务器存储块的编码数据以及参数化块的速率 - 失真曲线的曲线信息。 服务器将曲线信息发送给客户端。 对于多个块,客户端使用曲线信息来确定各个块的速率失真偏好,然后将指示速率失真偏好的反馈发送到服务器。 对于多个块中的每一个,服务器至少部分地基于反馈,选择块的一个或多个可缩放层以递送给客户端。