会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 82. 发明授权
    • System and method for data de-duplication
    • 用于重复数据删除的系统和方法
    • US09465823B2
    • 2016-10-11
    • US11584782
    • 2006-10-19
    • Kam ShergillBharath AletiDheerai PandeySujatha MuthulingamAmit Ganesh
    • Kam ShergillBharath AletiDheerai PandeySujatha MuthulingamAmit Ganesh
    • G06F17/30G06F11/14
    • G06F17/30303G06F11/1453G06F17/30156
    • Disclosed are methods, systems, and computer program products for processing a file which include using a computer system that is programmed for performing a process of receiving the file in response to a request for storing the file, determining whether a database already contains the file, and storing the file in the database if the database does not already contain the file. The process may alternatively include receiving the file in response to a request for storing the file, determining whether a database already contains the file, and storing the file without storing the received file if the database already contains the file. The process may also alternatively include receiving the file in response to a request for storing the file in a database, separating the file into a plurality of portions, and storing the plurality of portions so each of the plurality of portions can be individually accessed.
    • 公开了用于处理文件的方法,系统和计算机程序产品,其包括使用被编程为响应于存储文件的请求执行接收文件的处理的计算机系统,确定数据库是否已经包含文件, 并且如果数据库尚未包含文件,则将该文件存储在数据库中。 该过程可以替代地包括响应于存储文件的请求而接收文件,确定数据库是否已经包含该文件,以及如果数据库已经包含该文件,则存储该文件而不存储接收到的文件。 该过程还可以包括响应于将文件存储在数据库中的请求而接收文件,将文件分离成多个部分,并且存储多个部分,使得多个部分中的每一个可以被单独访问。
    • 86. 发明授权
    • Compression analyzer
    • 压缩分析仪
    • US08356060B2
    • 2013-01-15
    • US12769508
    • 2010-04-28
    • Vineet MarwahVikram KapoorAmit GaneshJesse KampSachin KulkarniRoger MacnicolKam ShergillManosiz Bhattacharyya
    • Vineet MarwahVikram KapoorAmit GaneshJesse KampSachin KulkarniRoger MacnicolKam ShergillManosiz Bhattacharyya
    • G06F7/00
    • H03M7/30G06F17/30595
    • Techniques are described herein for automatically selecting the compression techniques to be used on tabular data. A compression analyzer gives users high-level control over the selection process without requiring the user to know details about the specific compression techniques that are available to the compression analyzer. Users are able to specify, for a given set of data, a “balance point” along the spectrum between “maximum performance” and “maximum compression”. The point thus selected is used by the compression analyzer in a variety of ways. For example, in one embodiment, the compression analyzer uses the user-specified balance point to determine which of the available compression techniques qualify as “candidate techniques” for the given set of data. The compression analyzer selects the compression technique to use on a set of data by actually testing the candidate compression techniques against samples from the set of data. After testing the candidate compression techniques against the samples, the resulting compression ratios are compared. The compression technique to use on the set of data is then selected based, in part, on the compression ratios achieved during the compression tests performed on the sample data.
    • 这里描述了用于自动选择要在表格数据上使用的压缩技术的技术。 压缩分析仪为用户提供了对选择过程的高级控制,而不需要用户了解有关压缩分析器可用的特定压缩技术的细节。 用户可以为给定的数据集指定沿最大性能和最大压缩之间的平衡点。 所选择的点由压缩分析器以各种方式使用。 例如,在一个实施例中,压缩分析器使用用户指定的平衡点来确定哪些可用的压缩技术被鉴定为用于给定的一组数据的候选技术。 压缩分析仪通过对来自该组数据的样本实际测试候选压缩技术来选择对一组数据使用的压缩技术。 在针对样品测试候选压缩技术之后,比较所得到的压缩比。 然后,部分地基于在对样本数据执行的压缩测试期间实现的压缩比来选择在该组数据上使用的压缩技术。
    • 87. 发明授权
    • Techniques for compression and processing optimizations by using data transformations
    • 使用数据转换进行压缩和处理优化的技术
    • US08239421B1
    • 2012-08-07
    • US12871862
    • 2010-08-30
    • Vineet MarwahVikram KapoorJesse KampKam ShergillRoger MacNicolManosiz BhattacharyyaAmit Ganesh
    • Vineet MarwahVikram KapoorJesse KampKam ShergillRoger MacNicolManosiz BhattacharyyaAmit Ganesh
    • G06F17/20
    • H03M7/30H03M7/3084
    • Described herein are compression and processing optimizations by using data transformation techniques. In example embodiments, a byte-wise differential transformation is applied to columnar data represented as a list of length-value pairs to determine a list of delta pairs that is subsequently compressed and stored on persistent storage. A length separation transformation is applied to separate a list of length-value pairs into a length array and a corresponding data value array, where these two arrays are subsequently compressed and stored separately on persistent storage. A native number transformation is applied to a set of number values to remove the lengths stored in the number values, where the transformed set is stored on persistent storage instead of the original set of number values. A native datetime-type transformation is applied to a set of datetime values to generate an encoding that is used to encode the set of datetime values into an encoded set that is stored on persistent storage instead of the original set.
    • 这里描述的是使用数据变换技术的压缩和处理优化。 在示例实施例中,将逐字节差分变换应用于表示为长度值对列表的列数据,以确定随后压缩并存储在持久存储器上的增量对列表。 应用长度分离变换将长度值对列表分隔成长度数组和对应的数据值数组,其中这两个数组随后被压缩并分别存储在持久存储器上。 原始数字变换被应用于一组数字值以去除存储在数字值中的长度,其中变换的集合存储在永久存储器上而不是原始的数字集合。 本机datetime类型转换应用于一组datetime值,以生成用于将datetime值集合编码为存储在永久存储而不是原始集合的编码集中的编码。
    • 89. 发明申请
    • STRUCTURE OF HIERARCHICAL COMPRESSED DATA STRUCTURE FOR TABULAR DATA
    • 用于数据数据的分层压缩数据结构的结构
    • US20100278446A1
    • 2010-11-04
    • US12617669
    • 2009-11-12
    • Amit GaneshVikram KapoorVineet MarwahKam ShergillRoger MacNicolSachin KulkarniJesse Kamp
    • Amit GaneshVikram KapoorVineet MarwahKam ShergillRoger MacNicolSachin KulkarniJesse Kamp
    • G06K9/36
    • G06F17/30315
    • A highly flexible and extensible structure is provided for physically storing tabular data. The structure, is referred to as a compression unit, and may be used to physically store tabular data that logically resides in any type of table-like structure. According to one embodiment, compression units are recursive. Thus, a compression unit may have a “parent” compression unit to which it belongs, and may have one or more “child” compression units that belong to it. In one embodiment, compression units include metadata that indicates how the tabular data is stored within them. The metadata for a compression unit may indicate, for example, whether the data within the compression unit is stored in row-major or column major-format (or some combination thereof), the order of the columns within the compression unit (which may differ from the logical order of the columns dictated by the definition of their logical container), a compression technique for the compression unit, the child compression units (if any), etc.
    • 提供了高度灵活和可扩展的结构,用于物理存储表格数据。 该结构被称为压缩单元,并且可以用于物理地存储逻辑上驻留在任何类型的表状结构中的表格数据。 根据一个实施例,压缩单元是递归的。 因此,压缩单元可以具有它所属的“父”压缩单元,并且可以具有属于它的一个或多个“子”压缩单元。 在一个实施例中,压缩单元包括指示表格数据如何被存储在其中的元数据。 压缩单元的元数据例如可以指示压缩单元内的数据是以主要或列主格式(或其某种组合)存储的,压缩单元中的列的顺序(其可以不同 从逻辑容器的定义指定的列的逻辑顺序),压缩单元的压缩技术,子压缩单元(如果有的话)等等
    • 90. 发明申请
    • CASCADING INDEX COMPRESSION
    • US20090292947A1
    • 2009-11-26
    • US12469637
    • 2009-05-20
    • Amit GaneshJuan R. LoaizaKrishna Kunchithapadam
    • Amit GaneshJuan R. LoaizaKrishna Kunchithapadam
    • G06F11/07G06F17/30
    • G06F11/1474G06F17/30
    • Techniques for maintaining a cascading index are provided. In one approach, one or more branch node compression techniques are applied to the main index of a cascading index. In an approach, a Bloom filter is generated and associated with, e.g., a branch node in the main index. The Bloom filter is used to determine whether, without accessing any leaf blocks, a particular key value exists, e.g., in leaf blocks associated with the branch node. In an approach, a new redo record is generated in response to a merge operation between two levels of the cascading index. The new redo record comprises (a) one or more addresses of blocks that are affected by the merge operation, (b) data is that being “pushed down” to a lower level of the cascading index, and (c) one or more addresses of blocks that are written to disk storage as a result of the merge operation.
    • 提供了维护级联索引的技术。 在一种方法中,一个或多个分支节点压缩技术被应用于级联索引的主索引。 在一种方法中,生成Bloom过滤器并与例如主索引中的分支节点相关联。 Bloom过滤器用于确定在不访问任何叶片块的情况下是否存在特定的键值,例如在与分支节点相关联的叶片块中。 在一种方法中,响应于级联索引的两个级别之间的合并操作,生成新的重做记录。 新的重做记录包括(a)受合并操作影响的块的一个或多个地址,(b)数据被“下推”到级联索引的较低级,以及(c)一个或多个地址 作为合并操作的结果写入磁盘存储的块。