会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • Compressibility estimation of non-unique indexes in a database management system
    • 数据库管理系统中非唯一索引的压缩性估计
    • US07895171B2
    • 2011-02-22
    • US12057055
    • 2008-03-27
    • Bishwaranjan BhattacharjeeLipyeow LimGeorge A. Mihaila
    • Bishwaranjan BhattacharjeeLipyeow LimGeorge A. Mihaila
    • G06F17/30
    • G06F17/30312H03M7/30
    • A method, information processing system, and computer readable storage product estimate a compression factor. A set of key values within an index are analyzed. Each key value is associated with a record identifier (“RID”) list comprising a set of RIDs. The index is in an uncompressed format and includes a total byte length. A number of RIDs associated with each key value is estimated for each key value in the set of key values. A total byte length for all RID deltas between each at least two consecutive RIDs within a RID list is estimated for each RID list based on the number of RIDs that have been determined. The total byte length estimated for each RID list is accumulated. A compression factor associated with the index is determined by dividing the total byte length that has been accumulated by the byte length of the index.
    • 一种方法,信息处理系统和计算机可读存储产品估计压缩因子。 分析索引中的一组关键值。 每个密钥值与包括一组RID的记录标识符(“RID”)列表相关联。 索引为未压缩格式,包括总字节长度。 与密钥值集合中的每个密钥值估计与每个密钥值相关联的多个RID。 基于已经确定的RID的数量,针对每个RID列表估计RID列表内的每个至少两个连续RID之间的所有RID差分的总字节长度。 累积为每个RID列表估计的总字节长度。 通过将累加的总字节长度除以索引的字节长度来确定与索引相关联的压缩因子。
    • 4. 发明申请
    • COMPRESSABILITY ESTIMATION OF NON-UNIQUE INDEXES IN A DATABASE MANAGEMENT SYSTEM
    • 数据库管理系统中非特定索引的可压缩性估计
    • US20090248725A1
    • 2009-10-01
    • US12057055
    • 2008-03-27
    • BISHWARANJAN BHATTACHARJEELipyeow LimGeorge A. Mihaila
    • BISHWARANJAN BHATTACHARJEELipyeow LimGeorge A. Mihaila
    • G06F17/30
    • G06F17/30312H03M7/30
    • A method, information processing system, and computer readable storage product estimate a compression factor. A set of key values within an index are analyzed. Each key value is associated with a record identifier (“RID”) list comprising a set of RIDs. The index is in an uncompressed format and includes a total byte length. A number of RIDs associated with each key value is estimated for each key value in the set of key values. A total byte length for all RID deltas between each at least two consecutive RIDs within a RID list is estimated for each RID list based on the number of RIDs that have been determined. The total byte length estimated for each RID list is accumulated. A compression factor associated with the index is determined by dividing the total byte length that has been accumulated by the byte length of the index.
    • 一种方法,信息处理系统和计算机可读存储产品估计压缩因子。 分析索引中的一组关键值。 每个密钥值与包括一组RID的记录标识符(“RID”)列表相关联。 索引为未压缩格式,包括总字节长度。 与密钥值集合中的每个密钥值估计与每个密钥值相关联的多个RID。 基于已经确定的RID的数量,针对每个RID列表估计RID列表内的每个至少两个连续RID之间的所有RID差分的总字节长度。 累积为每个RID列表估计的总字节长度。 通过将累加的总字节长度除以索引的字节长度来确定与索引相关联的压缩因子。
    • 5. 发明授权
    • Method and apparatus for selecting an optimal delete-safe compression method on list of delta encoded integers
    • 用于在增量编码整数列表上选择最佳删除安全压缩方法的方​​法和装置
    • US08990173B2
    • 2015-03-24
    • US12056979
    • 2008-03-27
    • Bishwaranjan BhattacharjeeLipyeow LimTimothy Ray MalkemusGeorge Andrei Mihaila
    • Bishwaranjan BhattacharjeeLipyeow LimTimothy Ray MalkemusGeorge Andrei Mihaila
    • G06F7/00H03M7/30
    • H03M7/30
    • Techniques are disclosed for selecting a delete-safe compression method for a plurality of delta encoded data values (e.g., delta encoded integers or deltas). For example, a computer-implemented method for selecting an optimal delete-safe compression algorithm from among two or more compression algorithms for use on a plurality of delta encoded data values includes the following steps. The maximum number of data values eliminated by each of the two or more compression algorithms is computed. For the plurality of delta encoded data values to be compressed, the minimum size of the plurality of delta encoded data values before compression thereof is computed. A delete-safe threshold value is computed based on the minimum size of the plurality of delta encoded data values. Then, the compression algorithm is selected from the two or more compression algorithms that achieves the delete-safe threshold value.
    • 公开了用于为多个增量编码的数据值(例如,增量编码的整数或三角形)选择删除安全的压缩方法的技术。 例如,用于从用于多个增量编码数据值的两个或更多个压缩算法中选择最佳删除安全压缩算法的计算机实现的方法包括以下步骤。 计算由两个或更多个压缩算法中的每一个消除的数据值的最大数目。 对于要压缩的多个delta编码数据值,计算其压缩之前的多个Δ编码数据值的最小大小。 基于多个增量编码数据值的最小大小来计算删除安全阈值。 然后,从实现删除安全​​阈值的两个或更多个压缩算法中选择压缩算法。
    • 6. 发明授权
    • Method and apparatus for encoding list of variable length structures to support bi-directional scans
    • 用于编码可变长度结构列表以支持双向扫描的方法和装置
    • US08126929B2
    • 2012-02-28
    • US12057012
    • 2008-03-27
    • Bishwaranjan BhattacharjeeLipyeow LimTimothy Ray MalkemusGeorge Andrei Mihaila
    • Bishwaranjan BhattacharjeeLipyeow LimTimothy Ray MalkemusGeorge Andrei Mihaila
    • G06F17/30
    • G06F17/30958G06F17/30286
    • Techniques are disclosed for encoding a variable length structure such that it facilitates forward and reverse scans of a list of such structures as needed. While the techniques are applicable to a wide variety of applications, they are particularly well-suited for use with structures such as those found in compressed database indexes. For example, a computer-implemented method for processing one or more variable length data structures includes the following steps. Each variable length data structure is obtained. Each variable length structure comprises one or more data block. A variable length encoding process is applied to the one or more blocks of each variable length data structure which comprises setting a continuation data value in each block to a first value or a second value, wherein the setting of the continuation data values enables bi-directional scanning of each variable length structure.
    • 公开了用于编码可变长度结构的技术,使得其有助于正向和反向扫描所需的这种结构的列表。 虽然这些技术适用于各种各样的应用,但它们特别适用于诸如在压缩数据库索引中发现的结构。 例如,用于处理一个或多个可变长度数据结构的计算机实现的方法包括以下步骤。 获得每个可变长度数据结构。 每个可变长度结构包括一个或多个数据块。 可变长度编码处理被应用于每个可变长度数据结构的一个或多个块,其包括将每个块中的连续数据值设置为第一值或第二值,其中,连续数据值的设置允许双向 扫描每个可变长度结构。
    • 9. 发明授权
    • Method and apparatus for organizing data sources
    • 组织数据源的方法和装置
    • US07529740B2
    • 2009-05-05
    • US11503713
    • 2006-08-14
    • Yuan-chi ChangLipyeow LimMin WangZhen Zhang
    • Yuan-chi ChangLipyeow LimMin WangZhen Zhang
    • G06F17/30
    • G06F17/30705Y10S707/99933Y10S707/99953
    • A method for organizing deep Web services is provided. In one aspect, the method obtains a collection of sources and their associated attributes and/or input modes, for instance, using a crawling algorithm. The method uses this information to organize the sources into communities. A mining algorithm such as the hyperclique mining algorithm is used to obtain cliques of highly correlated attributes. A clustering algorithm such as the hierarchical agglomerative clustering algorithm is used to further cluster the cliques of attributes into larger cliques, which in the present disclosure is referred to as signatures. The sources that are associated with each signature form a community and a graph representation of the communities is constructed, where the vertices are communities and the edges are the shared attributes.
    • 提供了组织深度Web服务的方法。 在一个方面,该方法获得源及其相关属性和/或输入模式的集合,例如使用爬行算法。 该方法使用这些信息将资源组织到社区。 使用诸如超临界挖掘算法的挖掘算法来获得高度相关属性的集合。 使用诸如分层聚类聚类算法的聚类算法进一步将属性集合聚类成更大的团块,其在本公开中被称为签名。 与每个签名相关联的源构成社区,并构建社区的图形表示,其中顶点是社区,边是共享属性。