会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Method and system for generating a statistical summary of a database using a join synopsis
    • 使用连接概要生成数据库的统计摘要的方法和系统
    • US06477534B1
    • 2002-11-05
    • US09480261
    • 2000-01-11
    • Swarup AcharyaPhillip B. GibbonsViswanath Poosala
    • Swarup AcharyaPhillip B. GibbonsViswanath Poosala
    • G06F1730
    • G06F17/30457G06F17/30489G06F17/30536Y10S707/99933Y10S707/99935
    • A method for generating an approximate answer to a query in a database environment in which the database has a plurality of base relations. A query relating to a database is received, and an approximate answer to the query is generated such that the approximate answer is based on at least one join synopsis formed from the database. The method further includes steps of forming a sample-tuple set for at least one selected base relation of a plurality of base relations of a database such that each sample-tuple set contains at least one sample tuple from a corresponding base relation, and forming a join synopsis set for each selected base relation such that each join synopsis set contains a join synopsis for each sample tuple in a sample-tuple set. A join synopsis of a sample tuple is based on a join of the sample tuple and at least one descendent relation of the sample tuple. All join synopsis sets form a statistical summary of the database and are stored.
    • 一种在数据库具有多个基本关系的数据库环境中生成关于查询的近似答案的方法。 接收到与数据库相关的查询,并且产生查询的近似答案,使得近似答案基于从数据库形成的至少一个连接概要。 该方法还包括以下步骤:为数据库的多个基本关系中的至少一个选定的基本关系形成样本组元组,使得每个样本组元组包含来自相应基础关系的至少一个样本元组,并形成 为每个选定的基础关系加入概要集,使得每个连接概要集包含样本元组中每个样本元组的连接概要。 样本元组的连接概要基于样本元组的连接和样本元组的至少一个后代关系。 所有连接概要集形成数据库的统计摘要,并存储。
    • 2. 发明授权
    • Join synopsis-based approximate query answering
    • 加入基于概要的近似查询回答
    • US06912524B2
    • 2005-06-28
    • US10216295
    • 2002-08-12
    • Swarup AcharyaPhillip B. GibbonsViswanath PoosalaSridhar Ramaswamy
    • Swarup AcharyaPhillip B. GibbonsViswanath PoosalaSridhar Ramaswamy
    • G06F17/30
    • G06F17/30489G06F17/30536Y10S707/99933Y10S707/99934
    • A method for generating an approximate answer to a query in a database environment in which the database has a plurality of base relations. A query relating to a database is received, and an approximate answer to the query is generated such that the approximate answer is based on at least one join synopsis formed from the database. The method further includes steps of forming a sample-tuple set for at least one selected base relation of a plurality of base relations of a database such that each sample-tuple set contains at least one sample tuple from a corresponding base relation, and forming a join synopsis set for each selected base relation such that each join synopsis set contains a join synopsis for each sample tuple in a sample-tuple set. A join synopsis of a sample tuple is based on a join of the sample tuple and at least one descendent relation of the sample tuple. All join synopsis sets form a statistical summary of the database and are stored.
    • 一种在数据库具有多个基本关系的数据库环境中生成关于查询的近似答案的方法。 接收到与数据库相关的查询,并且产生查询的近似答案,使得近似答案基于从数据库形成的至少一个连接概要。 该方法还包括以下步骤:为数据库的多个基本关系中的至少一个选定的基本关系形成样本组元组,使得每个样本组元组包含来自相应基础关系的至少一个样本元组,并形成 为每个选定的基础关系加入概要集,使得每个连接概要集包含样本元组中每个样本元组的连接概要。 样本元组的连接概要基于样本元组的连接和样本元组的至少一个后代关系。 所有连接概要集形成数据库的统计摘要,并存储。
    • 4. 发明授权
    • Incremental maintenance of an approximate histogram in a database system
    • 在数据库系统中增加一个近似直方图的维护
    • US5870752A
    • 1999-02-09
    • US915804
    • 1997-08-21
    • Phillip B. GibbonsYossi MatiasViswanath PoosalaAndrew Witkowski
    • Phillip B. GibbonsYossi MatiasViswanath PoosalaAndrew Witkowski
    • G06F17/30
    • G06F17/30368Y10S707/99942Y10S707/99943Y10S707/99945
    • Techniques for maintaining an approximate histogram of a relation in a database, in the presence of updates to the relation. The histogram includes a number of subsets, or "buckets," each representing at least one possible value of an attribute of the relation. Each of the subsets has a count associated therewith indicative of the frequency of occurrence of the corresponding value of the attribute. After an update to the relation, the counts associated with the subsets are compared to a threshold. If the count associated with a given subset exceeds the threshold, the given subset is separated at its median into two separate subsets. After the separation operation, the two subsets with the lowest counts are combined such that a constant number of subsets are maintained in the histogram, if the total combined count of the subsets does not exceed the threshold. If no two subsets have a total combined count which does not exceed the threshold, the histogram is recomputed from a random sample of the relation. The invention substantially reduces the number of times the histogram must be recomputed from the random sample, and is particularly well-suited for use with approximate equi-depth and compressed histograms.
    • 在关系更新的情况下维护数据库中关系的近似直方图的技术。 直方图包括多个子集或“桶”,每个子集表示该关系属性的至少一个可能的值。 每个子集具有与其相关联的计数,指示属性的相应值的出现频率。 在更新关系后,将与子集关联的计数与阈值进行比较。 如果与给定子集相关联的计数超过阈值,则给定子集在其中间被分离成两个单独的子集。 在分离操作之后,组合具有最低计数的两个子集,使得如果子集的总组合计数不超过阈值,则在直方图中维持恒定数量的子集。 如果没有两个子集具有不超过阈值的总组合计数,则从该关系的随机样本重新计算直方图。 本发明基本上减少了从随机样本重新计算直方图的次数,并且特别适用于近似等深度和压缩的直方图。
    • 5. 发明授权
    • Maintaining a random sample of a relation in a database in the presence
of updates to the relation
    • 在关系更新的情况下,在数据库中维护关系的随机抽样
    • US6012064A
    • 2000-01-04
    • US915774
    • 1997-08-21
    • Phillip B. GibbonsYossi MatiasViswanath Poosala
    • Phillip B. GibbonsYossi MatiasViswanath Poosala
    • G06F17/30
    • G06F17/30595Y10S707/954Y10S707/99933Y10S707/99934Y10S707/99942Y10S707/99943Y10S707/99944
    • Techniques for maintaining a random sample of a relation in a database in the presence of updates to the relation. The random sample of the relation is referred to as a "backing sample," and it is maintained in the presence of insert, modify and delete operations involving the relation. When a new tuple is inserted into the relation, a sample of the given tuple is added to the backing sample if the size of the backing sample is below an upper bound. Otherwise, a randomly-selected tuple of the backing sample is replaced with the new tuple if a sample of the new tuple must be inserted into the backing sample to maintain randomness or another characteristic. When a tuple in the relation is the subject of a modify operation, the backing sample is left unchanged if the modify operation does not affect an attribute of interest to an application which uses the backing sample. Otherwise, a value field in a sample of the tuple in the backing sample is updated. When a tuple is deleted from the relation, any sample of that tuple in the backing sample is removed. A new backing sample may be computed if this removal causes the size of the backing sample to fall below a prespecified lower bound. The backing sample can be of a size which is negligible in comparison to the relation, and need only be modified very infrequently. As a result, its overhead in terms of computation time and storage space is minimal.
    • 在存在关系更新的情况下,在数据库中维护关系随机抽样的技术。 该关系的随机样本被称为“后备样本”,并且在存在涉及该关系的插入,修改和删除操作的情况下保持该样本。 当一个新元组被插入关系中时,如果背景样本的大小低于上限,则将给定元组的样本添加到背景样本中。 否则,如果必须将新元组的样本插入到背景样本中以保持随机性或其他特征,则将随机选择的背衬样本的元组替换为新的元组。 当关系中的元组是修改操作的主题时,如果修改操作不影响使用后备样本的应用程序感兴趣的属性,则后备样本将保持不变。 否则,将更新背景样本中的元组样本中的值字段。 当从该关系中删除元组时,将删除该背景样本中该元组的任何样本。 如果这种去除导致背衬样品的尺寸低于预先指定的下限,则可以计算新的背衬样品。 背衬样本的尺寸可以与关系相比可以忽略不计,并且只需要非常频繁地修改。 因此,其在计算时间和存储空间方面的开销是最小的。
    • 9. 发明授权
    • Methods and apparatus for scheduling parallel processors
    • 调度并行处理器的方法和装置
    • US06434590B1
    • 2002-08-13
    • US09053873
    • 1998-04-01
    • Guy E. BlellochPhillip B. GibbonsYossi MatiasGirija J. Narlikar
    • Guy E. BlellochPhillip B. GibbonsYossi MatiasGirija J. Narlikar
    • G06F900
    • G06F9/5066G06F2209/5021
    • A parallel processing method involves the steps of determining a sequential ordering of tasks for processing, assigning priorities to available tasks on the basis of the earliest and then later in the sequential ordering, selecting a number of tasks greater than a total number of available parallel processing elements from all available tasks having the highest priorities, partitioning the selected tasks into a number of groups equal to the available number of parallel processing elements, and executing the tasks in the groups in the parallel processing elements. The determining step establishes an ordering with a specific predetermined sequential schedule that is independent of the parallel execution, and the assigning step assigns priorities for parallel execution on the basis of the sequential schedule that is independent of the parallel execution.
    • 并行处理方法包括以下步骤:确定用于处理的任务的顺序排序,基于顺序排序中的最早然后稍后的顺序为可用任务分配优先级,选择大于总数的可用并行处理 来自具有最高优先级的所有可用任务的元素,将所选择的任务划分成等于可用数量的并行处理元素的多个组,以及在并行处理元素中执行组中的任务。 确定步骤建立具有独立于并行执行的特定预定顺序调度的排序,并且分配步骤基于独立于并行执行的顺序调度分配用于并行执行的优先级。
    • 10. 发明授权
    • Distinct sampling system and a method of distinct sampling for optimizing distinct value query estimates
    • 不同的抽样系统和不同抽样的方法来优化不同的价值查询估计
    • US07047230B2
    • 2006-05-16
    • US10237993
    • 2002-09-09
    • Phillip B. Gibbons
    • Phillip B. Gibbons
    • G06F17/30
    • G06F17/30536G06F17/30457G06F17/30489Y10S707/99932Y10S707/99944
    • For use with a database that accommodates distinct value queries having predicates, a distinct sampling system and a method of distinct sampling. In one embodiment, the distinct sampling system includes a scanning subsystem that is configured to scan each row in the database for a distinct target attribute, employ a hash function to map the distinct target attribute to an attribute priority level, maintain random samples of each row based on a sample priority level and a sample size, and produce a distinct sample therefrom. The distinct sampling system further includes a distinct query estimator that is configured to receive the distinct value queries, cause the distinct value queries to be executed on the distinct sample to retrieve a result, and adjust the result to produce a distinct estimate therefrom.
    • 用于容纳具有谓词的不同值查询的数据库,独特的采样系统和不同采样的方法。 在一个实施例中,不同采样系统包括扫描子系统,其被配置为扫描数据库中的每一行以获得不同的目标属性,采用散列函数将不同的目标属性映射到属性优先级,维护每行的随机采样 基于样本优先级和样本大小,并从中产生不同的样本。 不同的采样系统还包括被配置为接收不同值查询的不同查询估计器,导致在不同样本上执行不同值查询以检索结果,并且调整结果以从其产生不同的估计。