会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • System and method for searching computer files and returning identified files and associated files
    • 用于搜索计算机文件并返回识别的文件和相关文件的系统和方法
    • US07930301B2
    • 2011-04-19
    • US10403063
    • 2003-03-31
    • Cezary MarcjanRyszard KottSurajit ChaudhuriLili Cheng
    • Cezary MarcjanRyszard KottSurajit ChaudhuriLili Cheng
    • G06F7/00
    • G06F17/30106
    • A search of an index database or another search method is conducted to identify preliminary results listing one or more selected computer objects having selected identifying information stored in an index database. In addition, one or more selected computer objects of the preliminary search results are correlated with one or more other computer objects that have associations with the selected computer objects of the preliminary search results. Integrated search results are then returned and include the preliminary search results and one or more other computer objects that have associations with the selected computer objects of the preliminary search results. The associations may be determined by a association system and represent relationships between computer files based upon user or other interactions between the objects. The associations between the objects may include similarities between them and their importance.
    • 执行索引数据库或另一搜索方法的搜索以识别列出存储在索引数据库中的具有所选择的识别信息的一个或多个所选计算机对象的初步结果。 此外,初步搜索结果的一个或多个所选择的计算机对象与与初步搜索结果的所选择的计算机对象具有关联的一个或多个其他计算机对象相关联。 然后返回集成搜索结果,并包括初步搜索结果以及与初步搜索结果的所选计算机对象相关联的一个或多个其他计算机对象。 关联可以由关联系统确定,并且基于用户或对象之间的其他交互来表示计算机文件之间的关系。 对象之间的关联可能包括它们之间的相似性及其重要性。
    • 5. 发明申请
    • PSEUDO-DOCUMENTS TO FACILITATE DATA DISCOVERY
    • 促进数据发现的原始文件
    • US20130275436A1
    • 2013-10-17
    • US13444717
    • 2012-04-11
    • Surajit ChaudhuriLev NovikJohn C. Platt
    • Surajit ChaudhuriLev NovikJohn C. Platt
    • G06F17/30
    • G06F16/319G06F16/245
    • Various embodiments promote the discoverability of data that can be contained within a database. In one or more embodiments, data within a database is organized in a structure having a schema. The structure and data can be processed in a manner that renders one or more pseudo-documents each of which constitutes a sub-structure that can be indexed. Once produced and indexed, the pseudo-documents constitute a set of searchable objects each of which relationally points back to its associated structure within the database. Searches can now be performed against the pseudo-documents which, in turn, returns a set of search results. The set of search results can include multiple sub-sets of pseudo-documents, each sub-set of which is associated with a different structure.
    • 各种实施例提高了可以包含在数据库内的数据的可发现性。 在一个或多个实施例中,数据库内的数据被组织在具有模式的结构中。 结构和数据可以以呈现一个或多个伪文档的方式进行处理,每个伪文档构成可被索引的子结构。 一旦生成和索引,伪文档构成一组可搜索的对象,每个可搜索对象在数据库中相互关联地指向其相关联的结构。 现在可以针对伪文档执行搜索,这些伪文档又返回一组搜索结果。 该组搜索结果可以包括多个伪文档子集,每个子​​集的每个子集与不同的结构相关联。
    • 6. 发明申请
    • INTEGRATED FUZZY JOINS IN DATABASE MANAGEMENT SYSTEMS
    • 数据库管理系统中的集成FUZZY JOINS
    • US20130091120A1
    • 2013-04-11
    • US13253315
    • 2011-10-05
    • Kris GanjamVivek Ravindranath NarasayyaRaghav KaushikArvind ArasuSurajit Chaudhuri
    • Kris GanjamVivek Ravindranath NarasayyaRaghav KaushikArvind ArasuSurajit Chaudhuri
    • G06F17/30
    • G06F17/30303G06F17/30533
    • A fuzzy joins system that is integrated in a database system generates fuzzy joins between records from two datasets. The fuzzy joins system includes a tokenizer to generate tokens for data records and a transformer to find transforms for the tokens. The fuzzy joins system invokes a signature generator, running within a runtime layer of the database system, to generate signatures for data records based on the tokens and their transforms. Subsequently, an equi-join operation joins the records from the two datasets with at least one equal signature. A similarity calculator, running within a runtime layer of the database system, computes a similarity measure using the token information of the joined records. If the similarity measure for any two records is above a threshold, the fuzzy joins system generates a fuzzy join between such two records.
    • 集成在数据库系统中的模糊连接系统在两个数据集的记录之间生成模糊连接。 模糊连接系统包括一个用于生成数据记录令牌的标记器和一个用于为令牌找到变换的变压器。 模糊连接系统调用在数据库系统的运行时层内运行的签名生成器,以基于令牌及其转换生成用于数据记录的签名。 随后,等连接操作将来自两个数据集的记录与至少一个相等的签名相连。 在数据库系统的运行时层内运行的相似度计算器使用所连接的记录的令牌信息来计算相似性度量。 如果任何两个记录的相似性度量高于阈值,则模糊连接系统在这两个记录之间生成模糊连接。
    • 7. 发明授权
    • Transformation rule profiling for a query optimizer
    • 用于查询优化器的转换规则概要分析
    • US08332388B2
    • 2012-12-11
    • US12818237
    • 2010-06-18
    • Surajit ChaudhuriLeo GiakoumakisVivek NarasayyaRavi Ramamurthy
    • Surajit ChaudhuriLeo GiakoumakisVivek NarasayyaRavi Ramamurthy
    • G06F7/00G06F17/30
    • G06F17/30463
    • Technology is described for transformation rule profiling for a query optimizer. The method can include obtaining a database query configured to be optimized by the query optimizer of a database system. An optimized query plan for the database query can be found using a host set of transformation rules. One transformation rule can be removed and checked at a time. Each transformation rule can be checked to determine whether the transformation rule affects an optimal query plan output. A test query plan can be generated after each transformation rule has been removed. The query optimizer can determine whether the test query plan is different than the optimized query plan in the absence of the removed transformation rule. An equivalent set of transformation rules can be created that includes transformation rules where the test query plan generated from the equivalent set of transformation rules is equivalent to the optimized plan.
    • 描述技术用于查询优化器的转换规则剖析。 该方法可以包括获得配置为由数据库系统的查询优化器优化的数据库查询。 可以使用主机转换规则集查找数据库查询的优化查询计划。 一次可以删除和检查一个转换规则。 可以检查每个变换规则以确定变换规则是否影响最优查询计划输出。 每个转换规则已被删除后,可以生成测试查询计划。 在没有删除的转换规则的情况下,查询优化器可以确定测试查询计划是否与优化的查询计划不同。 可以创建一组等效的转换规则,其中包括转换规则,其中从等效转换规则集生成的测试查询计划等同于优化的计划。
    • 8. 发明申请
    • TRANSFORMATION RULE PROFILING FOR A QUERY OPTIMIZER
    • 用于查询优化器的变换规则轮廓
    • US20110314000A1
    • 2011-12-22
    • US12818237
    • 2010-06-18
    • Surajit ChaudhuriLeo GiakoumakisVivek NarasayyaRavi Ramamurthy
    • Surajit ChaudhuriLeo GiakoumakisVivek NarasayyaRavi Ramamurthy
    • G06F17/30
    • G06F17/30463
    • Technology is described for transformation rule profiling for a query optimizer. The method can include obtaining a database query configured to be optimized by the query optimizer of a database system. An optimized query plan for the database query can be found using a host set of transformation rules. One transformation rule can be removed and checked at a time. Each transformation rule can be checked to determine whether the transformation rule affects an optimal query plan output. A test query plan can be generated after each transformation rule has been removed. The query optimizer can determine whether the test query plan is different than the optimized query plan in the absence of the removed transformation rule. An equivalent set of transformation rules can be created that includes transformation rules where the test query plan generated from the equivalent set of transformation rules is equivalent to the optimized plan.
    • 描述技术用于查询优化器的转换规则剖析。 该方法可以包括获得配置为由数据库系统的查询优化器优化的数据库查询。 可以使用主机转换规则集查找数据库查询的优化查询计划。 一次可以删除和检查一个转换规则。 可以检查每个变换规则以确定变换规则是否影响最优查询计划输出。 每个转换规则已被删除后,可以生成测试查询计划。 在没有删除的转换规则的情况下,查询优化器可以确定测试查询计划是否与优化的查询计划不同。 可以创建一组等效的转换规则,其中包括转换规则,其中从等效转换规则集生成的测试查询计划等同于优化的计划。
    • 9. 发明授权
    • Transformation-based framework for record matching
    • 用于记录匹配的基于变换的框架
    • US08032546B2
    • 2011-10-04
    • US12031715
    • 2008-02-15
    • Arvind ArasuSurajit Chaudhuri
    • Arvind ArasuSurajit Chaudhuri
    • G06F7/00G06F17/30
    • G06F17/30569G06F17/30675G06F17/30985
    • A transformation-based record matching technique. The technique provides a flexible way to account for synonyms and more general forms of string equivalences when performing record matching by taking as explicit input user-defined transformation rules (such as, for example, the fact that “Robert” and “Bob” that are synonymous). The input string and user-defined transformation rules are used to generate a larger set of strings which are used when performing record matching. Both the input string and data elements in a database can be transformed using the user-defined transformation rules in order to generate a larger set of potential record matches. These potential record matches can then be subjected to a threshold test in order to determine one or more best matches. Additionally, signature-based similarity functions are used to improve the computational efficiency of the technique.
    • 基于变换的记录匹配技术。 当通过采用显式输入用户定义的转换规则(例如,“Robert”和“Bob”)这样的事实来执行记录匹配时,该技术提供了一种灵活的方式来解释同义词和更一般的字符串等同形式 同义词)。 输入字符串和用户定义的转换规则用于生成在执行记录匹配时使用的较大的一组字符串。 可以使用用户定义的变换规则来转换数据库中的输入字符串和数据元素,以便生成更大的潜在记录匹配集合。 然后可以对这些潜在的记录匹配进行阈值测试,以确定一个或多个最佳匹配。 另外,使用基于签名的相似度函数来提高该技术的计算效率。