会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 21. 发明申请
    • Method and apparatus for organizing data sources
    • 组织数据源的方法和装置
    • US20080040326A1
    • 2008-02-14
    • US11503713
    • 2006-08-14
    • Yuan-chi ChangLipyeow LimMin WangZhen Zhang
    • Yuan-chi ChangLipyeow LimMin WangZhen Zhang
    • G06F17/30
    • G06F17/30705Y10S707/99933Y10S707/99953
    • A method and apparatus for organizing deep Web services are provided. In one aspect, the method and apparatus obtains a collection of sources and their associated attributes and/or input modes, for instance, using a crawling algorithm. The method and apparatus uses this information to organize the sources into communities. A mining algorithm such as the hyperclique mining algorithm is used to obtain cliques of highly correlated attributes. A clustering algorithm such as the hierarchical agglomerative clustering algorithm is used to further cluster the cliques of attributes into larger cliques, which in the present disclosure is referred to as signatures. The sources that are associated with each signature form a community and a graph representation of the communities is constructed, where the vertices are communities and the edges are the shared attributes.
    • 提供了一种用于组织深度Web服务的方法和装置。 在一个方面,该方法和装置例如使用爬行算法获得源及其相关属性和/或输入模式的集合。 该方法和装置使用这些信息将资源组织到社区。 使用诸如超临界挖掘算法的挖掘算法来获得高度相关属性的集合。 使用诸如分层聚类聚类算法的聚类算法进一步将属性集合聚类成更大的团块,其在本公开中被称为签名。 与每个签名相关联的源构成社区,并构建社区的图形表示,其中顶点是社区,边是共享属性。
    • 22. 发明授权
    • Dynamic maintenance of web indices using landmarks
    • 使用地标动态维护网络索引
    • US07299404B2
    • 2007-11-20
    • US10430049
    • 2003-05-06
    • Ramesh C. AgarwalLipyeow LimSriram K. PadmanabhanMin Wang
    • Ramesh C. AgarwalLipyeow LimSriram K. PadmanabhanMin Wang
    • G06F15/00
    • G06F17/2288G06F17/218G06F17/2247
    • A repository index records the position of document entries relative to landmark entries within the document. Landmark entries are selecting using a landmarking policy and their position relative to the document are stored in a landmark directory. During index updates, an edit transcript is generated describing the difference between old and new document versions, and both the document repository index and the landmark directory are updated as needed. Thus, the number of update operations preformed as compared with conventional indexing techniques may be substantially reduced when small, localized changes are made to the document. This is due to fact that the positions of document entries are recorded relative to the landmark entries rather than the document itself. By doing so, the document index becomes more shift-invariant, requiring fewer update operations when entries are added or inserted in localized areas of the document.
    • 存储库索引记录文档条目相对于文档中的地标条目的位置。 地标条目正在使用标记政策进行选择,并且相对于文档的位置将存储在地标目录中。 在索引更新期间,生成描述旧文档版本和新文档版本之间的差异的编辑脚本,并根据需要更新文档存储库索引和地标目录。 因此,与传统的索引技术相比,执行的更新操作的数量可以在对文档进行小的局部改变时被显着地减少。 这是因为文件条目的位置相对于里程碑条目而不是文档本身被记录。 通过这样做,文档索引变得越来越不变,当在文档的本地化区域中添加或插入条目时,需要更少的更新操作。
    • 23. 发明授权
    • Classification-based method and apparatus for string selectivity estimation
    • 用于字符串选择性估计的基于分类的方法和装置
    • US07987180B2
    • 2011-07-26
    • US12057885
    • 2008-03-28
    • Lipyeow LimMin Wang
    • Lipyeow LimMin Wang
    • G06F7/00
    • G06F17/3071Y10S707/99932Y10S707/99936Y10S707/99942
    • Histogram construction and selectivity estimation for string and substring match queries in databases of data having strings associated with attributes. The histogram construction counts string-attribute pairs in the documents, and outputs string-attribute-count triples sorted by count. The collection is partitioned into buckets. A synopsis is generated for the partition, having an average selectivity or count of the string-attribute-count triples in the partition and summary information representing the set of string-attribute pairs belonging to the bucket. Subsequent queries, both for exact and substring matches, use the synopsis to estimate the selectivity of buckets.
    • 字符串和子串的直方图构造和选择性估计在具有与属性相关联的字符串的数据的数据库中匹配查询。 直方图构造计算文档中的字符串属性对,并输出按count排序的字符串属性计数三元组。 集合被分成桶。 为分区生成概要,具有分区中的字符串属性计数三元组的平均选择性或计数,以及表示属于该分组的字符串属性对的集合的摘要信息。 随后的查询(对于精确和子串匹配)使用概要来估计存储桶的选择性。
    • 28. 发明授权
    • Semantic-aware record matching
    • 语义感知记录匹配
    • US08468160B2
    • 2013-06-18
    • US12610101
    • 2009-10-30
    • Oktie HassanzadehAnastasios KementsietsidisLipyeow LimMin Wang
    • Oktie HassanzadehAnastasios KementsietsidisLipyeow LimMin Wang
    • G06F17/30
    • G06F17/2785G06F17/30734
    • A method of semantic-aware record matching includes receiving source and target string record specifications associated with a source string record and a target string record, receiving semantic knowledge referring to tokens of the source string record and target string record, creating a first set of tokens for the source string record and a second set of tokens for the target string record based on the semantic knowledge, assigning a similarity score to the source string record and the target string record based on a semantic relationship between the first set of tokens and the second set of tokens, and matching the source string record and the target string record based on the similarity score.
    • 语义感知记录匹配的方法包括接收与源字符串记录和目标字符串记录相关联的源和目标字符串记录规范,接收引用源字符串记录和目标字符串记录的令牌的语义知识,创建第一组令牌 对于源字符串记录和基于语义知识的目标字符串记录的第二组令牌,基于第一组令牌和第二组令牌之间的语义关系向源字符串记录和目标字符串记录分配相似性分数 一组令牌,并根据相似性得分匹配源字符串记录和目标字符串记录。
    • 29. 发明授权
    • Ontology-based searching in database systems
    • 数据库系统中基于本体的搜索
    • US08135730B2
    • 2012-03-13
    • US12481009
    • 2009-06-09
    • Lipyeow LimAnastasios KementsietsidisMin Wang
    • Lipyeow LimAnastasios KementsietsidisMin Wang
    • G06F17/30
    • G06F17/3064G06Q10/10G06Q50/22
    • A method, information processing system, and computer program storage product retrieve data from a database. A search request is received from a user for a set of data in at least one database. An ontology query over is performed over at least one ontology associated with at least one database resulting in an ontological dataset associated with the search request in response to receiving the search request from the user. The ontological dataset includes at least one of a set of synonyms, a set of hypernyms, and a set of hyponyms, associated with the search request. A data query is performed over data in the at least one database using the ontological dataset in response to performing the ontology query. The set of data is returned to the user based on the data query that has been performed.
    • 方法,信息处理系统和计算机程序存储产品从数据库检索数据。 对于至少一个数据库中的一组数据,从用户接收搜索请求。 响应于接收到来自用户的搜索请求,在与至少一个数据库相关联的至少一个本体上执行本体查询,导致与搜索请求相关联的本体数据集。 本体数据集包括与搜索请求相关联的一组同义词,一组超词和一组下位词中的至少一个。 响应于执行本体查询,使用本体数据集在至少一个数据库中的数据上执行数据查询。 基于已经执行的数据查询,该组数据被返回给用户。