专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

21. 发明申请

US20080040326A1 Method and apparatus for organizing data sources 有权
标题翻译：组织数据源的方法和装置
公开(公告)号：US20080040326A1
公开(公告)日：2008-02-14
申请号：US11503713
申请日：2006-08-14
申请人： Yuan-chi Chang , Lipyeow Lim , Min Wang , Zhen Zhang
发明人： Yuan-chi Chang , Lipyeow Lim , Min Wang , Zhen Zhang
IPC分类号： G06F17/30
CPC分类号： G06F17/30705 , Y10S707/99933 , Y10S707/99953
摘要： A method and apparatus for organizing deep Web services are provided. In one aspect, the method and apparatus obtains a collection of sources and their associated attributes and/or input modes, for instance, using a crawling algorithm. The method and apparatus uses this information to organize the sources into communities. A mining algorithm such as the hyperclique mining algorithm is used to obtain cliques of highly correlated attributes. A clustering algorithm such as the hierarchical agglomerative clustering algorithm is used to further cluster the cliques of attributes into larger cliques, which in the present disclosure is referred to as signatures. The sources that are associated with each signature form a community and a graph representation of the communities is constructed, where the vertices are communities and the edges are the shared attributes.
摘要翻译：提供了一种用于组织深度Web服务的方法和装置。在一个方面，该方法和装置例如使用爬行算法获得源及其相关属性和/或输入模式的集合。该方法和装置使用这些信息将资源组织到社区。使用诸如超临界挖掘算法的挖掘算法来获得高度相关属性的集合。使用诸如分层聚类聚类算法的聚类算法进一步将属性集合聚类成更大的团块，其在本公开中被称为签名。与每个签名相关联的源构成社区，并构建社区的图形表示，其中顶点是社区，边是共享属性。

22. 发明授权

US07299404B2 Dynamic maintenance of web indices using landmarks 有权
标题翻译：使用地标动态维护网络索引
公开(公告)号：US07299404B2
公开(公告)日：2007-11-20
申请号：US10430049
申请日：2003-05-06
申请人： Ramesh C. Agarwal , Lipyeow Lim , Sriram K. Padmanabhan , Min Wang
发明人： Ramesh C. Agarwal , Lipyeow Lim , Sriram K. Padmanabhan , Min Wang
IPC分类号： G06F15/00
CPC分类号： G06F17/2288 , G06F17/218 , G06F17/2247
摘要： A repository index records the position of document entries relative to landmark entries within the document. Landmark entries are selecting using a landmarking policy and their position relative to the document are stored in a landmark directory. During index updates, an edit transcript is generated describing the difference between old and new document versions, and both the document repository index and the landmark directory are updated as needed. Thus, the number of update operations preformed as compared with conventional indexing techniques may be substantially reduced when small, localized changes are made to the document. This is due to fact that the positions of document entries are recorded relative to the landmark entries rather than the document itself. By doing so, the document index becomes more shift-invariant, requiring fewer update operations when entries are added or inserted in localized areas of the document.
摘要翻译：存储库索引记录文档条目相对于文档中的地标条目的位置。地标条目正在使用标记政策进行选择，并且相对于文档的位置将存储在地标目录中。在索引更新期间，生成描述旧文档版本和新文档版本之间的差异的编辑脚本，并根据需要更新文档存储库索引和地标目录。因此，与传统的索引技术相比，执行的更新操作的数量可以在对文档进行小的局部改变时被显着地减少。这是因为文件条目的位置相对于里程碑条目而不是文档本身被记录。通过这样做，文档索引变得越来越不变，当在文档的本地化区域中添加或插入条目时，需要更少的更新操作。

23. 发明授权

US07987180B2 Classification-based method and apparatus for string selectivity estimation 有权
标题翻译：用于字符串选择性估计的基于分类的方法和装置
公开(公告)号：US07987180B2
公开(公告)日：2011-07-26
申请号：US12057885
申请日：2008-03-28
申请人： Lipyeow Lim , Min Wang
发明人： Lipyeow Lim , Min Wang
IPC分类号： G06F7/00
CPC分类号： G06F17/3071 , Y10S707/99932 , Y10S707/99936 , Y10S707/99942
摘要： Histogram construction and selectivity estimation for string and substring match queries in databases of data having strings associated with attributes. The histogram construction counts string-attribute pairs in the documents, and outputs string-attribute-count triples sorted by count. The collection is partitioned into buckets. A synopsis is generated for the partition, having an average selectivity or count of the string-attribute-count triples in the partition and summary information representing the set of string-attribute pairs belonging to the bucket. Subsequent queries, both for exact and substring matches, use the synopsis to estimate the selectivity of buckets.
摘要翻译：字符串和子串的直方图构造和选择性估计在具有与属性相关联的字符串的数据的数据库中匹配查询。直方图构造计算文档中的字符串属性对，并输出按count排序的字符串属性计数三元组。集合被分成桶。为分区生成概要，具有分区中的字符串属性计数三元组的平均选择性或计数，以及表示属于该分组的字符串属性对的集合的摘要信息。随后的查询（对于精确和子串匹配）使用概要来估计存储桶的选择性。

24. 发明申请

US20110106821A1 Semantic-Aware Record Matching 失效
标题翻译：语义感知记录匹配
公开(公告)号：US20110106821A1
公开(公告)日：2011-05-05
申请号：US12610101
申请日：2009-10-30
申请人： Oktie Hassanzadeh , Anastasios Kementsietsidis , Lipyeow Lim , Min Wang
发明人： Oktie Hassanzadeh , Anastasios Kementsietsidis , Lipyeow Lim , Min Wang
IPC分类号： G06F17/30 , G06F17/21
CPC分类号： G06F17/2785 , G06F17/30734
摘要： A method of semantic-aware record matching includes receiving source and target string record specifications associated with a source string record and a target string record, receiving semantic knowledge referring to tokens of the source string record and target string record, creating a first set of tokens for the source string record and a second set of tokens for the target string record based on the semantic knowledge, assigning a similarity score to the source string record and the target string record based on a semantic relationship between the first set of tokens and the second set of tokens, and matching the source string record and the target string record based on the similarity score.
摘要翻译：语义感知记录匹配的方法包括接收与源字符串记录和目标字符串记录相关联的源和目标字符串记录规范，接收引用源字符串记录和目标字符串记录的令牌的语义知识，创建第一组令牌对于源字符串记录和基于语义知识的目标字符串记录的第二组令牌，基于第一组令牌和第二组令牌之间的语义关系向源字符串记录和目标字符串记录分配相似性分数一组令牌，并根据相似性得分匹配源字符串记录和目标字符串记录。

25. 发明授权

US07613682B2 Statistics collection using path-identifiers for relational databases 失效
标题翻译：使用关系数据库的路径标识符进行统计收集
公开(公告)号：US07613682B2
公开(公告)日：2009-11-03
申请号：US11435017
申请日：2006-05-16
申请人： Lipyeow Lim , George Andrei Mihaila , Min Wang
发明人： Lipyeow Lim , George Andrei Mihaila , Min Wang
IPC分类号： G06F17/30
CPC分类号： G06F17/30536 , G06F17/30442 , G06F17/30935 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933 , Y10S707/99944 , Y10S707/99953
摘要： Disclosed are a method for collecting statistics associated with data in a database. The method comprises determining an amount of memory needed to collect statistics for data associated with a defined data type in a relational database. The defined data type is based upon a mark-up language using a tree structure with one or more root-to-node paths therein. The amount of memory as determined is allocated for collecting the statistics for the data of the defined data type. A statistics collection is performed for the data of the defined data type in a single pass through the database and within the amount of memory which has been allocated.
摘要翻译：公开了一种用于收集与数据库中的数据相关联的统计信息的方法。该方法包括确定为关系数据库中与定义的数据类型相关联的数据收集统计信息所需的存储器量。定义的数据类型基于使用具有一个或多个根到节点路径的树结构的标记语言。分配所确定的内存量用于收集所定义数据类型的数据的统计信息。在通过数据库的单次传递中以及已分配的内存量内，对定义的数据类型的数据执行统计信息收集。

26. 发明授权

US07533085B2 Method for searching deep web services 失效
标题翻译：搜索深度Web服务的方法
公开(公告)号：US07533085B2
公开(公告)日：2009-05-12
申请号：US11503754
申请日：2006-08-14
申请人： Yuan-chi Chang , Lipyeow Lim , Min Wang , Zhen Zhang
发明人： Yuan-chi Chang , Lipyeow Lim , Min Wang , Zhen Zhang
IPC分类号： G06F17/30
CPC分类号： G06F17/3089 , Y10S707/99933
摘要： A method for searching deep web services is provided. The method in one aspect allows organizing communities, sources and schema attributes in a multi-tier containment relationship; searching representative schema attributes in one or more communities; searching representative services in one or more communities; searching for related schema attributes; and searching for related communities.
摘要翻译：提供了一种用于搜索深度Web服务的方法。该方法在一个方面允许组织多层遏制关系中的社区，源和模式属性; 在一个或多个社区中搜索代表性模式属性; 在一个或多个社区寻找代表服务; 搜索相关的模式属性; 并搜索相关社区。

27. 发明申请

US20080208856A1 Classification-Based Method and Apparatus for String Selectivity Estimation 有权
标题翻译：用于字符串选择性估计的基于分类的方法和装置
公开(公告)号：US20080208856A1
公开(公告)日：2008-08-28
申请号：US12057885
申请日：2008-03-28
申请人： Lipyeow Lim , Min Wang
发明人： Lipyeow Lim , Min Wang
IPC分类号： G06F7/06 , G06F17/30
CPC分类号： G06F17/3071 , Y10S707/99932 , Y10S707/99936 , Y10S707/99942
摘要： Histogram construction and selectivity estimation for string and substring match queries in databases of data having strings associated with attributes. The histogram construction counts string-attribute pairs in the documents, and outputs string-attribute-count triples sorted by count. The collection is partitions the collection into buckets. A synopsis is generated for the partition, having an average selectivity or count of the string-attribute-count triples in the partition and summary information representing the set of string-attribute pairs belonging to the bucket. Subsequent queries, both for exact and substring matches, use the synopsis to estimate the selectivity of buckets.
摘要翻译：字符串和子串的直方图构造和选择性估计在具有与属性相关联的字符串的数据的数据库中匹配查询。直方图构造计算文档中的字符串属性对，并输出按count排序的字符串属性计数三元组。集合将集合分区为桶。为分区生成概要，具有分区中的字符串属性计数三元组的平均选择性或计数，以及表示属于该分组的字符串属性对的集合的摘要信息。随后的查询（对于精确和子串匹配）使用概要来估计存储桶的选择性。

28. 发明授权

US08468160B2 Semantic-aware record matching 失效
标题翻译：语义感知记录匹配
公开(公告)号：US08468160B2
公开(公告)日：2013-06-18
申请号：US12610101
申请日：2009-10-30
申请人： Oktie Hassanzadeh , Anastasios Kementsietsidis , Lipyeow Lim , Min Wang
发明人： Oktie Hassanzadeh , Anastasios Kementsietsidis , Lipyeow Lim , Min Wang
IPC分类号： G06F17/30
CPC分类号： G06F17/2785 , G06F17/30734
摘要： A method of semantic-aware record matching includes receiving source and target string record specifications associated with a source string record and a target string record, receiving semantic knowledge referring to tokens of the source string record and target string record, creating a first set of tokens for the source string record and a second set of tokens for the target string record based on the semantic knowledge, assigning a similarity score to the source string record and the target string record based on a semantic relationship between the first set of tokens and the second set of tokens, and matching the source string record and the target string record based on the similarity score.
摘要翻译：语义感知记录匹配的方法包括接收与源字符串记录和目标字符串记录相关联的源和目标字符串记录规范，接收引用源字符串记录和目标字符串记录的令牌的语义知识，创建第一组令牌对于源字符串记录和基于语义知识的目标字符串记录的第二组令牌，基于第一组令牌和第二组令牌之间的语义关系向源字符串记录和目标字符串记录分配相似性分数一组令牌，并根据相似性得分匹配源字符串记录和目标字符串记录。

29. 发明授权

US08135730B2 Ontology-based searching in database systems 失效
标题翻译：数据库系统中基于本体的搜索
公开(公告)号：US08135730B2
公开(公告)日：2012-03-13
申请号：US12481009
申请日：2009-06-09
申请人： Lipyeow Lim , Anastasios Kementsietsidis , Min Wang
发明人： Lipyeow Lim , Anastasios Kementsietsidis , Min Wang
IPC分类号： G06F17/30
CPC分类号： G06F17/3064 , G06Q10/10 , G06Q50/22
摘要： A method, information processing system, and computer program storage product retrieve data from a database. A search request is received from a user for a set of data in at least one database. An ontology query over is performed over at least one ontology associated with at least one database resulting in an ontological dataset associated with the search request in response to receiving the search request from the user. The ontological dataset includes at least one of a set of synonyms, a set of hypernyms, and a set of hyponyms, associated with the search request. A data query is performed over data in the at least one database using the ontological dataset in response to performing the ontology query. The set of data is returned to the user based on the data query that has been performed.
摘要翻译：方法，信息处理系统和计算机程序存储产品从数据库检索数据。对于至少一个数据库中的一组数据，从用户接收搜索请求。响应于接收到来自用户的搜索请求，在与至少一个数据库相关联的至少一个本体上执行本体查询，导致与搜索请求相关联的本体数据集。本体数据集包括与搜索请求相关联的一组同义词，一组超词和一组下位词中的至少一个。响应于执行本体查询，使用本体数据集在至少一个数据库中的数据上执行数据查询。基于已经执行的数据查询，该组数据被返回给用户。

30. 发明授权

US07472108B2 Statistics collection using path-value pairs for relational databases 失效
标题翻译：使用关系数据库的路径值对的统计信息收集
公开(公告)号：US07472108B2
公开(公告)日：2008-12-30
申请号：US11435353
申请日：2006-05-16
申请人： Lipyeow Lim , George Andrei Mihaila , Min Wang
发明人： Lipyeow Lim , George Andrei Mihaila , Min Wang
IPC分类号： G06F17/30 , G06F12/00
CPC分类号： G06F17/30442 , G06F17/30306 , Y10S707/99931 , Y10S707/99932 , Y10S707/99933 , Y10S707/99944 , Y10S707/99953
摘要： A method for collecting statistics associated with data in a database are disclosed. The method comprises determining an amount of memory needed to collect statistics for data associated with a defined data type in a relational database. The defined data type is based upon a mark-up language using a tree structure with one or more root-to-node paths therein. The amount of memory is allocated as determined for collecting the statistics for the data of the defined data type. A statistics collection is performed for the data of the defined data type in a single pass through the database and within the amount of memory which has been allocated. The performing includes at least determining a total number of instances of at least one path-identifier associated with a given value within a given set of documents.
摘要翻译：公开了一种用于收集与数据库中的数据相关联的统计信息的方法。该方法包括确定为关系数据库中与定义的数据类型相关联的数据收集统计信息所需的存储器量。定义的数据类型基于使用具有一个或多个根到节点路径的树结构的标记语言。分配的内存量被确定为收集定义的数据类型的数据的统计信息。在通过数据库的单次传递中以及已经分配的内存量中，对定义的数据类型的数据执行统计信息收集。执行包括至少确定与给定文档集合内的给定值相关联的至少一个路径标识符的实例的总数。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式