会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 14. 发明授权
    • System and method for indexing weighted-sequences in large databases
    • 用于索引大数据库中加权序列的系统和方法
    • US07418455B2
    • 2008-08-26
    • US10723229
    • 2003-11-26
    • Wei FanChang-Shing PerngHaixun WangPhilip Shi-Lung Yu
    • Wei FanChang-Shing PerngHaixun WangPhilip Shi-Lung Yu
    • G06F7/00G06F17/00
    • G06F17/30327G06F17/30548Y10S707/99943
    • The present invention provides an index structure for managing weighted-sequences in large databases. A weighted-sequence is defined as a two-dimensional structure in which each element in the sequence is associated with a weight. A series of network events, for instance, is a weighted-sequence because each event is associated with a timestamp. Querying a large sequence database by events' occurrence patterns is a first step towards understanding the temporal causal relationships among the events. The index structure proposed herein enables the efficient retrieval from the database of all subsequences (contiguous and non-contiguous) that match a given query sequence both by events and by weights. The index structure also takes into consideration the nonuniform frequency distribution of events in the sequence data.
    • 本发明提供了一种用于在大数据库中管理加权序列的索引结构。 加权序列被定义为二维结构,其中序列中的每个元素与权重相关联。 例如,一系列网络事件是加权序列,因为每个事件都与时间戳相关联。 通过事件发生模式查询大序列数据库是了解事件之间的时间因果关系的第一步。 这里提出的索引结构使得能够通过事件和权重从数据库有效地检索与给定查询序列匹配的所有子序列(连续的和不连续的)。 索引结构还考虑了序列数据中事件的不均匀频率分布。
    • 17. 发明授权
    • Index structure for supporting structural XML queries
    • 用于支持结构XML查询的索引结构
    • US07287023B2
    • 2007-10-23
    • US10723206
    • 2003-11-26
    • Wei FanHaixun WangPhilip Shi-Lung Yu
    • Wei FanHaixun WangPhilip Shi-Lung Yu
    • G06F17/30
    • G06F17/30911Y10S707/99933Y10S707/99943
    • The present invention provides a ViST (or “virtual suffix tree”), which is a novel index structure for searching XML documents. By representing both XML documents and XML queries in structure-encoded sequences, it is shown that querying XML data is equivalent to finding (non-contiguous) subsequence matches. A variety of XML queries, including those with branches, or wild-cards (‘*’ and ‘//’), can be expressed by structure-encoded sequences. Unlike index methods that disassemble a query into multiple sub-queries, and then join the results of these sub-queries to provide the final answers, ViST uses tree structures as the basic unit of query to avoid expensive join operations. Furthermore, ViST provides a unified index on both content and structure of the XML documents, hence it has a performance advantage over methods indexing either just content or structure. ViST supports dynamic index update, and it relies solely on B+Trees without using any specialized data structures that are not well supported by common database management systems (hereinafter referred to as “DBMSs”).
    • 本发明提供了一种ViST(或“虚拟后缀树”),其是用于搜索XML文档的新型索引结构。 通过在结构编码序列中同时表示XML文档和XML查询,显示查询XML数据等同于查找(非连续)子序列匹配。 各种XML查询(包括具有分支的查询)或通配符('*'和'//')可以由结构编码的序列表示。 不同于将查询反汇编成多个子查询的索引方法,然后加入这些子查询的结果以提供最终答案,ViST使用树结构作为查询的基本单位,以避免昂贵的连接操作。 此外,ViST为XML文档的内容和结构提供了一个统一的索引,因此与仅通过内容或结构索引方法相比,它具有性能优势。 ViST支持动态索引更新,它仅仅依赖于B< +>树,而不使用通用数据库管理系统(以下简称“DBMS”)不能很好地支持的任何专门的数据结构。
    • 20. 发明授权
    • Method for building space-splitting decision tree
    • 建立空间分裂决策树的方法
    • US06871201B2
    • 2005-03-22
    • US09918952
    • 2001-07-31
    • Philip Shi-lung YuHaixun Wang
    • Philip Shi-lung YuHaixun Wang
    • G06F7/00G06F17/30G06K9/62
    • G06F17/30705G06K9/6282Y10S707/99935Y10S707/99937Y10S707/99943
    • A method is provided for data classification that achieves improved interpretability and accuracy while preserving the efficiency and scalability of univariate decision trees. To build a compact decision tree, the method searches for clusters in subspaces to enable multivariate splitting based on weighted distances to such a cluster. To classify an instance more accurately, the method performs a nearest neighbor (NN) search among the potential nearest leaf nodes of the instance. The similarity measure used in the NN search is based on Euclidean distances defined in different subspaces for different leaf nodes. Since instances are scored by their similarity to a certain class, this approach provides an effective means for target selection that is not supported well by conventional decision trees.
    • 提供了一种用于数据分类的方法,其实现了改进的可解释性和准确性,同时保持了单变量决策树的效率和可扩展性。 为了构建一个紧凑的决策树,该方法将搜索子空间中的群集,以便根据这种群集的加权距离来启用多变量分割。 为了更精确地对实例进行分类,该方法在实例的最靠近的叶节点之间执行最近邻(NN)搜索。 NN搜索中使用的相似性度量是基于不同叶节点不同子空间中定义的欧几里德距离。 由于实例与某一类别的相似性得分,所以这种方法为常规决策树不能很好地支持目标选择提供了有效的手段。