会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • High-dimensional stratified sampling
    • 高维分层抽样
    • US08639692B2
    • 2014-01-28
    • US12824849
    • 2010-06-28
    • Aiyou ChenMing Xiong
    • Aiyou ChenMing Xiong
    • G06F7/00
    • G06F17/30598G06F17/30536
    • In one aspect, a processing device of an information processing system is operative to perform high-dimensional stratified sampling of a database comprising a plurality of records arranged in overlapping sub-groups. For a given record, the processing device determines which of the sub-groups the given record is associated with, and for each of the sub-groups associated with the given record, checks if a sampling rate of the sub-group is less than a specified sampling rate. If the sampling rate of each of the sub-groups is less than the specified sampling rate, the processing device samples the given record, and otherwise does not sample the given record. The determine, check and sample operations are repeated for additional records, and samples resulting from the sample operations are processed to generate information characterizing the database. Other aspects of the invention relate to determining which records to sample through iterative optimization of an objective function that may be based, for example, on a likelihood function of the sampled records.
    • 一方面,信息处理系统的处理装置可操作以对数据库进行高维分层采样,该数据库包括以重叠子组排列的多个记录。 对于给定的记录,处理设备确定给定记录与哪个子组相关联,并且对于与给定记录相关联的每个子组,检查子组的采样率是否小于 指定采样率。 如果每个子组的采样率小于指定的采样率,则处理设备对给定的记录进行采样,否则不对给定的记录进行采样。 对附加记录重复确定,检查和抽样操作,并处理样本操作产生的样本,以生成表征数据库的信息。 本发明的其他方面涉及通过例如基于采样记录的似然函数的目标函数的迭代优化来确定要采样哪些记录。
    • 2. 发明授权
    • Method and apparatus for detecting wireless data subscribers using natted devices
    • 用于使用发送的设备检测无线数据订户的方法和装置
    • US08081567B2
    • 2011-12-20
    • US12011908
    • 2008-01-30
    • Li (Erran) LiTian BuScott C. MillerAiyou Chen
    • Li (Erran) LiTian BuScott C. MillerAiyou Chen
    • G08C15/00
    • H04L29/12339H04L61/2503H04W8/005
    • A system and method for network based detection of wireless data subscribers using network address translation devices is provided. The method includes identifying a minimum number of devices showing the same internet protocol address. Packet identification sequences may include port numbers or internet protocol identification numbers. The method continues with grouping these applications by their packet identification sequences and applying detection logic where detection logic yields a conclusion that there are multiple host computers when a set of applications appears in a plurality of packet identification sequences. This method is particularly useful when internet protocol addresses are dynamic, as opposed to static. This method overcomes previous embodiments known in the art by being able to account for and work with live traffic, which enables real time detection.
    • 提供了一种使用网络地址转换设备进行网络检测的无线数据用户的系统和方法。 该方法包括识别显示相同互联网协议地址的设备的最小数量。 分组识别序列可以包括端口号或因特网协议标识号。 该方法继续通过其分组标识序列对这些应用进行分组,并应用检测逻辑,其中检测逻辑产生一组结论在多个分组识别序列中出现时存在多个主计算机的结论。 当互联网协议地址是动态的,而不是静态时,这种方法特别有用。 该方法克服了本领域已知的以前的实施例,其能够考虑和使用实时流量,从而实现实时检测。
    • 3. 发明授权
    • Scalable methods for detecting significant traffic patterns in a data network
    • 用于检测数据网络中重要流量模式的可扩展方法
    • US07779143B2
    • 2010-08-17
    • US11770430
    • 2007-06-28
    • Tian BuJin CaoAiyou ChenPak-Ching Lee
    • Tian BuJin CaoAiyou ChenPak-Ching Lee
    • G06F15/16
    • H04L43/028H04L43/0876H04L45/745H04L49/552H04L63/1408H04L63/1458
    • Methods and apparatuses are provided for detecting traffic patterns in a data network. A sequential hashing scheme can be utilized that has D hash arrays. Each hash array i, wherein 1≦i≦D, includes Mi independent hash tables each having K buckets, with each of the buckets having an associated traffic total. Each of the keys corresponds with a single bucket of each of the Mi independent hash tables of each hash array i. The keys of the data network are partitioned into D words. As traffic is received for a key, a traffic total of each bucket that corresponds with a key is updated. The hash arrays can then be utilized to identify high traffic buckets of the independent hash tables having a traffic total greater than a threshold value. The high traffic buckets can be used to detect significant traffic patterns of the data network.
    • 提供了用于检测数据网络中的流量模式的方法和装置。 可以使用具有D个散列数组的顺序散列方案。 每个散列数组i,其中1≦̸ i≦̸ D包括每个具有K个桶的独立的独立哈希表,其中每个桶具有相关联的业务量。 每个密钥对应于每个散列数组i的每个Mi独立哈希表的单个桶。 数据网络的密钥分为D个字。 当一个密钥接收到流量时,更新与密钥对应的每个桶的流量总和。 然后可以使用散列数组来识别具有大于阈值的流量总和的独立散列表的高流量桶。 高流量桶可用于检测数据网络的重要流量模式。
    • 4. 发明申请
    • EFFICIENT PROBABILISTIC COUNTING SCHEME FOR STREAM-EXPRESSION CARDINALITIES
    • 流动表达方式的有效概念计数方案
    • US20090268623A1
    • 2009-10-29
    • US12110380
    • 2008-04-28
    • Tian BuJin CaoAiyou Chen
    • Tian BuJin CaoAiyou Chen
    • G06F11/00
    • H04L41/142H04L43/026
    • In one embodiment, a method of monitoring a network. The method includes, at each node of a fixed set, constructing a corresponding vector of M components based on data packets received at the node during a time period, M being an integer greater than 1, the fixed set being formed of some nodes of the network; and, based on the constructed vectors, estimating how many of the received data packets have been received by all of the nodes of the set or estimating how many flows of the received data packets have data packets that have passed through all of the nodes of the set. The constructing includes updating a component of the vector of one of the nodes in response to the one of the nodes receiving a data packet. The updating includes selecting the component for updating by hashing a property of the data packet received by the one of the nodes.
    • 在一个实施例中,一种监视网络的方法。 该方法包括:在固定集合的每个节点处,基于在一段时间内在节点处接收到的数据分组来构造M个分量的相应向量,M是大于1的整数,该固定集合由 网络; 并且基于所构建的向量,估计所集合的所有节点已经接收到多少接收到的数据分组,或者估计接收到的数据分组的多少流具有已经通过所有节点的数据分组 组。 所述构造包括响应于接收到数据分组的所述节点之一更新所述节点之一的向量的分量。 该更新包括通过对由该节点之一接收到的数据分组的属性进行哈希来选择用于更新的分量。
    • 5. 发明授权
    • Probabilistic aggregation over distributed data streams
    • 分布式数据流的概率聚合
    • US08204985B2
    • 2012-06-19
    • US12110431
    • 2008-04-28
    • Jin CaoAiyou Chen
    • Jin CaoAiyou Chen
    • G06F15/173
    • H04L41/142
    • In one embodiment, a method of monitoring a network. The method includes, at each node of a set, constructing a corresponding vector of M components based on a stream of data packets received at the node during a time period, the set including a plurality of nodes of the network, M being greater than 1; and estimating a value of a byte traffic produced by a part of the packets based on the constructed vectors, the part being the packets received by every node of the set. The constructing includes updating a component of the vector corresponding to one of the nodes in response to the one of the nodes receiving a data packet. The updating includes selecting a component of the vector to be updated by hashing a property of the received data packet.
    • 在一个实施例中,一种监视网络的方法。 该方法包括在一组的每个节点处,基于在一段时间段内在该节点处接收到的数据分组流来构建M个分量的相应向量,该组包括网络的多个节点,M大于1 ; 以及基于构造的向量来估计由部分分组产生的字节流量的值,所述部分是由所述集合的每个节点接收的分组。 所述构造包括响应于接收到数据分组的所述节点之一更新与所述节点之一相对应的向量的分量。 更新包括通过散列所接收的数据分组的属性来选择要更新的向量的分量。
    • 6. 发明申请
    • Spectral Neighborhood Blocking for Entity Resolution
    • 实体分辨率的光谱邻域阻塞
    • US20110258190A1
    • 2011-10-20
    • US12762441
    • 2010-04-19
    • Aiyou ChenLiangcai ShuMing Xiong
    • Aiyou ChenLiangcai ShuMing Xiong
    • G06F17/30G06F7/32G06F7/00
    • G06F17/3071G06K9/6219
    • A processing device of an information processing system is operative to obtain a plurality of records, documents, web pages or other data objects, and to construct a binary tree using a bipartition procedure in which subsets of the data objects are associated with respective nodes of the tree. Evaluation of a designated modularity for a given one of the nodes of the tree is used as a stopping criterion to prevent further partitioning of that node and to indicate designation of that node as a leaf node of the tree. The resulting leaf nodes of the tree provide a non-overlapping partitioning of the plurality of data objects. The processing device is further operative to perform a neighborhood search on the tree to identify pairs of the plurality of data objects that match the same entity, and to store an indication of the matching pairs of data objects.
    • 信息处理系统的处理装置可操作以获得多个记录,文档,网页或其它数据对象,并且使用二分法过程来构造二叉树,其中数据对象的子集与相关节点相关联 树。 将树的给定一个节点的指定模块化的评估用作停止标准,以防止该节点的进一步分区,并指示该节点作为树的叶节点的指定。 所生成的树的叶节点提供了多个数据对象的非重叠划分。 处理装置进一步操作以在树上执行邻域搜索以识别与同一实体匹配的多个数据对象的对,并存储匹配的数据对象对的指示。
    • 8. 发明申请
    • ESTIMATING CARDINALITY DISTRIBUTIONS IN NETWORK TRAFFIC
    • 估计网络交通中的心理分配
    • US20090296594A1
    • 2009-12-03
    • US12129883
    • 2008-05-30
    • Jin CaoAiyou ChenLi Li
    • Jin CaoAiyou ChenLi Li
    • G06F11/30
    • H04L43/00
    • In one embodiment, a method of monitoring a network. The method includes: receiving, from each host of a set of two or more hosts of the network, a corresponding vector of M components constructed based on data packets received at the host during a time period, M being an integer greater than 1; and, based on the constructed vectors, using an expectation-maximization algorithm to estimate a cardinality distribution for the hosts in the set, wherein constructing a vector includes updating a component of the vector of the corresponding host in response to the corresponding host receiving a data packet, the updating including selecting the component for updating by hashing one or more fields of the data packet received by the corresponding host.
    • 在一个实施例中,一种监视网络的方法。 该方法包括:从网络的一组两个或多个主机的每个主机接收在一段时间内基于在主机处接收到的数据分组构成的M个分量的相应向量,M是大于1的整数; 并且基于构造的向量,使用期望最大化算法来估计集合中的主机的基数分布,其中构建向量包括响应于相应主机接收到数据来更新对应主机的向量的分量 分组,所述更新包括通过对由相应主机接收的数据分组的一个或多个字段进行哈希来选择用于更新的分量。
    • 10. 发明授权
    • Spectral neighborhood blocking for entity resolution
    • 光谱邻域阻塞用于实体分辨率
    • US08719267B2
    • 2014-05-06
    • US12762441
    • 2010-04-19
    • Aiyou ChenLiangcai ShuMing Xiong
    • Aiyou ChenLiangcai ShuMing Xiong
    • G06F17/30
    • G06F17/3071G06K9/6219
    • A processing device of an information processing system is operative to obtain a plurality of records, documents, web pages or other data objects, and to construct a binary tree using a bipartition procedure in which subsets of the data objects are associated with respective nodes of the tree. Evaluation of a designated modularity for a given one of the nodes of the tree is used as a stopping criterion to prevent further partitioning of that node and to indicate designation of that node as a leaf node of the tree. The resulting leaf nodes of the tree provide a non-overlapping partitioning of the plurality of data objects. The processing device is further operative to perform a neighborhood search on the tree to identify pairs of the plurality of data objects that match the same entity, and to store an indication of the matching pairs of data objects.
    • 信息处理系统的处理装置可操作以获得多个记录,文档,网页或其它数据对象,并且使用二分法过程来构造二叉树,其中数据对象的子集与相关节点相关联 树。 将树的给定一个节点的指定模块化的评估用作停止标准,以防止该节点的进一步分区,并指示该节点作为树的叶节点的指定。 所生成的树的叶节点提供了多个数据对象的非重叠划分。 处理装置进一步操作以在树上执行邻域搜索以识别与同一实体匹配的多个数据对象的对,并存储匹配的数据对象对的指示。