会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Rule induction on large noisy data sets
    • 大噪声数据集上的规则归纳
    • US5719692A
    • 1998-02-17
    • US499247
    • 1995-07-07
    • William W. Cohen
    • William W. Cohen
    • G06F9/44G06N5/02G06N5/04G06F17/00G06F15/00
    • G06N5/025
    • Efficient techniques for inducing rules used in classifying data items on a noisy data set. The prior-art IREP technique, which produces a set of classification rules by inducing each rule and then pruning it and continuing thus until a stopping condition is reached, is improved with a new rule-value metric for stopping pruning and with a stopping condition which depends on the description length of the rule set. The rule set which results from the improved IREP technique is then optimized by pruning rules from the set to minimize the description length and further optimized by making a replacement rule and a modified rule for each rule and using the description length to determine whether to use the replacement rule, the modified rule, or the original rule in the rule set. Further improvement is achieved by inducing rules for data items not covered by the original set and then pruning these rules. Still further improvement is gained by repeating the steps of inducing rules for data items not covered, pruning the rules, optimizing the rules, and again pruning for a fixed number of times. The fully-developed technique has the O(nlog.sup.2 n) running time characteristic of IREP, but produces rule sets which do a substantially better job of classification than those produced by IREP.
    • 用于诱导在嘈杂数据集上分类数据项的规则的高效技术。 现有技术的IREP技术通过引导每个规则然后修剪并继续直到停止条件产生一组分类规则,通过用于停止修剪的新的规则值度量和具有停止修剪的停止条件来改进 取决于规则集的描述长度。 然后,通过从集合中修剪规则来优化来自改进的IREP技术的规则集,以最小化描述长度并通过为每个规则做出替换规则和修改的规则进一步优化,并使用描述长度来确定是否使用 替换规则,修改规则或规则集中的原始规则。 通过为原始集合未涵盖的数据项引发规则,然后修剪这些规则来实现进一步的改进。 通过重复对未包括的数据项进行规则,修剪规则,优化规则,再次修剪固定次数的步骤,可以进一步改进。 完全开发的技术具有IREP的O(nlog2n)运行时间特性,但是产生的规则集比IREP生成的分类要好得多。
    • 2. 发明授权
    • Context-dependent similarity measurements
    • 上下文相关性相似性测量
    • US08538972B1
    • 2013-09-17
    • US13532972
    • 2012-06-26
    • William W. Cohen
    • William W. Cohen
    • G06F7/00G06F17/30
    • G06F17/30675
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining similarity measures for objects in a dataset that include contextual associations of the objects with contexts. In one aspect, a method includes calculating a similarity measure for any two objects that include a common feature f based, in part, on the likelihood that the two object representations in the dataset that both include f will we associated with distinct contexts, and the likelihood that the two objects in the dataset that both include f will be associated with the same context.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于确定包括对象与上下文的上下文关联的数据集中的对象的相似性度量。 在一个方面,一种方法包括:计算包括公共特征f的任何两个对象的相似性度量,部分地基于在数据集中包括f的两个对象表示将与不同上下文相关联的可能性,以及 数据集中包含f的两个对象将与相同的上下文相关联的可能性。
    • 3. 发明授权
    • Context-dependent similarity measurements
    • 上下文相关性相似性测量
    • US08234285B1
    • 2012-07-31
    • US12506685
    • 2009-07-21
    • William W. Cohen
    • William W. Cohen
    • G06F7/00G06F17/30
    • G06F17/30675
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining similarity measures for objects in a dataset that include contextual associations of the objects with contexts. In one aspect, a method includes calculating a similarity measure for any two objects that include a common feature f based, in part, on the likelihood that the two object representations in the dataset that both include f will we associated with distinct contexts, and the likelihood that the two objects in the dataset that both include f will be associated with the same context.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于确定包括对象与上下文的上下文关联的数据集中的对象的相似度度量。 在一个方面,一种方法包括:计算包括公共特征f的任何两个对象的相似性度量,部分地基于在数据集中包括f的两个对象表示将与不同上下文相关联的可能性,以及 数据集中包含f的两个对象将与相同的上下文相关联的可能性。
    • 5. 发明授权
    • Biased learning system
    • 偏倚学习系统
    • US5481650A
    • 1996-01-02
    • US320102
    • 1994-10-07
    • William W. Cohen
    • William W. Cohen
    • G06N5/02G06F15/18
    • G06N5/025
    • The invention permits various types of background knowledge for a concept learning system to be represented in a single formal structure known as an antecedent description grammar. A user formulates background knowledge for a learning problem into such a grammar, which then becomes an input to a learning system, together with training data representing the concept to be learned. The learning system, constrained by the grammar, then uses the training data to generate a hypothesis for the concept to be learned. Such hypothesis is in the form of a set of logic clauses known as Horn clauses.
    • 本发明允许将概念学习系统的各种类型的背景知识以被称为先行描述语法的单个形式结构来表示。 用户将学习问题的背景知识制定成这样的语法,然后将该语法与学习系统的输入连同表示要学习的概念的训练数据一起构成。 学习系统受到语法约束,然后使用训练数据为要学习的概念生成假设。 这样的假设是一组被称为霍恩(Norn)子句的逻辑子句的形式。
    • 6. 发明授权
    • System and method for accessing heterogeneous databases
    • 用于访问异构数据库的系统和方法
    • US06295533B2
    • 2001-09-25
    • US09028471
    • 1998-02-24
    • William W. Cohen
    • William W. Cohen
    • G06F1730
    • G06F17/30566Y10S707/99935
    • A system and method are provided for answering queries concerning information stored in a set of collections. Each collection includes a structured entity, and each structured entity includes a field. A query is received that specifies a subset of the set of collections and a logical constraint between fields that includes a requirement that a first field match a second field. The probability that the first field matches the second field is determined automatically based upon the contents of the fields. A collection of lists is generated in response to the query, where each list includes members of the subset of collections specified in the query, and where each list has an estimate of the probability that the members of the list satisfies the logical constraint specified in the query.
    • 提供了一种用于回答关于存储在一组集合中的信息的查询的系统和方法。 每个集合包括一个结构化实体,每个结构化实体包括一个字段。 接收到一个查询,该查询指定集合集合的子集,以及包含第一个字段与第二个字段匹配的要求的字段之间的逻辑约束。 基于字段的内容自动确定第一字段与第二字段匹配的概率。 响应于查询生成列表的集合,其中每个列表包括在查询中指定的集合的子集的成员,并且其中每个列表具有对列表的成员满足在 查询。
    • 7. 发明授权
    • Biased learning system
    • 偏倚学习系统
    • US5627945A
    • 1997-05-06
    • US566198
    • 1995-12-01
    • William W. Cohen
    • William W. Cohen
    • G06N5/02G06F3/00
    • G06N5/025
    • The invention permits various types of background knowledge for a concept learning system to be represented in a single formal structure known as an antecedent description grammar. A user formulates background knowledge for a learning problem into such a grammar, which then becomes an input to a learning system, together with training data representing the concept to be learned. The learning system, constrained by the grammar, then uses the training data to generate a hypothesis for the concept to be learned. Such hypothesis is in the form of a set of logic clauses known as Horn clauses.
    • 本发明允许将概念学习系统的各种类型的背景知识以被称为先行描述语法的单个形式结构来表示。 用户将学习问题的背景知识制定成这样的语法,然后将该语法与学习系统的输入连同表示要学习的概念的训练数据一起构成。 学习系统受到语法约束,然后使用训练数据为要学习的概念生成假设。 这样的假设是一组被称为霍恩(Norn)子句的逻辑子句的形式。
    • 9. 发明授权
    • Software discovery system
    • 软件发现系统
    • US5642472A
    • 1997-06-24
    • US246437
    • 1994-05-20
    • William W. Cohen
    • William W. Cohen
    • G06F9/06G06F9/44G06F11/34G06F11/36G06F15/18G06N5/04G06F3/00
    • G06F11/3604G06F11/34G06F11/3624G06F11/3636G06N99/005G06F11/3466
    • Apparatus and methods which employ a machine learning system to "learn" the specification for a program from a trace of an execution of the program on a set of test problems. The program is instrumented to produce the trace. Performance is improved by means of a declarative bias which expresses knowledge of the user about the program and constrains the learning system to produce only specifications which are consistent with the declarative bias. The apparatus and methods of the preferred embodiment are employed to learn specifications of views in a data base for a telephone switching system from traces produced by executing the programs which produce the views. Techniques for producing more than one specification and for dealing with views which involve conversions are also disclosed.
    • 使用机器学习系统从一组测试问题的程序执行跟踪中“学习”程序的规范的装置和方法。 该程序用于生成跟踪。 通过表达对用户对程序的了解的声明偏差来改进性能,并限制学习系统仅产生与声明偏差一致的规范。 采用优选实施例的装置和方法从通过执行产生视图的程序产生的痕迹来学习电话交换系统的数据库中的视图的规范。 还公开了用于生产多个规范和处理涉及转换的视图的技术。