会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明授权
    • Method and apparatus for simmered greedy optimization
    • 贪婪优化的方法和装置
    • US07853541B1
    • 2010-12-14
    • US11845318
    • 2007-08-27
    • Sadik KapadiaRichard Rohwer
    • Sadik KapadiaRichard Rohwer
    • G06F15/18
    • G06K9/6226G06F17/11
    • A method and apparatus comprising a fast and highly effective stochastic algorithm, referred to as Simmered Greedy Optimization (SG(N)), for solving combinatorial optimization problems, including the co-clustering problem comprising simultaneously clustering two finite sets by maximizing the mutual information between the clusterings and deriving maximally predictive feature sets. Co-clustering has found application in many areas, particularly statistical natural language processing and bio-informatics. Provided are results of tests on a suite of statistical natural language problems comparing SG(N) with simulated annealing and a publicly available implementation of co-clustering, wherein using SG(N) provided superior results with far less computation.
    • 一种包括快速且高效的随机算法(称为Simmered Greedy Optimization(SG(N)))的方法和装置,用于解决组合优化问题,包括同时聚类问题,包括通过最大化两个有限集合之间的相互信息 聚类和导出最大预测特征集。 共聚集已经在许多领域得到应用,特别是统计自然语言处理和生物信息学。 提供了一系列统计自然语言问题的测试结果,这些问题将SG(N)与模拟退火相比较,以及公共可用的共聚集实现,其中使用SG(N)提供优异的结果,计算量少得多。
    • 6. 发明授权
    • Method and apparatus for automatic entity disambiguation
    • 自动实体消歧的方法和装置
    • US07672833B2
    • 2010-03-02
    • US11234692
    • 2005-09-22
    • Matthias BlumeRichard CalmbachDayne FreitagRichard RohwerScott Zoldi
    • Matthias BlumeRichard CalmbachDayne FreitagRichard RohwerScott Zoldi
    • G06F17/21
    • G06F17/278G06Q10/10
    • Entity disambiguation resolves which names, words, or phrases in text correspond to distinct persons, organizations, locations, or other entities in the context of an entire corpus. The invention is based largely on language-independent algorithms. Thus, it is applicable not only to unstructured text from arbitrary human languages, but also to semi-structured data, such as citation databases and the disambiguation of named entities mentioned in wire transfer transaction records for the purpose of detecting money-laundering activity. The system uses multiple types of context as evidence for determining whether two mentions correspond to the same entity and it automatically learns the weight of evidence of each context item via corpus statistics. The invention uses multiple search keys to efficiently find pairs of mentions that correspond to the same entity, while skipping billions of unnecessary comparisons, yielding a system with very high throughput that can be applied to truly massive data.
    • 实体消歧解决了文本中的哪些名称,单词或短语对应于整个语料库上下文中不同的人,组织,位置或其他实体。 本发明主要基于语言无关的算法。 因此,它不仅适用于来自任意人类语言的非结构化文本,还适用于半结构化数据,例如引用数据库,以及为了检测洗钱活动而消除电汇交易记录中提到的命名实体。 该系统使用多种类型的上下文作为确定两个提及是否对应于相同实体的证据,并通过语料库统计自动学习每个上下文项的证据的权重。 本发明使用多个搜索密钥来有效地找到对应于相同实体的提及对,同时跳过数十亿次不必要的比较,产生可以应用于真实海量数据的非常高的吞吐量的系统。
    • 7. 发明申请
    • Method and apparatus for automatic entity disambiguation
    • 自动实体消歧的方法和装置
    • US20070067285A1
    • 2007-03-22
    • US11234692
    • 2005-09-22
    • Matthias BlumeRichard CalmbachDayne FreitagRichard RohwerScott Zoldi
    • Matthias BlumeRichard CalmbachDayne FreitagRichard RohwerScott Zoldi
    • G06F17/30
    • G06F17/278G06Q10/10
    • Entity disambiguation resolves which names, words, or phrases in text correspond to distinct persons, organizations, locations, or other entities in the context of an entire corpus. The invention is based largely on language-independent algorithms. Thus, it is applicable not only to unstructured text from arbitrary human languages, but also to semi-structured data, such as citation databases and the disambiguation of named entities mentioned in wire transfer transaction records for the purpose of detecting money-laundering activity. The system uses multiple types of context as evidence for determining whether two mentions correspond to the same entity and it automatically learns the weight of evidence of each context item via corpus statistics. The invention uses multiple search keys to efficiently find pairs of mentions that correspond to the same entity, while skipping billions of unnecessary comparisons, yielding a system with very high throughput that can be applied to truly massive data.
    • 实体消歧解决了文本中的哪些名称,单词或短语对应于整个语料库上下文中不同的人,组织,位置或其他实体。 本发明主要基于语言无关的算法。 因此,它不仅适用于来自任意人类语言的非结构化文本,还适用于半结构化数据,例如引用数据库,以及为了检测洗钱活动而消除电汇交易记录中提到的命名实体。 该系统使用多种类型的上下文作为确定两个提及是否对应于相同实体的证据,并通过语料库统计自动学习每个上下文项的证据的权重。 本发明使用多个搜索密钥来有效地找到对应于相同实体的提及对,同时跳过数十亿次不必要的比较,产生可以应用于真实海量数据的非常高的吞吐量的系统。