会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明授权
    • Keyword presentation apparatus and method
    • 关键词呈现装置及方法
    • US08812504B2
    • 2014-08-19
    • US13216380
    • 2011-08-24
    • Tomoharu KokubuToshihiko ManabeKosei FumeWataru NakanoHiromi Wakaki
    • Tomoharu KokubuToshihiko ManabeKosei FumeWataru NakanoHiromi Wakaki
    • G06F17/00G06F17/30
    • G06F17/3064
    • According to one embodiment, a keyword presentation apparatus includes an extraction unit, a selection unit and a clustering unit. The extraction unit is configured to extract, as technical terms, morpheme strings, which are not defined in a general concept dictionary, from a document set. The selection unit is configured to evaluate relevancies between each of basic term candidates and the technical terms, and to preferentially select basic term candidates having high relevancies as basic terms. The clustering unit is configured to calculate weighted sums of statistical degrees of correlation between the basic terms based on the document set, to calculate conceptual degrees of correlation between the basic terms based on the general concept dictionary, and to cluster the basic terms based on the weighted sums.
    • 根据一个实施例,关键词呈现装置包括提取单元,选择单元和聚类单元。 提取单元被配置为从文档集合中提取未在通用概念字典中定义的语素字符串作为技术术语。 选择单元被配置为评估每个基本术语候选者和技术术语之间的相关性,并优先选择具有高相关性的基本术语候选者作为基本术语。 聚类单元被配置为基于文档集来计算基本项之间的统计学相关度的加权和,以基于一般概念词典计算基本术语之间的概念相关度,并且基于 加权总和。
    • 6. 发明申请
    • KEYWORD PRESENTATION APPARATUS AND METHOD
    • 关键词介绍及方法
    • US20120078907A1
    • 2012-03-29
    • US13216380
    • 2011-08-24
    • Tomoharu KokubuToshihiko ManabeKosei FumeWataru NakanoHiromi Wakaki
    • Tomoharu KokubuToshihiko ManabeKosei FumeWataru NakanoHiromi Wakaki
    • G06F17/30
    • G06F17/3064
    • According to one embodiment, a keyword presentation apparatus includes an extraction unit, a selection unit and a clustering unit. The extraction unit is configured to extract, as technical terms, morpheme strings, which are not defined in a general concept dictionary, from a document set. The selection unit is configured to evaluate relevancies between each of basic term candidates and the technical terms, and to preferentially select basic term candidates having high relevancies as basic terms. The clustering unit is configured to calculate weighted sums of statistical degrees of correlation between the basic terms based on the document set, to calculate conceptual degrees of correlation between the basic terms based on the general concept dictionary, and to cluster the basic terms based on the weighted sums.
    • 根据一个实施例,关键词呈现装置包括提取单元,选择单元和聚类单元。 提取单元被配置为从文档集合中提取未在通用概念字典中定义的语素字符串作为技术术语。 选择单元被配置为评估每个基本术语候选者和技术术语之间的相关性,并优先选择具有高相关性的基本术语候选者作为基本术语。 聚类单元被配置为基于文档集来计算基本项之间的统计学相关度的加权和,以基于一般概念词典计算基本术语之间的概念相关度,并且基于 加权总和。
    • 7. 发明申请
    • APPARATUS AND METHOD FOR RETRIEVING STRUCTURED DOCUMENTS
    • 检索结构化文档的装置和方法
    • US20090138473A1
    • 2009-05-28
    • US12205636
    • 2008-09-05
    • Toshihiko ManabeTomoharu Kokubu
    • Toshihiko ManabeTomoharu Kokubu
    • G06F17/30
    • G06F16/334
    • An apparatus for retrieving structured documents includes a first categorizing unit configured to categorize components into a first component of typical descriptions and a second component of atypical descriptions, based on statistics information for the components, a second categorizing unit configured to categorize the terms into a first term whose appearance ratio in the first component exceeds a threshold and a second term whose appearance ratio in the first component is not more than the threshold, an extraction unit configured to extract a set of structured documents each having the first component including the first term and the second component from the structured documents, and a ranking unit configured to rank the set of structured documents by a retrieval score calculating based o a relation between the second term and the second component.
    • 一种用于检索结构化文档的装置包括:第一分类单元,被配置为基于组件的统计信息将组件分类为典型描述的第一组件和非典型描述的第二组件;第二分类单元,被配置为将所述术语分类为第一 所述第一成分的出现比率超过阈值的项目,以及所述第一成分的出现比率不大于所述阈值的第二项,提取单元,被配置为提取具有包括所述第一项的第一成分的一组结构化文档, 来自结构化文档的第二组件,以及排列单元,被配置为通过基于第二项和第二组件之间的关系的检索分数来计算该组结构化文档。