会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明申请
    • DOCUMENT RETRIEVAL SYSTEM AND DOCUMENT RETRIEVAL METHOD
    • 文件检索系统和文件检索方法
    • US20080270386A1
    • 2008-10-30
    • US12029694
    • 2008-02-12
    • Hiroko OhiYoshiki NiwaKiyohiro Obara
    • Hiroko OhiYoshiki NiwaKiyohiro Obara
    • G06F17/30
    • G06F17/3069
    • A document retrieval is performed with similarities between documents in numeric data taken into consideration. To this end, generated is a set E of intervals in which each element of a set D of numeric values representing a feature A is included in any one of the intervals. Each numeric value in each document is indexed by assigning, with 1, an interval including an element x of the set D, and with 0, an interval without the element x. Each document data including numeric values is indexed by indexing its text part with term frequencies, and by indexing its numeric-value part with the above-described numeric value indexing scheme. By use of indices thus created for each of the document data, similarities between the document data are calculated using a vector space model or a probability model, and the document data are presented in order of similarity.
    • 在考虑到数字数据的文档之间执行文档检索。 为此,生成的是间隔的集合E,其中表示特征A的数值的集合D的每个元素被包括在间隔的任何一个中。 每个文档中的每个数值通过用1分配包含集合D的元素x的间隔,并且使用0,没有元素x的间隔来进行索引。 包括数值的每个文档数据通过用术语频率索引其文本部分并通过使用上述数值索引方案对其数值部分进行索引来进行索引。 通过使用如此为每个文档数据创建的索引,使用向量空间模型或概率模型来计算文档数据之间的相似性,并且以相似性的顺序呈现文档数据。
    • 4. 发明授权
    • Document retrieval system and document retrieval method
    • 文件检索系统和文件检索方法
    • US08046368B2
    • 2011-10-25
    • US12029694
    • 2008-02-12
    • Hiroko OhiYoshiki NiwaKiyohiro Obara
    • Hiroko OhiYoshiki NiwaKiyohiro Obara
    • G06F7/00G06F17/30
    • G06F17/3069
    • A document retrieval is performed with similarities between documents in numeric data taken into consideration. To this end, generated is a set E of intervals in which each element of a set D of numeric values representing a feature A is included in any one of the intervals. Each numeric value in each document is indexed by assigning, with 1, an interval including an element x of the set D, and with 0, an interval without the element x. Each document data including numeric values is indexed by indexing its text part with term frequencies, and by indexing its numeric-value part with the above-described numeric value indexing scheme. By use of indices thus created for each of the document data, similarities between the document data are calculated using a vector space model or a probability model, and the document data are presented in order of similarity.
    • 在考虑到数字数据的文档之间执行文档检索。 为此,生成的是间隔的集合E,其中表示特征A的数值的集合D的每个元素被包括在间隔的任何一个中。 每个文档中的每个数值通过用1分配包含集合D的元素x的间隔,并且使用0,没有元素x的间隔来进行索引。 包括数值的每个文档数据通过用术语频率索引其文本部分并通过使用上述数值索引方案对其数值部分进行索引来进行索引。 通过使用如此为每个文档数据创建的索引,使用向量空间模型或概率模型来计算文档数据之间的相似性,并且以相似性的顺序呈现文档数据。