会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 8. 发明授权
    • Index extraction from documents
    • 从文件索引提取
    • US08805803B2
    • 2014-08-12
    • US10916877
    • 2004-08-12
    • Steven J. SimskeDavid W. Wright
    • Steven J. SimskeDavid W. Wright
    • G06F7/00G06F17/30
    • G06F17/30613G06F17/30705
    • Systems, methods, and programs embodied in a computer readable medium are provided for index extraction. Stored in a database are ground truth documents that are organized according to a plurality of classifications, each classification having a group of predefined indices. A document to be indexed is classified by drawing an association between the document and one of the classifications. An attempt is made to extract from the document at least a subset of the group of predefined indices associated with the one of the classifications. Upon a failure to extract the subset of the group of predefined indices, attempts are made to find and correct at least one text recognition error in the document based upon a salient dictionary associated with the one of the classifications.
    • 提供体现在计算机可读介质中的系统,方法和程序用于索引提取。 存储在数据库中的是根据多个分类组织的地面真实文档,每个分类具有一组预定义的索引。 要索引的文档通过绘制文档和其中一个分类之间的关联来分类。 尝试从文档中提取与该分类之一相关联的预定义索引组的至少一个子集。 当未能提取预定义索引组的子集时,尝试基于与所述分类之一相关联的显着词典尝试在文档中查找和校正至少一个文本识别错误。