会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 12. 发明申请
    • LOGICAL STRUCTURE ANALYZING APPARATUS, METHOD, AND COMPUTER PRODUCT
    • 逻辑结构分析设备,方法和计算机产品
    • US20090112797A1
    • 2009-04-30
    • US12180202
    • 2008-07-25
    • Akihiro MinagawaYoshinobu HottaYusaku FujiiKatsuhito Fujimoto
    • Akihiro MinagawaYoshinobu HottaYusaku FujiiKatsuhito Fujimoto
    • G06F17/30
    • G06K9/00469
    • A logical structure analyzing apparatus includes an extracting unit that extracts word candidates from a form, a first generating unit that classifies each of the word candidates into a group of heading candidates or a group of data candidates to generate, based on positions of the word candidates on the form, first candidate sets each including one heading candidate and one data candidate identifiable by the heading candidate, and a second generating unit that combines the first candidate sets to generate second candidate sets that each include plural heading candidates that differ and one data candidate. The apparatus also includes a removing unit that, based on positions of the heading candidates and the data word candidate in each second candidate set, removes from among the second candidate sets, a determined set including a data item and headings identifying the data item, and an output unit that outputs the determined set.
    • 逻辑结构分析装置包括从表单中提取词候选的提取单元,基于候选词的位置,将每个候选候选词划分成一组候选标题或一组候选数据的第一生成单元 在表格上,包括一个标题候选的第一候选集和由标题候选可识别的一个数据候选,以及组合第一候选集以产生第二候选集的第二生成单元,其中每个候选组包括不同的多个候选候选项和一个候选数 。 该装置还包括一个删除单元,其基于每个第二候选集中的候选候选标题和数据字候选的位置从第二候选集中移除包括数据项和标识数据项的标题的确定集合,以及 输出单元,其输出所确定的集合。
    • 13. 发明授权
    • Logical structure analyzing apparatus, method, and computer product
    • 逻辑结构分析装置,方法和计算机产品
    • US08010564B2
    • 2011-08-30
    • US12180202
    • 2008-07-25
    • Akihiro MinagawaYoshinobu HottaYusaku FujiiKatsuhito Fujimoto
    • Akihiro MinagawaYoshinobu HottaYusaku FujiiKatsuhito Fujimoto
    • G06F7/00G06F17/30
    • G06K9/00469
    • A logical structure analyzing apparatus includes an extracting unit that extracts word candidates from a form, a first generating unit that classifies each of the word candidates into a group of heading candidates or a group of data candidates to generate, based on positions of the word candidates on the form, first candidates sets each including one heading candidate and one data candidate identifiable by the heading candidate, and a second generating unit that combines the first candidate sets to generate second candidate sets that each include plural heading candidates that differ and one data candidate. The apparatus also includes a removing unit that, based on positions of the heading candidates and the data word candidate in each second candidate set, removes from among the second candidates sets, a determined set including a data item and headings identifying the data item, and an output unit that outputs the determined set.
    • 逻辑结构分析装置包括从表单中提取词候选的提取单元,基于候选词的位置,将每个候选候选词划分成一组候选标题或一组候选数据的第一生成单元 在表格上,第一候选人设置每个包括一个候选候选人和一个可由候选候选人标识的候选数据候选的候选文件,以及第二生成单元,其组合第一候选组以生成第二候选组,每个候选组包括不同的多个标题候选和一个数据候选 。 该装置还包括一个删除单元,其基于每个第二候选集中的候选候选和候选字符的位置,从第二候选集中移除包括数据项和标识数据项的标题的确定集合,以及 输出单元,其输出所确定的集合。
    • 17. 发明授权
    • Document type identifying method and document type identifying apparatus
    • 文件类型识别方法和文件类型识别装置
    • US08275792B2
    • 2012-09-25
    • US12585155
    • 2009-09-04
    • Akihiro MinagawaHiroaki TakebeKatsuhito Fujimoto
    • Akihiro MinagawaHiroaki TakebeKatsuhito Fujimoto
    • G06F17/30
    • G06K9/2054G06K2209/01
    • A document type identifying apparatus includes in advance a database storing therein keywords used as keys that identify document types in association with each document type. The document type identifying apparatus aligns word strings written on a document and generates partial keyword strings for each keyword by using the keywords stored in the database. The partial keyword strings are to be checked for matching with the word strings written on the document. Then, the document type identifying apparatus checks matching of the grouped and aligned word strings with the partial keyword strings and obtains, for each keyword, each number of matched words with the highest matching rates between the grouped word strings that are successfully matched and the partial keyword strings. Then, each number of matched words is used to calculate each evaluation value to determine the document type.
    • 文档类型识别装置预先包括在其中存储关键字的数据库,所述关键字用作与每个文档类型相关联的用于标识文档类型的键。 文档类型识别装置对准写在文档上的字串,并通过使用存储在数据库中的关键字为每个关键字生成部分关键字串。 要检查部分关键字字符串以匹配写在文档上的字串。 然后,文档类型识别装置检查分组和排列的字串与部分关键字串的匹配,并且为每个关键字获得在成功匹配的分组字串之间​​具有最高匹配速率的每个匹配字数, 关键字字符串。 然后,使用每个匹配字数来计算每个评估值以确定文档类型。
    • 19. 发明授权
    • Apparatus and method of analyzing layout of document, and computer product
    • 分析文件布局和计算机产品的装置和方法
    • US07257253B2
    • 2007-08-14
    • US10350180
    • 2003-01-24
    • Noriaki OzawaHiroaki TakebeKatsuhito FujimotoSatoshi Naoi
    • Noriaki OzawaHiroaki TakebeKatsuhito FujimotoSatoshi Naoi
    • G06K9/34
    • G06K9/00463
    • In an apparatus for analyzing a layout of a document, a character candidate element generator generates character candidate elements from black pixel linkage components of a document image. A horizontally oriented line rectangle generator sets a plurality of character candidate elements as a line candidate rectangle, among character candidate elements aligned in horizontal line orientation, when each amount of displacement of the set character candidate elements in a vertical orientation with respect to the horizontal line orientation, is smaller than or equal to a threshold value. A horizontally oriented paragraph-box generator sets a plurality of line candidate elements having approximately the same length as each other in the vertical orientation, as a paragraph candidate element.
    • 在用于分析文档的布局的装置中,字符候选元素生成器从文档图像的黑色像素连接分量生成角色候选元素。 当水平方向的线矩形发生器在垂直方向上相对于水平线的每个位移量时,将多个字符候选元素设置为在水平行方向对齐的字符候选元素中的行候选矩形 取向小于或等于阈值。 水平定向的段落框生成器将在垂直方向上彼此具有大致相同长度的多个行候选元素设置为段落候选元素。