会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 91. 发明公开
    • SYSTEM AND METHOD FOR AUTOMATIC PAGE REGISTRATION AND AUTOMATIC ZONE DETECTION DURING FORMS PROCESSING
    • 系统和方法自动页面检测和形式检测
    • EP0764308A1
    • 1997-03-26
    • EP96912690.0
    • 1996-04-10
    • Rebus Technology, Inc.
    • RANGARAJAN, Vijayakumar c/o Parthasarathy,
    • G06K9G06T3
    • G06T3/608G06K9/2054G06K2209/01
    • A system and method automatically detect user defined zones in a document image of a form, compensating for skew and displacement of the image with respect to an original image of form. The system provides a mechanism to input an image for a form document, such as a scanner. The system processes the image to reduce its resolution and to remove significant skew. The image is presented to the user to define the zones. These zones are areas from which the user desires to extract meaningful data through optical character recognition, such as names, dates, addresses, and items on an invoice form. The system further processes the image to remove horizontal and vertical lines, and to create a number of blocks, representing either text or image data. The lines are removed and the blocks form by runlength smoothing with various parameters. The blocks form clusters of connected pixels. The blocks are labeled such that any set of connected blocks share a unique identification value. Additional data is collected on the commonly labeled blocks to select those blocks useful to definition of a template. The template is a collection of vectors between the centroids of each of the selected blocks. A second document image for processing is obtained, and similarly processed to minimize, deskew, and identify blocks and vectors therein. The vectors in the second document image are compared with vectors in a user selected template to determine the location of user defined zones in the second document image.
    • 94. 发明公开
    • Forms recognition management system and method
    • 表格识别管理系统和方法
    • EP0651346A2
    • 1995-05-03
    • EP94115179.7
    • 1994-09-27
    • International Business Machines Corporation
    • Burger, Mark E.Sun, Hsiao
    • G06K9/68
    • G06K9/2054G06K9/6807G06K2209/01
    • Document form templates are grouped into groups of related form templates. The number of times a particular template is used by the system is counted for each group during a forms processing interval. Then when processing submitted forms, the method scans in a plurality of them in an aggregated submission. Forms recognition processing starts for a first form. The method begins searching for a form template to match the first form, starting with the most frequent group. If recognition of the first form is successful with the most frequent primary form template, then the method searches the group for the remaining submitted forms. If recognition of the first form is not successful with the most frequent group, then the method tries to match the first form with a template in the second most frequently processed primary group. In this manner, forms recognition of preprinted forms is managed, with the order of searching the template archive changed in response to the frequency with which particular form types are processed.
    • 文档表单模板被分组到相关表单模板的组中。 在表单处理间隔期间,为每个组计算系统使用特定模板的次数。 然后,在处理提交的表单时,该方法在汇总的提交中扫描多个表单。 表单识别处理以第一种形式开始。 该方法开始搜索表单模板以匹配第一个表单,从最频繁的组开始。 如果第一种形式的识别对于最常见的主要表单模板是成功的,那么该方法搜索组中剩余的提交表单。 如果第一种形式的识别对于最频繁的组不成功,那么该方法试图将第一种形式与第二种最频繁处理的主要组中的模板进行匹配。 以这种方式,管理预打印表格的表格识别,其中搜索响应于处理特定表格类型的频率而改变的模板存档的顺序。
    • 95. 发明公开
    • Method and system for managing character recognition of a plurality of document form images having common data types
    • 方法和装置用于字符识别多个具有公共的数据类型形式的图像的管理。
    • EP0646887A2
    • 1995-04-05
    • EP94114026.1
    • 1994-09-07
    • International Business Machines Corporation
    • Billings, Douglas W.
    • G06K9/20
    • G06K9/2054G06K2209/01
    • A family of form types having a common set of data types, can be transformed into a meta-form which contains all of the data types which are arranged into a single layout. The location of the fields on the meta-form are specified by a meta-form definition data set. The single meta-form definition data set is substituted in a character recognition processor, for the plurality of form definition data sets which specify the layouts of the original form types in the family. This speeds the character recognition operation by not requiring a time consuming search for customized form definitions. Furthermore, by transforming the layout of field images for each form type in a family, into a single layout in the meta-form, the fields of the meta-form can be quickly located to speed the decompression of meaningful parts of the compressed document form image, without requiring the time consuming decompression of other parts which are not of interest.
    • 具有共同的一组数据类型的形式类型的家庭,可以转化成一个元形成包含所有被布置成一个单一的布局中的数据类型。 元表格中的字段的位置是由一元格式定义数据集中指定。 单元形式定义数据集是在字符识别处理器取代的,表单定义数据的多组其中在家庭指定原始形式类型的布局。 这加快字符识别通过不需要耗时的搜索定制表单定义操作。 进一步,通过在所述元形式转化场图像的布局中的一个家庭的每个形式类型,成一个单一的布局中,元形式的字段可以被快速地定位,以加速压缩文档形式的有意义的部分的解压缩 图像,而不需要耗费哪些是不感兴趣的其他部位的减压时间。