会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明公开
    • CHARACTER SEGMENTATION AND RECOGNITION METHOD
    • 特征分割和识别方法
    • EP3258422A1
    • 2017-12-20
    • EP15881706.4
    • 2015-06-26
    • GRG Banking Equipment Co., Ltd.
    • WANG, WeifengQIU, XinhuaWANG, RongqiuWANG, Kun
    • G06K9/20
    • G06K9/344G06K9/346G06K9/348G06K2209/01
    • A character segmentation and recognition method, which is used for resolving the problems that the prior-art segmentation method is low in capacity for recognizing characters under a complex background and poor in contamination interference resistance. The method comprises: collecting image data to obtain an image to be recognized; locating a character-row candidate area on the image to be recognized; acquiring preset character-row prior information, wherein the character-row prior information comprises the number of characters, a character pitch and a character size; acquiring a corresponding segmentation point template according to the character-row prior information; acquiring reliability of the segmentation point template on different positions when the character-row candidate area is traversed; determining a position with the highest reliability as an optimum segmentation position; segmenting the character-row candidate area according to the segmentation point template and the optimum segmentation position to obtain a plurality of single character areas; and performing character recognition on the single character area to obtain a corresponding recognition result.
    • 一种字符分割识别方法,用于解决现有分割方法在复杂背景下字符识别能力较低,抗污染干扰能力较差的问题。 该方法包括:收集图像数据以获得待识别的图像; 在要识别的图像上定位字符行候选区域; 获取预设的字符行先验信息,所述字符行先验信息包括字符数,字符间距和字符大小; 根据所述字符行先验信息获取对应的分割点模板; 当遍历所述字符行候选区域时,获取所述分割点模板在不同位置的可靠性; 确定具有最高可靠性的位置作为最佳分割位置; 根据分割点模板和最佳分割位置对字符行候选区域进行分割,得到多个单个字符区域; 并对单个字符区域进行字符识别以获得相应的识别结果。
    • 2. 发明公开
    • Character recognition device
    • Zeichenerkennungsvorrichtung
    • EP2214124A2
    • 2010-08-04
    • EP10160939.4
    • 2009-03-30
    • Fujitsu Frontech Limited
    • EGUCHI, ShinichiKAWASHIMA, HajimeKANAMOTO, KouichiHASEGAWA, ShoheiKOBARA, KatsutoshiYABUKI, Maki
    • G06K9/34G06K9/20
    • G06K9/346G06K9/00449G06K2209/01G06K2209/015
    • A character frame specification device for specifying a character frame in a form comprises: a line segment extracting unit for extracting a line segment corresponding to a character frame from image data in the form; a space table generation unit for calculating a space between two arbitrary vertical lines in each of all combinations in the case where the two arbitrary vertical lines are selected from respective vertical lines of a line segment extracted by the line segment extraction unit and generating a space table indicating the calculated spaces; a vote table generation unit for counting the number of spaces which are the same from among the spaces indicated in the space table, and generating a vote table indicating the total count value for each of the spaces; a space assumption unit for assuming a space whose total number indicated in the vote table is largest as the space of the character frame; a space modification unit for modifying the or each space in a shape pattern of a character frame registered in advance to the space assumed by the space assumption unit; and a specification unit for matching the shape pattern of the character frame whose space is modified by the space modification unit with a pattern of a line segment extracted by the line segment extraction unit and specifying a character frame on the basis of a result of the pattern matching.
    • 用于指定形式的字符帧的字符帧指定装置包括:线段提取单元,用于从形式的图像数据中提取与字符帧相对应的线段; 空间表生成单元,用于在从线段提取单元提取的线段的各垂直线中选择两条任意的垂直线的情况下,计算所有组合中的两个任意垂直线之间的间隔,并生成空格表 指示计算的空间; 投票表生成单元,用于从空格表中指示的空格中对相同的空格数进行计数,并生成表示每个空格的总计数值的投票表; 用于假设投票表中指示的总数最大的空格的空间假设单元作为字符框的空格; 空间修改单元,用于将预先登记的字符帧的形状图案中的每个空间修改为由空间假设单元假设的空间; 以及规格单元,用于将由空间修改单元修改的空格的字符框的形状图案与由线段提取单元提取的线段的图案进行匹配,并基于图案的结果指定字符帧 匹配。
    • 9. 发明公开
    • Ruled line extracting apparatus for extracting ruled line from normal document image and method thereof
    • 装置和方法用于正常文档图像内的表线的提取
    • EP0854434A1
    • 1998-07-22
    • EP97306162.5
    • 1997-08-13
    • FUJITSU LIMITED
    • Katsuyama, Yutaka
    • G06K9/20G06K9/34
    • G06K9/346G06K2209/01
    • A ruled line extracting apparatus obtains circumscribed rectangles of pixel concatenation regions included in an input pattern, and calculates the most frequent value of their heights. Additionally, the apparatus integrates segments by ignoring a wild card segment, and calculates the most frequent value of height/width of extracted straight lines and segments structuring the straight line. Next, it performs a process for integrating/deleting straight lines using each threshold value based on the highest frequency value. Then, it checks/deletes a straight line according to a distribution of black pixels around the straight line, and recognizes the remaining straight lines as ruled line candidates.
    • 的格线提取装置在输入模式获得包括像素级联区域的外接矩形的,并且计算它们的高度的最频繁的值。 另外,该装置通过忽略通配符段集成段,并计算所提取的直线和段构造的直线高度/宽度的最频繁的值。 接着,它执行用于INTEGRA婷过程/删除使用基于所述最高频率值中的每个阈值的直线。 然后,它检查/删除的直线gemäß到黑色像素的周围的直线的分布,并认识到剩余的直线为格线的候选者。
    • 10. 发明公开
    • System for extracting attached text from a table-cell frame
    • 用于从表格单元框架中提取附加文本的系统
    • EP0814422A3
    • 1998-01-28
    • EP97304087.6
    • 1997-06-11
    • CANON KABUSHIKI KAISHA
    • Shin-Ywan, Wang
    • G06K9/20
    • G06K9/00449G06K9/346G06K2209/01
    • A method for identifying and extracting text data from a table-cell frame. The method includes the steps of tracing connected components of a document image, tracing white contours within a connected component, defining a frame outline based on the white contours, identifying unattached character data inside the frame outline, and defining an initial rectangular area inside the frame outline. The method further includes detecting black pixels in a horizontal or vertical direction from the initial rectangular area in order to create an extended character area, locating boundary pixels lying inside the extended character area for each white contour, identifying black pixels positioned between boundary pixels lying inside the extended character area, combining black pixels positioned between boundary pixels lying inside the extended character area so as to form at least one connected component, recognizing the at least one connected component as a text component if it is not recognized as a vertical line, as a horizontal line, as part of a broken line, or as part of the frame, and defining a character node of a hierarchical tree structure corresponding to the extended character area and containing both the at least one connected component and any identified unattached connected components.
    • 一种从表格单元框架中识别和提取文本数据的方法。 该方法包括以下步骤:跟踪文档图像的连通分量,跟踪连通组件内的白色轮廓,基于白色轮廓定义框架轮廓,识别框架轮廓内的未附加字符数据,以及在框架内定义初始矩形区域 大纲。 该方法还包括从初始矩形区域检测水平或垂直方向上的黑色像素,以便创建扩展字符区域,对于每个白色轮廓定位位于扩展字符区域内的边界像素,识别位于位于内部的边界像素之间的黑色像素 所述扩展字符区域组合位于所述扩展字符区域内的边界像素之间的黑色像素以形成至少一个连接的组件,如果所述至少一个连接的组件未被识别为垂直线,则将所述至少一个连接的组件识别为文本组件, 水平线,作为虚线的一部分或者作为框架的一部分,并且定义对应于扩展字符区域并包含至少一个连接组件和任何识别的未连接的连接组件的层级树结构的字符节点。