会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明申请
    • DOCUMENT PAGE SEGMENTATION IN OPTICAL CHARACTER RECOGNITION
    • 光学字符识别中的文档分页
    • US20110222769A1
    • 2011-09-15
    • US12720943
    • 2010-03-10
    • Sasa GalicBogdan RadakovicNikola Todic
    • Sasa GalicBogdan RadakovicNikola Todic
    • G06K9/34G06K9/72
    • G06K9/00456G06K9/3283G06K9/38G06K9/4604G06K2209/01
    • Page segmentation in an optical character recognition process is performed to detect textual objects and/or image objects. Textual objects in an input gray scale image are detected by selecting candidates for native lines which are sets of horizontally neighboring connected components (i.e., subsets of image pixels where each pixel from the set is connected with all remaining pixels from the set) having similar vertical statistics defined by values of baseline (the line upon which most text characters “sit”) and mean line (the line under which most of the characters “hang”). Binary classification is performed on the native line candidates to classify them as textual or non-textual through examination of any embedded regularity. Image objects are indirectly detected by detecting the image's background using the detected text to define the background. Once the background is detected, what remains (i.e., the non-background) is an image object.
    • 执行光学字符识别处理中的页面分割以检测文本对象和/或图像对象。 通过选择作为水平相邻连接分量的集合(即,来自集合的每个像素与集合中的每个像素与集合中的所有剩余像素连接的图像像素的集合),选择具有相似垂直方向的本机线的候选,来检测输入灰度图像中的文本对象 由基准值(大多数文本字符“坐”的行)和平均线(大多数字符“挂起”的行)定义的统计信息。 对本地候选人执行二进制分类,以便通过审查任何嵌入规律性将其分类为文本或非文本。 通过使用检测到的文本检测图像的背景以定义背景来间接检测图像对象。 一旦检测到背景,剩余的(即非背景)是图像对象。
    • 4. 发明授权
    • Text enhancement of a textual image undergoing optical character recognition
    • 正在进行光学字符识别的文字图像的文本增强
    • US08526732B2
    • 2013-09-03
    • US12720732
    • 2010-03-10
    • Sasa GalicDjordje NijemcevicBodin Dresevic
    • Sasa GalicDjordje NijemcevicBodin Dresevic
    • G06K9/00
    • G06K9/4638G06K9/38G06K2209/01G06K2209/015
    • A method for enhancing a textual image for undergoing optical character recognition begins by receiving an image that includes native lines of text. A background line profile is determined which represents an average background intensity along the native lines in the image. Likewise, a foreground line profile is determined which represents an average foreground background intensity along the native lines in the image. The pixels in the image are assigned to either a background or foreground portion of the image based at least in part on the background line profile and the foreground line profile. The intensity of the pixels designated to the background portion of the image is adjusted to a maximum brightness so as to represent a portion of the image that does not include text.
    • 用于增强用于进行光学字符识别的文本图像的方法通过接收包括原生文本行的图像开始。 确定背景线轮廓,其表示沿着图像中的原生线的平均背景强度。 同样,确定前景线轮廓,其表示沿着图像中的本机线的平均前景背景强度。 至少部分地基于背景线轮廓和前景线轮廓,将图像中的像素分配给图像的背景或前景部分。 将指定给图像的背景部分的像素的强度调整到最大亮度,以便表示不包括文本的图像的一部分。
    • 5. 发明授权
    • Resolution adjustment of an image that includes text undergoing an OCR process
    • 包含正在进行OCR过程的文本的图像的分辨率调整
    • US08380009B2
    • 2013-02-19
    • US12721705
    • 2010-03-11
    • Sasa Galic
    • Sasa Galic
    • G06K9/32G06K15/02
    • G06K9/42G06K2209/01
    • A system and method is provided which rescales a received image to an optimal size to undergo an optical character recognition (OCR) process. The system includes an optimal size determination component that determines an optimum size for the image such that processing time of the received image is minimized without affecting accuracy. The optimal size determination component determines the optimum size of the image based at least in part on a dominant interline spacing of text and a dominant text height. The system also includes a rescaling component that resizes the received image to the determined optimum size.
    • 提供了一种系统和方法,其将接收到的图像重新调整为最佳尺寸以进行光学字符识别(OCR)处理。 该系统包括确定图像的最佳尺寸的最佳尺寸确定部件,使得接收图像的处理时间最小化而不影响精度。 最优尺寸确定组件至少部分地基于文本的主导间距和主要文本高度来确定图像的最佳尺寸。 该系统还包括重新缩放组件,其将接收到的图像的大小调整到确定的最佳尺寸。
    • 6. 发明授权
    • Techniques in optical character recognition
    • 光学字符识别技术
    • US08189961B2
    • 2012-05-29
    • US12797219
    • 2010-06-09
    • Djordje NijemcevicSasa Galic
    • Djordje NijemcevicSasa Galic
    • G06K9/00
    • G06K9/3283G06K2209/01
    • An image deskew system and techniques are used in the context of optical character recognition. An image is obtained of an original set of characters in an original linear (horizontal) orientation. An acquired set of characters, which is skewed relative to the original linear orientation by a rotation angle, is represented by pixels of the image. The rotation angle is estimated, and a confidence value may be associated with the estimation, to determine whether to deskew the image. In connection with rotation angle estimation, an edge detection filter is applied to the acquired set of characters to produce an edge map, which is input to a linear hough transform filter to produce a set of output lines in parametric form. The output lines are assigned scores, and based on the scores, at least one output line is determined to be a dominant line with a slope approximating the rotation angle.
    • 在光学字符识别的上下文中使用图像校正系统和技术。 以原始的线性(水平)方向获得原始的一组字符的图像。 通过图像的像素来表示相对于原始线性方向偏斜旋转角度的所获取的一组字符。 估计旋转角度,并且置信度值可以与估计相关联,以确定是否使图像偏斜。 结合旋转角度估计,将边缘检测滤波器应用于所获取的字符集,以产生边缘图,其被输入到线性霍夫变换滤波器以产生一组参数形式的输出线。 输出线被分配分数,并且基于分数,至少一条输出线被确定为具有接近旋转角度的斜率的主导线。
    • 8. 发明申请
    • PARAGRAPH RECOGNITION IN AN OPTICAL CHARACTER RECOGNITION (OCR) PROCESS
    • 光学识别(OCR)过程中的符号识别
    • US20110222773A1
    • 2011-09-15
    • US12720992
    • 2010-03-10
    • Bogdan RadakovicSasa GalicAleksandar Uzelac
    • Bogdan RadakovicSasa GalicAleksandar Uzelac
    • G06K9/18G06K9/62
    • G06K9/00469G06K9/00463
    • An image processing apparatus for detecting paragraphs in a textual image includes an input component for receiving an input image in which textual lines and words have been identified and a page classification component for classifying the input image as a first or second page type. The apparatus also includes a paragraph detection component for classifying all textual lines on the input image as a beginning paragraph line or a continuation paragraph line. The apparatus is also provided with a paragraph creation component for creating paragraphs that include textual lines between two successive beginning paragraph lines, including a first of the two successive beginning paragraph lines. The paragraphs that have been identified may be classified by the type of alignment they exhibit. For instance, paragraphs may be classified according to whether they are left aligned, right aligned, center aligned or justified.
    • 用于检测文本图像中的段落的图像处理装置包括用于接收其中已经识别了文本行和单词的输入图像的输入组件和用于将输入图像分类为第一或第二页面类型的页面分类组件。 该装置还包括段落检测部件,用于将输入图像上的所有文本行分类为起始段落线或连续段落线。 该装置还具有段落创建部件,用于创建包括两个连续起始段落线之间的文本行的段落,包括两个连续起始段落行中的第一行。 已确定的段落可以按照它们展示的对齐方式进行分类。 例如,段落可以根据它们是否对齐,右对齐,中心对齐或对齐来进行分类。
    • 10. 发明申请
    • TECHNIQUES IN OPTICAL CHARACTER RECOGNITION
    • 光学识别技术
    • US20110305393A1
    • 2011-12-15
    • US12797219
    • 2010-06-09
    • Djordje NijemcevicSasa Galic
    • Djordje NijemcevicSasa Galic
    • G06K9/18
    • G06K9/3283G06K2209/01
    • An image deskew system and techniques are used in the context of optical character recognition. An image is obtained of an original set of characters in an original linear (horizontal) orientation. An acquired set of characters, which is skewed relative to the original linear orientation by a rotation angle, is represented by pixels of the image. The rotation angle is estimated, and a confidence value may be associated with the estimation, to determine whether to deskew the image. In connection with rotation angle estimation, an edge detection filter is applied to the acquired set of characters to produce an edge map, which is input to a linear hough transform filter to produce a set of output lines in parametric form. The output lines are assigned scores, and based on the scores, at least one output line is determined to be a dominant line with a slope approximating the rotation angle.
    • 在光学字符识别的上下文中使用图像校正系统和技术。 以原始的线性(水平)方向获得原始的一组字符的图像。 通过图像的像素来表示相对于原始线性方向偏斜旋转角度的所获取的一组字符。 估计旋转角度,并且置信度值可以与估计相关联,以确定是否使图像偏斜。 结合旋转角度估计,将边缘检测滤波器应用于所获取的字符集,以产生边缘图,其被输入到线性霍夫变换滤波器以产生一组参数形式的输出线。 输出线被分配分数,并且基于分数,至少一条输出线被确定为具有接近旋转角度的斜率的主导线。