会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Segmentation of a word bitmap into individual characters or glyphs during an OCR process
    • 在OCR过程中将单词位图分割成单个字符或字形
    • US08571270B2
    • 2013-10-29
    • US12776576
    • 2010-05-10
    • Djordje Nijemcevic
    • Djordje Nijemcevic
    • G06K9/34
    • G06K9/342G06K2209/01
    • An image processing apparatus is provided that includes a character chopper component that segments words into individual characters in a bitmap of a textual image undergoing an OCR process. The Character chopper component is configured to produce a set of (possibly curved) chop-lines which divide a bitmap of any given word into its individual character or glyph candidates. Cases where an input bitmap contains two separate words are handled by marking a place where those words should be split. The character segmentation algorithm computes the set of vertically oriented, curved chop-lines by considering glyph and background colors in a given word bitmap. The set is filtered afterwards using various heuristics, in order to preserve those lines that indeed do separate a word's glyphs and minimize the number of those that do not.
    • 提供了一种图像处理装置,其包括将文字分割成经历OCR处理的文本图像的位图中的单个字符的字符斩波器部件。 字符斩波器组件被配置为产生一组(可能是弯曲的)斩波线,其将任何给定字的位图划分成其单独的字符或字形候选。 输入位图包含两个单独的单词的情况通过标记应分割这些单词的地方来处理。 字符分割算法通过考虑给定字位图中的字形和背景颜色来计算垂直取向的弯曲chop线的集合。 该集合之后使用各种启发式方法进行过滤,以便保留那些确实分离单词的字形并最小化那些不符号的字符串的行。
    • 3. 发明授权
    • Text enhancement of a textual image undergoing optical character recognition
    • 正在进行光学字符识别的文字图像的文本增强
    • US08526732B2
    • 2013-09-03
    • US12720732
    • 2010-03-10
    • Sasa GalicDjordje NijemcevicBodin Dresevic
    • Sasa GalicDjordje NijemcevicBodin Dresevic
    • G06K9/00
    • G06K9/4638G06K9/38G06K2209/01G06K2209/015
    • A method for enhancing a textual image for undergoing optical character recognition begins by receiving an image that includes native lines of text. A background line profile is determined which represents an average background intensity along the native lines in the image. Likewise, a foreground line profile is determined which represents an average foreground background intensity along the native lines in the image. The pixels in the image are assigned to either a background or foreground portion of the image based at least in part on the background line profile and the foreground line profile. The intensity of the pixels designated to the background portion of the image is adjusted to a maximum brightness so as to represent a portion of the image that does not include text.
    • 用于增强用于进行光学字符识别的文本图像的方法通过接收包括原生文本行的图像开始。 确定背景线轮廓,其表示沿着图像中的原生线的平均背景强度。 同样,确定前景线轮廓,其表示沿着图像中的本机线的平均前景背景强度。 至少部分地基于背景线轮廓和前景线轮廓,将图像中的像素分配给图像的背景或前景部分。 将指定给图像的背景部分的像素的强度调整到最大亮度,以便表示不包括文本的图像的一部分。
    • 6. 发明授权
    • Resolution adjustment of an image that includes text undergoing an OCR process
    • 包含正在进行OCR过程的文本的图像的分辨率调整
    • US08311331B2
    • 2012-11-13
    • US12719894
    • 2010-03-09
    • Djordje NijemcevicMilan VugdelijaBodin Dresevic
    • Djordje NijemcevicMilan VugdelijaBodin Dresevic
    • G06K9/34
    • G06K9/3283G06K9/00463G06K2209/01
    • An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function. Specifically, the second fitness function increases with increasing lightless of pixels immediately above the shifted base-line while also increasing with decreasing lightness of pixels through which the shifted base-line passes. The x-height is equal to the sum of the predetermined amounts by which the base-line is shifted upward in order to maximize the second fitness function. In some cases different groups of text-lines in the textual image may be characterized differently from one another. For example, each group may be characterized by a most probable x-height for that group.
    • 光学字符识别过程通过其基线,平均线和x高度来表征文本图像中的文本行。 通过找到最大化第一适应度函数的参数曲线来确定图像中至少一条文本行的基线,该参数曲线取决于参数曲线通过的像素的值和参数曲线下方的像素。 基线对应于第一适应度函数最大化的参数曲线。 第一适应度功能被设计成随着参数曲线正下方的像素的无光或亮度的增加而增加,同时随着参数曲线通过的像素的亮度的减小而增加。 通过将基线向上逐渐地移动预定量(例如,单个像素)直到用于移位的基线的第二适应度函数最大化来确定平均线。 第二适应度函数本质上是第一适应度函数的倒数。 具体地,第二适应度函数随着位于移动基线上方的像素的无光增加而增加,同时随着偏移的基线通过的像素的亮度的减小而增加。 x高度等于基线向上移位的预定量的总和,以便使第二适应度函数最大化。 在某些情况下,文本图像中不同的文本行组可能具有彼此不同的特征。 例如,每个组的特征可以是该组的最可能的x-高度。
    • 7. 发明申请
    • WORD RECOGNITION OF TEXT UNDERGOING AN OCR PROCESS
    • WORD承认OCR过程的文字
    • US20110268360A1
    • 2011-11-03
    • US12772376
    • 2010-05-03
    • Aleksandar AntonijevicIvan MiticMircea CimpoiDjordje Nijemcevic
    • Aleksandar AntonijevicIvan MiticMircea CimpoiDjordje Nijemcevic
    • G06K9/34
    • G06K9/344G06K2209/01
    • A method for identifying words in a textual image undergoing optical character recognition includes receiving a bitmap of an input image which includes textual lines that have been segmented by a plurality of chop lines. The chop lines are each associated with a confidence level reflecting a degree to which the respective chop line properly segments the textual line into individual characters. One or more words are identified in one of the textual lines based at least in part on the textual lines and a first subset of the plurality of chop lines which have a chop line confidence level above a first threshold value. If the first word is not associated with a sufficiently high word confidence level, at least a second word in the textual line is identified based at least in part on a second subset of the plurality of chop lines which have a confidence level above a second threshold value lower than the first threshold value.
    • 用于识别进行光学字符识别的文本图像中的单词的方法包括:接收包括已经被多个chop行划分的文本行的输入图像的位图。 斩线各自与置信水平相关联,该置信度反映了各剁行将文本行适当地分割成单个字符的程度。 至少部分地基于文本行和多条斩波线的第一子集具有高于第一阈值的斩线置信水平,在一条文本行中识别一个或多个字。 如果第一个字没有与足够高的字词置信水平相关联,则文本行中的至少第二个字词至少部分地基于具有高于第二阈值的置信水平的多个chop行的第二子集来识别 值低于第一阈值。
    • 8. 发明申请
    • RESOLUTION ADJUSTMENT OF AN IMAGE THAT INCLUDES TEXT UNDERGOING AN OCR PROCESS
    • 分辨率调整包含OCR过程文本的图像
    • US20110222772A1
    • 2011-09-15
    • US12719894
    • 2010-03-09
    • Djordje NijemcevicMilan VugdelijaBodin Dresevic
    • Djordje NijemcevicMilan VugdelijaBodin Dresevic
    • G06K9/18
    • G06K9/3283G06K9/00463G06K2209/01
    • An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function. Specifically, the second fitness function increases with increasing lightless of pixels immediately above the shifted base-line while also increasing with decreasing lightness of pixels through which the shifted base-line passes. The x-height is equal to the sum of the predetermined amounts by which the base-line is shifted upward in order to maximize the second fitness function. In some cases different groups of text-lines in the textual image may be characterized differently from one another. For example, each group may be characterized by a most probable x-height for that group.
    • 光学字符识别过程通过其基线,平均线和x高度来表征文本图像中的文本行。 通过找到最大化第一适应度函数的参数曲线来确定图像中至少一条文本行的基线,该参数曲线取决于参数曲线通过的像素的值和参数曲线下方的像素。 基线对应于第一适应度函数最大化的参数曲线。 第一适应度功能被设计成随着参数曲线正下方的像素的无光或亮度的增加而增加,同时随着参数曲线通过的像素的亮度的减小而增加。 通过将基线向上逐渐地移动预定量(例如,单个像素)直到用于移位的基线的第二适应度函数最大化来确定平均线。 第二适应度函数本质上是第一适应度函数的倒数。 具体地,第二适应度函数随着位于移动基线上方的像素的无光增加而增加,同时随着偏移的基线通过的像素的亮度的减小而增加。 x高度等于基线向上移位的预定量的总和,以便使第二适应度函数最大化。 在某些情况下,文本图像中不同的文本行组可能具有彼此不同的特征。 例如,每个组的特征可以是该组的最可能的x-高度。