会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Document page segmentation in optical character recognition
    • 光学字符识别中的文档页面分割
    • US08509534B2
    • 2013-08-13
    • US12720943
    • 2010-03-10
    • Sasa GalicBogdan RadakovicNikola Todic
    • Sasa GalicBogdan RadakovicNikola Todic
    • G06K9/36
    • G06K9/00456G06K9/3283G06K9/38G06K9/4604G06K2209/01
    • Page segmentation in an optical character recognition process is performed to detect textual objects and/or image objects. Textual objects in an input gray scale image are detected by selecting candidates for native lines which are sets of horizontally neighboring connected components (i.e., subsets of image pixels where each pixel from the set is connected with all remaining pixels from the set) having similar vertical statistics defined by values of baseline (the line upon which most text characters “sit”) and mean line (the line under which most of the characters “hang”). Binary classification is performed on the native line candidates to classify them as textual or non-textual through examination of any embedded regularity. Image objects are indirectly detected by detecting the image's background using the detected text to define the background. Once the background is detected, what remains (i.e., the non-background) is an image object.
    • 执行光学字符识别处理中的页面分割以检测文本对象和/或图像对象。 通过选择作为水平相邻连接分量的集合(即,来自集合的每个像素与集合中的每个像素与集合中的所有剩余像素连接的图像像素的集合),选择具有相似垂直方向的本机线的候选,来检测输入灰度图像中的文本对象 由基准值(大多数文本字符“坐”的行)和平均线(大多数字符“挂起”的行)定义的统计信息。 对本地候选人执行二进制分类,以便通过审查任何嵌入规律性将其分类为文本或非文本。 通过使用检测到的文本检测图像的背景以定义背景来间接检测图像对象。 一旦检测到背景,剩余的(即非背景)是图像对象。
    • 2. 发明申请
    • DOCUMENT PAGE SEGMENTATION IN OPTICAL CHARACTER RECOGNITION
    • 光学字符识别中的文档分页
    • US20110222769A1
    • 2011-09-15
    • US12720943
    • 2010-03-10
    • Sasa GalicBogdan RadakovicNikola Todic
    • Sasa GalicBogdan RadakovicNikola Todic
    • G06K9/34G06K9/72
    • G06K9/00456G06K9/3283G06K9/38G06K9/4604G06K2209/01
    • Page segmentation in an optical character recognition process is performed to detect textual objects and/or image objects. Textual objects in an input gray scale image are detected by selecting candidates for native lines which are sets of horizontally neighboring connected components (i.e., subsets of image pixels where each pixel from the set is connected with all remaining pixels from the set) having similar vertical statistics defined by values of baseline (the line upon which most text characters “sit”) and mean line (the line under which most of the characters “hang”). Binary classification is performed on the native line candidates to classify them as textual or non-textual through examination of any embedded regularity. Image objects are indirectly detected by detecting the image's background using the detected text to define the background. Once the background is detected, what remains (i.e., the non-background) is an image object.
    • 执行光学字符识别处理中的页面分割以检测文本对象和/或图像对象。 通过选择作为水平相邻连接分量的集合(即,来自集合的每个像素与集合中的每个像素与集合中的所有剩余像素连接的图像像素的集合),选择具有相似垂直方向的本机线的候选,来检测输入灰度图像中的文本对象 由基准值(大多数文本字符“坐”的行)和平均线(大多数字符“挂起”的行)定义的统计信息。 对本地候选人执行二进制分类,以便通过审查任何嵌入规律性将其分类为文本或非文本。 通过使用检测到的文本检测图像的背景以定义背景来间接检测图像对象。 一旦检测到背景,剩余的(即非背景)是图像对象。
    • 4. 发明授权
    • Paragraph recognition in an optical character recognition (OCR) process
    • 光学字符识别(OCR)过程中的段落识别
    • US08565474B2
    • 2013-10-22
    • US12720992
    • 2010-03-10
    • Bogdan RadakovicSasa GalicAleksandar Uzelac
    • Bogdan RadakovicSasa GalicAleksandar Uzelac
    • G06K9/00
    • G06K9/00469G06K9/00463
    • An image processing apparatus for detecting paragraphs in a textual image includes an input component for receiving an input image in which textual lines and words have been identified and a page classification component for classifying the input image as a first or second page type. The apparatus also includes a paragraph detection component for classifying all textual lines on the input image as a beginning paragraph line or a continuation paragraph line. The apparatus is also provided with a paragraph creation component for creating paragraphs that include textual lines between two successive beginning paragraph lines, including a first of the two successive beginning paragraph lines. The paragraphs that have been identified may be classified by the type of alignment they exhibit. For instance, paragraphs may be classified according to whether they are left aligned, right aligned, center aligned or justified.
    • 用于检测文本图像中的段落的图像处理装置包括用于接收其中已经识别了文本行和单词的输入图像的输入组件和用于将输入图像分类为第一或第二页面类型的页面分类组件。 该装置还包括段落检测部件,用于将输入图像上的所有文本行分类为起始段落线或连续段落线。 该装置还具有段落创建部件,用于创建包括两个连续起始段落线之间的文本行的段落,包括两个连续起始段落行中的第一行。 已确定的段落可以按照它们展示的对齐方式进行分类。 例如,段落可以根据它们是否对齐,右对齐,中心对齐或对齐来进行分类。
    • 5. 发明授权
    • Detecting position of word breaks in a textual line image
    • 检测文字行图像中的分词位置
    • US08345978B2
    • 2013-01-01
    • US12749599
    • 2010-03-30
    • Aleksandar UzelacBodin DresevicSasa GalicBogdan Radakovic
    • Aleksandar UzelacBodin DresevicSasa GalicBogdan Radakovic
    • G06K9/00
    • G06K9/344G06K9/342G06K2209/01
    • Line segmentation in an OCR process is performed to detect the positions of words within an input textual line image by extracting features from the input to locate breaks and then classifying the breaks into one of two break classes which include inter-word breaks and inter-character breaks. An output including the bounding boxes of the detected words and a probability that a given break belongs to the identified class can then be provided to downstream OCR or other components for post-processing. Advantageously, by reducing line segmentation to the extraction of features, including the position of each break and the number of break features, and break classification, the task of line segmentation is made less complex but with no loss of generality.
    • 执行OCR处理中的线分割以通过从输入中提取特征来定位分组,然后将分组分类成包括字间间隔和字符间的两个断点类之一来检测输入文本行图像内的单词的位置 休息 然后可以将包括检测到的单词的边界框和给定中断属于所识别的类别的概率的输出提供给下游OCR或用于后处理的其他组件。 有利的是,通过将行分割减少到特征的提取,包括每个断点的位置和断裂特征的数量以及断裂分类,线分割的任务变得不那么复杂,但不失一般性。
    • 7. 发明申请
    • PARAGRAPH RECOGNITION IN AN OPTICAL CHARACTER RECOGNITION (OCR) PROCESS
    • 光学识别(OCR)过程中的符号识别
    • US20110222773A1
    • 2011-09-15
    • US12720992
    • 2010-03-10
    • Bogdan RadakovicSasa GalicAleksandar Uzelac
    • Bogdan RadakovicSasa GalicAleksandar Uzelac
    • G06K9/18G06K9/62
    • G06K9/00469G06K9/00463
    • An image processing apparatus for detecting paragraphs in a textual image includes an input component for receiving an input image in which textual lines and words have been identified and a page classification component for classifying the input image as a first or second page type. The apparatus also includes a paragraph detection component for classifying all textual lines on the input image as a beginning paragraph line or a continuation paragraph line. The apparatus is also provided with a paragraph creation component for creating paragraphs that include textual lines between two successive beginning paragraph lines, including a first of the two successive beginning paragraph lines. The paragraphs that have been identified may be classified by the type of alignment they exhibit. For instance, paragraphs may be classified according to whether they are left aligned, right aligned, center aligned or justified.
    • 用于检测文本图像中的段落的图像处理装置包括用于接收其中已经识别了文本行和单词的输入图像的输入组件和用于将输入图像分类为第一或第二页面类型的页面分类组件。 该装置还包括段落检测部件,用于将输入图像上的所有文本行分类为起始段落线或连续段落线。 该装置还具有段落创建部件,用于创建包括两个连续起始段落线之间的文本行的段落,包括两个连续起始段落行中的第一行。 已确定的段落可以按照它们展示的对齐方式进行分类。 例如,段落可以根据它们是否对齐,右对齐,中心对齐或对齐来进行分类。
    • 8. 发明申请
    • Geometric parsing of mathematical expressions
    • 几何解析数学表达式
    • US20080253657A1
    • 2008-10-16
    • US11784889
    • 2007-04-10
    • Bogdan RadakovicGoran PredovicBodin Dresevic
    • Bogdan RadakovicGoran PredovicBodin Dresevic
    • G06K9/18
    • G06K9/00402
    • A processing device may parse a group of strokes representing a mathematical expression. The group of strokes may be examined to determine whether the group of strokes satisfies any of a finite set of rules. When the group of strokes, included in a region, satisfies any of the finite set of rules, the region may be partitioned according to a satisfied one of the finite set of rules. The group of strokes included in the region may be further examined to determine whether the group of strokes may be further partitioned according to any of the finite set of rules. After all regions have been examined and no further partitioning of regions may be performed, all mathematical symbols of the mathematical expression may be isolated in at least some of the regions and may be recognized.
    • 处理设备可以解析表示数学表达式的一组笔划。 可以检查一组笔划以确定笔划组是否满足任何一组有限的规则。 当包括在区域中的笔划组满足任何有限的规则集合时,可以根据有限规则集合中的一个满足区域。 可以进一步检查包括在该区域中的笔划组以确定是否可以根据任何有限规则集进一步划分笔划组。 在检查了所有区域之后,并且不能进行区域的进一步分割,数学表达式的所有数学符号可以在至少一些区域中被隔离并且可被识别。
    • 10. 发明授权
    • Geometric parsing of mathematical expressions
    • 几何解析数学表达式
    • US08064696B2
    • 2011-11-22
    • US11784889
    • 2007-04-10
    • Bogdan RadakovicGoran PredovicBodin Dresevic
    • Bogdan RadakovicGoran PredovicBodin Dresevic
    • G06K9/00
    • G06K9/00402
    • A processing device may parse a group of strokes representing a mathematical expression. The group of strokes may be examined to determine whether the group of strokes satisfies any of a finite set of rules. When the group of strokes, included in a region, satisfies any of the finite set of rules, the region may be partitioned according to a satisfied one of the finite set of rules. The group of strokes included in the region may be further examined to determine whether the group of strokes may be further partitioned according to any of the finite set of rules. After all regions have been examined and no further partitioning of regions may be performed, all mathematical symbols of the mathematical expression may be isolated in at least some of the regions and may be recognized.
    • 处理设备可以解析表示数学表达式的一组笔划。 可以检查一组笔划以确定笔划组是否满足任何一组有限的规则。 当包括在区域中的笔划组满足任何有限的规则集合时,可以根据有限规则集合中的一个满足区域。 可以进一步检查包括在该区域中的笔划组以确定是否可以根据任何有限规则集进一步划分笔划组。 在检查了所有区域之后,并且不能进行区域的进一步分割,数学表达式的所有数学符号可以在至少一些区域中被隔离并且可被识别。