会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Method and system for preprocessing an image for optical character recognition
    • 用于光学字符识别的图像预处理方法和系统
    • US08218875B2
    • 2012-07-10
    • US12814448
    • 2010-06-12
    • Hussein Khalid Al-OmariMohammad Sulaiman Khorsheed
    • Hussein Khalid Al-OmariMohammad Sulaiman Khorsheed
    • G06K9/34G06K9/18G06K9/00G06K9/36H04N1/04
    • G06K9/00463G06K2209/013
    • A method and system for preprocessing an image for Optical Character Recognition (OCR), wherein the image includes a plurality of columns is disclosed. Each column includes one or more of Arabic text and non-text items. The method includes determining a plurality of components associated with one or more of the Arabic text and the non-text items, wherein a component includes a set of connected pixels. On determining the plurality of components, a line height and a column spacing is determined for the plurality of components. The plurality of components are then associated with a column of the plurality of columns based on the line height and the column spacing. Subsequently, a set of characteristic parameters are calculated for each column and the plurality of components of each column are merged based on the set of characteristic parameters to form sub-words and words.
    • 一种用于预处理光学字符识别(OCR)的图像的方法和系统,其中图像包括多个列。 每列包括一个或多个阿拉伯语文本和非文本项目。 该方法包括确定与阿拉伯语文本和非文本项目中的一个或多个相关联的多个组件,其中组件包括一组连接的像素。 在确定多个分量时,确定多个分量的行高和列间距。 然后,多个部件基于线高度和列间距与多列的列相关联。 随后,针对每列计算一组特征参数,并且基于特征参数集合来合并每列的多个分量以形成子词和词。
    • 6. 发明申请
    • METHOD AND SYSTEM FOR TEXT SEGMENTATION
    • 文本分割的方法和系统
    • US20120281919A1
    • 2012-11-08
    • US13102373
    • 2011-05-06
    • Ahmad AbdulkaderHussein Khalid Al-OmariMohammad Sulaiman Khorsheed
    • Ahmad AbdulkaderHussein Khalid Al-OmariMohammad Sulaiman Khorsheed
    • G06K9/34
    • G06K9/34G06K9/6814G06K2209/01G06K2209/013
    • A method and system for segmenting a text into a plurality of sections is provided. The text may be received in the form of an image. The method involves receiving one or more input labels from a user corresponding to one or more segmentation points of a plurality of segmentation points of the text. The plurality of segmentation points of the text are obtained by applying one or more segmentation heuristics over the text. The one or more input labels provided by the user are utilized to label the plurality of segmentation points of the text. In response to labeling, validation is performed to identify whether a segmentation point of the plurality of segmentation points is a valid segmentation point. Thereafter, based on the validation, a set of valid segmentation points is updated with one or more segmentation points of the plurality of segmentation points. The set of valid segmentation points facilitates segmentation of the text for recognizing the plurality of sections.
    • 提供了一种用于将文本分割成多个部分的方法和系统。 可以以图像的形式接收文本。 该方法涉及从对应于文本的多个分割点中的一个或多个分割点的用户接收一个或多个输入标签。 通过在文本上应用一个或多个分割启发式算法来获得文本的多个分割点。 由用户提供的一个或多个输入标签用于标记文本的多个分割点。 响应于标签,执行验证以识别多个分割点中的分割点是否是有效的分割点。 此后,基于验证,利用多个分割点中的一个或多个分割点更新一组有效分割点。 该组有效的分割点有助于文本的分割以识别多个部分。
    • 9. 发明申请
    • METHOD AND SYSTEM FOR PREPROCESSING AN IMAGE FOR OPTICAL CHARACTER RECOGNITION
    • 用于预处理光学字符识别的图像的方法和系统
    • US20110305387A1
    • 2011-12-15
    • US12814448
    • 2010-06-12
    • Hussein Khalid Al-OmariMohammad Sulaiman Khorsheed
    • Hussein Khalid Al-OmariMohammad Sulaiman Khorsheed
    • G06K9/00G06K9/18
    • G06K9/00463G06K2209/013
    • A method and system for preprocessing an image for Optical Character Recognition (OCR), wherein the image includes a plurality of columns is disclosed. Each column includes one or more of Arabic text and non-text items. The method includes determining a plurality of components associated with one or more of the Arabic text and the non-text items, wherein a component includes a set of connected pixels. On determining the plurality of components, a line height and a column spacing is determined for the plurality of components. The plurality of components are then associated with a column of the plurality of columns based on the line height and the column spacing. Subsequently, a set of characteristic parameters are calculated for each column and the plurality of components of each column are merged based on the set of characteristic parameters to form sub-words and words.
    • 一种用于预处理光学字符识别(OCR)的图像的方法和系统,其中图像包括多个列。 每列包括一个或多个阿拉伯语文本和非文本项目。 该方法包括确定与阿拉伯语文本和非文本项目中的一个或多个相关联的多个组件,其中组件包括一组连接的像素。 在确定多个分量时,确定多个分量的行高和列间距。 然后,多个部件基于线高度和列间距与多列的列相关联。 随后,针对每列计算一组特征参数,并且基于特征参数集合来合并每列的多个分量以形成子词和词。