会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 103. 发明授权
    • Image processing apparatus
    • 图像处理装置
    • US08045228B2
    • 2011-10-25
    • US12048733
    • 2008-03-14
    • Yuji Takemoto
    • Yuji Takemoto
    • H04N1/40
    • G06F17/30011G06K9/00442G06K9/00483H04N1/2166H04N2201/0094
    • An image processing apparatus including a data inputting part for inputting image data, a document recognizing part for recognizing the image data as a document, a document storing part for storing document data corresponding to the document recognized by the document recognizing part, and a stored document managing part for managing the document data stored in the document storing part is disclosed. The image processing apparatus has a document analyzing part configured to analyze the input image data, a text writing part configured to obtain an analysis result from the document analyzing part and write the analysis result in a text format, a part configured to associate the analysis result to the document data stored in the document storing part and register the analysis result in correspondence with the document data, and a part configured to search for a target document by referring to the registered analysis result.
    • 一种图像处理装置,包括用于输入图像数据的数据输入部分,用于将图像数据识别为文档的文档识别部分,用于存储与由文档识别部分识别的文档相对应的文档数据的文档存储部分,以及存储的文档 公开了用于管理存储在文档存储部分中的文档数据的管理部分。 图像处理装置具有:文件分析部,被配置为分析输入图像数据;文本写入部,被配置为从文档分析部获取分析结果,并将分析结果写入文本格式;配置为将分析结果相关联的部分 存储在文档存储部分中的文档数据并且与文档数据对应地登记分析结果,以及被配置为通过参考所登记的分析结果来搜索目标文档的部分。
    • 106. 发明申请
    • Identifying Matching Canonical Documents in Response to a Visual Query
    • 识别匹配的规范文件以响应可视化查询
    • US20110129153A1
    • 2011-06-02
    • US12852189
    • 2010-08-06
    • David PetrouAshok C. PopatMatthew R. Casey
    • David PetrouAshok C. PopatMatthew R. Casey
    • G06K9/18
    • G06F17/30244G06F17/30864G06K9/00483G06K9/036G06K9/72
    • A server system receives a visual query from a client system. The visual query is an image containing text such as a picture of a document. At the receiving server or another server, optical character recognition (OCR) is performed on the visual query to produce text recognition data representing textual characters. Each character in a contiguous region of the visual query is individually scored according to its quality. The quality score of a respective character is influenced by the quality scores of neighboring or nearby characters. Using the scores, one or more high quality strings of characters are identified. Each high quality string has a plurality of high quality characters. A canonical document containing the one or more high quality textual strings is retrieved. At least a portion of the canonical document is sent to the client system.
    • 服务器系统从客户端系统接收可视化查询。 视觉查询是包含文本的图像的图像,例如文档的图片。 在接收服务器或其他服务器上,对视觉查询执行光学字符识别(OCR),以产生表示文本字符的文本识别数据。 视觉查询的连续区域中的每个字符根据其质量单独评分。 相应角色的质量得分受邻近或附近角色质量得分的影响。 使用分数,识别出一个或多个高质量的字符串。 每个高质量的字符串都有多个高质量字符。 检索包含一个或多个高质量文本字符串的规范文档。 规范文件的至少一部分被发送到客户端系统。
    • 107. 发明授权
    • Scalable indexing for layout based document retrieval and ranking
    • 可扩展索引,用于基于布局的文档检索和排名
    • US07953679B2
    • 2011-05-31
    • US12556098
    • 2009-09-09
    • Boris ChidlovskiiLoïc M. Lecerf
    • Boris ChidlovskiiLoïc M. Lecerf
    • G06F15/18G06E1/00G06E3/00G06G7/00
    • G06F17/30247G06F17/3069G06K9/00463G06K9/00483
    • A computer-based method and a system for indexing, querying, and ranking documents based on layout are provided. The method includes providing a plurality of documents to computer memory, extracting layout blocks from the provided documents, clustering the layout blocks into a plurality of layout block clusters, computing a representative block for each of the layout block clusters, generating a document index for each provided document based on the layout blocks of the document and the computed representatives blocks, clustering the created document indexes into a plurality of document index clusters, and generating a representative cluster index for each of the document index clusters. The indexes generated, together with the representative blocks and document index clusters, can be stored and used for retrieval of documents responsive to a layout query.
    • 提供了基于计算机的方法和用于基于布局对文档进行索引,查询和排序的系统。 该方法包括向计算机存储器提供多个文档,从提供的文档中提取布局块,将布局块聚类成多个布局块集群,为每个布局块集群计算代表块,为每个布局块集群生成文档索引 基于文档的布局块和计算的代表块提供的文档,将所创建的文档索引聚类成多个文档索引集群,以及为每个文档索引集群生成代表性的聚类索引。 生成的索引以及代表性的块和文档索引簇可以被存储并用于响应于布局查询的文档的检索。
    • 108. 发明申请
    • SCALABLE INDEXING FOR LAYOUT BASED DOCUMENT RETRIEVAL AND RANKING
    • 基于布局文件检索和排名的可扩展索引
    • US20110022599A1
    • 2011-01-27
    • US12556098
    • 2009-09-09
    • Boris ChidlovskiiLoïc M. Lecerf
    • Boris ChidlovskiiLoïc M. Lecerf
    • G06F17/30
    • G06F17/30247G06F17/3069G06K9/00463G06K9/00483
    • A computer-based method and a system for indexing, querying, and ranking documents based on layout are provided. The method includes providing a plurality of documents to computer memory, extracting layout blocks from the provided documents, clustering the layout blocks into a plurality of layout block clusters, computing a representative block for each of the layout block clusters, generating a document index for each provided document based on the layout blocks of the document and the computed representatives blocks, clustering the created document indexes into a plurality of document index clusters, and generating a representative cluster index for each of the document index clusters. The indexes generated, together with the representative blocks and document index clusters, can be stored and used for retrieval of documents responsive to a layout query.
    • 提供了基于计算机的方法和用于基于布局对文档进行索引,查询和排序的系统。 该方法包括向计算机存储器提供多个文档,从提供的文档中提取布局块,将布局块聚类成多个布局块集群,为每个布局块集群计算代表块,为每个布局块集群生成文档索引 基于文档的布局块和计算的代表块提供的文档,将所创建的文档索引聚类成多个文档索引集群,以及为每个文档索引集群生成代表性的聚类索引。 生成的索引以及代表性的块和文档索引簇可以被存储并用于响应于布局查询的文档的检索。
    • 109. 发明授权
    • System and methods for data indexing and processing
    • 数据索引和处理的系统和方法
    • US07860844B2
    • 2010-12-28
    • US11487021
    • 2006-07-14
    • Michael John EbaughMatthew Joseph Morvant
    • Michael John EbaughMatthew Joseph Morvant
    • G06F7/00
    • G06F17/30616G06F17/30321G06F17/30613G06F17/30657G06K9/00463G06K9/00483G06Q10/10G06Q30/04Y02A90/26
    • Systems and methods are disclosed that allow for indexing, processing, or both of information from physical media or electronic media, which may be received from a plurality of sources. In embodiments, a document file may be matched using pattern matching methods and may include comparisons with a comparison reference database to improve or accelerate the indexing process. In embodiments, information may be presented to a user as potential matches thereby improving manual indexing processes. In embodiments, one or more additional actions may occur as part of the processing, including without limitation, association additional data with a document file, making observations from the document file, notifying individuals, creating composite messages, and billing events. In an embodiment, data from a document file may be associated with a key word, key phrase, or word frequency value that enables adaptive learning so that unindexed data may be automatically indexed based on user interaction history.
    • 公开了允许从物理介质或电子介质进行索引,处理或两者的系统和方法,其可以从多个源接收。 在实施例中,可以使用模式匹配方法来匹配文档文件,并且可以包括与比较参考数据库的比较以改善或加速索引过程。 在实施例中,可以将信息作为潜在的匹配呈现给用户,从而改进手动索引过程。 在实施例中,一个或多个附加动作可以作为处理的一部分发生,包括但不限于将附加数据与文档文件相关联,从文档文件进行观察,通知个人,创建组合消息和计费事件。 在一个实施例中,来自文档文件的数据可以与启用自适应学习的关键字,密钥短语或词频值相关联,使得可以基于用户交互历史自动索引非索引数据。