会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Method and apparatus for discriminating between documents in batch scanned document files
    • 用于在批量扫描的文档文件中区分文档的方法和装置
    • US06996276B2
    • 2006-02-07
    • US10840617
    • 2004-05-07
    • Ming LiuKevyn Collins-ThompsonDaryl Lawton
    • Ming LiuKevyn Collins-ThompsonDaryl Lawton
    • G06K9/68G06K9/54G06K9/60
    • G06K9/00442
    • Discriminating between documents scanned in a batch scanning process is achieved based on various analyses of the constituent document pages. The data provided by the various analyses are compared with each other to determine whether successive pages belong to the same document. Scanned documents result in a page sequence that is analyzed to extract one or more feature attributes for each page. The feature attributes are provided to a feature comparison process in order to assess the similarity of successive pages. If a sufficient likelihood of similarity is found, the compared pages are deemed to be from the same document; otherwise, they are deemed to be from different documents, indicating the existence of a document break. Based on the document breaks, separate scan files may be established. In this manner, the present invention represents eliminates the requirement of user intervention.
    • 在批量扫描过程中扫描的文档之间的差异是基于组成文档页面的各种分析来实现的。 将各种分析提供的数据彼此进行比较,以确定连续页面是否属于同一文档。 扫描的文档导致分析的页面序列,以提取每个页面的一个或多个要素属性。 特征属性被提供给特征比较过程以便评估连续页的相似性。 如果发现有相似性的充分可能性,则比较的页面被认为来自相同的文档; 否则,它们被视为来自不同的文件,表明存在文件中断。 根据文档中断,可能会建立单独的扫描文件。 以这种方式,本发明代表消除了用户干预的要求。
    • 2. 发明授权
    • Method and apparatus for discriminating between documents in batch scanned document files
    • 用于在批量扫描的文档文件中区分文档的方法和装置
    • US06735335B1
    • 2004-05-11
    • US09583049
    • 2000-05-30
    • Ming LiuKevyn Collins-ThompsonDaryl Lawton
    • Ming LiuKevyn Collins-ThompsonDaryl Lawton
    • G06K968
    • G06K9/00442
    • Discriminating between documents scanned in a batch scanning process is achieved based on various analyses of the constituent document pages. The data provided by the various analyses are compared with each other to determine whether successive pages belong to the same document. Scanned documents result in a page sequence. The page sequence is then analyzed to extract one or more features attributes for each page. The feature attributes are provided to a feature comparison process in order to assess the similarity of successive pages. If a sufficient likelihood of similarity is found, then the compared pages are deemed to be from the same document; otherwise, they are deemed to be from different documents, indicating the existence of a document break. Through the display of the page sequence, a user may optionally modify the location of one or more document breaks. Based on the document breaks, separate scan files may be established. In this manner, the present invention represents eliminates the requirement of user intervention.
    • 在批量扫描过程中扫描的文档之间的差异是基于组成文档页面的各种分析来实现的。 将各种分析提供的数据彼此进行比较,以确定连续页面是否属于同一文档。 扫描的文档导致页面序列。 然后分析页面序列以提取每个页面的一个或多个特征属性。 特征属性被提供给特征比较过程以便评估连续页的相似性。 如果找到足够的相似可能性,则比较的页面被认为来自相同的文档; 否则,它们被视为来自不同的文件,表明存在文件中断。 通过显示页面序列,用户可以可选地修改一个或多个文档分隔符的位置。 根据文档中断,可能会建立单独的扫描文件。 以这种方式,本发明代表消除了用户干预的要求。
    • 4. 发明授权
    • System and method for improved string matching under noisy channel conditions
    • 在噪声通道条件下改进字符串匹配的系统和方法
    • US06687697B2
    • 2004-02-03
    • US09918791
    • 2001-07-30
    • Kevyn Collins-ThompsonCharles B. Schweizer
    • Kevyn Collins-ThompsonCharles B. Schweizer
    • G06F1730
    • G06F17/2715G06F17/30985G06K9/03G06K9/723G06K2209/01Y10S707/99933Y10S707/99935Y10S707/99936
    • Described is a system and method for improving string matching in a noisy channel environment. The invention provides a method for identifying string candidates and analyzing the probability that the string candidate matches a user-defined string. In one implementation, a find engine receives a query string, converts an image file into a textual file, and identifies each instance of the query string in the textual file. The find engine identifies candidates within the textual file that may match the query string. The find engine refers to a confusion table to help identify whether candidates that are near matches to the query string are actually matches to the query string but for a common recognition error. Candidates meeting a probability threshold are identified as matches to the query string. The invention further provides for analysis options including word heuristics, language models, and OCR confidences.
    • 描述了用于在噪声通道环境中改进串匹配的系统和方法。 本发明提供了一种识别字符串候选并分析字符串候选与用户定义字符串匹配的概率的方法。 在一个实现中,查找引擎接收查询字符串,将图像文件转换为文本文件,并识别文本文件中的查询字符串的每个实例。 查找引擎识别文本文件中可能匹配查询字符串的候选项。 查找引擎是指一个混淆表,用于帮助确定与查询字符串接近匹配的候选者是否实际上与查询字符串匹配,但是对于常见的识别错误。 符合概率阈值的候选者被识别为与查询字符串的匹配。 本发明还提供了分析选项,包括词启发式,语言模型和OCR信心。