会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明公开
    • DETECTING AND EXTRACTING IMAGE DOCUMENT COMPONENTS TO CREATE FLOW DOCUMENT
    • ERKENNUNG UND EXTRAKTION VON BILDDUMUMENTKOMPONENTEN ZUR ERZEUGUNG EINES FLUSSDOKUMENTS
    • EP3117369A1
    • 2017-01-18
    • EP15714992.3
    • 2015-03-09
    • Microsoft Technology Licensing, LLC
    • SESUM, MilanVUJIC, Ivan
    • G06K9/00
    • One or more components of an image document may be detected and extracted in order to create a flow document from the image document. Components of an image document may include text, one or more paths, and one or more images. The text may be detected using optical character recognition (OCR) and the image document may be binarized. The detected text may be extracted from the binarized image document to enable detection of the paths, which may then be extracted from the binarized image document to enable detection of the images. In some examples, the images, similar to the text and paths, may be extracted from the binarized image document. The extracted text, paths, and/or images may be stored in a data store, and may be retrieved in order to create a flow document that may provide better adaption to a variety of reading experiences and provide editable documents.
    • 可以检测和提取图像文档的一个或多个组件,以从图像文档创建流文档。 图像文档的组件可以包括文本,一个或多个路径和一个或多个图像。 可以使用光学字符识别(OCR)检测文本,并且图像文档可以被二进制化。 可以从二值化图像文档中提取检测到的文本,以使得能够检测路径,然后可以从二值化图像文档中提取路径,以便能够检测图像。 在一些示例中,类似于文本和路径的图像可以从二值化的图像文档中提取出来。 提取的文本,路径和/或图像可以存储在数据存储中,并且可以被检索以便创建可以提供更好地适应于各种阅读体验并提供可编辑文档的流文档。
    • 5. 发明公开
    • DETECTION AND RECONSTRUCTION OF RIGHT-TO-LEFT TEXT DIRECTION, LIGATURES AND DIACRITICS IN A FIXED FORMAT DOCUMENT
    • 检测和右的重构左行驶和变音符号连写中的文本文档,包含给定的格式
    • EP2972991A2
    • 2016-01-20
    • EP14713643.6
    • 2014-02-28
    • Microsoft Technology Licensing, LLC
    • SESUM, MilanZARIC, DrazenANTIC, MarijaRASKOVIC, Milos
    • G06F17/22G06F17/27
    • G06F17/275G06F17/2223G06F17/2247
    • Detection of right-to-left text direction, left-to-right text direction, ligatures and diacritics in fixed format documents for reconstruction of fixed format documents into flow format documents is provided. Each text run of a fixed format document is analyzed for directionality. If text runs contain ligatures, the ligatures are mapped to corresponding characters for proper reading order of the ligatures in context with other characters comprising a text run in which the ligatures are situated or neighboring the ligature. Each text run is collected based on determined text directionality for reconstruction in a flow format document. Proper text directionality for columns of text is determined in the same manner as proper text directionality for text runs in paragraphs of text. If diacritics are present in association with one or more characters or glyphs, a determination may be made as to a carrier character or glyph associated with each diacritic.
    • 从右到左的文本方向,左到右textDirection,连字和附加符号为的固定格式文档转换成流格式文件重建固定格式文件提供的检测。 固定格式文档的每个文本运行的分析方向性。 如果文本串包含连字,连字被映射到对应的字符与其他字符包括文本串,其中连字位于或邻接的结扎上下文连字的正确阅读顺序。 每个文本串是基于确定性开采文本方向性用于流格式文件中重建收集。 文本列正确的文本方向性确定性开采以同样的方式作为文字文本正确的方向性文本段落运行。 如果附加符号存在于与一个或多个字符或字形关联,确定可以做出关于与每个音调符号相关联的载波字符或字形。