会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 82. 发明申请
    • Interactive System for Extracting Data from a Website
    • 从网站提取数据的互动系统
    • US20110191381A1
    • 2011-08-04
    • US12696061
    • 2010-01-29
    • Shuyi ZhengRuihua SongMatthew Robert ScottJi-Rong Wen
    • Shuyi ZhengRuihua SongMatthew Robert ScottJi-Rong Wen
    • G06F17/30
    • G06F16/00
    • Described is a technology for efficiently labeling a webpage. A wrapper tool labels records of a webpage at the record level. If an existing wrapper exists that is appropriate for labeling a record, the wrapper tool automatically labels that record. For unlabeled records, the tool provides a user interface to label those records, and updates the set of existing wrappers with a new wrapper that is generated based upon the labeling operation; the new wrapper is then applied to any unlabeled records if appropriate for those records. As a result, a user typically needs only to label a relatively few records, with the wrappers generated for those records automatically used to label the other unlabeled records of the webpage.
    • 描述了一种有效地标记网页的技术。 包装工具在记录级别上标记网页的记录。 如果存在适用于标记记录的现有包装器,则包装工具会自动标记该记录。 对于未标记的记录,该工具提供用户界面来标记这些记录,并使用基于标签操作生成的新包装器来更新现有包装器集合; 如果适用于这些记录,则将新的包装器应用于任何未标记的记录。 因此,用户通常仅需要标记相对较少的记录,为这些记录生成的包装器自动用于标记网页的其他未标记的记录。
    • 84. 发明授权
    • Vision-based document segmentation
    • 基于视觉的文档分割
    • US07613995B2
    • 2009-11-03
    • US11275488
    • 2006-01-09
    • Ji-Rong WenShipeng YuDeng CaiWei-Ying Ma
    • Ji-Rong WenShipeng YuDeng CaiWei-Ying Ma
    • G06F17/00
    • G06F17/30716G06F17/218G06F17/2247
    • Vision-based document segmentation identifies one or more portions of semantic content of a document. The one or more portions are identified by identifying a plurality of visual blocks in the document, and detecting one or more separators between the visual blocks of the plurality of visual blocks. A content structure for the document is constructed based at least in part on the plurality of visual blocks and the one or more separators, and the content structure identifies the one or more portions of semantic content of the document. The content structure obtained using the vision-based document segmentation can optionally be used during document retrieval.
    • 基于视觉的文档分割识别文档的语义内容的一个或多个部分。 通过识别文档中的多个可视块并且检测多个视觉块中的可视块之间的一个或多个分隔符来识别一个或多个部分。 至少部分地基于多个可视块和一个或多个分隔符来构建文档的内容结构,并且内容结构标识文档的语义内容的一个或多个部分。 使用基于视觉的文档分割获得的内容结构可以可选地在文档检索期间使用。
    • 86. 发明授权
    • Method and system for schema matching of web databases
    • Web数据库模式匹配的方法和系统
    • US07249135B2
    • 2007-07-24
    • US10846396
    • 2004-05-14
    • Wei-Ying MaJi-Rong Wen
    • Wei-Ying MaJi-Rong Wen
    • G06F17/30
    • G06F17/30731G06F17/30861Y10S707/99933Y10S707/99943
    • A method and system for identifying schemas of web databases is provided. A schema matching system generates a mapping between an interface schema and a result schema of a web database, which is used to represent the underlying database schema. The schema matching system also generates a mapping of the interface attributes and the result attributes of the web database to global attributes of a global schema whose semantics are known. Using these mappings, a search engine service can formulate queries using the global attributes, map those queries to the corresponding interface attributes, submit the query, and retrieve the values from the result attributes that correspond to the desired global attributes.
    • 提供了一种用于识别Web数据库模式的方法和系统。 模式匹配系统生成Web数据库的接口模式和结果模式之间的映射,用于表示底层数据库模式。 模式匹配系统还会将Web数据库的接口属性和结果属性的映射生成为语义已知的全局模式的全局属性。 使用这些映射,搜索引擎服务可以使用全局属性来制定查询,将这些查询映射到相应的接口属性,提交查询,并从对应于所需全局属性的结果属性中检索值。