会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 97. 发明申请
    • SCALABLE WEB DATA EXTRACTION
    • 可扩展的WEB数据提取
    • WO2016090625A1
    • 2016-06-16
    • PCT/CN2014/093670
    • 2014-12-12
    • HEWLETT-PACKARD DEVELOPMENT COMPANY, L.P.YU, Xiao-FengXIE, Jun-Qing
    • YU, Xiao-FengXIE, Jun-Qing
    • G06F17/30
    • G06N7/005G06F17/18G06F17/30563G06F17/30604G06F17/30705G06F17/30864G06N5/04G06N99/005
    • Example embodiments relate to scalable web data extraction. In example embodiments, a joint potential function is defined for data record segments of web data extracted from a web page, where the joint potential function models data record segmentation of the web data and dependencies between pairs of data segments in the data record segments. At this stage, a principal record segment and several related record segments are identified from the data record segments, where each of the plurality of related record segments is associated with the principal record segment. A related attribute is determined for each related record segment. Next, the joint potential function is applied to the principal record segment and each corresponding related segment to determine a relationship label that describes a data relationship between the principal record segment and the corresponding related segment.
    • 示例实施例涉及可伸缩网页数据提取。 在示例实施例中,为从网页提取的网络数据的数据记录段定义联合潜在函数,其中联合潜在函数模拟数据记录网络数据的分段和数据记录段中的数据段对之间的依赖关系。 在该阶段,从数据记录段识别主记录段和若干相关记录段,其中多个相关记录段中的每一个与主记录段相关联。 确定每个相关记录段的相关属性。 接下来,将联合潜在函数应用于主记录段和每个对应的相关段以确定描述主记录段和相应相关段之间的数据关系的关系标签。