会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Path-based ranking of unvisited web pages
    • 基于路径的未访问网页排名
    • US07424484B2
    • 2008-09-09
    • US10358941
    • 2003-02-05
    • Xiaochuan MaYue PanHui Su
    • Xiaochuan MaYue PanHui Su
    • G06F17/30
    • G06F17/30864Y10S707/99933Y10S707/99936Y10S707/99943
    • Path-based ranking of unvisited Web pages for WWW crawling is provided, via identifying all the paths beginning with a “seed” URL and leading to visited relevant web pages as “good-path set”, and for each unvisited web page, identifying the paths beginning from the “seed” URL leading to it as “partial-path set”; classifying all the visited web pages and labeling each web Page with the labels of a class or classes it belongs to; training a statistic model for generalizing the common patterns among all ones of “good-path set”; and evaluating the “partial-path set” with the statistic model and ranking the unvisited web pages with the evaluation results.
    • 通过识别从“种子”URL开始的所有路径,并将访问过的相关网页导向为“良好路径集”,并为每个未访问的网页标识,以提供用于WWW抓取的未访问网页的基于路径的排名 从“种子”URL开始的路径导致它作为“部分路径集”; 对所有访问的网页进行分类,并使用所属类别的标签将每个网页标注; 培养统一模式,推广“好路径”的共同模式; 并用统计模型评估“部分路径集”,并通过评估结果对未访问的网页进行排序。
    • 2. 发明申请
    • PATH-BASED RANKING OF UNVISITED WEB PAGES
    • 基于路径的无缝网页排序
    • US20080313176A1
    • 2008-12-18
    • US12183751
    • 2008-07-31
    • Xiaochuan MaYue PanHui Su
    • Xiaochuan MaYue PanHui Su
    • G06F7/06G06F17/30
    • G06F17/30864Y10S707/99933Y10S707/99936Y10S707/99943
    • Path-based ranking of unvisited Web pages for WWW crawling is provided, via identifying all the paths beginning with a “seed” URL and leading to visited relevant web pages as “good-path set”, and for each unvisited web page, identifying the paths beginning from the “seed” URL leading to it as “partial-path set”; classifying all the visited web pages and labeling each web Page with the labels of a class or classes it belongs to; training a statistic model for generalizing the common patterns among all ones of “good-path set”; and evaluating the “partial-path set” with the statistic model and ranking the unvisited web pages with the evaluation results.
    • 通过识别从“种子”URL开始的所有路径,并将访问过的相关网页导向为“良好路径集”,并为每个未访问的网页标识,以提供用于WWW抓取的未访问网页的基于路径的排名 从“种子”URL开始的路径导致它作为“部分路径集”; 对所有访问的网页进行分类,并使用所属类别的标签将每个网页标注; 培养统一模式,推广“好路径”的共同模式; 并用统计模型评估“部分路径集”,并通过评估结果对未访问的网页进行排序。
    • 3. 发明授权
    • Path-based ranking of unvisited web pages
    • 基于路径的未访问网页排名
    • US07979444B2
    • 2011-07-12
    • US12183751
    • 2008-07-31
    • Xiaochuan MaYue PanHui Su
    • Xiaochuan MaYue PanHui Su
    • G06F17/30
    • G06F17/30864Y10S707/99933Y10S707/99936Y10S707/99943
    • Path-based ranking of unvisited Web pages for WWW crawling is provided, via identifying all the paths beginning with a “seed” URL and leading to visited relevant web pages as “good-path set”, and for each unvisited web page, identifying the paths beginning from the “seed” URL leading to it as “partial-path set”; classifying all the visited web pages and labeling each web Page with the labels of a class or classes it belongs to; training a statistic model for generalizing the common patterns among all ones of “good-path set”; and evaluating the “partial-path set” with the statistic model and ranking the unvisited web pages with the evaluation results.
    • 通过识别从“种子”URL开始的所有路径,并将访问过的相关网页导向为“良好路径集”,并为每个未访问的网页标识,以提供用于WWW抓取的未访问网页的基于路径的排名 从“种子”URL开始的路径导致它作为“部分路径集”; 对所有访问的网页进行分类,并使用所属类别的标签将每个网页标注; 培养统一模式,推广“好路径”的共同模式; 并用统计模型评估“部分路径集”,并通过评估结果对未访问的网页进行排序。