专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US09495453B2 Resource download policies based on user browsing statistics 有权
标题翻译：基于用户浏览统计的资源下载策略
公开(公告)号：US09495453B2
公开(公告)日：2016-11-15
申请号：US13114643
申请日：2011-05-24
申请人： Rui Cai , Xiaodong Fan , Lei Zhang
发明人： Rui Cai , Xiaodong Fan , Lei Zhang
IPC分类号： G06F7/00 , G06F17/30 , G06F17/00
CPC分类号： G06F17/30864 , G06F17/30705 , G06F17/30861 , G06F17/3089 , G06F17/30899
摘要： Web crawling polices are generated based on user web browsing statistics. User browsing statistics are aggregated at the granularity of resource identifier patterns (such as URL patterns) that denote groups of resources within a particular domain or website that share syntax at a certain level of granularity. The web crawl policies rank the resource identifier patterns according to their associated aggregated user browsing statistics. A crawl ordering defined by the web crawl polices is used to download and discover new resources within a domain or website.
摘要翻译：基于用户网络浏览统计信息生成Web爬行策略。用户浏览统计信息以资源标识符模式（例如URL模式）的粒度进行聚合，这些资源标识符模式表示特定域或网站中以特定粒度级别共享语法的资源组。网络爬网策略根据其关联的聚合用户浏览统计信息对资源标识符模式进行排序。由网络抓取策略定义的爬网排序用于下载和发现域或网站中的新资源。

2. 发明申请

US20120303606A1 Resource Download Policies Based On User Browsing Statistics 有权
标题翻译：基于用户浏览统计的资源下载策略
公开(公告)号：US20120303606A1
公开(公告)日：2012-11-29
申请号：US13114643
申请日：2011-05-24
申请人： Rui Cai , Xiaodong Fan , Lei Zhang
发明人： Rui Cai , Xiaodong Fan , Lei Zhang
IPC分类号： G06F17/30 , G06F15/173
CPC分类号： G06F17/30864 , G06F17/30705 , G06F17/30861 , G06F17/3089 , G06F17/30899
摘要： Web crawling polices are generated based on user web browsing statistics. User browsing statistics are aggregated at the granularity of resource identifier patterns (such as URL patterns) that denote groups of resources within a particular domain or website that share syntax at a certain level of granularity. The web crawl policies rank the resource identifier patterns according to their associated aggregated user browsing statistics. A crawl ordering defined by the web crawl polices is used to download and discover new resources within a domain or website.
摘要翻译：基于用户网络浏览统计信息生成Web爬行策略。用户浏览统计信息以资源标识符模式（例如URL模式）的粒度进行聚合，这些资源标识符模式表示特定域或网站中以特定粒度级别共享语法的资源组。网络爬网策略根据其关联的聚合用户浏览统计信息对资源标识符模式进行排序。由网络抓取策略定义的爬网排序用于下载和发现域或网站中的新资源。

3. 发明申请

US20110307436A1 PATTERN TREE-BASED RULE LEARNING 有权
标题翻译：基于树的基于规则学习
公开(公告)号：US20110307436A1
公开(公告)日：2011-12-15
申请号：US12813171
申请日：2010-06-10
申请人： Rui Cai , Lei Zhang , Jiang-Ming Yang , Yan Ke , Xiaodong Fan , Wei-Ying Ma
发明人： Rui Cai , Lei Zhang , Jiang-Ming Yang , Yan Ke , Xiaodong Fan , Wei-Ying Ma
IPC分类号： G06N5/02 , G06F17/30
CPC分类号： G06F17/30864 , G06F17/30625
摘要： A pattern tree is constructed based on a plurality of key-value pairs representing portions of a data set. In some implementations, the pattern tree may be used for learning one or more rules for interacting with a source of the data set.
摘要翻译：基于表示数据集的部分的多个键值对来构造模式树。在一些实现中，模式树可以用于学习用于与数据集的源交互的一个或多个规则。

4. 发明授权

US08429110B2 Pattern tree-based rule learning 有权
标题翻译：基于树型的规则学习
公开(公告)号：US08429110B2
公开(公告)日：2013-04-23
申请号：US12813171
申请日：2010-06-10
申请人： Rui Cai , Lei Zhang , Jiang-Ming Yang , Yan Ke , Xiaodong Fan , Wei-Ying Ma
发明人： Rui Cai , Lei Zhang , Jiang-Ming Yang , Yan Ke , Xiaodong Fan , Wei-Ying Ma
IPC分类号： G06F17/00 , G06N5/00
CPC分类号： G06F17/30864 , G06F17/30625
摘要： A pattern tree is constructed based on a plurality of key-value pairs representing portions of a data set. In some implementations, the pattern tree may be used for learning one or more rules for interacting with a source of the data set.
摘要翻译：基于表示数据集的部分的多个键值对来构造模式树。在一些实现中，模式树可以用于学习用于与数据集的源交互的一个或多个规则。

5. 发明授权

US08954425B2 Snippet extraction and ranking 有权
标题翻译：片段提取和排名
公开(公告)号：US08954425B2
公开(公告)日：2015-02-10
申请号：US12796345
申请日：2010-06-08
申请人： Rong Xiao , Qiang Hao , Changhu Wang , Rui Cai , Lei Zhang
发明人： Rong Xiao , Qiang Hao , Changhu Wang , Rui Cai , Lei Zhang
IPC分类号： G06F17/30
CPC分类号： G06F17/30867 , G06F17/30241
摘要： Described herein is a technology that facilitates efficient automated mining of topic-related aspects of user-generated content based on automated analysis of the user-generated content. Locations are automatically learned based on dividing documents into document segments, and decomposing the segments into local topics and global topics. Techniques are described that facilitate automatically extracting snippets. These techniques include, for example, computer annotating travelogues with learned tags and images, performing topic learning to obtain an interest model, performing location matching based on the interest model, calculating geographic and semantic relevance scores, ranking snippets based on the geographic and semantic relevance scores, and searching snippets with a “location+context term” query.
摘要翻译：这里描述了一种技术，其有助于基于对用户生成的内容的自动化分析来有效地自动挖掘用户生成的内容的主题相关方面。根据将文档分割成文档段，自动学习位置，并将段分解为本地主题和全局主题。描述了便于自动提取代码段的技术。这些技术包括例如计算机注释具有学习标签和图像的旅行记录，执行主题学习以获得兴趣模型，基于兴趣模型执行位置匹配，计算地理和语义相关性分数，基于地理和语义相关性来排序片段分数和搜索带有“位置+上下文术语”查询的片段。

6. 发明授权

US08370119B2 Website design pattern modeling 有权
标题翻译：网站设计模式建模
公开(公告)号：US08370119B2
公开(公告)日：2013-02-05
申请号：US12389368
申请日：2009-02-19
申请人： Rui Cai , Jiang-Ming Yang , Lei Zhang , Wei-Ying Ma
发明人： Rui Cai , Jiang-Ming Yang , Lei Zhang , Wei-Ying Ma
IPC分类号： G06G7/48
CPC分类号： G06F17/218 , G06F8/75 , G06F17/27
摘要： Website design pattern modeling technique embodiments are presented that model a website's design patterns. This can be based on the website's layout elements, its URL tokens, or both. When based on both, the design patterns can be modeled separately using first the layout elements and then the URL tokens, or vice versa. Alternately, the modeling can be based on coupled layout and URL token patterns. In operation, the modeling involves first identifying layout elements and/or URL tokens found on at least some of the pages of the website. The website design patterns are then modeled based on the occurrences of the identified layout elements and/or URL tokens in pages of the website. In cases where a coupled modeling scheme is employed, a modeling technique that exploits the correlations between the layout elements and URL tokens is used.
摘要翻译：呈现网站设计模式建模技术实施例，模拟网站的设计模式。这可以基于网站的布局元素，其网址令牌或两者兼而有之。当基于这两者时，可以使用第一个布局元素和URL令牌来单独建模设计模式，反之亦然。或者，建模可以基于耦合的布局和URL令牌模式。在操作中，建模涉及首先识别在网站的至少一些页面上发现的布局元素和/或URL令牌。然后基于网站页面中识别的布局元素和/或URL令牌的出现来对网站设计模式进行建模。在使用耦合建模方案的情况下，使用利用布局元素和URL令牌之间的相关性的建模技术。

7. 发明申请

US20110302162A1 Snippet Extraction and Ranking 有权
标题翻译：代码段提取和排名
公开(公告)号：US20110302162A1
公开(公告)日：2011-12-08
申请号：US12796345
申请日：2010-06-08
申请人： Rong Xiao , Qiang Hao , Changhu Wang , Rui Cai , Lei Zhang
发明人： Rong Xiao , Qiang Hao , Changhu Wang , Rui Cai , Lei Zhang
IPC分类号： G06F17/30
CPC分类号： G06F17/30867 , G06F17/30241
摘要： Described herein is a technology that facilitates efficient automated mining of topic-related aspects of user-generated content based on automated analysis of the user-generated content. Locations are automatically learned based on dividing documents into document segments, and decomposing the segments into local topics and global topics. Techniques are described that facilitate automatically extracting snippets. These techniques include, for example, computer annotating travelogues with learned tags and images, performing topic learning to obtain an interest model, performing location matching based on the interest model, calculating geographic and semantic relevance scores, ranking snippets based on the geographic and semantic relevance scores, and searching snippets with a “location+context term” query.
摘要翻译：这里描述了一种技术，其有助于基于对用户生成的内容的自动化分析来有效地自动挖掘用户生成的内容的主题相关方面。根据将文档分割成文档段，自动学习位置，并将段分解为本地主题和全局主题。描述了便于自动提取代码段的技术。这些技术包括例如计算机注释具有学习标签和图像的旅行记录，执行主题学习以获得兴趣模型，基于兴趣模型执行位置匹配，计算地理和语义相关性分数，基于地理和语义相关性来排序片段分数和搜索带有“位置+上下文术语”查询的片段。

8. 发明申请

US20100211927A1 WEBSITE DESIGN PATTERN MODELING 有权
标题翻译：网站设计图案建模
公开(公告)号：US20100211927A1
公开(公告)日：2010-08-19
申请号：US12389368
申请日：2009-02-19
申请人： Rui Cai , Jiang-Ming Yang , Lei Zhang , Wei-Ying Ma
发明人： Rui Cai , Jiang-Ming Yang , Lei Zhang , Wei-Ying Ma
IPC分类号： G06F9/44
CPC分类号： G06F17/218 , G06F8/75 , G06F17/27
摘要： Website design pattern modeling technique embodiments are presented that model a website's design patterns. This can be based on the website's layout elements, its URL tokens, or both. When based on both, the design patterns can be modeled separately using first the layout elements and then the URL tokens, or vice versa. Alternately, the modeling can be based on coupled layout and URL token patterns. In operation, the modeling involves first identifying layout elements and/or URL tokens found on at least some of the pages of the website. The website design patterns are then modeled based on the occurrences of the identified layout elements and/or URL tokens in pages of the website. In cases where a coupled modeling scheme is employed, a modeling technique that exploits the correlations between the layout elements and URL tokens is used.
摘要翻译：呈现网站设计模式建模技术实施例，模拟网站的设计模式。这可以基于网站的布局元素，其网址令牌或两者兼而有之。当基于这两者时，可以使用第一个布局元素和URL令牌来分别设计设计模式，反之亦然。或者，建模可以基于耦合的布局和URL令牌模式。在操作中，建模涉及首先识别在网站的至少一些页面上发现的布局元素和/或URL令牌。然后基于网站页面中识别的布局元素和/或URL令牌的出现来对网站设计模式进行建模。在使用耦合建模方案的情况下，使用利用布局元素和URL令牌之间的相关性的建模技术。

9. 发明申请

US20100211533A1 EXTRACTING STRUCTURED DATA FROM WEB FORUMS 审中-公开
标题翻译：从网站提取结构化数据
公开(公告)号：US20100211533A1
公开(公告)日：2010-08-19
申请号：US12388517
申请日：2009-02-18
申请人： Jiangming Yang , Rui Cai , Lei Zhang , Wei-Ying Ma
发明人： Jiangming Yang , Rui Cai , Lei Zhang , Wei-Ying Ma
IPC分类号： G06F15/18 , G06N5/02
CPC分类号： G06N20/00 , G06F16/958
摘要： The web forum data extraction technique is designed for the structured data extraction of data on web forums using both page-level information and site-level knowledge. To do this, the technique finds the kinds of page objects a forum site has, which object a page belongs to, and how different page objects are connected with each other. This information can be obtained by re-constructing the sitemap of the target forum which is based on a Data Object Model of the target forum. The web forum data extraction technique collects three kinds of evidence for data extraction: 1) inner-page features which cover both semantic and layout information on an individual page; 2) inter-vertex features which describe linkage-related observations; and 3) inner-vertex features which characterize interrelationships among pages in one vertex. The technique employs Markov Logic Networks to combine the types of evidence statistically for inference and thereby can extract the desired structures.
摘要翻译：网络论坛数据提取技术是为了使用页面级信息和站点级知识，在Web论坛上的数据结构化数据提取。为此，该技术可以找到论坛网站所拥有的页面对象的种类，页面所属的对象以及不同的页面对象如何相互连接。该信息可以通过重新构建基于目标论坛的数据对象模型的目标论坛的站点地图来获得。网络论坛数据提取技术收集了三种数据提取证据：1）内页特征，涵盖单个页面上的语义和布局信息; 2）描述连锁相关观察的顶点间特征; 和3）表示一个顶点中的页面之间的相互关系的内顶点特征。该技术采用马可夫逻辑网络来统计证据的类型，从而推断出所需的结构。

10. 发明申请

US20090281906A1 Music Recommendation using Emotional Allocation Modeling 有权
标题翻译：使用情感分配建模的音乐推荐
公开(公告)号：US20090281906A1
公开(公告)日：2009-11-12
申请号：US12116855
申请日：2008-05-07
申请人： Rui Cai , Lei Zhang , Wei-Ying Ma
发明人： Rui Cai , Lei Zhang , Wei-Ying Ma
IPC分类号： G06Q30/00
CPC分类号： G06Q30/02 , G06Q10/06 , G06Q30/0601 , G06Q30/0603
摘要： An exemplary method includes defining a vocabulary for emotions; extracting descriptions for songs; generating distributions for the songs in an emotion space based at least in part on the vocabulary and the extracted descriptions; extracting salient words from a document; generating a distribution for the document in an emotion space based at least in part on the vocabulary and the extracted salient words; and matching the distribution for the document to one or more of the distributions for the songs. Various other exemplary methods, devices, systems, etc., are also disclosed.
摘要翻译：一种示例性方法包括定义情绪词汇; 提取歌曲的描述; 至少部分地基于词汇和所提取的描述来生成情感空间中的歌曲的分布; 从文档中提取突出的单词; 至少部分地基于词汇和提取的突出词语在情感空间中生成文档的分发; 并将文档的分发与歌曲的一个或多个分发相匹配。还公开了各种其它示例性方法，装置，系统等。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式