专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20100318533A1 ENRICHED DOCUMENT REPRESENTATIONS USING AGGREGATED ANCHOR TEXT 审中-公开
标题翻译：使用聚集的锚固文本增强文档表示
公开(公告)号：US20100318533A1
公开(公告)日：2010-12-16
申请号：US12482377
申请日：2009-06-10
申请人： Jasmine Novak , Donald Metzler , Hang Cui , Srihari Reddy , Emre Velipasaoglu
发明人： Jasmine Novak , Donald Metzler , Hang Cui , Srihari Reddy , Emre Velipasaoglu
IPC分类号： G06F17/00 , G06F17/30
CPC分类号： G06F16/958
摘要： A system and method for aggregating anchor text over the web graph and using the aggregated anchor text to enrich document representations. For a target page, its internal inlinks, which point to the target page and are within the site containing the target page, are identified first. Then external anchors that point to the internal inlinks from pages outside of the site are identified. Anchor text of the external anchors are collected, weighted, stored, and used to enrich document presentations. The method not only reduces the number of pages with no anchor text, but also adds lines of anchor text to URLs.
摘要翻译：一种用于在网络图上聚合锚文本并使用聚合锚文本来丰富文档表示的系统和方法。对于目标页面，首先标识其指向目标页面并且在包含目标页面的站点内的内部链接。然后识别指向站点外部的内部链接的外部锚点。收集，加权，存储和使用外部锚点的锚文本来丰富文档演示。该方法不仅减少了没有锚文本的页面数，而且还向URL添加了一些锚文本。

2. 发明申请

US20110035345A1 AUTOMATIC CLASSIFICATION OF SEGMENTED PORTIONS OF WEB PAGES 有权
标题翻译：网页分类部分的自动分类
公开(公告)号：US20110035345A1
公开(公告)日：2011-02-10
申请号：US12538776
申请日：2009-08-10
申请人： Lei Duan , Fan Li , Srinivas Vadrevu , Emre Velipasaoglu , Swapnil Hajela , Deepayan Chakrabarti
发明人： Lei Duan , Fan Li , Srinivas Vadrevu , Emre Velipasaoglu , Swapnil Hajela , Deepayan Chakrabarti
IPC分类号： G06F15/18 , G06F7/06 , G06F17/30
CPC分类号： G06F17/30598 , G06F15/18 , G06F17/30873 , G06K9/6256 , G06N5/04 , G06N99/005 , G06Q10/10
摘要： Exemplary methods and apparatuses are provided which may be used for classifying and indexing segmented portions of web pages and providing related information for use in information extraction and/or information retrieval systems.
摘要翻译：提供了可用于对网页的分段部分进行分类和索引并提供用于信息提取和/或信息检索系统的相关信息的示例性方法和装置。

3. 发明授权

US08255793B2 Automatic visual segmentation of webpages 有权
标题翻译：网页自动视觉分割
公开(公告)号：US08255793B2
公开(公告)日：2012-08-28
申请号：US11971160
申请日：2008-01-08
申请人： Deepayan Chakrabarti , Manav Ratan Mital , Swapnil Hajela , Emre Velipasaoglu
发明人： Deepayan Chakrabarti , Manav Ratan Mital , Swapnil Hajela , Emre Velipasaoglu
IPC分类号： G06N3/00
CPC分类号： G06F17/2229 , G06F17/2247 , G06F17/30707 , G06F17/30864
摘要： To provide valuable information regarding a webpage, the webpage must be divided into distinct semantically coherent segments for analysis. A set of heuristics allow a segmentation algorithm to identify an optimal number of segments for a given webpage or any portion thereof more accurately. A first heuristic estimates the optimal number of segments for any given webpage or portion thereof. A second heuristic coalesces segments where the number of segments identified far exceeds the optimal number recommended. A third heuristic coalesces segments corresponding to a portion of a webpage with much unused whitespace and little content. A fourth heuristic coalesces segments of nodes that have a recommended number of segments below a certain threshold into segments of other nodes. A fifth heuristic recursively analyzes and splits segments that correspond to webpage portions surpassing a certain threshold portion size.
摘要翻译：为了提供关于网页的有价值的信息，网页必须被划分成不同的语义上相关的分段用于分析。一组启发式允许分割算法更精确地识别给定网页或其任何部分的最佳段数。第一启发式估计任何给定网页或其部分的最佳段数。第二个启发式聚合段，其中识别的段数远远超过推荐的最优数。第三个启发式聚合对应于网页的一部分的段，其中使用了许多未使用的空白和少量的内容。第四个启发式将具有低于某个阈值的推荐数量的节点段合并成其他节点的段。第五个启发式递归地分析和分割对应于超过某个阈值部分大小的网页部分的分段。

4. 发明申请

US20090177959A1 AUTOMATIC VISUAL SEGMENTATION OF WEBPAGES 有权
标题翻译：自动视觉分割
公开(公告)号：US20090177959A1
公开(公告)日：2009-07-09
申请号：US11971160
申请日：2008-01-08
申请人： DEEPAYAN CHAKRABARTI , Manav Ratan Mital , Swapnil Hajela , Emre Velipasaoglu
发明人： DEEPAYAN CHAKRABARTI , Manav Ratan Mital , Swapnil Hajela , Emre Velipasaoglu
IPC分类号： G06F17/21
CPC分类号： G06F17/2229 , G06F17/2247 , G06F17/30707 , G06F17/30864
摘要： To provide valuable information regarding a webpage, the webpage must be divided into distinct semantically coherent segments for analysis. A set of heuristics allow a segmentation algorithm to identify an optimal number of segments for a given webpage or any portion thereof more accurately. A first heuristic estimates the optimal number of segments for any given webpage or portion thereof. A second heuristic coalesces segments where the number of segments identified far exceeds the optimal number recommended. A third heuristic coalesces segments corresponding to a portion of a webpage with much unused whitespace and little content. A fourth heuristic coalesces segments of nodes that have a recommended number of segments below a certain threshold into segments of other nodes. A fifth heuristic recursively analyzes and splits segments that correspond to webpage portions surpassing a certain threshold portion size.
摘要翻译：为了提供关于网页的有价值的信息，网页必须被划分成不同的语义上相关的分段用于分析。一组启发式允许分割算法更精确地识别给定网页或其任何部分的最佳段数。第一启发式估计任何给定网页或其部分的最佳段数。第二个启发式聚合段，其中识别的段数远远超过推荐的最优数。第三个启发式聚合对应于网页的一部分的段，其中使用了许多未使用的空白和少量的内容。第四个启发式将具有低于某个阈值的推荐数量的节点段合并成其他节点的段。第五个启发式递归地分析和分割对应于超过某个阈值部分大小的网页部分的分段。

5. 发明申请

US20130110863A1 ASSISTED SEARCHING 有权
标题翻译：辅助搜索
公开(公告)号：US20130110863A1
公开(公告)日：2013-05-02
申请号：US13286204
申请日：2011-10-31
申请人： Larry Lai , Emre Velipasaoglu , David (Ciemo) Ciemiewicz
发明人： Larry Lai , Emre Velipasaoglu , David (Ciemo) Ciemiewicz
IPC分类号： G06F17/30
CPC分类号： G06F17/30867 , G06F17/30967
摘要： Example systems, methods, apparatuses, or articles of manufacture, etc. are disclosed in connection with assisted search results.
摘要翻译：关于辅助搜索结果公开了示例系统，方法，装置或制品等。

6. 发明申请

US20120191745A1 Synthesized Suggestions for Web-Search Queries 审中-公开
标题翻译：网络搜索查询的综合建议
公开(公告)号：US20120191745A1
公开(公告)日：2012-07-26
申请号：US13012795
申请日：2011-01-24
申请人： Emre Velipasaoglu , Alpa Jain , Umut Ozertem
发明人： Emre Velipasaoglu , Alpa Jain , Umut Ozertem
IPC分类号： G06F17/30
CPC分类号： G06F16/3322
摘要： Data-mining software receives a user query as an input and segments the user query into a number of units. The data-mining software then drops terms from a unit using a Conditional Random Field (CRF) model that combines a number of features. At least one of the features is derived from query logs and at least one of the features is derived from web documents. The data-mining software then generates one or more candidate queries by adding terms to the unit. The added terms result from a hybrid method that utilizes query sessions and a web corpus. The data-mining software also scores each candidate query on well-formedness of the candidate query, utility, and relevance to the user query. Then the data-mining software stores the scored candidate queries in a database for subsequent display in a graphical user interface for a search engine.
摘要翻译：数据挖掘软件接收用户查询作为输入，并将用户查询分段成多个单位。然后，数据挖掘软件使用组合许多功能的条件随机场（CRF）模型从单位中删除术语。至少有一个功能来源于查询日志，至少有一个功能是从Web文档派生的。然后，数据挖掘软件通过向单元添加术语来生成一个或多个候选查询。添加的术语来自使用查询会话和Web语料库的混合方法。数据挖掘软件还对候选查询的良好形式，实用程序和与用户查询的相关性进行了分类。然后，数据挖掘软件将得分的候选查询存储在数据库中，以便随后在用于搜索引擎的图形用户界面中显示。

7. 发明申请

US20110035374A1 SEGMENT SENSITIVE QUERY MATCHING OF DOCUMENTS 有权
标题翻译：部分敏感查询文件匹配
公开(公告)号：US20110035374A1
公开(公告)日：2011-02-10
申请号：US12538711
申请日：2009-08-10
申请人： Srinivas Vadrevu , Emre Velipasaoglu
发明人： Srinivas Vadrevu , Emre Velipasaoglu
IPC分类号： G06F17/30
CPC分类号： G06F17/30864
摘要： Exemplary methods and apparatuses are provided which may be used to provide or otherwise support segment sensitive query matching based on segmented portions of web pages and/or providing related information for use in information extraction and/or information retrieval systems.
摘要翻译：提供了可用于提供或以其他方式支持基于网页的分段部分的分段敏感查询匹配和/或提供用于信息提取和/或信息检索系统中的相关信息的示例性方法和装置。

8. 发明授权

US09465872B2 Segment sensitive query matching 有权
标题翻译：分段敏感查询匹配
公开(公告)号：US09465872B2
公开(公告)日：2016-10-11
申请号：US12538711
申请日：2009-08-10
申请人： Srinivas Vadrevu , Emre Velipasaoglu
发明人： Srinivas Vadrevu , Emre Velipasaoglu
IPC分类号： G06F17/30
CPC分类号： G06F17/30864
摘要： Exemplary techniques are provided which may be implemented using various methods, apparatuses, and/or articles of manufacture to provide or otherwise support segment sensitive query matching based on segmented portions of web pages and/or providing related information for use in information extraction and/or information retrieval systems. In certain example implementations techniques may be provided for determining whether a query match exists between a document and obtained query terms based, at least in part, on labeled portion information associated with a plurality of segmented portions of a document.
摘要翻译：提供了可以使用各种方法，装置和/或制品制造来实现的示例性技术，以提供或以其他方式支持基于网页的分段部分的段敏感查询匹配和/或提供用于信息提取和/或信息检索系统。在某些示例实现中，可以提供技术来至少部分地基于与文档的多个分段部分相关联的标记部分信息来确定文档和获得的查询项之间是否存在查询匹配。

9. 发明授权

US08983996B2 Assisted searching 有权
标题翻译：辅助搜索
公开(公告)号：US08983996B2
公开(公告)日：2015-03-17
申请号：US13286204
申请日：2011-10-31
申请人： Larry Lai , Emre Velipasaoglu , David (Ciemo) Ciemiewicz
发明人： Larry Lai , Emre Velipasaoglu , David (Ciemo) Ciemiewicz
IPC分类号： G06F17/30
CPC分类号： G06F17/30867 , G06F17/30967
摘要： Example systems, methods, apparatuses, or articles of manufacture, etc. are disclosed in connection with assisted search results.
摘要翻译：关于辅助搜索结果公开了示例系统，方法，装置或制品等。

10. 发明授权

US08849725B2 Automatic classification of segmented portions of web pages 有权
标题翻译：对网页的分段部分进行自动分类
公开(公告)号：US08849725B2
公开(公告)日：2014-09-30
申请号：US12538776
申请日：2009-08-10
申请人： Lei Duan , Fan Li , Srinivas Vadrevu , Emre Velipasaoglu , Swapnil Hajela , Deepayan Chakrabarti
发明人： Lei Duan , Fan Li , Srinivas Vadrevu , Emre Velipasaoglu , Swapnil Hajela , Deepayan Chakrabarti
IPC分类号： G06F15/18 , G06F17/30 , G06K9/62 , G06N99/00 , G06Q10/10
CPC分类号： G06F17/30598 , G06F15/18 , G06F17/30873 , G06K9/6256 , G06N5/04 , G06N99/005 , G06Q10/10
摘要： Exemplary methods and apparatuses are provided which may be used for classifying and indexing segmented portions of web pages and providing related information for use in information extraction and/or information retrieval systems.
摘要翻译：提供了可用于对网页的分段部分进行分类和索引并提供用于信息提取和/或信息检索系统的相关信息的示例性方法和装置。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式