会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • System and method for detecting a web page
    • 用于检测网页的系统和方法
    • US20080275901A1
    • 2008-11-06
    • US11800236
    • 2007-05-04
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • G06F17/30
    • G06F17/30864
    • An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
    • 提供了用于检测网页模板的改进的系统和方法。 可以提供网页模板检测器用于在网页上执行页面级模板检测。 通常,网页模板分类器可以使用自动生成的训练数据进行训练,然后可以将网页模板分类器应用于网页以识别网页模板。 可以通过将网页的片段分类为模板结构,通过将分类分数分配给分类为模板结构的网页的片段,然后通过平滑分配给网页的片段的分类得分来检测网页模板。 通过使用动态规划最小化优化函数,广义等渗回归可以应用于与层次结点相关联的平滑分数。
    • 3. 发明申请
    • System and method for smoothing hierarchical data using isotonic regression
    • 使用等渗回归平滑分层数据的系统和方法
    • US20080275890A1
    • 2008-11-06
    • US11800235
    • 2007-05-04
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • G06F17/30G06F15/00
    • G06F17/3089
    • An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
    • 提供了用于检测网页模板的改进的系统和方法。 可以提供网页模板检测器用于在网页上执行页面级模板检测。 通常,网页模板分类器可以使用自动生成的训练数据进行训练,然后可以将网页模板分类器应用于网页以识别网页模板。 可以通过将网页的片段分类为模板结构,通过将分类分数分配给分类为模板结构的网页的片段,然后通过平滑分配给网页的片段的分类得分来检测网页模板。 通过使用动态规划最小化优化函数,广义等渗回归可以应用于与层次结点相关联的平滑分数。
    • 5. 发明申请
    • QUICKLINK SELECTION FOR NAVIGATIONAL QUERY
    • 快速选择导航查询
    • US20100250528A1
    • 2010-09-30
    • US12412252
    • 2009-03-26
    • Kunal PuneraDeepayan ChakrabartiShanmugasundaram Ravikumar
    • Kunal PuneraDeepayan ChakrabartiShanmugasundaram Ravikumar
    • G06F17/30
    • G06F16/957G06F16/9038
    • According to techniques described herein, the best set of quicklinks is picked to maximize the benefits for a majority of the users of a search engine, since the “real estate” on a search results page is constrained and valuable. Quicklinks are navigational shortcuts that are displayed below the website homepage on a search results page. Using user browsing trails obtained from browser toolbars, and a simple probabilistic model, the quicklink selection program is formulated as a combinatorial optimization problem. Two techniques are proposed herein: a greedy technique and a tree-based technique. The tree-based technique finds an optimal solution, but may do so in a greater amount of time than the greedy technique takes to find a solution that is not guaranteed to be optimal. The tree-based technique may incorporate natural constraints on the set of chosen quicklinks.
    • 根据本文描述的技术,由于搜索结果页面上的“房地产”被限制和有价值,所以选择最佳的一组快速链接以使搜索引擎的大多数用户的利益最大化。 快速链接是在搜索结果页面上显示在网站首页下方的导航快捷方式。 使用从浏览器工具栏获得的用户浏览轨迹,以及简单的概率模型,将快速链接选择程序作为组合优化问题。 本文提出了两种技术:贪心技术和基于树的技术。 基于树的技术找到一个最佳解决方案,但可能会比贪婪技术花费更多的时间来找到不能保证是最佳的解决方案。 基于树的技术可以在所选择的快速链接集合上引入自然约束。
    • 6. 发明授权
    • System and method for detecting a web page template
    • 用于检测网页模板的系统和方法
    • US07987417B2
    • 2011-07-26
    • US11800236
    • 2007-05-04
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • G06F17/00
    • G06F17/30864
    • An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
    • 提供了用于检测网页模板的改进的系统和方法。 可以提供网页模板检测器用于在网页上执行页面级模板检测。 通常,网页模板分类器可以使用自动生成的训练数据进行训练,然后可以将网页模板分类器应用于网页以识别网页模板。 可以通过将网页的片段分类为模板结构,通过将分类分数分配给分类为模板结构的网页的片段,然后通过平滑分配给网页的片段的分类得分来检测网页模板。 通过使用动态规划最小化优化函数,广义等渗回归可以应用于与层次结点相关联的平滑分数。
    • 7. 发明授权
    • System and method for smoothing hierarchical data using isotonic regression
    • 使用等渗回归平滑分层数据的系统和方法
    • US07870474B2
    • 2011-01-11
    • US11800235
    • 2007-05-04
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • G06F17/00
    • G06F17/3089
    • An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
    • 提供了用于检测网页模板的改进的系统和方法。 可以提供网页模板检测器用于在网页上执行页面级模板检测。 通常,网页模板分类器可以使用自动生成的训练数据进行训练,然后可以将网页模板分类器应用于网页以识别网页模板。 可以通过将网页的片段分类为模板结构,通过将分类分数分配给分类为模板结构的网页的片段,然后通过平滑分配给网页的片段的分类得分来检测网页模板。 通过使用动态规划最小化优化函数,广义等渗回归可以应用于与层次结点相关联的平滑分数。
    • 10. 发明授权
    • System and method using hierachical clustering for evolutionary clustering of sequential data sets
    • 使用层次聚类的序列数据集的进化聚类的系统和方法
    • US07734629B2
    • 2010-06-08
    • US11414442
    • 2006-04-29
    • Deepayan ChakrabartiShanmugasundaram RavikumarAndrew Tomkins
    • Deepayan ChakrabartiShanmugasundaram RavikumarAndrew Tomkins
    • G06F7/00
    • G06F17/30705G06K9/6218
    • An improved system and method for evolutionary clustering of sequential data sets is provided. A snapshot cost may be determined for representing the data set for a particular clustering method used and may determine the cost of clustering the data set independently of a series of clusterings of the data sets in the sequence. A history cost may also be determined for measuring the distance between corresponding clusters of the data set and the previous data set in the sequence of data sets to determine a cost of clustering the data set as part of a series of clusterings of the data sets in the sequence. An overall cost may be determined for clustering the data set by minimizing the combination of the snapshot cost and the history cost. Any clustering method may be used, including flat clustering and hierarchical clustering.
    • 提供了一种用于顺序数据集进化聚类的改进的系统和方法。 可以确定用于表示所使用的特定聚类方法的数据集的快照成本,并且可以独立于序列中的数据集的一系列聚类来确定数据集的聚类成本。 还可以确定历史成本用于测量数据集的相应簇之间的距离和数据集序列中的先前数据集之间的距离,以确定数据集的聚类成本,作为数据集的一系列聚类的一部分 序列。 可以通过最小化快照成本和历史成本的组合来确定用于对数据集进行聚类的总体成本。 可以使用任何聚类方法,包括平面聚类和层次聚类。