会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • System and method for detecting a web page
    • 用于检测网页的系统和方法
    • US20080275901A1
    • 2008-11-06
    • US11800236
    • 2007-05-04
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • G06F17/30
    • G06F17/30864
    • An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
    • 提供了用于检测网页模板的改进的系统和方法。 可以提供网页模板检测器用于在网页上执行页面级模板检测。 通常,网页模板分类器可以使用自动生成的训练数据进行训练,然后可以将网页模板分类器应用于网页以识别网页模板。 可以通过将网页的片段分类为模板结构,通过将分类分数分配给分类为模板结构的网页的片段,然后通过平滑分配给网页的片段的分类得分来检测网页模板。 通过使用动态规划最小化优化函数,广义等渗回归可以应用于与层次结点相关联的平滑分数。
    • 4. 发明申请
    • System and method for smoothing hierarchical data using isotonic regression
    • 使用等渗回归平滑分层数据的系统和方法
    • US20080275890A1
    • 2008-11-06
    • US11800235
    • 2007-05-04
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • G06F17/30G06F15/00
    • G06F17/3089
    • An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
    • 提供了用于检测网页模板的改进的系统和方法。 可以提供网页模板检测器用于在网页上执行页面级模板检测。 通常,网页模板分类器可以使用自动生成的训练数据进行训练,然后可以将网页模板分类器应用于网页以识别网页模板。 可以通过将网页的片段分类为模板结构,通过将分类分数分配给分类为模板结构的网页的片段,然后通过平滑分配给网页的片段的分类得分来检测网页模板。 通过使用动态规划最小化优化函数,广义等渗回归可以应用于与层次结点相关联的平滑分数。
    • 6. 发明申请
    • QUICKLINK SELECTION FOR NAVIGATIONAL QUERY
    • 快速选择导航查询
    • US20100250528A1
    • 2010-09-30
    • US12412252
    • 2009-03-26
    • Kunal PuneraDeepayan ChakrabartiShanmugasundaram Ravikumar
    • Kunal PuneraDeepayan ChakrabartiShanmugasundaram Ravikumar
    • G06F17/30
    • G06F16/957G06F16/9038
    • According to techniques described herein, the best set of quicklinks is picked to maximize the benefits for a majority of the users of a search engine, since the “real estate” on a search results page is constrained and valuable. Quicklinks are navigational shortcuts that are displayed below the website homepage on a search results page. Using user browsing trails obtained from browser toolbars, and a simple probabilistic model, the quicklink selection program is formulated as a combinatorial optimization problem. Two techniques are proposed herein: a greedy technique and a tree-based technique. The tree-based technique finds an optimal solution, but may do so in a greater amount of time than the greedy technique takes to find a solution that is not guaranteed to be optimal. The tree-based technique may incorporate natural constraints on the set of chosen quicklinks.
    • 根据本文描述的技术,由于搜索结果页面上的“房地产”被限制和有价值,所以选择最佳的一组快速链接以使搜索引擎的大多数用户的利益最大化。 快速链接是在搜索结果页面上显示在网站首页下方的导航快捷方式。 使用从浏览器工具栏获得的用户浏览轨迹,以及简单的概率模型,将快速链接选择程序作为组合优化问题。 本文提出了两种技术:贪心技术和基于树的技术。 基于树的技术找到一个最佳解决方案,但可能会比贪婪技术花费更多的时间来找到不能保证是最佳的解决方案。 基于树的技术可以在所选择的快速链接集合上引入自然约束。
    • 7. 发明授权
    • Method for summarizing event-related texts to answer search queries
    • 用于总结事件相关文本以回答搜索查询的方法
    • US08666916B2
    • 2014-03-04
    • US13178396
    • 2011-07-07
    • Kunal PuneraDeepayan Chakrabarti
    • Kunal PuneraDeepayan Chakrabarti
    • G06F15/18
    • G06F17/30864G06N7/005
    • A method and apparatus for receiving training data that comprise a plurality of event-and-time-specific texts that are contextually related to a plurality of events; iteratively processing the training data to generate a modified network model that defines a plurality of states; receiving additional data that comprise a plurality of additional event-and-time-specific texts that are contextually related to a particular event; processing the additional data by applying the modified network model to the additional data to identify, within the plurality of additional event-and-time specific texts, a particular set of texts that belong to a particular state of the plurality of states; identifying, within the particular set of texts, one or more texts that are most representative of all texts in the particular set of texts that belong to the particular state; wherein the method is performed by one or more special-purpose computing devices.
    • 一种用于接收训练数据的方法和装置,所述培训数据包括与多个事件相关的多个事件和时间专用文本; 迭代地处理训练数据以生成定义多个状态的修改的网络模型; 接收包括与特定事件上下文相关的多个附加事件和时间特定文本的附加数据; 通过将修改的网络模型应用于附加数据来处理附加数据,以在多个附加事件和时间特定文本内识别属于多个状态的特定状态的特定文本集合; 在特定文本集中确定一个或多个文本,其最具代表属于特定国家的特定文本集中的所有文本; 其中所述方法由一个或多个专用计算设备执行。
    • 8. 发明授权
    • System and method for detecting a web page template
    • 用于检测网页模板的系统和方法
    • US07987417B2
    • 2011-07-26
    • US11800236
    • 2007-05-04
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • G06F17/00
    • G06F17/30864
    • An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
    • 提供了用于检测网页模板的改进的系统和方法。 可以提供网页模板检测器用于在网页上执行页面级模板检测。 通常,网页模板分类器可以使用自动生成的训练数据进行训练,然后可以将网页模板分类器应用于网页以识别网页模板。 可以通过将网页的片段分类为模板结构,通过将分类分数分配给分类为模板结构的网页的片段,然后通过平滑分配给网页的片段的分类得分来检测网页模板。 通过使用动态规划最小化优化函数,广义等渗回归可以应用于与层次结点相关联的平滑分数。
    • 9. 发明授权
    • System and method for smoothing hierarchical data using isotonic regression
    • 使用等渗回归平滑分层数据的系统和方法
    • US07870474B2
    • 2011-01-11
    • US11800235
    • 2007-05-04
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • Deepayan ChakrabartiKunal PuneraShanmugasundaram Ravikumar
    • G06F17/00
    • G06F17/3089
    • An improved system and method is provided for detecting a web page template. A web page template detector may be provided for performing page-level template detection on a web page. In general, the web page template classifier may be trained using automatically generated training data, and then the web page template classifier may be applied to web pages to identify web page templates. A web page template may be detected by classifying segments of a web page as template structures, by assigning classification scores to the segments of the web page classified as template structures, and then by smoothing the classification scores assigned to the segments of the web page. Generalized isotonic regression may be applied for smoothing scores associated with the nodes of a hierarchy by minimizing an optimization function using dynamic programming.
    • 提供了用于检测网页模板的改进的系统和方法。 可以提供网页模板检测器用于在网页上执行页面级模板检测。 通常,网页模板分类器可以使用自动生成的训练数据进行训练,然后可以将网页模板分类器应用于网页以识别网页模板。 可以通过将网页的片段分类为模板结构,通过将分类分数分配给分类为模板结构的网页的片段,然后通过平滑分配给网页的片段的分类得分来检测网页模板。 通过使用动态规划最小化优化函数,广义等渗回归可以应用于与层次结点相关联的平滑分数。