会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • APPARATUS AND METHODS FOR CONCEPT-CENTRIC INFORMATION EXTRACTION
    • 概念中心信息提取的装置和方法
    • US20100241639A1
    • 2010-09-23
    • US12408450
    • 2009-03-20
    • Daniel KiferSrujana MeruguAnkur JainSathiya Keerthi SelvarajAlok S. KirpalPhilip L. BohannonRaghu Ramakrishnan
    • Daniel KiferSrujana MeruguAnkur JainSathiya Keerthi SelvarajAlok S. KirpalPhilip L. BohannonRaghu Ramakrishnan
    • G06F17/30
    • G06F16/345G06F16/313
    • Disclosed are methods and apparatus for extracting (or annotating) structured information from web content. Web content of interest from a particular domain is represented as one or more tree instances having a plurality of branching nodes that each correspond to a web object such that the tree instances correspond to one or more structured data instances. The particular domain is associated with domain knowledge that includes one or more presentation rulesets that each specifies a particular structure for a set of data instances, a domain-specific concept labeler, one or more specified properties of the web objects in the tree instances, and a concept schema that specifies a representation of the data to be extracted from the web content. A structured data instance that conforms to the concept schema is extracted from the one or more tree instances based on the domain knowledge for the particular domain. Extraction of the structured data instances is accomplished by (i) using the domain-specific concept labeler to annotate a subset of nodes of the tree instances; and (ii) using a locally adaptive concept annotator to extract the structured data instances based on the annotated segments and the local properties associated with such annotated segments. The extracted structured data instance is stored as structured output records in a database.
    • 公开了从网页内容中提取(或注释)结构化信息的方法和装置。 来自特定域的感兴趣的Web内容被表示为具有多个分支节点的一个或多个树实例,每个分支节点对应于web对象,使得树实例对应于一个或多个结构化数据实例。 特定域与域知识相关联,其包括一个或多个呈现规则集,每个表示规则集指定一组数据实例的特定结构,特定于域的概念标签器,树实例中的web对象的一个​​或多个指定的属性,以及 一个概念模式,指定要从Web内容中提取的数据的表示。 基于特定域的域知识,从一个或多个树实例提取符合概念模式的结构化数据实例。 结构化数据实例的提取是通过(i)使用域特定概念标签器来注释树实例的节点的子集来实现的; 以及(ii)使用本地适应性概念注释器基于所注释的段和与这些注释段相关联的本地属性来提取结构化数据实例。 提取的结构化数据实例作为结构化输出记录存储在数据库中。
    • 2. 发明申请
    • FAULT-TOLERANT WEB CACHING
    • 容错网页缓存
    • US20140372627A1
    • 2014-12-18
    • US13149587
    • 2011-05-31
    • Michael AxelrodAnkur Jain
    • Michael AxelrodAnkur Jain
    • G06F15/173
    • H04L67/327G06F11/3041G06F11/3409G06F11/3466H04L45/126H04L67/02H04L67/2842H04L69/40
    • Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for monitoring capability to process web traffic. At various times, a web proxy announces a most specific route that is received by multiple clients configured to send web traffic for an address to a received most specific route to the address. The web proxy processes web traffic received from one of the clients as a result of announcing the route. When the web proxy determines a decrease in processing capability of the web proxy, the web proxy ceases to announce the most specific route such that one or more of the clients direct web traffic for the address to an alternative less specific route.
    • 方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于监视处理网络流量的能力。 在不同的时间,Web代理宣布由被配置为将地址的web流量发送到接收到的最特定的到达地址的路由的多个客户端接收的最具体的路由。 Web代理处理从其中一个客户端接收的Web流量作为通告路由的结果。 当Web代理确定Web代理的处理能力降低时,Web代理停止宣布最具体的路由,使得一个或多个客户端将该地址的web流量指向替代的较不具体的路由。
    • 5. 发明申请
    • RESULTS RETURNED FOR LIST-SEEKING QUERIES
    • 结果返回列表查询
    • US20130144870A1
    • 2013-06-06
    • US13310658
    • 2011-12-02
    • Anjani GuptaAnkur Jain
    • Anjani GuptaAnkur Jain
    • G06F17/30
    • G06F17/30675G06F17/30864
    • List-based search results are generated. According to one technique, items are extracted from multiple resources deemed relevant to a user-submitted search query, and a comprehensive master list of those items is compiled and returned to the query-submitted user in response to his submission. According to another technique, lists of items are identified within such query-relevant resources. For each list-containing resource deemed to be relevant to the query terms, a list is extract from that resource and included within that resource's abstract on the search results page returned to the user in response to his submission. Additionally, the resources may be re-ranked for generation of the search results page based on the lists contained within those resources in addition to (or regardless of) occurrences of query terms within those resources.
    • 生成基于列表的搜索结果。 根据一种技术,从与用户提交的搜索查询相关的多个资源中提取项目,并且根据他的提交,将这些项目的综合主列表编译并返回给查询提交的用户。 根据另一技术,在这些查询相关资源内识别项目列表。 对于被认为与查询项相关的每个包含列表的资源,从该资源中提取列表,并将其包含在该资源的摘要中,以响应于他的提交而返回给用户的搜索结果页面。 此外,除了在这些资源内(或不管这些资源)中出现的查询项之外,还可以基于除了这些资源内的列表来生成搜索结果页面,资源可以被重新排序。