会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • TRANSDUCTIVE APPROACH TO CATEGORY-SPECIFIC RECORD ATTRIBUTE EXTRACTION
    • 对特定记录属性提取的传播方法
    • US20100274770A1
    • 2010-10-28
    • US12429442
    • 2009-04-24
    • Rahul GuptaSathiya Keerthi SelvarajDaniel KiferSrujana Merugu
    • Rahul GuptaSathiya Keerthi SelvarajDaniel KiferSrujana Merugu
    • G06F17/30
    • G06F16/951G06F16/285
    • Disclosed are methods and apparatus for segmenting and labeling a collection of token sequences. A plurality of segments of one or more tokens in a token sequence collection are partially labeled with labels from a set of target labels using high precision domain-specific labelers so as to generate a partially labeled sequence collection having a plurality of labeled segments and a plurality of unlabeled segments. Any label conflicts in the partially labeled sequence collection are resolved. One or more of the labeled segments of the partially labeled sequence collection are expanded so as to cover one or more additional tokens of the partially labeled sequence collection. A statistical model, for labeling segments using local token and segment features of the sequence collection, is trained based on the partially labeled sequence collection. This trained model is then used to label the unlabeled segments and the labeled segments of the sequence collection so as to generate a labeled sequence collection. The labeled sequence collection is then stored as structured output records in a database.
    • 公开了用于分割和标记令牌序列集合的方法和装置。 令牌序列集合中的一个或多个令牌的多个片段使用高精度域专用标签器从一组目标标签部分标记,以便生成具有多个标记片段和多个标记片段的部分标记序列集合 的未标记片段。 部分标记的序列集合中的任何标签冲突都被解决。 扩展部分标记的序列集合的一个或多个标记片段,以覆盖部分标记的序列集合的一个或多个附加标记。 基于部分标记的序列集合训练用于使用本地令牌和序列集合的片段特征来标记片段的统计模型。 然后将该训练模型用于标记序列集合的未标记片段和标记片段,以产生标记序列集合。 标记的序列集合然后作为结构化输出记录存储在数据库中。
    • 2. 发明申请
    • APPARATUS AND METHODS FOR CONCEPT-CENTRIC INFORMATION EXTRACTION
    • 概念中心信息提取的装置和方法
    • US20100241639A1
    • 2010-09-23
    • US12408450
    • 2009-03-20
    • Daniel KiferSrujana MeruguAnkur JainSathiya Keerthi SelvarajAlok S. KirpalPhilip L. BohannonRaghu Ramakrishnan
    • Daniel KiferSrujana MeruguAnkur JainSathiya Keerthi SelvarajAlok S. KirpalPhilip L. BohannonRaghu Ramakrishnan
    • G06F17/30
    • G06F16/345G06F16/313
    • Disclosed are methods and apparatus for extracting (or annotating) structured information from web content. Web content of interest from a particular domain is represented as one or more tree instances having a plurality of branching nodes that each correspond to a web object such that the tree instances correspond to one or more structured data instances. The particular domain is associated with domain knowledge that includes one or more presentation rulesets that each specifies a particular structure for a set of data instances, a domain-specific concept labeler, one or more specified properties of the web objects in the tree instances, and a concept schema that specifies a representation of the data to be extracted from the web content. A structured data instance that conforms to the concept schema is extracted from the one or more tree instances based on the domain knowledge for the particular domain. Extraction of the structured data instances is accomplished by (i) using the domain-specific concept labeler to annotate a subset of nodes of the tree instances; and (ii) using a locally adaptive concept annotator to extract the structured data instances based on the annotated segments and the local properties associated with such annotated segments. The extracted structured data instance is stored as structured output records in a database.
    • 公开了从网页内容中提取(或注释)结构化信息的方法和装置。 来自特定域的感兴趣的Web内容被表示为具有多个分支节点的一个或多个树实例,每个分支节点对应于web对象,使得树实例对应于一个或多个结构化数据实例。 特定域与域知识相关联,其包括一个或多个呈现规则集,每个表示规则集指定一组数据实例的特定结构,特定于域的概念标签器,树实例中的web对象的一个​​或多个指定的属性,以及 一个概念模式,指定要从Web内容中提取的数据的表示。 基于特定域的域知识,从一个或多个树实例提取符合概念模式的结构化数据实例。 结构化数据实例的提取是通过(i)使用域特定概念标签器来注释树实例的节点的子集来实现的; 以及(ii)使用本地适应性概念注释器基于所注释的段和与这些注释段相关联的本地属性来提取结构化数据实例。 提取的结构化数据实例作为结构化输出记录存储在数据库中。
    • 4. 发明申请
    • System for Query Scheduling to Maximize Work Sharing
    • 用于查询调度以最大化工作共享的系统
    • US20090216718A1
    • 2009-08-27
    • US12036956
    • 2008-02-25
    • Parag AgrawalDaniel KiferChris Olston
    • Parag AgrawalDaniel KiferChris Olston
    • G06F7/06
    • G06F17/30442G06F17/30474
    • A system of query scheduling to maximize work sharing. The system schedules queries to account for future queries possessing a sharability component. Included in the system are operations for assigning an incoming query to a query queue based on a sharability characteristic of the incoming query, and evaluating a priority function for each member of a plurality of query queues to identify one highest priority query queue. The priority function accounts for the probability that a future incoming query will contain the sharability characteristic common to a member of the plurality of query queues. The system of query scheduling to maximize work sharing selects a batch of queries from the highest priority query queue, and dispatches the batch to one or more query execution units.
    • 一个查询调度系统,最大限度地提高工作共享。 系统调度查询以考虑具有可共享组件的将来查询。 系统中包括用于基于传入查询的可共享性特性将输入查询分配给查询队列的操作,以及评估多个查询队列中的每个成员以识别一个最高优先级查询队列的优先级功能。 优先级功能考虑未来传入查询将包含多个查询队列的成员共有的共享特性的概率。 查询调度系统最大化工作共享,从最高优先级查询队列中选择一批查询,并将批处理分派到一个或多个查询执行单元。
    • 5. 发明授权
    • System for query scheduling to maximize work sharing
    • 用于查询调度的系统以最大限度地提高工作共享
    • US07877380B2
    • 2011-01-25
    • US12036956
    • 2008-02-25
    • Parag AgrawalDaniel KiferChris Olston
    • Parag AgrawalDaniel KiferChris Olston
    • G06F7/00G06F17/30
    • G06F17/30442G06F17/30474
    • A system of query scheduling to maximize work sharing. The system schedules queries to account for future queries possessing a sharability component. Included in the system are operations for assigning an incoming query to a query queue based on a sharability characteristic of the incoming query, and evaluating a priority function for each member of a plurality of query queues to identify one highest priority query queue. The priority function accounts for the probability that a future incoming query will contain the sharability characteristic common to a member of the plurality of query queues. The system of query scheduling to maximize work sharing selects a batch of queries from the highest priority query queue, and dispatches the batch to one or more query execution units.
    • 一个查询调度系统,最大限度地提高工作共享。 系统调度查询以考虑具有可共享组件的将来查询。 系统中包括用于基于传入查询的可共享性特性将输入查询分配给查询队列的操作,以及评估多个查询队列中的每个成员以识别一个最高优先级查询队列的优先级功能。 优先级功能考虑未来传入查询将包含多个查询队列的成员共有的共享特性的概率。 查询调度系统最大化工作共享,从最高优先级查询队列中选择一批查询,并将批处理分派到一个或多个查询执行单元。