专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20100274770A1 TRANSDUCTIVE APPROACH TO CATEGORY-SPECIFIC RECORD ATTRIBUTE EXTRACTION 审中-公开
标题翻译：对特定记录属性提取的传播方法
公开(公告)号：US20100274770A1
公开(公告)日：2010-10-28
申请号：US12429442
申请日：2009-04-24
申请人： Rahul Gupta , Sathiya Keerthi Selvaraj , Daniel Kifer , Srujana Merugu
发明人： Rahul Gupta , Sathiya Keerthi Selvaraj , Daniel Kifer , Srujana Merugu
IPC分类号： G06F17/30
CPC分类号： G06F16/951 , G06F16/285
摘要： Disclosed are methods and apparatus for segmenting and labeling a collection of token sequences. A plurality of segments of one or more tokens in a token sequence collection are partially labeled with labels from a set of target labels using high precision domain-specific labelers so as to generate a partially labeled sequence collection having a plurality of labeled segments and a plurality of unlabeled segments. Any label conflicts in the partially labeled sequence collection are resolved. One or more of the labeled segments of the partially labeled sequence collection are expanded so as to cover one or more additional tokens of the partially labeled sequence collection. A statistical model, for labeling segments using local token and segment features of the sequence collection, is trained based on the partially labeled sequence collection. This trained model is then used to label the unlabeled segments and the labeled segments of the sequence collection so as to generate a labeled sequence collection. The labeled sequence collection is then stored as structured output records in a database.
摘要翻译：公开了用于分割和标记令牌序列集合的方法和装置。令牌序列集合中的一个或多个令牌的多个片段使用高精度域专用标签器从一组目标标签部分标记，以便生成具有多个标记片段和多个标记片段的部分标记序列集合的未标记片段。部分标记的序列集合中的任何标签冲突都被解决。扩展部分标记的序列集合的一个或多个标记片段，以覆盖部分标记的序列集合的一个或多个附加标记。基于部分标记的序列集合训练用于使用本地令牌和序列集合的片段特征来标记片段的统计模型。然后将该训练模型用于标记序列集合的未标记片段和标记片段，以产生标记序列集合。标记的序列集合然后作为结构化输出记录存储在数据库中。

2. 发明申请

US20100241639A1 APPARATUS AND METHODS FOR CONCEPT-CENTRIC INFORMATION EXTRACTION 审中-公开
标题翻译：概念中心信息提取的装置和方法
公开(公告)号：US20100241639A1
公开(公告)日：2010-09-23
申请号：US12408450
申请日：2009-03-20
申请人： Daniel Kifer , Srujana Merugu , Ankur Jain , Sathiya Keerthi Selvaraj , Alok S. Kirpal , Philip L. Bohannon , Raghu Ramakrishnan
发明人： Daniel Kifer , Srujana Merugu , Ankur Jain , Sathiya Keerthi Selvaraj , Alok S. Kirpal , Philip L. Bohannon , Raghu Ramakrishnan
IPC分类号： G06F17/30
CPC分类号： G06F16/345 , G06F16/313
摘要： Disclosed are methods and apparatus for extracting (or annotating) structured information from web content. Web content of interest from a particular domain is represented as one or more tree instances having a plurality of branching nodes that each correspond to a web object such that the tree instances correspond to one or more structured data instances. The particular domain is associated with domain knowledge that includes one or more presentation rulesets that each specifies a particular structure for a set of data instances, a domain-specific concept labeler, one or more specified properties of the web objects in the tree instances, and a concept schema that specifies a representation of the data to be extracted from the web content. A structured data instance that conforms to the concept schema is extracted from the one or more tree instances based on the domain knowledge for the particular domain. Extraction of the structured data instances is accomplished by (i) using the domain-specific concept labeler to annotate a subset of nodes of the tree instances; and (ii) using a locally adaptive concept annotator to extract the structured data instances based on the annotated segments and the local properties associated with such annotated segments. The extracted structured data instance is stored as structured output records in a database.
摘要翻译：公开了从网页内容中提取（或注释）结构化信息的方法和装置。来自特定域的感兴趣的Web内容被表示为具有多个分支节点的一个或多个树实例，每个分支节点对应于web对象，使得树实例对应于一个或多个结构化数据实例。特定域与域知识相关联，其包括一个或多个呈现规则集，每个表示规则集指定一组数据实例的特定结构，特定于域的概念标签器，树实例中的web对象的一个或多个指定的属性，以及一个概念模式，指定要从Web内容中提取的数据的表示。基于特定域的域知识，从一个或多个树实例提取符合概念模式的结构化数据实例。结构化数据实例的提取是通过（i）使用域特定概念标签器来注释树实例的节点的子集来实现的; 以及（ii）使用本地适应性概念注释器基于所注释的段和与这些注释段相关联的本地属性来提取结构化数据实例。提取的结构化数据实例作为结构化输出记录存储在数据库中。

3. 发明申请

US20090327168A1 PLAYFUL INCENTIVE FOR LABELING CONTENT 审中-公开
标题翻译：有趣的激励标签内容
公开(公告)号：US20090327168A1
公开(公告)日：2009-12-31
申请号：US12147342
申请日：2008-06-26
申请人： Kilian Quirin Weinberger , Anirban Dasgupta , Raghu Ramakrishnan , David Reiley , Martin Andre Monroe Zinkevich , Bo Pang , Daniel Kifer
发明人： Kilian Quirin Weinberger , Anirban Dasgupta , Raghu Ramakrishnan , David Reiley , Martin Andre Monroe Zinkevich , Bo Pang , Daniel Kifer
IPC分类号： G06F3/048
CPC分类号： H04L51/12
摘要： Embodiments are directed towards employing a playful incentive to encourage users to provide feedback that is useable to train a classifier. The classifier being associated with any of a variety of different settings, including but not limited to classifying: messages as ham/spam, images, advertising, bookmarking, music, videos, photographs, shopping, or the like. An animated image, such as a pet, provides an interface to the classifier that encourages and responds to user feedback. Users may share their classifiers or aspects thereof with other users to enable a community of knowledge to be applied to a classification task, while preserving privacy of the user feedback. One form of sharing may be within the context of a competitive game. Various evaluations may be performed on a classifier to indicate user feedback consistency, or quality. Classifiers may also be used to provide users with advertisements, products, or services based on the user's feedback.
摘要翻译：实施例旨在采用有趣的激励来鼓励用户提供可用于训练分类器的反馈。分类器与各种不同的设置相关联，包括但不限于分类：消息作为火腿/垃圾邮件，图像，广告，书签，音乐，视频，照片，购物等。动画图像（如宠物）为分类器提供了一个界面，鼓励和响应用户反馈。用户可以与其他用户共享他们的分类器或其方面，以使知识社区能够应用于分类任务，同时保持用户反馈的隐私。一种共享的形式可能在竞争性游戏的背景下。可以在分类器上执行各种评估，以指示用户反馈一致性或质量。分类器也可以用于根据用户的反馈向用户提供广告，产品或服务。

4. 发明申请

US20090216718A1 System for Query Scheduling to Maximize Work Sharing 有权
标题翻译：用于查询调度以最大化工作共享的系统
公开(公告)号：US20090216718A1
公开(公告)日：2009-08-27
申请号：US12036956
申请日：2008-02-25
申请人： Parag Agrawal , Daniel Kifer , Chris Olston
发明人： Parag Agrawal , Daniel Kifer , Chris Olston
IPC分类号： G06F7/06
CPC分类号： G06F17/30442 , G06F17/30474
摘要： A system of query scheduling to maximize work sharing. The system schedules queries to account for future queries possessing a sharability component. Included in the system are operations for assigning an incoming query to a query queue based on a sharability characteristic of the incoming query, and evaluating a priority function for each member of a plurality of query queues to identify one highest priority query queue. The priority function accounts for the probability that a future incoming query will contain the sharability characteristic common to a member of the plurality of query queues. The system of query scheduling to maximize work sharing selects a batch of queries from the highest priority query queue, and dispatches the batch to one or more query execution units.
摘要翻译：一个查询调度系统，最大限度地提高工作共享。系统调度查询以考虑具有可共享组件的将来查询。系统中包括用于基于传入查询的可共享性特性将输入查询分配给查询队列的操作，以及评估多个查询队列中的每个成员以识别一个最高优先级查询队列的优先级功能。优先级功能考虑未来传入查询将包含多个查询队列的成员共有的共享特性的概率。查询调度系统最大化工作共享，从最高优先级查询队列中选择一批查询，并将批处理分派到一个或多个查询执行单元。

5. 发明授权

US07877380B2 System for query scheduling to maximize work sharing 有权
标题翻译：用于查询调度的系统以最大限度地提高工作共享
公开(公告)号：US07877380B2
公开(公告)日：2011-01-25
申请号：US12036956
申请日：2008-02-25
申请人： Parag Agrawal , Daniel Kifer , Chris Olston
发明人： Parag Agrawal , Daniel Kifer , Chris Olston
IPC分类号： G06F7/00 , G06F17/30
CPC分类号： G06F17/30442 , G06F17/30474
摘要： A system of query scheduling to maximize work sharing. The system schedules queries to account for future queries possessing a sharability component. Included in the system are operations for assigning an incoming query to a query queue based on a sharability characteristic of the incoming query, and evaluating a priority function for each member of a plurality of query queues to identify one highest priority query queue. The priority function accounts for the probability that a future incoming query will contain the sharability characteristic common to a member of the plurality of query queues. The system of query scheduling to maximize work sharing selects a batch of queries from the highest priority query queue, and dispatches the batch to one or more query execution units.
摘要翻译：一个查询调度系统，最大限度地提高工作共享。系统调度查询以考虑具有可共享组件的将来查询。系统中包括用于基于传入查询的可共享性特性将输入查询分配给查询队列的操作，以及评估多个查询队列中的每个成员以识别一个最高优先级查询队列的优先级功能。优先级功能考虑未来传入查询将包含多个查询队列的成员共有的共享特性的概率。查询调度系统最大化工作共享，从最高优先级查询队列中选择一批查询，并将批处理分派到一个或多个查询执行单元。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式