专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US07606797B2 Reverse value attribute extraction 失效
标题翻译：反向值属性提取
公开(公告)号：US07606797B2
公开(公告)日：2009-10-20
申请号：US11357289
申请日：2006-02-16
申请人： Keiron McCammon , Manish Chandra
发明人： Keiron McCammon , Manish Chandra
IPC分类号： G06F17/30
CPC分类号： G06F17/30622 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936 , Y10S707/99937
摘要： An attribute manager extracts attribute values from formatted data. The attribute manager maintains information concerning a plurality of attributes, such as matching names and values for attributes. Formatted data is parsed into a plurality of elements comprising a canonical representation of the data, independent of the data format. The formatted data can be, for example, a web page, a portable document format document or a word processor document. The attribute manager scans the elements for occurrences of attribute values. Based upon value occurrence distribution and frequency within the data, and maintained information concerning attributes, the attribute manager infers occurrence of specific attributes in the formatted data and assigns the most appropriate occurring values to the specific attributes. In some embodiments, the attribute manager stores attributes and their assigned values, and uses this information to automatically prepare summaries of input data.
摘要翻译：属性管理器从格式化数据中提取属性值。属性管理器维护关于多个属性的信息，例如匹配属性的名称和值。格式化的数据被解析为包括数据的规范表示的多个元素，与数据格式无关。格式化的数据可以是例如网页，便携式文档格式文档或文字处理器文档。属性管理器扫描元素以发生属性值。基于数据中的价值发生分布和频率，并且维护关于属性的信息，属性管理器推断格式化数据中的特定属性的发生，并将最适合的发生值分配给特定属性。在一些实施例中，属性管理器存储属性及其分配的值，并且使用该信息来自动准备输入数据的摘要。

2. 发明申请

US20060190684A1 Reverse value attribute extraction 失效
标题翻译：反向值属性提取
公开(公告)号：US20060190684A1
公开(公告)日：2006-08-24
申请号：US11357289
申请日：2006-02-16
申请人： Keiron McCammon , Manish Chandra
发明人： Keiron McCammon , Manish Chandra
IPC分类号： G06F13/28
CPC分类号： G06F17/30622 , Y10S707/99934 , Y10S707/99935 , Y10S707/99936 , Y10S707/99937
摘要： An attribute manager extracts attribute values from formatted data. The attribute manager maintains information concerning a plurality of attributes, such as matching names and values for attributes. Formatted data is parsed into a plurality of elements comprising a canonical representation of the data, independent of the data format. The formatted data. can be, for example, a web page, a portable document format document or a word processor document. The attribute manager scans the elements for occurrences of attribute values. Based upon value occurrence distribution and frequency within the data, and maintained information concerning attributes, the attribute manager infers occurrence of specific attributes in the formatted data and assigns the most appropriate occurring values to the specific attributes. In some embodiments, the attribute manager stores attributes and their assigned values, and uses this information to automatically prepare summaries of input data.
摘要翻译：属性管理器从格式化数据中提取属性值。属性管理器维护关于多个属性的信息，例如匹配属性的名称和值。格式化的数据被解析为包括数据的规范表示的多个元素，与数据格式无关。格式化数据。可以是例如网页，便携式文档格式文档或文字处理器文档。属性管理器扫描元素以发生属性值。基于数据中的价值发生分布和频率，并且维护关于属性的信息，属性管理器推断格式化数据中的特定属性的发生，并将最适合的发生值分配给特定属性。在一些实施例中，属性管理器存储属性及其分配的值，并且使用该信息来自动准备输入数据的摘要。

3. 发明申请

US20060200457A1 Extracting information from formatted sources 失效
公开(公告)号：US20060200457A1
公开(公告)日：2006-09-07
申请号：US11357656
申请日：2006-02-16
申请人： Keiron McCammon , Manish Chandra
发明人： Keiron McCammon , Manish Chandra
IPC分类号： G06F17/30
CPC分类号： G06F17/2785 , G06F17/30719 , G06F17/30864 , Y10S707/99932 , Y10S707/99933 , Y10S707/99937
摘要： An extraction manager extracts information from formatted input. The input is annotated with presentation information, and parsed into a set of elements comprising a canonical representation thereof. An information analyzer analyzes the elements in order to glean additional information. An entity extractor determines entities to extract from the input. The entity extractor analyzes elements according to specific entities to be extracted, and creates entity specific observations for analyzed elements. These observations comprise possible values for the relevant entities. A heuristics processor maintains a collection of entity specific heuristics, each comprising a test to help determine the suitability of data as a value for the corresponding entity. The heuristics processor selects heuristics for the entities to be extracted, and tests observations for these entities against the selected heuristics. Responsive to this testing, ordered possible values for entities to extract are determined.

4. 发明授权

US07630968B2 Extracting information from formatted sources 失效
标题翻译：从格式化源中提取信息
公开(公告)号：US07630968B2
公开(公告)日：2009-12-08
申请号：US11357656
申请日：2006-02-16
申请人： Keiron McCammon , Manish Chandra
发明人： Keiron McCammon , Manish Chandra
IPC分类号： G06F17/30
CPC分类号： G06F17/2785 , G06F17/30719 , G06F17/30864 , Y10S707/99932 , Y10S707/99933 , Y10S707/99937
摘要： An extraction manager extracts information from formatted input. The input is annotated with presentation information, and parsed into a set of elements comprising a canonical representation thereof. An information analyzer analyzes the elements in order to glean additional information. An entity extractor determines entities to extract from the input. The entity extractor analyzes elements according to specific entities to be extracted, and creates entity specific observations for analyzed elements. These observations comprise possible values for the relevant entities. A heuristics processor maintains a collection of entity specific heuristics, each comprising a test to help determine the suitability of data as a value for the corresponding entity. The heuristics processor selects heuristics for the entities to be extracted, and tests observations for these entities against the selected heuristics. Responsive to this testing, ordered possible values for entities to extract are determined.
摘要翻译：提取管理器从格式化输入中提取信息。该输入用呈现信息注释，并被解析成包括其规范表示的一组元素。信息分析器分析元素以收集附加信息。实体提取器确定从输入中提取的实体。实体提取器根据要提取的特定实体分析元素，并为分析元素创建实体特定观察值。这些意见包括相关实体的可能价值。启发式处理器维护一个实体特定的启发式的集合，每个都包含测试，以帮助确定数据作为相应实体的值的适用性。启发式处理器为要提取的实体选择启发式，并根据所选择的启发式测试这些实体的观察结果。响应于此测试，确定要提取的实体的可能值。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式