会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Reverse value attribute extraction
    • 反向值属性提取
    • US07606797B2
    • 2009-10-20
    • US11357289
    • 2006-02-16
    • Keiron McCammonManish Chandra
    • Keiron McCammonManish Chandra
    • G06F17/30
    • G06F17/30622Y10S707/99934Y10S707/99935Y10S707/99936Y10S707/99937
    • An attribute manager extracts attribute values from formatted data. The attribute manager maintains information concerning a plurality of attributes, such as matching names and values for attributes. Formatted data is parsed into a plurality of elements comprising a canonical representation of the data, independent of the data format. The formatted data can be, for example, a web page, a portable document format document or a word processor document. The attribute manager scans the elements for occurrences of attribute values. Based upon value occurrence distribution and frequency within the data, and maintained information concerning attributes, the attribute manager infers occurrence of specific attributes in the formatted data and assigns the most appropriate occurring values to the specific attributes. In some embodiments, the attribute manager stores attributes and their assigned values, and uses this information to automatically prepare summaries of input data.
    • 属性管理器从格式化数据中提取属性值。 属性管理器维护关于多个属性的信息,例如匹配属性的名称和值。 格式化的数据被解析为包括数据的规范表示的多个元素,与数据格式无关。 格式化的数据可以是例如网页,便携式文档格式文档或文字处理器文档。 属性管理器扫描元素以发生属性值。 基于数据中的价值发生分布和频率,并且维护关于属性的信息,属性管理器推断格式化数据中的特定属性的发生,并将最适合的发生值分配给特定属性。 在一些实施例中,属性管理器存储属性及其分配的值,并且使用该信息来自动准备输入数据的摘要。
    • 2. 发明申请
    • Reverse value attribute extraction
    • 反向值属性提取
    • US20060190684A1
    • 2006-08-24
    • US11357289
    • 2006-02-16
    • Keiron McCammonManish Chandra
    • Keiron McCammonManish Chandra
    • G06F13/28
    • G06F17/30622Y10S707/99934Y10S707/99935Y10S707/99936Y10S707/99937
    • An attribute manager extracts attribute values from formatted data. The attribute manager maintains information concerning a plurality of attributes, such as matching names and values for attributes. Formatted data is parsed into a plurality of elements comprising a canonical representation of the data, independent of the data format. The formatted data. can be, for example, a web page, a portable document format document or a word processor document. The attribute manager scans the elements for occurrences of attribute values. Based upon value occurrence distribution and frequency within the data, and maintained information concerning attributes, the attribute manager infers occurrence of specific attributes in the formatted data and assigns the most appropriate occurring values to the specific attributes. In some embodiments, the attribute manager stores attributes and their assigned values, and uses this information to automatically prepare summaries of input data.
    • 属性管理器从格式化数据中提取属性值。 属性管理器维护关于多个属性的信息,例如匹配属性的名称和值。 格式化的数据被解析为包括数据的规范表示的多个元素,与数据格式无关。 格式化数据。 可以是例如网页,便携式文档格式文档或文字处理器文档。 属性管理器扫描元素以发生属性值。 基于数据中的价值发生分布和频率,并且维护关于属性的信息,属性管理器推断格式化数据中的特定属性的发生,并将最适合的发生值分配给特定属性。 在一些实施例中,属性管理器存储属性及其分配的值,并且使用该信息来自动准备输入数据的摘要。
    • 4. 发明授权
    • Extracting information from formatted sources
    • 从格式化源中提取信息
    • US07630968B2
    • 2009-12-08
    • US11357656
    • 2006-02-16
    • Keiron McCammonManish Chandra
    • Keiron McCammonManish Chandra
    • G06F17/30
    • G06F17/2785G06F17/30719G06F17/30864Y10S707/99932Y10S707/99933Y10S707/99937
    • An extraction manager extracts information from formatted input. The input is annotated with presentation information, and parsed into a set of elements comprising a canonical representation thereof. An information analyzer analyzes the elements in order to glean additional information. An entity extractor determines entities to extract from the input. The entity extractor analyzes elements according to specific entities to be extracted, and creates entity specific observations for analyzed elements. These observations comprise possible values for the relevant entities. A heuristics processor maintains a collection of entity specific heuristics, each comprising a test to help determine the suitability of data as a value for the corresponding entity. The heuristics processor selects heuristics for the entities to be extracted, and tests observations for these entities against the selected heuristics. Responsive to this testing, ordered possible values for entities to extract are determined.
    • 提取管理器从格式化输入中提取信息。 该输入用呈现信息注释,并被解析成包括其规范表示的一组元素。 信息分析器分析元素以收集附加信息。 实体提取器确定从输入中提取的实体。 实体提取器根据要提取的特定实体分析元素,并为分析元素创建实体特定观察值。 这些意见包括相关实体的可能价值。 启发式处理器维护一个实体特定的启发式的集合,每个都包含测试,以帮助确定数据作为相应实体的值的适用性。 启发式处理器为要提取的实体选择启发式,并根据所选择的启发式测试这些实体的观察结果。 响应于此测试,确定要提取的实体的可能值。