会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 72. 发明授权
    • SYSTEM AND METHOD FOR SEARCH, INDEX, PARSING DOCUMENT DATABASE INCLUDING SUBJECT DOCUMENT HAVING NESTED FIELDS ASSOCIATED START AND END META WORDS WHERE EACH META WORD IDENTIFY LOCATION AND NESTING LEVEL
    • 用于搜索,索引,分配文档数据的系统和方法,包括具有相关字段的子文档相关的开始和结束元素每个元素字识别位置和核心级别
    • US06963869B2
    • 2005-11-08
    • US10042028
    • 2002-01-07
    • Michael Burrows
    • Michael Burrows
    • G06F17/30
    • G06F17/30911Y10S707/99934
    • An indexer indexes a database of documents, and a search engine searches the database of documents. Nesting level information stored in index entries is used to identify, and match together, start and end meta words comprising fields at assorted nesting levels within a document. Based on a query specifying words to be found within fields, spatial criteria are applied to the identified meta words to determine if the specified words are found within the specified fields. A subset of the documents have nested fields, and each nested field has an associated start meta word and end meta word. Each meta word has an associated nesting level. Each document is indexed by parsing the document to determine locations within the document of words and meta words, as well as the nesting level associated with each meta word. An index is generated that has word entries, meta word entries, and generic meta word entries. The meta word entries indicate locations within the documents of an identified meta word, as well as the nesting level of the meta word. The generic meta word entries identify locations within the document of a class of meta words, including meta words at all nesting levels of the meta words within the document. For each identified location within the generic meta word entry, the generic meta word entry also includes nesting level information associated with the meta word at the identified location.
    • 索引器索引文档的数据库,搜索引擎在数据库中搜索文档。 存储在索引条目中的嵌套级别信息用于标识和匹配起始和结束元文字,其中包含文档中各种嵌套级别的字段。 基于指定要在字段内找到的字的查询,将空间标准应用于所识别的元字,以确定在指定字段内是否找到指定的字。 文档的一个子集具有嵌套字段,并且每个嵌套字段都具有关联的开始元字和结束元字。 每个元词具有相关联的嵌套级别。 通过解析文档来索引每个文档,以确定单词和元词的文档内的位置,以及与每个元词相关联的嵌套级别。 生成具有单词条目,元词条目和通用元词条目的索引。 元字条目指示所识别的元字的文档内的位置以及元字词的嵌套级别。 通用元词条目标识了一类元词的文档内的位置,包括文档中元词的所有嵌套级别的元词。 对于通用元词条目中的每个识别的位置,通用元词条目还包括与所识别的位置处的元词相关联的嵌套级别信息。
    • 73. 发明授权
    • Web page connectivity server
    • 网页连接服务器
    • US06598051B1
    • 2003-07-22
    • US09664647
    • 2000-09-19
    • Janet L. WienerRaymond P. StataMichael Burrows
    • Janet L. WienerRaymond P. StataMichael Burrows
    • G06F1700
    • G06F17/3089Y10S707/99932Y10S707/99943Y10S707/99945
    • A connectivity server for a collecting, arranging and representing data defining the interconnection of pages on the World Wide Web (Web). A URL Database stores URLs and associates a fingerprint and CS_id with each URL. The URL Database interface is operable to translate between any two of a URL, a fingerprint, and a Host_id. A Host Database associates a Host_id with each distinct hostname in the URL Database. The Host Database interface is operable to accept a Host_id and return a number equal to the number of URLs on the respective host and to return the CS_ids of those URLs. A Link Database stores links between source URLs and destination URLs. The Link Database interface is operable to retrieve, for a given CS_id, the number of inlinks to and outlinks from the URL corresponding to the CS_id. In an embodiment characterized by a single processor, with access to all databases stored in RAM links and URL information may be retrieved sufficiently rapidly so that applications that touch every link, even multiple times, may execute in real time, in a few minutes or hours. Representative applications enabled by the connectivity server include static ranking of pages (eigenranks), query precomputation, mirror-site detection, and related-page identification.
    • 用于收集,排列和表示在万维网(Web)上定义页面互连的数据的连接服务器。 URL数据库存储URL并将指纹和CS_id与每个URL相关联。 URL数据库接口可用于在URL,指纹和Host_id中的任意两个之间进行转换。 主机数据库将Host_id与URL数据库中的每个不同的主机名相关联。 主机数据库接口可操作以接受Host_id并返回等于相应主机上的URL数量并返回这些URL的CS_id的数字。 链接数据库存储源URL和目标URL之间的链接。 链接数据库接口可用于针对给定的CS_id检索与对应于CS_id的URL的链接的数量和从外部链接的数量。 在以单个处理器为特征的实施例中,对存储在RAM链接中的所有数据库的访问和URL信息的访问可以被足够快速地检索,使得甚至多次触摸每个链路的应用可以在几分钟或几小时内实时执行 。 由连通性服务器启用的代表性应用程序包括页面(本征排列)的静态排名,查询预计算,镜像站点检测和相关页面标识。
    • 74. 发明授权
    • Memory storing an integrated index of database records
    • 存储数据库记录的集成索引的内存
    • US6078923A
    • 2000-06-20
    • US991179
    • 1997-12-16
    • Michael Burrows
    • Michael Burrows
    • G06F17/30
    • G06F17/30622G06F17/30631Y10S707/99942
    • A data processing system includes at least one central processor for executing instructions of software programs. In addition the data processing system includes a memory containing a data structure common to the software programs. The common data structure includes a compressed index data structure. The index structure stores index entries referencing a database. The database includes multiple records, each having a unique address in the database. Each index entry includes a word entry if the index entry represents a compressed encoding of a unique portion of information sequentially parsed from the database. The word entry is followed by one or more location entries which reference occurrences of the portions of information. Each index entry includes a metaword entry if the index entry represents a unique attribute of one or more related words. The metaword entry is followed by one or more location entries referencing occurrences of the attributes. The index entries are sequentially organized according to a collating order of the words and metawords, and the location entries organized according to their collating order. The memory also stores a summary data structure including summary entries generated by periodically sampling the location entries. Each summary entry includes an encoding of the word entry associated with the sampled location entry, an encoding of the sampled location entry, and a pointer to a next location entry following the sampled entry.
    • 数据处理系统包括用于执行软件程序指令的至少一个中央处理器。 此外,数据处理系统包括包含软件程序共同的数据结构的存储器。 公共数据结构包括压缩索引数据结构。 索引结构存储引用数据库的索引条目。 数据库包括多个记录,每个记录在数据库中都有唯一的地址。 如果索引条目表示从数据库顺序解析的唯一部分信息的压缩编码,则每个索引条目包括单词条目。 词条后面是一个或多个引用信息部分的位置条目。 如果索引条目表示一个或多个相关词的唯一属性,则每个索引条目包括元词条目。 元词条目后跟一个或多个引用属性出现的位置条目。 索引条目根据单词和元词汇的整理顺序顺序组织,并且根据其对照顺序组织的位置条目。 存储器还存储总结数据结构,包括通过周期性地对位置条目进行采样而生成的摘要条目。 每个摘要条目包括与采样的位置条目相关联的词条目的编码,采样的位置条目的编码,以及指向采样条目之后的下一个位置条目的指针。
    • 75. 发明授权
    • Connectivity server for locating linkage information between Web pages
    • 用于在网页之间查找链接信息的连接服务器
    • US6073135A
    • 2000-06-06
    • US37350
    • 1998-03-10
    • Andrei Z. BroderMichael BurrowsMonika H. HenzingerSanjay GhemawatPuneet KumarSuresh Venkatasubramanian
    • Andrei Z. BroderMichael BurrowsMonika H. HenzingerSanjay GhemawatPuneet KumarSuresh Venkatasubramanian
    • G06F17/30
    • G06F17/30882G06F17/30873Y10S707/99932Y10S707/99933Y10S707/99937
    • A server computer is provided for representing and navigating the connectivity of Web pages. The Web pages include links to other Web pages. The links and Web page s have associated names (URLs). The names of the Web pages are sorted in a memory of the connectivity server. The sorted names are delta encoded while periodically storing full names as checkpoints in the memory. Each delta encoded name and checkpoint has a unique identification. A list of pairs of identifications representing existent links is sorted twice, first according to the first identification of each pair to produce an inlist, and second according to the second identification of each pair to produce an outlist. An array of elements is stored in the memory, there is one array element for each Web page. Each element includes a first pointer to one of the checkpoints, a second pointer to an associated inlist of the Web page, and a third pointer to an associated outlist of the Web page. The array is indexed by a particular identification to locate connected Web pages.
    • 提供服务器计算机用于表示和浏览网页的连接。 网页包含指向其他网页的链接。 链接和网页都有相关联的名称(URL)。 网页的名称在连接服务器的内存中排序。 排序的名称是增量编码的,同时周期性地将全名作为检查点存储在内存中。 每个delta编码的名称和检查点都有唯一的标识。 代表存在的链接的标识对的列表被分类两次,首先根据每对的第一个标识来产生一个列表,其次是根据每一对的第二个标识来产生一个列表。 元素数组存储在内存中,每个网页有一个数组元素。 每个元素包括指向其中一个检查点的第一指针,指向该网页的相关联列表的第二指针,以及指向该网页的相关联的列表的第三指针。 该阵列由特定的标识索引,以定位连接的网页。
    • 78. 发明授权
    • Method for encoding and decoding a list of variable size integers to
reduce branch mispredicts
    • 用于对可变大小整数列表进行编码和解码以减少分支错误预测的方法
    • US6005503A
    • 1999-12-21
    • US31882
    • 1998-02-27
    • Michael Burrows
    • Michael Burrows
    • H03M7/30H03M7/40
    • H03M7/3084H03M7/40
    • In a computer implemented method, a list of variable size integers is encoded in a memory. Each variable size integer is expressed as a set of a minimum number of bytes. A fixed number the bytes of the sets are grouped with an associated bit map into a logical memory word unit. Each bit map has one continuation bit for each of the fixed number of bytes. Each continuation bit indicating whether or not a particular variable size integer continues into a following byte. An entry is stored in an array for each possible pattern of continuation bits of the bit maps. Each entry including a plurality of fields. There is one field for each of the fixed number of bytes in each group. Each field stories a length of a corresponding set of bytes expressing a particular variable size integer in the group. The entries provide a decoding table that is indexable by the bit maps to recover the list of variable size integers.
    • 在计算机实现的方法中,可变大小整数的列表被编码在存储器中。 每个可变大小的整数表示为一组最小字节数。 固定数量的集合的字节与关联的位图分组成逻辑存储器字单元。 每个位图每个固定数量的字节都有一个连续位。 每个连续位指示特定可变大小整数是否继续进入后续字节。 对于位图的连续位的每个可能的模式,将条目存储在数组中。 每个条目包括多个字段。 每组中每个固定数量的字节有一个字段。 每个字段描述在组中表示特定可变大小整数的相应字节集合的长度。 这些条目提供可由位图索引的解码表,以恢复可变大小整数的列表。
    • 79. 发明授权
    • Method and apparatus for generating and searching range-based index of
word locations
    • 用于生成和搜索基于范围的单词位置索引的方法和装置
    • US5915251A
    • 1999-06-22
    • US54445
    • 1998-04-03
    • Michael BurrowsAndrew L. Hisgen
    • Michael BurrowsAndrew L. Hisgen
    • G06F17/30
    • G06F17/30622G06F17/30722Y10S707/99932Y10S707/99933Y10S707/99935Y10S707/99943
    • A method indexes a database so that records including literal values, and numeric values within a specified range can be located. The range is selected from an interval of values. Each record of the database is indexed by storing index entries in a memory. Each index entry includes a literal word entry representing a unique portion of information of the database, and one or more numeric location entries indicating occurrences of the portion of information. A plurality of sets of subintervals are generated from the numeric interval of values. Each set of subintervals includes all of the values of the interval. A first set includes one subinterval for each value of the interval, and each next set includes one interval for a group of adjacent subintervals of a previous set. The subinterval of the last set is the entire interval of numeric values. Each subinterval is represented by a unique literal which can be indexed as a word entry in the index. While indexing the records, some portions of information are identified as range-based values. The range-based values are indexed by storing word and location entries in the index such that there is one word entry for each subinterval which includes the range-based values, the locations indicating occurrences of the range-based portions of information.
    • 方法对数据库进行索引,以便可以找到包含文字值和指定范围内的数值的记录。 范围从值的间隔中选择。 通过将索引条目存储在内存中来索引数据库的每个记录。 每个索引条目包括表示数据库的唯一信息部分的文字字条目,以及指示该部分信息的出现的一个或多个数字位置条目。 从值的数值间隔生成多组子区间。 每组子区间包括间隔的所有值。 第一组包括对于间隔的每个值的一个子间隔,并且每个下一组包括对于先前集合的一组相邻子间隔的一个间隔。 最后一组的子间隔是数值的整个间隔。 每个子间隔由独特的文字表示,可以在索引中作为单词条目进行索引。 在索引记录时,信息的一些部分被标识为基于范围的值。 通过在索引中存储单词和位置条目来索引基于范围的值,使得对于每个子区间存在包含基于范围的值的一个单词条目,该位置指示基于范围的信息部分的出现。
    • 80. 发明授权
    • Method for encoding delta values
    • 编码增量值的方法
    • US5914679A
    • 1999-06-22
    • US032826
    • 1998-02-27
    • Michael Burrows
    • Michael Burrows
    • H03M7/30H03M7/40H03M3/00
    • H03M7/30H03M7/40
    • In a computer implemented method for encoding digital values that are arranged in a successively increasing order, a delta value is determined for each pair of immediately successive values. The delta values are the differences between the pair of immediately successive values. For each delta value which can be encoded as a single byte, a logical zero is stored in the least significant bit of the single byte, and the delta value is stored in the most significant bits of the single byte. Otherwise, for each delta value which must be encoded as a plurality of bytes, a logical one is stored in the least significant bit of the first byte of the plurality of bytes, and a first portion of the delta value is stored in the most significant bits of the first byte. In this case, a logical zero is stored in the most significant bit of the next byte, and a next portion of the delta value is stored in the least significant bits of the next byte. If the next portion is the last portion of the delta value, a logical zero is stored in the most significant bit of the last byte, and the last portion of the delta value is stored in the least significant bits of the last byte.
    • 在用于编码按顺序排列的数字值的计算机实现的方法中,对于每对直接连续的值确定增量值。 增量值是两对立即连续值之间的差异。 对于可以编码为单个字节的每个增量值,逻辑零存储在单个字节的最低有效位中,增量值存储在单个字节的最高有效位中。 否则,对于必须被编码为多个字节的每个增量值,逻辑1被存储在多个字节的第一个字节的最低有效位中,增量值的第一部分被存储在最高有效位 第一个字节的位。 在这种情况下,逻辑零存储在下一个字节的最高有效位中,增量值的下一部分被存储在下一个字节的最低有效位中。 如果下一部分是增量值的最后一部分,则逻辑零存储在最后一个字节的最高有效位中,增量值的最后一部分存储在最后一个字节的最低有效位中。