![一种为半结构化数据构建NoSQL数据库索引的方法及装置](/CN/2014/1/5/images/201410025080.jpg)
基本信息:
- 专利标题: 一种为半结构化数据构建NoSQL数据库索引的方法及装置
- 专利标题(英):Method and device for establishing NoSQL database index for semi-structured data
- 申请号:CN201410025080.6 申请日:2014-01-20
- 公开(公告)号:CN104794123A 公开(公告)日:2015-07-22
- 发明人: 周琦 , 孙廷韬 , 蔡华 , 林豪
- 申请人: 阿里巴巴集团控股有限公司
- 申请人地址: 英属开曼群岛大开曼资本大厦一座四层847号邮箱
- 专利权人: 阿里巴巴集团控股有限公司
- 当前专利权人: 阿里巴巴集团控股有限公司
- 当前专利权人地址: 英属开曼群岛大开曼资本大厦一座四层847号邮箱
- 代理机构: 北京市清华源律师事务所
- 代理人: 沈泳; 李赞坚
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
The invention provides a method for establishing an NoSQL database index for semi-structured data. The method comprises the steps of preprocessing semi-structured source data; storing text segments obtained after preprocessing in a data sheet, wherein the data sheet comprises a first major key assembly including a structured thread major key and a sequential value major key, a structured thread is divided into multiple continuous intervals according to a determined sequence, and a specific key value is allocated to each interval to serve as the key values of the structured thread major key; establishing a reverse index table for the text segments obtained after preprocessing, wherein the reverse index table comprises a second major key assembly including a structured thread major key and a keyword major key, and relevant text segment sequence tags are recorded to serve as index values corresponding to the key values of the major keys. The index values with the same keyword major key and different structured thread major key values are located on different lines in the reverse index table. In this way, database index query efficiency is improved, and updating is facilitated. Furthermore, the invention provides a device for establishing an NoSQL database index for semi-structured data.
公开/授权文献:
- CN104794123B 一种为半结构化数据构建NoSQL数据库索引的方法及装置 公开/授权日:2018-07-27
IPC结构图谱:
G | 物理 |
--G06 | 计算;推算;计数 |
----G06F | 电数字数据处理 |
------G06F17/00 | 特别适用于特定功能的数字计算设备或数据处理设备或数据处理方法 |
--------G06F17/30 | .信息检索;及其数据库结构 |