
基本信息:
- 专利标题: 一种基于可读性指标的信息检索方法
- 专利标题(英):Readability indicator based information retrieval method
- 申请号:CN201510976829.X 申请日:2015-12-21
- 公开(公告)号:CN105630940A 公开(公告)日:2016-06-01
- 发明人: 张程 , 宋大为 , 张鹏 , 王博 , 张文雅
- 申请人: 天津大学
- 申请人地址: 天津市南开区卫津路92号
- 专利权人: 天津大学
- 当前专利权人: 天津大学
- 当前专利权人地址: 天津市南开区卫津路92号
- 代理机构: 天津市北洋有限责任专利代理事务所
- 代理人: 李丽萍
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
The present invention discloses a readability indicator based information retrieval method. The method comprises: in a searching process by using a search engine, sorting documents, which meet a search condition, according to a relevance between the documents and a query keyword; and organizing the documents that meet the search condition, a relevance sorting and a readability score into a page and returning the page to a user, wherein a text readability score equals to M*(N*average Chinese stroke number+(1-N)*difficult Chinese word frequency)+(1-M)*(P*average English character number+(1-P)*difficult English word frequency), M adjusts a weight proportion of Chinese and English readability, N adjusts weight proportions between an average Chinese stroke number indicator and a difficult Chinese word frequency indicator, and P adjusts a weight proportion between an average English character number indicator and a difficult English word frequency indicator. According to the method disclosed by the present invention, the readability score of the document is returned after retrieval, so that the user can conveniently and rapidly extract a relatively readable portion from the documents with a relatively high relevance, so that retrieval efficiency is improved.
公开/授权文献:
- CN105630940B 一种基于可读性指标的信息检索方法 公开/授权日:2019-03-22
IPC结构图谱:
G | 物理 |
--G06 | 计算;推算;计数 |
----G06F | 电数字数据处理 |
------G06F17/00 | 特别适用于特定功能的数字计算设备或数据处理设备或数据处理方法 |
--------G06F17/30 | .信息检索;及其数据库结构 |