![一种科技新闻的增量学习多层次二分类方法](/CN/2015/1/128/images/201510642902.jpg)
基本信息:
- 专利标题: 一种科技新闻的增量学习多层次二分类方法
- 专利标题(英):Incremental learning multi-level binary-classification method of scientific news
- 申请号:CN201510642902.X 申请日:2015-10-08
- 公开(公告)号:CN105205163A 公开(公告)日:2015-12-30
- 发明人: 朱全银 , 潘禄 , 刘文儒 , 李翔 , 周泓 , 胡荣林 , 丁瑾 , 金鹰 , 邵武杰 , 唐海波
- 申请人: 淮阴工学院
- 申请人地址: 江苏省淮安市高教园区枚乘东路1号
- 专利权人: 淮阴工学院
- 当前专利权人: 淮阴工学院
- 当前专利权人地址: 江苏省淮安市高教园区枚乘东路1号
- 主分类号: G06F17/30
- IPC分类号: G06F17/30
The invention discloses an incremental learning multi-level binary-classification method of scientific news. The method includes the steps that article titles, article contents and key words in the property of news are utilized in combination with a text weighted method and a text similarity calculation method under a vector space model, preprocessing and feature weighing are firstly conducted on marked information and full-text information collected in a marked news documents; an intermediate result is calculated and stored; then, the similarity between a new text and scientific news classification and non-scientific news classification is calculated through cosine similarity on the aspects of feature information and a full text, and accordingly the classification of the new text is judged. Sensitiveness to scientific and technical vocabularies through the multi-level judgment method and the incremental learning method, the number of texts of news unrelated to the scientific news can be reduced through the binary-classification method, and thus the text multi-classification accuracy is improved. The incremental learning multi-level binary-classification method of the scientific news is used for improving the use value of news information extracted from Web pages and improving the classification accuracy of the scientific news.
公开/授权文献:
- CN105205163B 一种科技新闻的增量学习多层次二分类方法 公开/授权日:2018-08-10
IPC结构图谱:
G | 物理 |
--G06 | 计算;推算;计数 |
----G06F | 电数字数据处理 |
------G06F17/00 | 特别适用于特定功能的数字计算设备或数据处理设备或数据处理方法 |
--------G06F17/30 | .信息检索;及其数据库结构 |