![一种基于随机森林模型的输变电可疑数据筛查方法和设备](/CN/2019/1/58/images/201910293642.jpg)
基本信息:
- 专利标题: 一种基于随机森林模型的输变电可疑数据筛查方法和设备
- 专利标题(英):Power transmission and transformation suspicious data screening method and device based on random forest model
- 申请号:CN201910293642.8 申请日:2019-04-12
- 公开(公告)号:CN110110757A 公开(公告)日:2019-08-09
- 发明人: 高尚 , 李慧辉 , 翟明玉 , 孙世明 , 陈玉慧 , 许寒阳 , 陈宁 , 季堃 , 马洁 , 唐元合
- 申请人: 国电南瑞科技股份有限公司 , 国家电网有限公司 , 国网江苏省电力有限公司 , 国电南瑞南京控制系统有限公司 , 南瑞集团有限公司
- 申请人地址: 江苏省南京市江宁区诚信大道19号
- 专利权人: 国电南瑞科技股份有限公司,国家电网有限公司,国网江苏省电力有限公司,国电南瑞南京控制系统有限公司,南瑞集团有限公司
- 当前专利权人: 国电南瑞科技股份有限公司,国家电网有限公司,国网江苏省电力有限公司,国电南瑞南京控制系统有限公司,南瑞集团有限公司
- 当前专利权人地址: 江苏省南京市江宁区诚信大道19号
- 代理机构: 南京苏高专利商标事务所
- 代理人: 李淑静
- 主分类号: G06K9/62
- IPC分类号: G06K9/62 ; G06Q50/06
The invention provides a power transmission and transformation suspicious data screening method and device based on a random forest model, and the method comprises the steps: S1, selecting data of multiple dimensions according to the category and periodicity rule of power transmission and transformation equipment, and building a data feature item; S2, distributing different weights for the data according to the sampling time, respectively marking known normal data and abnormal data as positive and negative samples, and dividing a data set into K parts; S3, training a random forest model by adopting a K-fold cross validation method, iteratively adjusting the number T of trees in the random forest by taking an average value of positive and negative sample accuracy as a target, and obtaininga value of T when an index is optimal; and S4, screening suspicious data by using the trained model. Power transmission and transformation equipment is used as an object to construct a suspicious datascreening object, a random forest model of an optimization training set is used for learning data rules from a large amount of historical sampling data, power transmission and transformation suspicious data identification and screening are achieved, the workload of manual screening is reduced, and the data quality of an electric power regulation and control system is improved.
公开/授权文献:
- CN110110757B 一种基于随机森林模型的输变电可疑数据筛查方法和设备 公开/授权日:2021-02-05
IPC结构图谱:
G | 物理 |
--G06 | 计算;推算;计数 |
----G06K | 数据识别;数据表示;记录载体;记录载体的处理 |
------G06K9/00 | 用于阅读或识别印刷或书写字符或者用于识别图形,例如,指纹的方法或装置 |
--------G06K9/62 | .应用电子设备进行识别的方法或装置 |