![一种基于远程监督的同义词提取方法](/CN/2018/1/310/images/201811554878.jpg)
基本信息:
- 专利标题: 一种基于远程监督的同义词提取方法
- 申请号:CN201811554878.4 申请日:2018-12-19
- 公开(公告)号:CN109740149B 公开(公告)日:2019-12-13
- 发明人: 张涛 , 刘前卫 , 盛兴 , 聂庆 , 谢秋学 , 贺芳 , 雍志娟 , 孙金 , 吴培培 , 常秀 , 张楠 , 商莹楠 , 滕家雨 , 赵生传 , 张婷婷 , 田书然
- 申请人: 英大传媒投资集团有限公司 , 国家电网有限公司 , 南瑞集团有限公司 , 国网山东省电力公司烟台供电公司
- 申请人地址: 北京市东城区北京站西街19号
- 专利权人: 英大传媒投资集团有限公司,国家电网有限公司,南瑞集团有限公司,国网山东省电力公司烟台供电公司
- 当前专利权人: 英大传媒投资集团有限公司,国家电网有限公司,南瑞集团有限公司,国网山东省电力公司烟台供电公司
- 当前专利权人地址: 北京市东城区北京站西街19号
- 代理机构: 南京苏高专利商标事务所
- 代理人: 李淑静
- 优先权: 201811511588.1 2018.12.11 CN
- 主分类号: G06F17/27
- IPC分类号: G06F17/27
The invention discloses a synonym extraction method based on remote supervision, and belongs to the technical field of natural language processing. The method comprises the following steps: establishing a vocabulary syntax mode model of synonyms in the field; constructing a remote supervision neural network learning model based on LSTM and CRF, and training by using domain entries to obtain a sentence sequence annotation set discovered by synonyms; and according to the annotation set, annotating and pairing candidate entities in statements in the corpus, and extracting the entities to obtain synonyms. According to the method, the corresponding vocabularies of the domain synonyms are combined by utilizing the entry characteristics based on the encyclopedic knowledge base; In the syntax mode, domain synonyms are obtained through remote supervised learning and machine autonomous learning, and the method takes machine processing as a main part and manual processing as an auxiliary part, sothat the efficiency of obtaining the synonyms is improved, and the labor cost is greatly reduced under the condition that the precision is not reduced. New words can be found through regular entry learning of online encyclopedia and analysis of hidden synonyms.
公开/授权文献:
- CN109740149A 一种基于远程监督的同义词提取方法 公开/授权日:2019-05-10