
基本信息:
- 专利标题: SYSTEMS AND METHODS FOR DETERMINING DOCUMENT SECTION TYPES
- 申请号:US17160712 申请日:2021-01-28
- 公开(公告)号:US20220237210A1 公开(公告)日:2022-07-28
- 发明人: Deya Banisakher , Naphtali Rishe , Mark Finlayson
- 申请人: Deya Banisakher , Naphtali Rishe , Mark Finlayson
- 申请人地址: US FL Miami; US FL Miami; US FL North Bay Village
- 专利权人: Deya Banisakher,Naphtali Rishe,Mark Finlayson
- 当前专利权人: Deya Banisakher,Naphtali Rishe,Mark Finlayson
- 当前专利权人地址: US FL Miami; US FL Miami; US FL North Bay Village
- 主分类号: G06F16/28
- IPC分类号: G06F16/28 ; G06F17/18 ; G06F7/14 ; G06N20/00 ; G16H10/60 ; G16H15/00
摘要:
Systems and methods for discovering and/or determining section types for a given document class in a data-driven manner are provided. A modified Bayesian model merging algorithm can be used, along with extending an Analogical Story Merging (ASM) algorithm. The systems and methods can learn the section structure of documents without a pre-existing ontology of sections or time-intensive annotation efforts.
公开/授权文献:
- US11494418B2 Systems and methods for determining document section types 公开/授权日:2022-11-08
IPC结构图谱:
G | 物理 |
--G06 | 计算;推算;计数 |
----G06F | 电数字数据处理 |
------G06F16/00 | 信息检索;数据库结构;文件系统结构 |
--------G06F16/10 | .文件系统;文件服务器 |
----------G06F16/28 | ..以数据库模型为特征的数据库,例如,关系或对象模型 |