会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Segmentation of strings into structured records
    • 将字符串分割成结构化记录
    • US07627567B2
    • 2009-12-01
    • US10825488
    • 2004-04-14
    • Venkatesh GantiVassilakis TheodoreYevgeny Agichtein
    • Venkatesh GantiVassilakis TheodoreYevgeny Agichtein
    • G06F17/30
    • G06F17/30569Y10S707/99933Y10S707/99935
    • An system for segmenting strings into component parts for use with a database management system. A reference table of string records are segmented into multiple substrings corresponding to database attributes. The substrings within an attribute are analyzed to provide a state model that assumes a beginning, a middle and an ending token topology for that attribute. A null token takes into account an empty attribute component and copying of states allows for erroneous token insertions and misordering. Once the model is created from the clean data, the process breaks or parses an input record into a sequence of tokens. The process then determines a most probable segmentation of the input record by comparing the tokens of the input record with a state models derived for attributes from the reference table.
    • 用于将字符串分割成用于数据库管理系统的组件的系统。 字符串记录的引用表被分割成与数据库属性对应的多个子字符串。 分析属性中的子串以提供假定该属性的开始,中间和结束令牌拓扑的状态模型。 空标记考虑了空属性组件,状态复制允许错误的标记插入和错误。 一旦从干净的数据创建了模型,该过程会将输入记录分解或解析成一系列令牌。 该过程然后通过将输入记录的令牌与从参考表导出的属性的状态模型进行比较来确定输入记录的最可能的分割。