会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 33. 发明申请
    • LEVERAGING CROSS-DOCUMENT CONTEXT TO LABEL ENTITY
    • 将交叉文档引向标签实体
    • US20090282012A1
    • 2009-11-12
    • US12114824
    • 2008-05-05
    • Arnd Christian KonigVenkatesh Ganti
    • Arnd Christian KonigVenkatesh Ganti
    • G06F7/06G06F17/30
    • G06F17/278G06F17/2785Y10S707/962
    • Entities, such as people, places and things, are labeled based on information collected across a possibly large number of documents. One or more documents are scanned to recognize the entities, and features are extracted from the context in which those entities occur in the documents. Observed entity-feature pairs are stored either in an in-memory store or an external store. A store manager optimizes use of the limited amount of space for an in-memory store by determining which store to put an entity-feature pair in, and when to evict features from the in-memory store to make room for new pairs. Feature that may be observed in an entity's context may take forms such as specific word sequences or membership in a particular list.
    • 诸如人物,地点和事物等实体根据可能大量文件收集的信息进行标注。 扫描一个或多个文档以识别实体,并且从文档中出现这些实体的上下文提取特征。 观察到的实体特征对存储在内存存储或外部存储中。 存储管理器通过确定哪个存储放置实体特征对,以及何时从存储器内存存储器中删除特征以为新的对腾出空间来优化对存储器存储器中的有限数量的空间的使用。 可能在实体的上下文中观察到的特征可以采取诸如特定单词序列或特定列表中的成员资格的形式。
    • 34. 发明申请
    • DATA PROFILE COMPUTATION
    • 数据配置文件计算
    • US20090006392A1
    • 2009-01-01
    • US11769050
    • 2007-06-27
    • Zhimin ChenVenkatesh GantiGunjan JhaShriraghav KaushikVivek Narasayya
    • Zhimin ChenVenkatesh GantiGunjan JhaShriraghav KaushikVivek Narasayya
    • G06F7/06G06F17/30
    • G06F17/30536
    • Architecture that provides a data profile computation technique which employs key profile computation and data pattern profile computation. Key profile computation in a data table includes both exact keys as well as approximate keys, and is based on key strengths. A key strength of 100% is an exact key, and any other percentage in an approximate key. The key strength is estimated based on the number of table rows that have duplicated attribute values. Only column sets that exceed a threshold value are returned. Pattern profiling identifies a small set of regular expression patterns which best describe the patterns within a given set of attribute values. Pattern profiling includes three phases: a first phases for determining token regular expressions, a second phase for determining candidate regular expressions, and a third phase for identifying the best regular expressions of the candidates that match the attribute values.
    • 提供采用关键轮廓计算和数据模式轮廓计算的数据轮廓计算技术的架构。 数据表中的关键轮廓计算包括精密键和近似键,并且基于关键优点。 100%的关键优势是一个确切的关​​键,其中一个关键的任何其他百分比。 基于具有重复的属性值的表行的数量来估计关键强度。 只返回超过阈值的列集。 模式分析标识一组最佳描述一组给定属性值中的模式的正则表达式模式。 模式分析包括三个阶段:用于确定令牌正则表达式的第一阶段,用于确定候选正则表达式的第二阶段,以及用于识别与属性值匹配的候选的最佳正则表达式的第三阶段。
    • 36. 发明申请
    • Segmentation of strings into structured records
    • 将字符串分割成结构化记录
    • US20050234906A1
    • 2005-10-20
    • US10825488
    • 2004-04-14
    • Venkatesh GantiTheodore VassilakisYevgeny Agichtein
    • Venkatesh GantiTheodore VassilakisYevgeny Agichtein
    • G06F7/00G06F17/30
    • G06F17/30569Y10S707/99933Y10S707/99935
    • An system for segmenting strings into component parts for use with a database management system. A reference table of string records are segmented into multiple substrings corresponding to database attributes. The substrings within an attribute are analyzed to provide a state model that assumes a beginning, a middle and an ending token topology for that attribute. A null token takes into account an empty attribute component and copying of states allows for erroneous token insertions and misordering. Once the model is created from the clean data, the process breaks or parses an input record into a sequence of tokens. The process then determines a most probable segmentation of the input record by comparing the tokens of the input record with a state models derived for attributes from the reference table.
    • 用于将字符串分割成用于数据库管理系统的组件的系统。 字符串记录的引用表被分割成与数据库属性对应的多个子字符串。 分析属性中的子串以提供假定该属性的开始,中间和结束令牌拓扑的状态模型。 空标记考虑了空属性组件,状态复制允许错误的标记插入和错误。 一旦从干净的数据创建了模型,该过程会将输入记录分解或解析成令牌序列。 该过程然后通过将输入记录的令牌与从参考表导出的属性的状态模型进行比较来确定输入记录的最可能的分割。
    • 39. 发明授权
    • Assisted query formation, validation, and result previewing in a database having a complex schema
    • 在具有复杂模式的数据库中辅助查询形成,验证和结果预览
    • US08965915B2
    • 2015-02-24
    • US14058189
    • 2013-10-18
    • Venkatesh GantiAaron KalbFeng NiuSatyen Sangani
    • Venkatesh GantiAaron KalbFeng NiuSatyen Sangani
    • G06F17/30
    • G06F17/30554G06F17/30292G06F17/30309G06F17/30646
    • Disclosed are a method, a device and/or a system of assisted query formation, validation, and result previewing in a database having a complex schema. In one aspect, a method of a query editor includes generating a data profile which includes a set of characteristics captured at various granularities of an initial result set generated from an initial query using a processor and a memory. The method determines what a user expects in the initial result set of the initial query and/or a subsequent result set of a subsequent query based on the data profile and/or a heuristically estimated data profile. The method includes enabling the user to evaluate a semantic accuracy of the subsequent query based on the likely expectation of the user as determined through the set of characteristics of the data profile. The set of characteristics may include metadata of the initial query.
    • 公开了具有复杂模式的数据库中的辅助查询形成,验证和结果预览的方法,设备和/或系统。 一方面,一种查询编辑器的方法包括生成数据简档,其包括使用处理器和存储器从初始查询生成的初始结果集的各种粒度捕获的一组特征。 该方法基于数​​据简档和/或启发式估计的数据简档确定初始查询的初始结果集中的用户期望值和/或后续查询的后续结果集。 该方法包括使得用户能够基于通过数据简档的特征集确定的用户的可能期望来评估后续查询的语义准确性。 该特征集可以包括初始查询的元数据。
    • 40. 发明授权
    • Curated answers community automatically populated through user query monitoring
    • 策略响应社区通过用户查询监控自动填充
    • US08935272B2
    • 2015-01-13
    • US14058206
    • 2013-10-18
    • Venkatesh GantiAaron KalbFeng NiuSatyen Sangani
    • Venkatesh GantiAaron KalbFeng NiuSatyen Sangani
    • G06F7/00G06F17/30
    • G06F17/30554G06F17/30292G06F17/30309G06F17/30646
    • In one embodiment, a method of a curated answers system includes automatically populating a profile markup page of a user with information describing an initial query of a database that the user has generated using a processor and a memory, determining that another user of the database has submitted a similar query that is semantically proximate to the initial query of the database that the user has generated, and presenting the profile markup page of the user to the other user. The method of the curated answers system may include enabling the other user to communicate with the user through a communication channel on the profile markup page. A question of the other user may be published to the user on the profile markup page of the user, and/or other profile markup page of the other user. The question may be associated as being posted by the other user.
    • 在一个实施例中,策展答案系统的方法包括使用描述用户使用处理器和存储器生成的数据库的初始查询的信息自动填充用户的简档标记页面,确定数据库的另一用户具有 提交了与用户已经生成的数据库的初始查询语义上接近的类似查询,并将用户的简档标记页面呈现给另一个用户。 策展答案系统的方法可以包括允许其他用户通过简档标记页面上的通信通道与用户通信。 可以在用户的​​简档标记页面和/或其他用户的其他简档标记页面上向用户发布另一用户的问题。 该问题可能会被另一个用户发布。