会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • CACHE OPTIMIZATION FOR DATA PREPARATION
    • 数据准备的缓存优化
    • WO2017065886A1
    • 2017-04-20
    • PCT/US2016/049312
    • 2016-08-29
    • PAXATA, INC.
    • BREWSTER, DaveTSO, Victor, Tze-Yeuan
    • G01R31/3183G06F9/44G06F9/45G06F11/22G06F15/00G06F17/30
    • G06F17/30345G06F17/30433G06F17/30477G06F17/30554
    • Cache optimization for data preparation includes: generating a data traversal program that represents a result of a set of sequenced data preparation operations performed on one or more sets of data, wherein the data traversal program indicates how to assemble one or more affected columns in the one or more sets of data to derive the result; in response to receiving a specification of the set of sequenced operations to be performed on the one or more sets of data, accessing the data traversal program that represents the result or a stored copy of the data traversal program that represents the result; assembling the one or more affected columns in the one or more sets of data according to the data traversal program to re-generate the result; and outputting the result.
    • 用于数据准备的高速缓存优化包括:生成数据遍历程序,该数据遍历程序表示对一个或多个数据集执行的一组有序数据准备操作的结果,其中数据遍历程序指示如何组装 一组或多组数据中的一个或多个受影响的列以得出结果; 响应于接收到对所述一个或多个数据集执行的所述一组有序操作的指定,访问表示所述结果的所述数据遍历程序或表示所述结果的所述数据遍历程序的存储副本; 根据所述数据遍历程序将所述一个或多个受影响的列组装在所述一个或多个数据集合中以重新生成所述结果; 并输出结果。
    • 2. 发明申请
    • AUTOMATIC CONTENT-BASED APPEND DETECTION
    • 基于内容的自动APPEND检测
    • WO2017160340A1
    • 2017-09-21
    • PCT/US2016/052271
    • 2016-09-16
    • PAXATA, INC.
    • BREWSTER, DavidTSO, Victor, Tze-YeuanJIN, Ashley PingTA, Quan ChuongSANKAR, Lakshman RoyKWOK, Whitman
    • G06F9/44G06F15/16G06F17/30H04L29/06
    • G06F16/2365G06F16/221
    • Automatic append includes: identifying, based at least in part on contents of a first data set comprising a first plurality of columns and contents of a second data set comprising a second plurality of columns, a plurality of matching columns and a plurality of non-matching columns. The matching columns comprise one or more columns among the first plurality of columns; and corresponding one or more matching columns among the second plurality of columns. The non- matching columns comprise: one or more columns among the first plurality of columns that do not match with any columns among the second plurality of columns; and one or more columns among the second plurality of columns that do not match with any columns among the first plurality of columns. Automatic append further includes obtaining a user specification of a first one or more non-matching columns to be appended to a second one or more non-matching columns, the first one or more non-matching columns and the second one or more non-matching columns being selected among the plurality of non-matching columns; and appending the first data set and the second data set according to at least the identified plurality of matching columns and the user specification.
    • 自动追加包括:至少部分地基于包括第一多个列的第一数据集的内容和包括第二多个列的第二数据集的内容来识别多个匹配 列和多个不匹配的列。 匹配列包括第一多个列中的一个或多个列; 以及第二多个列中的对应的一个或多个匹配列。 不匹配的列包括:第一多个列中的一个或多个列与第二多个列中的任何列不匹配; 以及第二多个列中的一个或多个列与第一多个列中的任何列不匹配。 自动附加进一步包括获得要附加到第二一个或多个非匹配列的第一个或多个不匹配列的用户规范,第一个或多个非匹配列和第二个或多个非匹配列 从多个不匹配列中选择列; 并且至少根据所识别的多个匹配列和用户规范附加第一数据集和第二数据集。
    • 3. 发明申请
    • STEP EDITOR FOR DATA PREPARATION
    • 数据准备步骤编辑器
    • WO2017065888A1
    • 2017-04-20
    • PCT/US2016/049314
    • 2016-08-29
    • PAXATA, INC.
    • BARDOLIWALLA, Nenshad, DinshawMATTHEWS, MichaelTIMOURIAN, IanCHEN, JingGUTNIK, LiliaKWOK, WhitmanBREWSTER, DaveTSO, Victor, Tze-Yeuan
    • G06F3/08G06F12/08G06F17/30
    • G06F17/30345G06F17/30457G06F17/30554
    • Using a step editor for data preparation includes: receiving an indication of a user input with respect to at least some of a set of sequenced data preparation operations on a set of data; generating, using one or more processors, a signature based at least in part on the set of sequenced data preparation operations, references to the set of data, and the user input; using the generated signature to determine whether there exists a cached result associated with the set of sequenced data preparation operations, the references to the set of data, and the user input; based at least in part on the determination, obtaining a data traversal program representing a result associated with the set of sequenced operations, the references to the set of data, and the user input; and providing output based at least in part on the result represented by the obtained data traversal program.
    • 使用用于数据准备的步骤编辑器包括:接收关于在一组数据上的一组有序数据准备操作中的至少一些的用户输入的指示; 使用一个或多个处理器至少部分地基于所述一组有序数据准备操作,对所述一组数据和所述用户输入来生成签名; 使用所生成的签名来确定是否存在与所述一组有序数据准备操作相关联的缓存结果,对所述一组数据的参考以及所述用户输入; 至少部分基于所述确定来获得表示与所述一组有序操作,所述一组数据和所述用户输入相关联的结果的数据遍历程序; 并且至少部分地基于由所获得的数据遍历程序表示的结果来提供输出。
    • 4. 发明申请
    • SIGNATURE-BASED CACHE OPTIMIZATION FOR DATA PREPARATION
    • 基于签名的数据准备缓存优化
    • WO2017065887A1
    • 2017-04-20
    • PCT/US2016/049313
    • 2016-08-29
    • PAXATA, INC.
    • BREWSTER, DaveTSO, Victor, Tze-Yeuan
    • G06F17/30G06F19/18G06F19/24G06F21/00
    • G06F17/30345G06F17/30457G06F17/30554
    • Signature-based cache optimization for data preparation includes: performing a first set of sequenced data preparation operations on one or more sets of data to generate a plurality of transformation results; caching one or more of the plurality of transformation results and one or more corresponding operation signatures, a cached operation signature being derived based at least in part on a subset of sequenced operations that generated a corresponding result; receiving a specification of a second set of sequenced operations; determining an operation signature associated with the second set of sequenced operations; identifying a cached result among the cached results based at least in part on the determined operation signature; and outputting the cached result.
    • 用于数据准备的基于签名的高速缓存优化包括:对一组或多组数据执行第一组有序数据准备操作以产生多个变换结果; 缓存所述多个变换结果和一个或多个对应的操作签名中的一个或多个,缓存的操作签名至少部分地基于生成相应结果的有序操作的子集来导出; 接收第二组有序操作的规范; 确定与第二组有序操作相关的操作签名; 至少部分地基于所确定的操作签名来标识所高速缓存的结果中的高速缓存的结果; 并输出缓存结果。