会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明申请
    • Unstructured data in a mining model language
    • 挖掘模型语言中的非结构化数据
    • US20070214164A1
    • 2007-09-13
    • US11373319
    • 2006-03-10
    • C. MacLennanIoan CrivatZhaoHui TangRaman Iyer
    • C. MacLennanIoan CrivatZhaoHui TangRaman Iyer
    • G06F7/00
    • G06F17/30943G06F2216/03Y10S707/99933
    • A standard mechanism for directly accessing unstructured data types (e.g., image, audio, video, gene sequencing and text data) in accordance with data mining operations is provided. The subject innovation can enable access to unstructured data directly from within the data mining engine or tool. Accordingly, the innovation enables multiple vendors to provide algorithms for mining unstructured data on a data mining platform (e.g., an SQL-brand server), thereby increasing adoption. As well, the subject innovation allows users to directly mine unstructured data that is not fixed-length, without pre-processing and tokenizing the data external to the data mining engine. In accordance therewith, the innovation can provide a mechanism to expand declarative language content types to include an “unstructured” data type thereby enabling a user and/or application to affirmatively designate mining data as an unstructured type.
    • 提供了一种用于根据数据挖掘操​​作直接访问非结构化数据类型(例如,图像,音频,视频,基因排序和文本数据)的标准机制。 主题创新可以直接从数据挖掘引擎或工具中访问非结构化数据。 因此,该创新使得多个供应商能够提供用于在数据挖掘平台(例如,SQL品牌服务器)上挖掘非结构化数据的算法,从而增加采用。 此外,本创新允许用户直接挖掘不固定长度的非结构化数据,而不需要对数据挖掘引擎外部的数据进行预处理和标记。 根据此,创新可以提供一种机制来扩展声明性语言内容类型以包括“非结构化”数据类型,从而使得用户和/或应用程序肯定地将挖掘数据指定为非结构化类型。
    • 5. 发明申请
    • Automatic training of data mining models
    • 数据挖掘模型的自动训练
    • US20070220034A1
    • 2007-09-20
    • US11377024
    • 2006-03-16
    • Raman IyerC. MacLennanIoan Crivat
    • Raman IyerC. MacLennanIoan Crivat
    • G06F7/00
    • G06F16/2465
    • A realtime training model update architecture for data mining models. The architecture facilitates automatic update processes with respect to evolving source/training data. Additionally, model update training can be performed at times other than in realtime. Scheduling can be invoked, for periodic and incremental updates, and refresh intervals applied through the training parameters for the mining structure and/or model. Training can also be triggered by user-defined events such as database notifications, and/or alerts from other operational systems. In support thereof, a data mining model component is provided for training a data mining model on a dataset in realtime, and an update component for incrementally training the data mining model according to predetermined criteria. Additionally, model versioning and version comparison can be employed to detect significant changes and retain updated models. Training data aging/weighting of training data can be applied.
    • 数据挖掘模型的实时训练模型更新架构。 该架构有助于针对不断变化的源/训练数据的自动更新过程。 此外,模型更新培训可以在实时以外的时间进行。 可以调用计划,用于定期和增量更新,以及通过采矿结构和/或模型的训练参数应用的刷新间隔。 培训也可以由用户定义的事件(例如数据库通知)和/或来自其他操作系统的警报触发。 为了支持这一点,提供了一种数据挖掘模型组件,用于实时地对数据集上的数据挖掘模型进行训练;以及更新部件,用于根据预定标准逐步地训练数据挖掘模型。 此外,可以使用模型版本控制和版本比较来检测重大变化并保留更新的模型。 训练数据的老化/加权训练数据可以被应用。
    • 6. 发明申请
    • Using a rowset as a query parameter
    • 使用行集作为查询参数
    • US20060010112A1
    • 2006-01-12
    • US11069121
    • 2005-02-28
    • Ioan CrivatC. MacLennanRaman IyerMarius Dumitru
    • Ioan CrivatC. MacLennanRaman IyerMarius Dumitru
    • G06F17/30
    • G06F17/30539G06F17/30421G06F17/30595Y10S707/99932Y10S707/99934Y10S707/99944
    • Architecture that facilitates syntax processing for data mining statements. The system includes a syntax engine that receives as an input a query statement which, for example, is a data mining request. The statement can be generated from many different sources, e.g., a client application and/or a server application, and requests query processing of a data source (e.g., a relational database) to return a result set. The syntax engine includes a binding component that converts the query statement into an encapsulated statement in accordance with a predefined grammar. The encapsulated statement includes both data and data operations to be performed on the data of the data source, and which is understood by the data source. An execution component processes the encapsulated statement against the data source to return the desired result set.
    • 促进数据挖掘语句的语法处理的架构。 该系统包括语法引擎,其作为输入接收诸如数据挖掘请求的查询语句。 语句可以从许多不同的来源(例如客户端应用程序和/或服务器应用程序)生成,并且请求数据源(例如,关系数据库)的查询处理以返回结果集。 语法引擎包括一个绑定组件,它根据预定义的语法将查询语句转换成封装语句。 封装语句包括要对数据源的数据执行的数据和数据操作,数据源可以理解。 执行组件根据数据源处理封装语句以返回所需的结果集。
    • 7. 发明申请
    • Partitioning of data mining training set
    • 数据挖掘训练集分区
    • US20070214135A1
    • 2007-09-13
    • US11371477
    • 2006-03-09
    • Ioan CrivatRaman IyerC. MdcLennan
    • Ioan CrivatRaman IyerC. MdcLennan
    • G06F17/30
    • G06F17/30539
    • A system that effectuates fetching a complete set of relational data into a mining services server and subsequently defining desired partitions upon the fetched data is provided. In accordance with the innovation, the data can be locally cached and partitioned therefrom. Accordingly, upon the same mining structure (e.g., cache) that has been partitioned, the novel innovation can build mining models for each partition. In other words, the innovation can employ the concept of mining structure as a data cache while manipulating only partitions of this cache in certain operations. The innovation can be employed in scenarios where a user wants to train a mining model using only data points that satisfy a particular Boolean condition, a user wants to split the training set into multiple partitions (e.g., training/testing) and/or a user wants to perform a data mining procedure known as “N-fold cross validation.”
    • 提供了一种能够将完整的关系数据集提取到采矿服务服务器中并随后在获取的数据上定义所需分区的系统。 根据创新,数据可以被本地缓存并从中分割。 因此,在已经被划分的相同挖掘结构(例如,高速缓存)上,新颖的创新可以为每个分区建立挖掘模型。 换句话说,创新可以采用挖掘结构的概念作为数据高速缓存,同时在某些操作中仅操纵该高速缓存的分区。 该创新可以在用户想要仅使用满足特定布尔条件的数据点来训练挖掘模型的情况下使用,用户希望将训练集合分成多个分区(例如,训练/测试)和/或用户 想要执行称为“N-fold交叉验证”的数据挖掘过程。
    • 8. 发明申请
    • Analyzing mining pattern evolutions using a data mining algorithm
    • 使用数据挖掘算法分析挖掘模式演化
    • US20070219990A1
    • 2007-09-20
    • US11376993
    • 2006-03-16
    • Ioan CrivatElena CristoforC. MacLennan
    • Ioan CrivatElena CristoforC. MacLennan
    • G06F17/30
    • G06F17/30592
    • Architecture for analyzing pattern shifts in data patterns of data mining models and outputting the results. This allows comparing and describing differences between two semantically similar sets of patterns (or mining models), and for analyzing historical changes in versions of the same model or differences in patterns found by two or more different algorithms applied to the same data. The architecture can also facilitate explaining data patterns that shift over time and over different data populations, and between versions of the same model that use different algorithms. A model component is employed for storing data mining models have respective sets of data patterns obtained from a dataset, and an analysis component analyzes the sets of the data patterns for difference data therebetween. The dataset can be a subsample of a larger set of data and can be analyzed by the analysis component over a time period.
    • 用于分析数据挖掘模型的数据模式的模式转换并输出结果的架构。 这允许比较和描述两种语义上相似的模式集合(或挖掘模型)之间的差异,并且用于分析相同模型的版本中的历史变化或由应用于相同数据的两个或多个不同算法所发现的模式差异。 该架构还可以促进解释随时间和不同数据群体以及使用不同算法的同一模型版本之间的数据模式。 使用模型组件来存储数据挖掘模型,其具有从数据集获得的相应数据模式集合,并且分析组件分析其间的差分数据的数据模式集合。 数据集可以是较大数据集的子样本,可以在一段时间内由分析组件进行分析。