会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • System and method for classification with effective use of manual data input and crowdsourcing
    • 有效使用人工数据输入和众包的分类系统和方法
    • US09195910B2
    • 2015-11-24
    • US13868808
    • 2013-04-23
    • Wal-Mart Stores, Inc.
    • Nikesh Lucky GareraNarasimhan RampalliDintyala Venkata Subrahmanya RavikantSrikanth SubramaniamChong SunHeather Dawn Yalin
    • G06K9/62G06F17/30
    • G06K9/6263G06F17/3087
    • Systems and methods are disclosed herein for classifying records, such as product records, using a machine learning algorithm. After training a classification model according to a machine learning algorithm using an initial training set, records are classified and high confidence classifications identified. Remaining classifications are submitted to a crowdsourcing forum that validates or invalidates the classifications or marks them as to unclear to evaluate. Invalidated classifications are automatically analyzed to identify one or both of classification values and categories having a high proportion of invalidated classifications. Requests are transmitted to analysts to generate training data that is added to the training set. The process of classifying records and obtaining crowdsourced validation thereof may then repeat. High confidence classifications may be identified using an accuracy model trained to relate an accuracy percentage to a confidence score output by the classification model.
    • 本文公开的系统和方法用于使用机器学习算法对诸如产品记录的记录进行分类。 根据使用初始训练集的机器学习算法训练分类模型后,对记录进行分类并确定高置信度分类。 剩余分类被提交给众包论坛,验证或使分类无效或标记为不清楚评估。 自动分析无效分类,以识别具有高比例无效分类的分类值和类别中的一个或两个。 将请求传送给分析人员,以生成添加到训练集中的训练数据。 然后可以重复对记录进行分类和获得众包验证的过程。 可以使用训练有关精度百分比与由分类模型输出的置信分数相关联的精度模型来识别高置信度分类。
    • 4. 发明授权
    • Product record normalization system with efficient and scalable methods for discovering, validating, and using schema mappings
    • 产品记录规范化系统,用于发现,验证和使用模式映射的高效且可扩展的方法
    • US09311372B2
    • 2016-04-12
    • US13907243
    • 2013-05-31
    • Wal-Mart Stores, Inc.
    • Nikesh Lucky GareraNarasimhan RampalliDintyala Venkata Subrahmanya RavikantSrikanth SubramaniamChong SunHeather Dawn Yalin
    • G06F7/00G06F17/30
    • G06F17/30569
    • Systems and methods are disclosed herein for generating a normalized record from an import record, the normalized record having attribute-value pairs corresponding to a native schema. In import records, a plurality of attribute-value are identified each having an attribute label not found in a native schema. One or more attribute labels in the native schema having as possible values one or more values corresponding to the values of the plurality of attribute-value pairs are also identified. The computer system generates one or more normalization rules relating one or more attribute labels of the plurality of attribute-value pairs to at least a portion of the one or more attribute labels in the native schema. Normalization rules may be validated by crowdsourcing. Normalization rules may be applied by identifying implicated rules by classifying the import record and identifying rules applicable to the classification.
    • 本文公开的系统和方法用于从导入记录生成归一化记录,归一化记录具有对应于本机模式的属性值对。 在导入记录中,识别多个属性值,每个属性值都具有在本机模式中未找到的属性标签。 本地模式中的一个或多个属性标签也具有与多个属性值对对应的值相对应的一个或多个值的可能值。 计算机系统生成将多个属性值对中的一个或多个属性标签与本机模式中的一个或多个属性标签的至少一部分相关联的一个或多个规范化规则。 规范化规则可以通过众包来验证。 可以通过对导入记录进行分类和识别适用于分类的规则来识别牵连规则来应用规范化规则。
    • 5. 发明申请
    • DETERMINATION OF PRODUCT ATTRIBUTES AND VALUES USING A PRODUCT ENTITY GRAPH
    • 使用产品实体图来确定产品属性和价值
    • US20150347572A1
    • 2015-12-03
    • US14293997
    • 2014-06-02
    • Wal-Mart Stores, Inc.
    • Fan YangNarasimhan RampalliDigvijay Lamba
    • G06F17/30
    • G06F17/30958G06F17/30342G06F17/30587G06F17/30705
    • A method of determining structured product information for a product from a product description using a product entity graph. The product graph can include a plurality of nodes. Each of the plurality of nodes can include an entity value key, one or more entity names, and an entity name count for each of the one or more entity names. The method can include determining k-grams of the product description. The method also can include, for each k-gram of the product description, determining a matching node of the plurality of nodes of the product entity graph that corresponds to the k-gram and determining a derived entity name for the product from the one or more entity names of the matching node based at least in part on the entity name counts corresponding to the one or more entity names. Other embodiments of related systems and methods are also disclosed.
    • 使用产品实体图从产品描述确定产品的结构化产品信息的方法。 产品图可以包括多个节点。 多个节点中的每一个可以包括一个或多个实体名称中的每一个的实体值密钥,一个或多个实体名称和实体名称计数。 该方法可以包括确定产品描述的k-gram。 该方法还可以包括对于产品描述的每个k-gram,确定与k-gram相对应的产品实体图的多个节点中的匹配节点,并且从该 至少部分地基于对应于一个或多个实体名称的实体名称计数的匹配节点的更多实体名称。 还公开了相关系统和方法的其它实施例。
    • 6. 发明申请
    • Product Record Normalization System With Efficient And Scalable Methods For Discovering, Validating, And Using Schema Mappings
    • 用于发现,验证和使用模式映射的高效可扩展方法的产品记录归一化系统
    • US20140358931A1
    • 2014-12-04
    • US13907243
    • 2013-05-31
    • Wal-Mart Stores, Inc.
    • Nikesh Lucky GareraNarasimhan RampalliDintyala Venkata Subrahmanya RavikantSirkanth SubramaniamChong SunHeather Dawn Yalin
    • G06F17/30
    • G06F17/30569
    • Systems and methods are disclosed herein for generating a normalized record from an import record, the normalized record having attribute-value pairs corresponding to a native schema. In import records, a plurality of attribute-value are identified each having an attribute label not found in a native schema. One or more attribute labels in the native schema having as possible values one or more values corresponding to the values of the plurality of attribute-value pairs are also identified. The computer system generates one or more normalization rules relating one or more attribute labels of the plurality of attribute-value pairs to at least a portion of the one or more attribute labels in the native schema. Normalization rules may be validated by crowdsourcing. Normalization rules may be applied by identifying implicated rules by classifying the import record and identifying rules applicable to the classification.
    • 本文公开的系统和方法用于从导入记录生成归一化记录,归一化记录具有对应于本机模式的属性值对。 在导入记录中,识别多个属性值,每个属性值都具有在本机模式中未找到的属性标签。 本地模式中的一个或多个属性标签也具有与多个属性值对对应的值相对应的一个或多个值的可能值。 计算机系统生成将多个属性值对中的一个或多个属性标签与本机模式中的一个或多个属性标签的至少一部分相关联的一个或多个规范化规则。 规范化规则可以通过众包来验证。 可以通过对导入记录进行分类和识别适用于分类的规则来识别牵连规则来应用规范化规则。
    • 7. 发明申请
    • SYSTEM AND METHOD FOR CLASSIFICATION WITH EFFECTIVE USE OF MANUAL DATA INPUT
    • 有效使用手动数据输入的分类系统和方法
    • US20140314311A1
    • 2014-10-23
    • US13868808
    • 2013-04-23
    • WAL-MART STORES, INC.
    • Nikesh Lucky GareraNarasimhan RampalliDintyala Venkata Subrahmanya RavikantSrikanth SubramaniamChong SunHeather Dawn Yalin
    • G06K9/62
    • G06K9/6263G06F17/3087
    • Systems and methods are disclosed herein for classifying records, such as product records, using a machine learning algorithm. After training a classification model according to a machine learning algorithm using an initial training set, records are classified and high confidence classifications identified. Remaining classifications are submitted to a crowdsourcing forum that validates or invalidates the classifications or marks them as to unclear to evaluate. Invalidated classifications are automatically analyzed to identify one or both of classification values and categories having a high proportion of invalidated classifications. Requests are transmitted to analysts to generate training data that is added to the training set. The process of classifying records and obtaining crowdsourced validation thereof may then repeat. High confidence classifications may be identified using an accuracy model trained to relate an accuracy percentage to a confidence score output by the classification model.
    • 本文公开的系统和方法用于使用机器学习算法对诸如产品记录的记录进行分类。 根据使用初始训练集的机器学习算法训练分类模型后,记录被分类并确定高置信度分类。 剩余的分类被提交给众包论坛,验证或使分类无效或标记为不清楚评估。 自动分析无效分类,以识别具有高比例无效分类的分类值和类别中的一个或两个。 将请求传送给分析人员,以生成添加到训练集中的训练数据。 然后可以重复对记录进行分类和获得众包验证的过程。 可以使用训练有关精度百分比与由分类模型输出的置信分数相关联的精度模型来识别高置信度分类。
    • 10. 发明申请
    • PRODUCT CLASSIFICATION DATA TRANSFER AND MANAGEMENT
    • 产品分类数据传输与管理
    • US20150379115A1
    • 2015-12-31
    • US14847944
    • 2015-09-08
    • WAL-MART STORES, INC.
    • Nikesh Lucky GareraNarasimhan RampalliDintyala Venkata Subrahmanya RavikantSrikanth SubramaniamChong SunHeather Dawn Yalin
    • G06F17/30G06Q10/08
    • G06F16/285G06Q10/087
    • Computerized data processing and electronic file management methods of organizing and indexing electronic records in an electronic database for categorizing new products that are being added to an existing database of product offerings and computerized digital data processing methods of transferring digital information between a plurality of computers and employing computer instructions to categorize new products that are being added to an existing database of product offerings. Multiple classification models classify a description of a particular product and the classifications are compared, and if found to be equivalent, are added to the existing database of product offerings. If the classifications from the models are not equivalent, then the description is sent to multiple people for classification and the classifications from the people are compared, and if found to be equivalent, are added to the existing database of product offerings.
    • 电子数据处理和电子文件管理方法,用于对电子数据库中的电子记录进行组织和编制索引,将新增的产品分类到现有的产品数据库和计算机化的数字数据处理方法中,数字信息在多台计算机之间传输和采用 计算机指令,用于对正在添加到现有产品数据库的新产品进行分类。 多个分类模型对特定产品的描述进行分类,并比较分类,如果发现相等,则添加到现有的产品数据库中。 如果模型的分类不相等,则将描述发送给多个人进行分类,并将来自人员的分类进行比较,如果发现相当于添加到现有的产品数据库中。