会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Classification using a cascade approach
    • 使用级联方法分类
    • US07693806B2
    • 2010-04-06
    • US11766434
    • 2007-06-21
    • Wen-tau YihJoshua T. GoodmanGeoffrey J. Hulten
    • Wen-tau YihJoshua T. GoodmanGeoffrey J. Hulten
    • G06F15/18G06N3/08
    • H04L51/12G06K9/6256G06Q10/06G06Q10/10
    • A system and method that facilitates and effectuates optimizing a classifier for greater performance in a specific region of classification that is of interest, such as a low false positive rate or a low false negative rate. A two-stage classification model can be trained and employed, where the first stage classification is optimized over the entire classification region and the second stage classifier is optimized for the specific region of interest. During training the entire set of training data is employed by a first stage classifier. Only data that is classified by the first stage classifier or by cross validation to fall within a region of interest is used to train the second stage classifier. During classification, data that is classified within the region of interest by the first classification is given the first stage classifier's classification value, otherwise the classification value for the instance of data from the second stage classifier is used.
    • 促进并实现分类器在特定感兴趣区域中的更高性能的系统和方法,例如低假阳性率或低假阴性率。 可以训练和采用两阶段分类模型,其中对整个分类区域优化第一阶段分类,并针对特定的兴趣区域优化第二阶段分类器。 在训练期间,整套训练数据由第一阶段分类器采用。 仅使用由第一阶段分类器分类的数据或通过交叉验证落入感兴趣区域内的数据来训练第二阶段分类器。 在分类期间,通过第一分类对分类在感兴趣区域内的数据给予第一阶段分类器的分类值,否则使用来自第二阶段分类器的数据实例的分类值。
    • 2. 发明授权
    • Training filters for detecting spasm based on IP addresses and text-related features
    • 培训过滤器,用于根据IP地址和文本相关功能检测痉挛
    • US07464264B2
    • 2008-12-09
    • US10809163
    • 2004-03-25
    • Joshua T. GoodmanRobert L. RounthwaiteGeoffrey J. HultenWen-tau Yih
    • Joshua T. GoodmanRobert L. RounthwaiteGeoffrey J. HultenWen-tau Yih
    • H04L9/00G06F21/00
    • H04L51/12G06Q10/107
    • The subject invention provides for an intelligent quarantining system and method that facilitates detecting and preventing spam. In particular, the invention employs a machine learning filter specifically trained using origination features such as an IP address as well as destination feature such as a URL. Moreover, the system and method involve training a plurality of filters using specific feature data for each filter. The filters are trained independently each other, thus one feature may not unduly influence another feature in determining whether a message is spam. Because multiple filters are trained and available to scan messages either individually or in combination (at least two filters), the filtering or spam detection process can be generalized to new messages having slightly modified features (e.g., IP address). The invention also involves locating the appropriate IP addresses or URLs in a message as well as guiding filters to weigh origination or destination features more than text-based features.
    • 本发明提供了一种便于检测和防止垃圾邮件的智能隔离系统和方法。 特别地,本发明采用使用诸如IP地址之类的发起特征以及目的地特征(例如URL)专门训练的机器学习滤波器。 此外,该系统和方法涉及使用针对每个滤波器的特定特征数据来训练多个滤波器。 滤波器被彼此独立地训练,因此在确定消息是否是垃圾邮件时,一个特征可能不会不适当地影响另一特征。 由于多个过滤器被训练并可用于单独或组合扫描消息(至少两个过滤器),因此过滤或垃圾邮件检测过程可以推广到具有稍微修改的特征(例如,IP地址)的新消息。 本发明还涉及在消息中定位适当的IP地址或URL,以及引导过滤器比基于文本的特征更重要的起始或目的地特征。
    • 4. 发明申请
    • CLASSIFICATION USING A CASCADE APPROACH
    • 使用CASCADE方法进行分类
    • US20080319932A1
    • 2008-12-25
    • US11766434
    • 2007-06-21
    • Wen-tau YihJoshua T. GoodmanGeoffrey J. Hulten
    • Wen-tau YihJoshua T. GoodmanGeoffrey J. Hulten
    • G06F15/18
    • H04L51/12G06K9/6256G06Q10/06G06Q10/10
    • A system and method that facilitates and effectuates optimizing a classifier for greater performance in a specific region of classification that is of interest, such as a low false positive rate or a low false negative rate. A two-stage classification model can be trained and employed, where the first stage classification is optimized over the entire classification region and the second stage classifier is optimized for the specific region of interest. During training the entire set of training data is employed by a first stage classifier. Only data that is classified by the first stage classifier or by cross validation to fall within a region of interest is used to train the second stage classifier. During classification, data that is classified within the region of interest by the first classification is given the first stage classifier's classification value, otherwise the classification value for the instance of data from the second stage classifier is used.
    • 促进并实现分类器在特定感兴趣区域中的更高性能的系统和方法,例如低假阳性率或低假阴性率。 可以训练和采用两阶段分类模型,其中对整个分类区域优化第一阶段分类,并针对特定的兴趣区域优化第二阶段分类器。 在训练期间,整套训练数据由第一阶段分类器采用。 仅使用由第一阶段分类器分类的数据或通过交叉验证落入感兴趣区域内的数据来训练第二阶段分类器。 在分类期间,通过第一分类对分类在感兴趣区域内的数据给予第一阶段分类器的分类值,否则使用来自第二阶段分类器的数据实例的分类值。
    • 8. 发明授权
    • Trees of classifiers for detecting email spam
    • 用于检测电子邮件垃圾邮件的分类树
    • US07930353B2
    • 2011-04-19
    • US11193691
    • 2005-07-29
    • David M. ChickeringGeoffrey J. HultenRobert L. RounthwaiteChristopher A. MeekDavid E. HeckermanJoshua T. Goodman
    • David M. ChickeringGeoffrey J. HultenRobert L. RounthwaiteChristopher A. MeekDavid E. HeckermanJoshua T. Goodman
    • G06F15/16
    • H04L51/12
    • Decision trees populated with classifier models are leveraged to provide enhanced spam detection utilizing separate email classifiers for each feature of an email. This provides a higher probability of spam detection through tailoring of each classifier model to facilitate in more accurately determining spam on a feature-by-feature basis. Classifiers can be constructed based on linear models such as, for example, logistic-regression models and/or support vector machines (SVM) and the like. The classifiers can also be constructed based on decision trees. “Compound features” based on internal and/or external nodes of a decision tree can be utilized to provide linear classifier models as well. Smoothing of the spam detection results can be achieved by utilizing classifier models from other nodes within the decision tree if training data is sparse. This forms a base model for branches of a decision tree that may not have received substantial training data.
    • 利用分类器模型填充的决策树利用电子邮件的每个功能使用单独的电子邮件分类器来提供增强的垃圾邮件检测。 这通过定制每个分类器模型提供了更高的垃圾邮件检测的概率,以便于在逐个特征的基础上更准确地确定垃圾邮件。 分类器可以基于诸如逻辑回归模型和/或支持向量机(SVM)等线性模型来构建。 分类器也可以基于决策树构建。 基于决策树的内部和/或外部节点的“复合特征”也可以用于提供线性分类器模型。 垃圾邮件检测结果的平滑可以通过使用来自决策树内的其他节点的分类器模型来实现,如果训练数据是稀疏的。 这形成了可能没有接收到大量训练数据的决策树的分支的基本模型。
    • 9. 发明授权
    • Intelligent quarantining for spam prevention
    • 智能隔离垃圾邮件防范
    • US07543053B2
    • 2009-06-02
    • US10779295
    • 2004-02-13
    • Joshua T. GoodmanRobert L. RounthwaiteGeoffrey J. HultenDerek Hazeur
    • Joshua T. GoodmanRobert L. RounthwaiteGeoffrey J. HultenDerek Hazeur
    • G06F15/173
    • G06Q10/107H04L51/12
    • The subject invention provides for an intelligent quarantining system and method that facilitates a more robust classification system in connection with spam prevention. The invention involves holding back some messages that appear to be questionable, suspicious, or untrustworthy from classification (as spam or good). In particular, the filter lacks information about these messages and thus classification is temporarily delayed. This provides more time for a filter update to arrive with a more accurate classification. The suspicious messages can be quarantined for a determined time period to allow more data to be collected regarding these messages. A number of factors can be employed to determine whether messages are more likely to be flagged for further analysis. User feedback by way of a feedback loop system can also be utilized to facilitate classification of the messages. After some time period, classification of the messages can be resumed.
    • 本发明提供了一种智能隔离系统和方法,其有助于与防止垃圾邮件相关联的更强大的分类系统。 本发明涉及阻止一些似乎是有疑问的,可疑的或不可分类的消息(作为垃圾邮件或好的)。 特别地,过滤器缺少关于这些消息的信息,因此分类被暂时延迟。 这样可以提供更多的时间来进行更新,以更精确的分类。 可疑邮件可以隔离一段确定的时间段,以便收集有关这些邮件的更多数据。 可以采用许多因素来确定消息是否更有可能标记为进一步分析。 通过反馈回路系统的用户反馈也可以用来促进消息的分类。 一段时间后,可以恢复消息分类。