会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • System and method for website categorization
    • 网站分类的系统和方法
    • US09311423B1
    • 2016-04-12
    • US14180249
    • 2014-02-13
    • Go Daddy Operating Company, LLC
    • Robert BrownTapan KamdarRyan KirkishWei-Cheng LaiJeff McLellan
    • G06F17/30
    • G06F17/30598G06F17/2705G06F17/30864G06F17/30867G06F17/30873G06F17/3089G06Q30/0201G06Q30/0278H04L61/2076
    • Systems and methods for the categorization of websites are presented. A website is categorized using one or a combination of its domain name and its web page content. The domain name is tokenized, and the tokens compared to categories in a category structure to determine probabilities that the token belongs to each category. Combinations of tokens are similarly compared to the categories. A category may be determined with reference to a vector space in which a training set of websites having known categories is converted according to a methodology into reference vectors containing keyword frequencies. A target website is converted to a target vector using the same methodology, and a distance score of the target vector to each reference vector is calculated. The website represented by the target vector is assigned the category of the reference vector having the lowest distance score.
    • 介绍了网站分类的系统和方法。 网站使用一个或其域名及其网页内容的组合进行分类。 域名被标记化,令牌与类别结构中的类别进行比较,以确定令牌属于每个类别的概率。 令牌的组合与类别相似。 可以参考向量空间来确定类别,其中将具有已知类别的网站的训练集合根据方法转换成包含关键字频率的参考向量。 使用相同的方法将目标网站转换为目标矢量,并计算目标矢量到每个参考矢量的距离分数。 由目标矢量表示的网站被分配具有最低距离得分的参考矢量的类别。
    • 3. 发明授权
    • System and method for identifying website verticals
    • 用于识别网站纵向的系统和方法
    • US09330168B1
    • 2016-05-03
    • US14180273
    • 2014-02-13
    • Go Daddy Operating Company, LLC
    • Robert BrownTapan KamdarRyan KirkishWei-Cheng LaiJeff McLellan
    • G06F17/30
    • G06F17/3071G06F17/2705G06F17/30867G06F17/30873G06Q30/0278G06Q30/0283
    • Systems and methods for the categorization of websites are presented. A website is categorized using one or a combination of its domain name and its web page content. The domain name is tokenized, and the tokens compared to categories in a category structure to determine probabilities that the token belongs to each category. Combinations of tokens are similarly compared to the categories. A category may be determined with reference to a vector space in which a training set of websites having known categories is converted according to a methodology into reference vectors containing keyword frequencies. A target website is converted to a target vector using the same methodology, and a distance score of the target vector to each reference vector is calculated. The website represented by the target vector is assigned the category of the reference vector having the lowest distance score.
    • 介绍了网站分类的系统和方法。 网站使用一个或其域名及其网页内容的组合进行分类。 域名被标记化,令牌与类别结构中的类别进行比较,以确定令牌属于每个类别的概率。 令牌的组合与类别相似。 可以参考向量空间来确定类别,其中将具有已知类别的网站的训练集合根据方法转换成包含关键字频率的参考向量。 使用相同的方法将目标网站转换为目标矢量,并计算目标矢量到每个参考矢量的距离分数。 由目标矢量表示的网站被分配具有最低距离得分的参考矢量的类别。