会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Supervised semantic indexing and its extensions
    • 监督语义索引及其扩展
    • US08359282B2
    • 2013-01-22
    • US12562840
    • 2009-09-18
    • Bing BaiJason WestonRonan CollorbertDavid Grangier
    • Bing BaiJason WestonRonan CollorbertDavid Grangier
    • G06F15/18G06F7/00
    • G06F17/30663G06F17/30616
    • A system and method for determining a similarity between a document and a query includes providing a frequently used dictionary and an infrequently used dictionary in storage memory. For each word or gram in the infrequently used dictionary, n words or grams are correlated from the frequently used dictionary based on a first score. Features for a vector of the infrequently used words or grams are replaced with features from a vector of the correlated words or grams from the frequently used dictionary when the features from a vector of the correlated words or grams meet a threshold value. A similarity score is determined between weight vectors of a query and one or more documents in a corpus by employing the features from the vector of the correlated words or grams that met the threshold value.
    • 用于确定文档和查询之间的相似性的系统和方法包括在存储存储器中提供频繁使用的字典和不经常使用的字典。 对于不经常使用的字典中的每个单词或克,n个词或克根据第一个分数与经常使用的词典相关联。 当相关词或克的向量的特征符合阈值时,不经常使用的单词或克的向量的特征将被来自经常使用的词典的相关词或克的向量的特征替换。 通过使用满足阈值的相关词或克的向量的特征,在查询的权重向量和语料库中的一个或多个文档之间确定相似性得分。
    • 2. 发明申请
    • SUPERVISED SEMANTIC INDEXING AND ITS EXTENSIONS
    • 监督语义索引及其扩展
    • US20100185659A1
    • 2010-07-22
    • US12562840
    • 2009-09-18
    • BING BAIJASON WESTONRONAN COLLORBERTDAVID GRANGIER
    • BING BAIJASON WESTONRONAN COLLORBERTDAVID GRANGIER
    • G06F17/30
    • G06F17/30663G06F17/30616
    • A system and method for determining a similarity between a document and a query includes providing a frequently used dictionary and an infrequently used dictionary in storage memory. For each word or gram in the infrequently used dictionary, n words or grams are correlated from the frequently used dictionary based on a first score. Features for a vector of the infrequently used words or grams are replaced with features from a vector of the correlated words or grams from the frequently used dictionary when the features from a vector of the correlated words or grams meet a threshold value. A similarity score is determined between weight vectors of a query and one or more documents in a corpus by employing the features from the vector of the correlated words or grams that met the threshold value.
    • 用于确定文档和查询之间的相似性的系统和方法包括在存储存储器中提供频繁使用的字典和不经常使用的字典。 对于不经常使用的字典中的每个单词或克,n个词或克根据第一个分数与经常使用的词典相关联。 当相关词或克的向量的特征符合阈值时,不经常使用的单词或克的向量的特征将被来自经常使用的词典的相关词或克的向量的特征替换。 通过使用满足阈值的相关词或克的向量的特征,在查询的权重向量和语料库中的一个或多个文档之间确定相似性得分。