会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • Learning Discriminative Projections for Text Similarity Measures
    • 用于文本相似度量度的学习判别预测
    • US20120323968A1
    • 2012-12-20
    • US13160485
    • 2011-06-14
    • Wen-tau YihKristina N. ToutanovaChristopher A. MeekJohn C. Platt
    • Wen-tau YihKristina N. ToutanovaChristopher A. MeekJohn C. Platt
    • G06F17/30
    • G06F16/31
    • A model for mapping the raw text representation of a text object to a vector space is disclosed. A function is defined for computing a similarity score given two output vectors. A loss function is defined for computing an error based on the similarity scores and the labels of pairs of vectors. The parameters of the model are tuned to minimize the loss function. The label of two vectors indicates a degree of similarity of the objects. The label may be a binary number or a real-valued number. The function for computing similarity scores may be a cosine, Jaccard, or differentiable function. The loss function may compare pairs of vectors to their labels. Each element of the output vector is a linear or non-linear function of the terms of an input vector. The text objects may be different types of documents and two different models may be trained concurrently.
    • 公开了将文本对象的原始文本表示映射到向量空间的模型。 定义了一个功能,用于计算给定两个输出向量的相似度得分。 定义了一种损失函数,用于计算基于相似度得分和向量对的标签的误差。 调整模型的参数以最小化损失函数。 两个向量的标签表示对象的相似度。 标签可以是二进制数字或实数值。 用于计算相似性分数的函数可以是余弦,Jaccard或可微分函数。 损失函数可以将向量对与其标签进行比较。 输出向量的每个元素是输入向量的项的线性或非线性函数。 文本对象可以是不同类型的文档,并且可以同时训练两个不同的模型。
    • 2. 发明授权
    • Consistent phrase relevance measures
    • 一致的短语相关性度量
    • US08996515B2
    • 2015-03-31
    • US13609257
    • 2012-09-11
    • Wen-tau YihChristopher A. Meek
    • Wen-tau YihChristopher A. Meek
    • G06F7/00G06F17/30G06Q30/02
    • G06F17/30687G06Q30/02
    • Two methods for measuring keyword-document relevance are described. The methods receive a keyword and a document as input and output a probability value for the keyword. The first method is a similarity-based approach which uses techniques for measuring similarity between two short-text segments to measure relevance between the keyword and the document. The second method is a regression-based approach based on an assumption that if an out-of-document phrase (the keyword) is semantically similar to an in-document phrase, then relevance scores of the in and out-of document phrases should be close to each other.
    • 描述了两种衡量关键字 - 文档相关性的方法。 方法接收关键字和文档作为输入,并输出关键字的概率值。 第一种方法是基于相似性的方法,其使用用于测量两个短文本段之间的相似性的技术来测量关键字和文档之间的相关性。 第二种方法是基于回归的方法,基于一个假设,如果文档外短语(关键字)在语义上类似于文档内短语,则文本内和外的短语的相关性分数应为 彼此接近
    • 3. 发明申请
    • CONSISTENT PHRASE RELEVANCE MEASURES
    • 一致性相关措施
    • US20120330978A1
    • 2012-12-27
    • US13609257
    • 2012-09-11
    • Wen-tau YihChristopher A. Meek
    • Wen-tau YihChristopher A. Meek
    • G06F17/30
    • G06F17/30687G06Q30/02
    • Two methods for measuring keyword-document relevance are described. The methods receive a keyword and a document as input and output a probability value for the keyword. The first method is a similarity-based approach which uses techniques for measuring similarity between two short-text segments to measure relevance between the keyword and the document. The second method is a regression-based approach based on an assumption that if an out-of-document phrase (the keyword) is semantically similar to an in-document phrase, then relevance scores of the in and out-of document phrases should be close to each other.
    • 描述了两种衡量关键字 - 文档相关性的方法。 方法接收关键字和文档作为输入,并输出关键字的概率值。 第一种方法是基于相似性的方法,其使用用于测量两个短文本段之间的相似性的技术来测量关键字和文档之间的相关性。 第二种方法是基于回归的方法,基于一个假设,如果文档外短语(关键字)在语义上类似于文档内短语,则文本内和外的短语的相关性分数应为 彼此接近
    • 4. 发明申请
    • SIMILIARITY MEASURES FOR SHORT SEGMENTS OF TEXT
    • 短篇短文的类似措施
    • US20090240498A1
    • 2009-09-24
    • US12051183
    • 2008-03-19
    • Wen-tau YihAlexei V. BocharovChristopher A. Meek
    • Wen-tau YihAlexei V. BocharovChristopher A. Meek
    • G10L15/08
    • G06F17/2211G06F16/35
    • Systems and methods to perform short text segment similarity measures. Illustratively, a short text segment similarity environment comprises a short text engine operative to process data representative of short segments of text and an instruction set comprising at least one instruction to instruct the short text engine to process data representative of short text segment inputs according to a selected short text similarity identification paradigm. Illustratively, two or more short text segments can be received as input by the short text engine and a request to identify similarities among the two or more short text segments. Responsive to the request and data input, the short text engine executes a selected similarity identification technique in accordance with the sort text similarity identification paradigm to process the received data and to identify similarities between the short text segment inputs.
    • 执行短文本段相似性度量的系统和方法。 示例性地,短文本段相似性环境包括用于处理代表短段文本的数据的短文本引擎和包括至少一个指令的指令集,以指示短文本引擎根据以下内容来处理代表短文本段输入的数据 选择短文本相似性识别范式。 说明性地,可以接收短文本引擎的两个或多个短文本段作为输入,以及用于标识两个或更多个短文本段之间的相似性的请求。 响应于请求和数据输入,短文本引擎根据排序文本相似性识别范例来执行所选择的相似性识别技术,以处理接收到的数据并识别短文本段输入之间的相似性。
    • 5. 发明授权
    • Consistent phrase relevance measures
    • 一致的短语相关性度量
    • US08290946B2
    • 2012-10-16
    • US12144647
    • 2008-06-24
    • Wen-tau YihChristopher A. Meek
    • Wen-tau YihChristopher A. Meek
    • G06F7/00G06F17/30
    • G06F17/30687G06Q30/02
    • Two methods for measuring keyword-document relevance are described. The methods receive a keyword and a document as input and output a probability value for the keyword. The first method is a similarity-based approach which uses techniques for measuring similarity between two short-text segments to measure relevance between the keyword and the document. The second method is a regression-based approach based on an assumption that if an out-of-document phrase (the keyword) is semantically similar to an in-document phrase, then relevance scores of the in and out-of document phrases should be close to each other.
    • 描述了两种衡量关键字 - 文档相关性的方法。 方法接收关键字和文档作为输入,并输出关键字的概率值。 第一种方法是基于相似性的方法,其使用用于测量两个短文本段之间的相似性的技术来测量关键字和文档之间的相关性。 第二种方法是基于回归的方法,基于一个假设,如果文档外短语(关键字)在语义上类似于文档内短语,则文本内和外的短语的相关性分数应为 彼此接近
    • 7. 发明申请
    • CONSISTENT PHRASE RELEVANCE MEASURES
    • 一致性相关措施
    • US20090319508A1
    • 2009-12-24
    • US12144647
    • 2008-06-24
    • Wen-tau YihChristopher A. Meek
    • Wen-tau YihChristopher A. Meek
    • G06F7/10G06F17/30
    • G06F17/30687G06Q30/02
    • Two methods for measuring keyword-document relevance are described. The methods receive a keyword and a document as input and output a probability value for the keyword. The first method is a similarity-based approach which uses techniques for measuring similarity between two short-text segments to measure relevance between the keyword and the document. The second method is a regression-based approach based on an assumption that if an out-of-document phrase (the keyword) is semantically similar to an in-document phrase, then relevance scores of the in and out-of document phrases should be close to each other.
    • 描述了两种衡量关键字 - 文档相关性的方法。 方法接收关键字和文档作为输入,并输出关键字的概率值。 第一种方法是基于相似性的方法,其使用用于测量两个短文本段之间的相似性的技术来测量关键字和文档之间的相关性。 第二种方法是基于回归的方法,基于一个假设,如果文档外短语(关键字)在语义上类似于文档内短语,则文本内和外的短语的相关性分数应为 彼此接近