会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Search results ranking using editing distance and document information
    • 使用编辑距离和文档信息搜索结果排名
    • US08812493B2
    • 2014-08-19
    • US12101951
    • 2008-04-11
    • Vladimir TankovichHang LiDmitriy MeyerzonJun Xu
    • Vladimir TankovichHang LiDmitriy MeyerzonJun Xu
    • G06F7/00
    • G06F17/2211G06F17/30864
    • Architecture for extracting document information from documents received as search results based on a query string, and computing an edit distance between the data string and the query string. The edit distance is employed in determining relevance of the document as part of result ranking by detecting near-matches of a whole query or part of the query. The edit distance evaluates how close the query string is to a given data stream that includes document information such as TAUC (title, anchor text, URL, clicks) information, etc. The architecture includes the index-time splitting of compound terms in the URL to allow the more effective discovery of query terms. Additionally, index-time filtering of anchor text is utilized to find the top N anchors of one or more of the document results. The TAUC information can be input to a neural network (e.g., 2-layer) to improve relevance metrics for ranking the search results.
    • 用于基于查询字符串从作为搜索结果接收的文档提取文档信息的结构,以及计算数据串和查询字符串之间的编辑距离。 编辑距离用于通过检测整个查询或部分查询的近似匹配来确定文档作为结果排名的一部分的相关性。 编辑距离评估查询字符串与包含诸如TAUC(标题,锚文本,URL,点击)信息等文档信息的给定数据流的距离。该体系结构包括索引时间分割URL中的复合术语 以便更有效地发现查询条款。 另外,使用锚文本的索引时间过滤来查找一个或多个文档结果的前N个锚点。 可以将TAUC信息输入到神经网络(例如,2层),以改进用于对搜索结果排序的相关性度量。
    • 2. 发明申请
    • SEARCH RESULTS RANKING USING EDITING DISTANCE AND DOCUMENT INFORMATION
    • 搜索结果使用编辑距离和文档信息排名
    • US20090259651A1
    • 2009-10-15
    • US12101951
    • 2008-04-11
    • Vladimir TankovichHang LiDmitriy MeyerzonJun Xu
    • Vladimir TankovichHang LiDmitriy MeyerzonJun Xu
    • G06F17/30
    • G06F17/2211G06F17/30864
    • Architecture for extracting document information from documents received as search results based on a query string, and computing an edit distance between the data string and the query string. The edit distance is employed in determining relevance of the document as part of result ranking by detecting near-matches of a whole query or part of the query. The edit distance evaluates how close the query string is to a given data stream that includes document information such as TAUC (title, anchor text, URL, clicks) information, etc. The architecture includes the index-time splitting of compound terms in the URL to allow the more effective discovery of query terms. Additionally, index-time filtering of anchor text is utilized to find the top N anchors of one or more of the document results. The TAUC information can be input to a neural network (e.g., 2-layer) to improve relevance metrics for ranking the search results.
    • 用于基于查询字符串从作为搜索结果接收的文档提取文档信息的结构,以及计算数据串和查询字符串之间的编辑距离。 编辑距离用于通过检测整个查询或部分查询的近似匹配来确定文档作为结果排名的一部分的相关性。 编辑距离评估查询字符串与包含诸如TAUC(标题,锚文本,URL,点击)信息等文档信息的给定数据流的距离。该体系结构包括索引时间分割URL中的复合术语 以便更有效地发现查询条款。 另外,使用锚文本的索引时间过滤来查找一个或多个文档结果的前N个锚点。 可以将TAUC信息输入到神经网络(例如,2层),以改进用于对搜索结果排序的相关性度量。
    • 4. 发明授权
    • Ranking search results using feature extraction
    • 使用特征提取排列搜索结果
    • US07716198B2
    • 2010-05-11
    • US11019091
    • 2004-12-21
    • Dmitriy MeyerzonHang Li
    • Dmitriy MeyerzonHang Li
    • G06F17/30
    • G06F17/30684
    • Methods and computer-readable media are provided for ranking search results using feature extraction data. Each of the results of a search engine query is parsed to obtain data, such as text, formatting information, metadata, and the like. The text, the formatting information and the metadata are passed through a feature extraction application to extract data that may be used to improve a ranking of the search results based on relevance of the search results to the search engine query. The feature extraction application extracts features, such as titles, found in any of the text based on formatting information applied to or associated with the text. The extracted titles, the text, the formatting information and the metadata for any given search results item are processed according to a field weighting application for determining a ranking of the given search results item. Ranked search results items may then be displayed according to ranking.
    • 提供方法和计算机可读介质用于使用特征提取数据对搜索结果进行排名。 解析搜索引擎查询的每个结果以获得诸如文本,格式信息,元数据等的数据。 文本,格式化信息和元数据通过特征提取应用程序传递,以提取可用于根据搜索结果与搜索引擎查询的相关性来提高搜索结果排名的数据。 特征提取应用程序基于应用于或与文本相关联的格式化信息来提取在任何文本中找到的特征,诸如标题。 根据用于确定给定搜索结果项目的排名的字段加权应用程序处理提取的标题,文本,格式化信息和用于任何给定搜索结果项目的元数据。 然后可以根据排名显示排名的搜索结果项。
    • 6. 发明申请
    • Ranking search results using feature extraction
    • 使用特征提取排列搜索结果
    • US20060136411A1
    • 2006-06-22
    • US11019091
    • 2004-12-21
    • Dmitriy MeyerzonHang Li
    • Dmitriy MeyerzonHang Li
    • G06F17/30
    • G06F17/30684
    • Methods and computer-readable media are provided for ranking search results using feature extraction data. Each of the results of a search engine query is parsed to obtain data, such as text, formatting information, metadata, and the like. The text, the formatting information and the metadata are passed through a feature extraction application to extract data that may be used to improve a ranking of the search results based on relevance of the search results to the search engine query. The feature extraction application extracts features, such as titles, found in any of the text based on formatting information applied to or associated with the text. The extracted titles, the text, the formatting information and the metadata for any given search results item are processed according to a field weighting application for determining a ranking of the given search results item. Ranked search results items may then be displayed according to ranking.
    • 提供方法和计算机可读介质用于使用特征提取数据对搜索结果进行排名。 解析搜索引擎查询的每个结果以获得诸如文本,格式信息,元数据等的数据。 文本,格式化信息和元数据通过特征提取应用程序传递,以提取可用于根据搜索结果与搜索引擎查询的相关性来提高搜索结果排名的数据。 特征提取应用程序基于应用于或与文本相关联的格式化信息来提取在任何文本中找到的特征,诸如标题。 根据用于确定给定搜索结果项目的排名的字段加权应用程序处理提取的标题,文本,格式化信息和用于任何给定搜索结果项目的元数据。 然后可以根据排名显示排名的搜索结果项。