会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • QUERY IDENTIFICATION AND NORMALIZATION FOR WEB SEARCH
    • 网页搜索的查询和标准化
    • US20100257150A1
    • 2010-10-07
    • US12818036
    • 2010-06-17
    • Yumao LuNawaaz AhmedFuchun PengMarco Zagha
    • Yumao LuNawaaz AhmedFuchun PengMarco Zagha
    • G06F17/30
    • G06F17/30867G06F17/3064
    • A computer-implemented method for processing user entered query data to improve results of a search of pages using a local search database, when searching the internet, is disclosed. The method includes receiving the user entered query data and parsing each word of the query data and segmenting words using a probabilistic dictionary to determine a likelihood that the word is for a particular name. And, associating the particular names with a name tag to create one or more tagged name terms. Then, normalizing each of the tagged name terms and the normalizing including boosting information if found in the local search database and determining proximity between selected ones of the tagged name terms. The method then generates an optimized search query that incorporates normalized terms and operators. The optimized search query being applied to the internet to enable search results to be produced and displayed to the user in response to the entered query data.
    • 公开了一种用于处理用户输入的查询数据以便在搜索互联网时使用本地搜索数据库搜索页面的结果的计算机实现的方法。 该方法包括接收用户输入的查询数据和解析查询数据的每个单词并使用概率词典分割单词,以确定单词对于特定名称的可能性。 并且,将特定名称与名称标签相关联以创建一个或多个标记名称术语。 然后,对每个标记的名称术语进行归一化,并且如果在本地搜索数据库中找到,则包括增强信息的归一化,并且确定所选标记的名称术语之间的接近度。 该方法然后生成一个优化的搜索查询,其中包含标准化术语和运算符。 优化的搜索查询被应用于互联网,以响应于输入的查询数据来产生和显示给用户的搜索结果。
    • 2. 发明授权
    • Techniques for navigational query identification
    • 导航查询识别技术
    • US07693865B2
    • 2010-04-06
    • US11514076
    • 2006-08-30
    • Yumao LuFuchun PengXin LiNawaaz Ahmed
    • Yumao LuFuchun PengXin LiNawaaz Ahmed
    • G06F17/00
    • G06K9/623G06F17/30707G06F17/30864G06K9/6278
    • To accurately classify a query as navigational, thousands of available features are explored, extracted from major commercial search engine results, user Web search click data, query log, and the whole Web's relational content. To obtain the most useful features for navigational query identification, a three level system is used which integrates feature generation, feature integration, and feature selection in a pipeline. Because feature selection plays a key role in classification methodologies, the best feature selection method is coupled with the best classification approach to achieve the best performance for identifying navigational queries. According to one embodiment, linear Support Vector Machine (SVM) is used to rank features and the top ranked features are fed into a Stochastic Gradient Boosting Tree (SGBT) classification method for identifying whether or not a particular query is a navigational query.
    • 为了将查询精确地分类为导航,从主要商业搜索引擎结果,用户Web搜索点击数据,查询日志和整个Web的关系内容中提取出数千种可用功能。 为了获得导航查询识别最有用的功能,使用了一个三级系统,将特征生成,特征集成和特征选择集成在一条流水线中。 因为特征选择在分类方法中起着关键作用,因此最好的特征选择方法与最佳分类方法相结合,以实现识别导航查询的最佳性能。 根据一个实施例,使用线性支持向量机(SVM)对特征进行排序,并且将顶级特征馈送到用于识别特定查询是否是导航查询的随机渐变增强树(SGBT)分类方法中。
    • 3. 发明授权
    • Query identification and normalization for web search
    • 网页搜索的查询识别和归一化
    • US07974971B2
    • 2011-07-05
    • US12818036
    • 2010-06-17
    • Yumao LuNawaaz AhmedFuchun PengMarco Zagha
    • Yumao LuNawaaz AhmedFuchun PengMarco Zagha
    • G06F17/30
    • G06F17/30867G06F17/3064
    • A computer-implemented method for processing user entered query data to improve results of a search of pages using a local search database, when searching the internet, is disclosed. The method includes receiving the user entered query data and parsing each word of the query data and segmenting words using a probabilistic dictionary to determine a likelihood that the word is for a particular name. And, associating the particular names with a name tag to create one or more tagged name terms. Then, normalizing each of the tagged name terms and the normalizing including boosting information if found in the local search database and determining proximity between selected ones of the tagged name terms. The method then generates an optimized search query that incorporates normalized terms and operators. The optimized search query being applied to the internet to enable search results to be produced and displayed to the user in response to the entered query data.
    • 公开了一种用于处理用户输入的查询数据以在搜索互联网时改进使用本地搜索数据库搜索页面的结果的计算机实现的方法。 该方法包括接收用户输入的查询数据和解析查询数据的每个单词并使用概率词典分割单词,以确定单词对于特定名称的可能性。 并且,将特定名称与名称标签相关联以创建一个或多个标记名称术语。 然后,对每个标记的名称术语进行归一化,并且如果在本地搜索数据库中找到,则包括增强信息的归一化,并且确定所选标记的名称术语之间的接近度。 该方法然后生成一个优化的搜索查询,其中包含标准化术语和运算符。 优化的搜索查询被应用于互联网,以响应于输入的查询数据来产生和显示给用户的搜索结果。
    • 4. 发明申请
    • NAME VERIFICATION USING MACHINE LEARNING
    • 使用机器学习的名称验证
    • US20090248595A1
    • 2009-10-01
    • US12060154
    • 2008-03-31
    • Yumao LuNawaaz AhmedFuchun PengBenoit Dumoulin
    • Yumao LuNawaaz AhmedFuchun PengBenoit Dumoulin
    • G06F15/18
    • G06F17/2765
    • Computer-enabled methods, apparatus, and computer-readable media are provided for verifying that a given network name, such as a URL, is an official, e.g., registered, approved, or otherwise officially recognized, network name that refers to or identifies a principal, such as a business. These techniques involve receiving a principal name and a given network name, receiving at least one feature attribute from at least one database of feature attributes, wherein the at least one feature attribute comprises a characteristic of the principal name or a characteristic of the network name, and invoking a logistic regression method to generate a probability, based upon the at least one feature attribute, that the given network name is an official network name for the principal name. The logistic regression method may include a gradient boosting tree model that generates the probability based upon the at least one feature attribute.
    • 提供了计算机启用的方法,装置和计算机可读介质,用于验证给定的网络名称(例如URL)是官方的,例如,已注册,批准或以其他官方认可的网络名称,其指代或识别 校长,如企业。 这些技术包括接收主体名称和给定的网络名称,从特征属性的至少一个数据库接收至少一个特征属性,其中所述至少一个特征属性包括主体名称的特征或网络名称的特性, 以及调用逻辑回归方法以基于所述至少一个特征属性生成所述给定网络名称是所述主体名称的正式网络名称的概率。 逻辑回归方法可以包括基于至少一个特征属性生成概率的梯度增强树模型。
    • 5. 发明申请
    • LOCAL QUERY IDENTIFICATION AND NORMALIZATION FOR WEB SEARCH
    • WEB查询的本地查询标识和正常化
    • US20090182729A1
    • 2009-07-16
    • US12015448
    • 2008-01-16
    • Yumao LuNawaaz AhmedFuchun PengMarco Zagha
    • Yumao LuNawaaz AhmedFuchun PengMarco Zagha
    • G06F7/06
    • G06F17/30867G06F17/3064
    • Computer-implemented methods and systems for processing user entered query data to improve results of a search of pages using a local search database are provided, when searching the internet. The method includes receiving the user entered query data and parsing each word of the query data and examining each word to determine if the word is associated with one of a business name, a city name or a state name. The examining uses probabilistic dictionaries to determine a likelihood that the word is one of the business name, the city name or the state name. Then, associating the words that were determined to be: (i) the business name with a business name tag to create one or more tagged business terms; (ii) the city name with a city name tag to create one or more tagged city terms; and (iii) the state name with a state name tag to create one or more tagged state terms. The method further includes normalizing each of the tagged business terms, the tagged city terms and the tagged state terms. The normalizing includes boosting information if found in the local search database and determining proximity between selected ones of the tagged business, city or state terms. Then, generating an optimized internal search query that incorporates constraints and ranking based on at least the boosting information and the determined proximity between the selected tagged business, city or state terms. The optimized internal search query is applied to the internet to enable search results to be produced and displayed to the user in response to the entered query data.
    • 当搜索互联网时,提供了用于处理用户输入的查询数据以改进使用本地搜索数据库的页面搜索结果的计算机实现的方法和系统。 该方法包括接收用户输入的查询数据并解析查询数据的每个单词并检查每个单词以确定该单词是否与商务名称,城市名称或州名称之一相关联。 检查使用概率词典来确定该词是商业名称,城市名称或州名称之一的可能性。 然后,将确定为:(i)商业名称与商家名称标签相关联的字词,以创建一个或多个标记的商业条款; (ii)具有城市名称标签的城市名称,以创建一个或多个标记的城市条款; 和(iii)具有状态名称标签的状态名称以创建一个或多个标记状态项。 该方法还包括对每个标记的业务术语,标记的城市术语和标记的状态项进行归一化。 标准化包括在本地搜索数据库中找到增强信息,并确定所标记的业务,城市或州条款之间的接近度。 然后,生成优化的内部搜索查询,该内部搜索查询至少基于提升信息和确定的所选标记的商业,城市或状态条件之间的接近度来结合约束和排序。 优化的内部搜索查询被应用于互联网,以便响应于输入的查询数据而产生并显示给用户的搜索结果。
    • 7. 发明申请
    • Word pluralization handling in query for web search
    • 在Web搜索查询中的Word复数处理
    • US20080189262A1
    • 2008-08-07
    • US11701736
    • 2007-02-01
    • Fuchun PengNawaaz AhmedXin LiYumao Lu
    • Fuchun PengNawaaz AhmedXin LiYumao Lu
    • G06F17/30
    • G06F17/30864
    • Techniques for determining when and how to transform words in a query to its plural or non-plural form in order to provide the most relevant search results while minimizing computational overhead are provided. A dictionary is generated based upon the words used in a specified number of previous most frequent search queries and comprises lists of transformations from plural to singular and singular to plural. Unnecessary transformations are removed from the dictionary based upon language modeling. The word to transform is determined by finding the last non-stop re-writable word of the query. The context of the transformed word is confirmed in the search documents and a version of the query is executed using both the original form of the word and the transformation of the word.
    • 提供了用于确定何时以及如何将查询中的单词转换为多个或非复数形式的技术,以便在最小化计算开销的同时提供最相关的搜索结果。 基于在指定数量的先前最频繁的搜索查询中使用的词来生成字典,并且包括从多个到单数和单数到多个的变换的列表。 基于语言建模,从字典中删除不必要的转换。 要转换的词是通过查找查询的最后一个不间断的可重写词来确定的。 在搜索文档中确认转换词的上下文,并且使用单词的原始形式和单词的转换来执行查询的版本。
    • 8. 发明申请
    • Query serving infrastructure
    • 查询服务基础设施
    • US20070198484A1
    • 2007-08-23
    • US11360793
    • 2006-02-22
    • Nawaaz AhmedRobert TravisChad Walters
    • Nawaaz AhmedRobert TravisChad Walters
    • G06F17/30
    • G06F17/30587
    • A system and method for dynamically composing a class is provided. A query is parsed and a compiler object tree is generated that corresponds to the operators in the query. A set of feature requests and responses are initiated and a set of feature objects are generated that are executed at runtime. The language of the query may be extended by registering a new operator, and optionally a new feature for a query language, with a search engine without modifying the source code of the search engine. The new operator may be specified in a plug-in and provided to the search engine via the plug-in. An application converts an end user query into an application query that conforms to the query language supported by the search engine and includes the new operator. The application query may also include references to traditional programming constructs and search engine primitives.
    • 提供了一种用于动态组合类的系统和方法。 解析查询,并生成与查询中的运算符相对应的编译器对象树。 启动一组功能请求和响应,并生成一组在运行时执行的功能对象。 可以通过向搜索引擎注册新的运算符以及可选的用于查询语言的新特征而不修改搜索引擎的源代码来扩展查询的语言。 新的操作员可以在插件中指定并通过插件提供给搜索引擎。 应用程序将最终用户查询转换为符合搜索引擎支持的查询语言的应用程序查询,并包含新的运算符。 应用程序查询还可以包括对传统编程结构和搜索引擎原语的引用。
    • 10. 发明申请
    • QUERY PROCESSING FOR WEB SEARCH
    • 用于WEB搜索的查询处理
    • US20110264647A1
    • 2011-10-27
    • US13175797
    • 2011-07-01
    • Yumao LuNawaaz AhmedFuchun PengMarco Zagha
    • Yumao LuNawaaz AhmedFuchun PengMarco Zagha
    • G06F17/30
    • G06F17/30867G06F17/3064
    • A computer-implemented method for processing user entered query data to improve results of a search of pages using a database, when searching the internet, is disclosed. The method includes receiving the user entered query data and parsing each word of the query data and segmenting words using probability to determine a likelihood that the word is for a particular name. And, associating the particular names with a name tag to create one or more tagged name terms. Then, normalizing each of the tagged name terms and the normalizing including boosting information if found in the database and determining proximity between selected ones of the tagged name terms. The method then generates an optimized search query that incorporates normalized terms and operators. The optimized search query being applied to the internet to enable search results to be produced and displayed to the user in response to the entered query data.
    • 公开了一种用于处理用户输入的查询数据以在搜索互联网时改进使用数据库的页面的搜索结果的计算机实现的方法。 该方法包括接收用户输入的查询数据和解析查询数据的每个单词,并使用概率来分割单词以确定单词用于特定名称的可能性。 并且,将特定名称与名称标签相关联以创建一个或多个标记名称术语。 然后,对每个标记的名称术语进行归一化,并且如果在数据库中找到,则标准化包括增强信息,并确定所选择的标记名称术语之间的接近度。 该方法然后生成一个优化的搜索查询,其中包含标准化术语和运算符。 优化的搜索查询被应用于互联网,以响应于输入的查询数据来产生和显示给用户的搜索结果。