会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • LEVERAGING CROSS-DOCUMENT CONTEXT TO LABEL ENTITY
    • 将交叉文档引向标签实体
    • US20090282012A1
    • 2009-11-12
    • US12114824
    • 2008-05-05
    • Arnd Christian KonigVenkatesh Ganti
    • Arnd Christian KonigVenkatesh Ganti
    • G06F7/06G06F17/30
    • G06F17/278G06F17/2785Y10S707/962
    • Entities, such as people, places and things, are labeled based on information collected across a possibly large number of documents. One or more documents are scanned to recognize the entities, and features are extracted from the context in which those entities occur in the documents. Observed entity-feature pairs are stored either in an in-memory store or an external store. A store manager optimizes use of the limited amount of space for an in-memory store by determining which store to put an entity-feature pair in, and when to evict features from the in-memory store to make room for new pairs. Feature that may be observed in an entity's context may take forms such as specific word sequences or membership in a particular list.
    • 诸如人物,地点和事物等实体根据可能大量文件收集的信息进行标注。 扫描一个或多个文档以识别实体,并且从文档中出现这些实体的上下文提取特征。 观察到的实体特征对存储在内存存储或外部存储中。 存储管理器通过确定哪个存储放置实体特征对,以及何时从存储器内存存储器中删除特征以为新的对腾出空间来优化对存储器存储器中的有限数量的空间的使用。 可能在实体的上下文中观察到的特征可以采取诸如特定单词序列或特定列表中的成员资格的形式。
    • 3. 发明授权
    • Leveraging cross-document context to label entity
    • 利用跨文档上下文标签实体
    • US07970808B2
    • 2011-06-28
    • US12114824
    • 2008-05-05
    • Arnd Christian KonigVenkatesh Ganti
    • Arnd Christian KonigVenkatesh Ganti
    • G06F17/30
    • G06F17/278G06F17/2785Y10S707/962
    • Entities, such as people, places and things, are labeled based on information collected across a possibly large number of documents. One or more documents are scanned to recognize the entities, and features are extracted from the context in which those entities occur in the documents. Observed entity-feature pairs are stored either in an in-memory store or an external store. A store manager optimizes use of the limited amount of space for an in-memory store by determining which store to put an entity-feature pair in, and when to evict features from the in-memory store to make room for new pairs. Feature that may be observed in an entity's context may take forms such as specific word sequences or membership in a particular list.
    • 诸如人物,地点和事物等实体根据可能大量文件收集的信息进行标注。 扫描一个或多个文档以识别实体,并且从文档中出现这些实体的上下文提取特征。 观察到的实体特征对存储在内存存储或外部存储中。 存储管理器通过确定哪个存储放置实体特征对,以及何时从存储器内存存储器中删除特征以为新的对腾出空间来优化对存储器存储器中的有限数量的空间的使用。 可能在实体的上下文中观察到的特征可以采取诸如特定单词序列或特定列表中的成员资格的形式。
    • 4. 发明申请
    • QUERY CLASSIFICATION USING SEARCH RESULT TAG RATIOS
    • 使用搜索结果标签比例查询分类
    • US20110125791A1
    • 2011-05-26
    • US12625594
    • 2009-11-25
    • Arnd Christian KonigVenkatesh GantiXiao Li
    • Arnd Christian KonigVenkatesh GantiXiao Li
    • G06F17/30
    • G06F16/951
    • Techniques are described herein for classifying a search query with respect to query intent using search result tag ratios. A tag is a character or a combination of characters (e.g., one or more words) that indicates a property of a document, such as a topic of the document, a type of entity (i.e., subject matter) the document references, etc. A search result tag ratio is defined as a fraction (e.g., a proportion, a percentage, etc.) of the documents in a search result that includes a respective tag. A search query may be classified based on back-off ratios, which are tag ratios of search queries that are related to the search query to be classified. Tag ratios may be pre-computed (i.e., calculated before the corresponding search queries are received from users).
    • 这里描述了使用搜索结果标签比率来分类关于查询意图的搜索查询的技术。 标签是指示文档的属性(例如文档的主题,文档引用的实体的类型(即主题)等)的字符或字符的组合(例如,一个或多个单词)。 搜索结果标签比率被定义为包括相应标签的搜索结果中的文档的分数(例如,比例,百分比等)。 搜索查询可以基于退避比率进行分类,后退比率是与要分类的搜索查询相关的搜索查询的标签比率。 可以预先计算标签比率(即,在从用户接收相应的搜索查询之前计算)。
    • 5. 发明申请
    • CLICK-THROUGH PREDICTION FOR NEWS QUERIES
    • 点击通过预测新闻查询
    • US20100299350A1
    • 2010-11-25
    • US12469692
    • 2009-05-21
    • Arnd Christian KonigMichael GamonQiang WuRoger P. MenezesMonwhea Jeng
    • Arnd Christian KonigMichael GamonQiang WuRoger P. MenezesMonwhea Jeng
    • G06F17/30
    • G06F17/30864
    • Described is estimating whether an online search query is a news-related query, and if so, outputting news-related results in association with other search results returned in response to the query. The query is processed into features, including by accessing corpora that corresponds to relatively current events, e.g., recently crawled from news and blog articles. A corpus of static reference data, such as an online encyclopedia, may be used to help determine whether the query is less likely to be about current events. Features include frequency-related data and context-related data corresponding to frequency and context information maintained in the corpora. Additional features may be obtained by processing text of the query itself, e.g., “query-only” features.
    • 描述了估计在线搜索查询是否是新闻相关查询,如果是,则输出与响应于该查询返回的其他搜索结果相关联的新闻相关结果。 该查询被处理成特征,包括通过访问对应于相对当前事件的语料库,例如最近从新闻和博客文章中爬行。 可以使用诸如在线百科全书的静态参考数据的语料库来帮助确定查询是否不太可能关于当前事件。 特征包括频率相关数据和对应于语料库中维护的频率和上下文信息的上下文相关数据。 可以通过处理查询本身的文本,例如“仅查询”特征来获得附加特征。
    • 6. 发明申请
    • LOCAL SEARCH USING FEATURE BACKOFF
    • 本地搜索使用功能备用
    • US20120158705A1
    • 2012-06-21
    • US12970928
    • 2010-12-16
    • Arnd Christian KonigKlaus L. BerberichDimitrios Lymberopoulos
    • Arnd Christian KonigKlaus L. BerberichDimitrios Lymberopoulos
    • G06F17/30
    • G06F16/9537G06F16/58
    • A local search system is described herein that provides a framework for the integration of various external sources to improve local search ranking. The framework provided by the local search system described herein uses a notion of backoff. The system uses a generalization of the concept of backoff to improve local search results that incorporate a variety of data features. The system can apply backoff in multiple dimensions at the same time to generate features for local search ranking. The system integrates various additional data sources, such as web access logs, driving direction request logs, reviews, and so forth, to quantify popularity and distance (or distance sensitivity) into a framework for local search ranking. Thus, the system provides search results that are more relevant by incorporating a number of data sources into the ranking in a manner that handles abnormalities in the data well.
    • 本文描述了本地搜索系统,其提供了用于整合各种外部源以改善本地搜索排名的框架。 由本文描述的本地搜索系统提供的框架使用退避的概念。 该系统使用退避概念的概括来改进包含各种数据特征的本地搜索结果。 该系统可以同时应用多个维度的退避功能来生成本地搜索排名的功能。 该系统集成了各种其他数据源,如Web访问日志,行车路线要求日志,评论等,以将人气和距离(或距离敏感度)量化为本地搜索排名的框架。 因此,系统通过以很好的方式处理数据异常的方式,将多个数据源并入排序中来提供更相关的搜索结果。
    • 8. 发明授权
    • Database configuration analysis
    • 数据库配置分析
    • US07805443B2
    • 2010-09-28
    • US11275657
    • 2006-01-20
    • Arnd Christian KonigShubha Umesh Nabar
    • Arnd Christian KonigShubha Umesh Nabar
    • G06F7/00
    • G06F17/30306
    • To determine a configuration for a database system, a plurality of queries may be sampled from a representative workload using statistical inference to compute the probability of correctly selecting one of a plurality of evaluation configurations. The probability of correctly selecting may determine which and/or how many queries to sample, and/or may be compared to a target probability threshold to determine if more queries must be sampled. The configuration from the plurality of configurations with the lowest estimated cost of executing the representative workload may be determined based on the probability of selecting correctly. Estimator variance may be reduced through a stratified sampling scheme that leverages commonality, such as an average cost of execution, between queries based on query templates. The applicability of the Central Limit Theorem may be verified and used to determine which and/or how many queries to sample.
    • 为了确定数据库系统的配置,可以使用统计推断从代表性工作负载中采样多个查询,以计算正确选择多个评估配置之一的概率。 正确选择的概率可以确定要采样和/或可以与目标概率阈值进行比较和/或可以查询多少查询以确定是否必须对更多查询进行采样。 可以基于正确选择的概率来确定具有执行代表性工作负荷的估计成本最低的多个配置的配置。 通过分层采样方案,可以通过分层抽样方案来减少估计器差异,该方案利用基于查询模板的查询之间的共同性,如平均执行成本。 可以验证中心极限定理的适用性,并用于确定哪些和/或多少查询查询。
    • 9. 发明授权
    • Database monitoring system
    • 数据库监控系统
    • US07194451B2
    • 2007-03-20
    • US10788077
    • 2004-02-26
    • Surajit ChaudhuriArnd Christian KonigVivek Narasayya
    • Surajit ChaudhuriArnd Christian KonigVivek Narasayya
    • G06F17/30
    • G06F17/30368Y10S707/955Y10S707/962Y10S707/99932
    • A framework is provided within a database system for specifying database monitoring rules that will be evaluated as part of the execution code path of database events being monitored. The occurrence of a selected database event triggers a rule that evaluates some parameter of an object related to the event against a condition in the rule. If the condition is met, a specified action is taken that can alter the execution of the database event or database system performance. Lightweight aggregation tables are utilized to enable aggregation of object parameter values so that presently occurring events can be compared to a summary of the object parameter values from previously occurring database events. Signatures are assigned to queries based on the structure of the query plan so that information in the lightweight aggregation tables can be grouped according to query signature.
    • 在数据库系统中提供一个框架,用于指定数据库监视规则,该规则将作为被监视的数据库事件的执行代码路径的一部分进行评估。 所选数据库事件的发生触发一个规则,该规则根据规则中的条件来评估与事件相关的对象的某些参数。 如果满足条件,则采取可以改变数据库事件或数据库系统性能执行的指定操作。 轻量级聚合表用于启用对象参数值的聚合,以便将当前发生的事件与先前发生的数据库事件的对象参数值的摘要进行比较。 根据查询计划的结构将签名分配给查询,以便轻量级聚合表中的信息可以根据查询签名进行分组。
    • 10. 发明授权
    • Click-through prediction for news queries
    • 新闻查询的点击式预测
    • US08719298B2
    • 2014-05-06
    • US12469692
    • 2009-05-21
    • Arnd Christian KonigMichael GamonQiang WuRoger P. MenezesMonwhea Jeng
    • Arnd Christian KonigMichael GamonQiang WuRoger P. MenezesMonwhea Jeng
    • G06F17/30
    • G06F17/30864
    • Described is estimating whether an online search query is a news-related query, and if so, outputting news-related results in association with other search results returned in response to the query. The query is processed into features, including by accessing corpora that corresponds to relatively current events, e.g., recently crawled from news and blog articles. A corpus of static reference data, such as an online encyclopedia, may be used to help determine whether the query is less likely to be about current events. Features include frequency-related data and context-related data corresponding to frequency and context information maintained in the corpora. Additional features may be obtained by processing text of the query itself, e.g., “query-only” features.
    • 描述了估计在线搜索查询是否是新闻相关查询,如果是,则输出与响应于该查询返回的其他搜索结果相关联的新闻相关结果。 该查询被处理成特征,包括通过访问对应于相对当前事件的语料库,例如最近从新闻和博客文章中爬行。 可以使用诸如在线百科全书的静态参考数据的语料库来帮助确定查询是否不太可能关于当前事件。 特征包括频率相关数据和对应于语料库中维护的频率和上下文信息的上下文相关数据。 可以通过处理查询本身的文本,例如“仅查询”特征来获得附加特征。