会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 94. 发明申请
    • MINING NEW WORDS FROM A QUERY LOG FOR INPUT METHOD EDITORS
    • 从输入法编辑器的查询记录中挖掘新的词
    • US20100088303A1
    • 2010-04-08
    • US12244774
    • 2008-10-03
    • Weizhu ChenQian Xun LiLi JuZheng ChenDong LIZhikai Fan
    • Weizhu ChenQian Xun LiLi JuZheng ChenDong LIZhikai Fan
    • G06F17/30
    • G06F17/30731
    • Described is a technology in which new words (including a phrase or set of Chinese characters) are mined from a query log. The new words may be added to (or otherwise supplement) an IME dictionary. A set of candidate queries may be selected from the log based upon market (e.g., the Chinese market) and/or by language. From this set, various filtering steps are performed to locate only new words that are frequently in used. For example, only frequent queries are kept for further processing, which may include filtering out queries based on length (e.g., less than two or greater than eight Chinese characters), and/or filtering out queries based on too many stop-words in the query. Processing may also include filtering out a query that is a substring of a larger query, or Vice-versa. Also described is Pinyin-based clustering and filtering, and filtering out queries already handled in the dictionary.
    • 描述了从查询日志中挖出新词(包括短语或一组汉字)的技术。 新词可能会添加到(或以其他方式补充)IME词典。 可以基于市场(例如,中国市场)和/或按语言从日志中选择一组候选查询。 从该集合中,执行各种过滤步骤以仅定位经常使用的新词。 例如,只有频繁的查询被保留用于进一步的处理,其可以包括基于长度(例如,少于两个或大于八个汉字)过滤掉查询,和/或基于过多的停止词过滤掉查询 查询。 处理还可以包括过滤掉作为较大查询的子串的查询,反之亦然。 还描述了基于拼音的群集和过滤,并且过滤掉已经在字典中处理的查询。
    • 95. 发明申请
    • Internet Visualization System and Related User Interfaces
    • 互联网可视化系统和相关用户界面
    • US20080256444A1
    • 2008-10-16
    • US11972073
    • 2008-01-10
    • Min WangWeizhu ChenBenyu ZhangZheng ChenJian Wang
    • Min WangWeizhu ChenBenyu ZhangZheng ChenJian Wang
    • G06F3/00
    • G06F17/30864G06F2216/03
    • Systems and methods are described for an Internet visualization system and related user interfaces. In one implementation, the system analyzes Internet search logs to determine most popular search queries across the world at a current time. A user interface displays a keyword of each of the most popular queries in a single visual display that relates each query to a geographical location of greatest popularity. The system can also filter queries according to demographics. In one implementation the user interface provides a 3-dimensional Internet visualization that adopts an ocean or seascape theme. The ocean floor displays a map of the world, and query bubbles rise from geographical locations on the map. The size and duration of each query bubble denotes the relative popularity of a given query.
    • 为互联网可视化系统和相关的用户界面描述了系统和方法。 在一个实现中,系统分析互联网搜索日志以确定当前世界上最流行的搜索查询。 用户界面在单个视觉显示中显示每个最流行的查询的关键字,其将每个查询与最受欢迎的地理位置相关联。 该系统还可以根据人口特征来过滤查询。 在一个实现中,用户界面提供采用海洋或海景主题的三维互联网可视化。 海底显示世界地图,查询气泡从地图上的地理位置上升。 每个查询气泡的大小和持续时间表示给定查询的相对受欢迎程度。
    • 96. 发明申请
    • CLICK MODELING FOR URL PLACEMENTS IN QUERY RESPONSE PAGES
    • 点击建模查询响应页面中的网址
    • US20110302031A1
    • 2011-12-08
    • US12795631
    • 2010-06-07
    • Weizhu ChenGang WangZheng ChenZhikai FanThomas Minka
    • Weizhu ChenGang WangZheng ChenZhikai FanThomas Minka
    • G06F15/18G06Q30/00G06N5/02
    • G06Q30/02G06Q30/0255
    • A “General Click Model” (GCM) is constructed using a Bayesian network that is inherently capable of modeling “tail queries” by building the model on multiple attribute values that are shared across queries. More specifically, the GCM learns and predicts user click behavior towards URLs displayed on a query results page returned by a search engine. Unlike conventional click modeling approaches that learn models based on individual queries, the GCM learns click models from multiple attributes, with the influence of different attribute values being measured by Bayesian inference. This provides an advantage in learning that enables the GCM to achieve improved generalization and results, especially for tail queries, than conventional click models. In addition, most conventional click models consider only position and the identity of URLs when learning the model. In contrast, the GCM considers more session-specific attributes in making a final prediction for anticipated or expected user click behaviors.
    • 使用贝叶斯网络构建“通用点击模型”(GCM),该贝叶斯网络本质上能够通过在查询之间共享的多个属性值上构建模型来建模“尾部查询”。 更具体地说,GCM学习并预测用户对搜索引擎返回的查询结果页面上显示的URL的点击行为。 不同于传统的点击建模方法,基于个别查询的模型,GCM从多个属性学习点击模型,不同属性值的影响是通过贝叶斯推理来衡量的。 这提供了学习的优势,使得GCM能够实现改进的泛化和结果,特别是尾部查询,而不是传统的点击模型。 此外,大多数传统的点击模型只在学习模型时考虑URL的位置和身份。 相比之下,GCM考虑更多的会话特定属性来对预期或预期的用户点击行为进行最终预测。
    • 98. 发明授权
    • Constructing web query hierarchies from click-through data
    • 从点击型数据构建Web查询层次结构
    • US07870132B2
    • 2011-01-11
    • US12020574
    • 2008-01-28
    • Weizhu ChenBenyu ZhangZheng ChenJian WangDou Shen
    • Weizhu ChenBenyu ZhangZheng ChenJian WangDou Shen
    • G06F17/30
    • G06F17/30864
    • The claimed subject matter is directed to constructing query hierarchies in response to a query request. To construct a query hierarchy, a list of related candidate queries is generated in response to the received query request. The list of related candidate queries is generated by determining the relative coverage of information shared by the candidate queries and the query request. Relationships between the submitted query request and the candidate queries in the list are determined based upon the extent of relative coverage of information shared by the candidate queries and the query request. A query hierarchy is then constructed to reflect the determined relationships between the query request and the candidate queries.
    • 所要求保护的主题涉及响应于查询请求构建查询层次结构。 为了构建查询层次结构,响应于接收的查询请求生成相关候选查询的列表。 通过确定候选查询和查询请求共享的信息的相对覆盖率来生成相关候选查询的列表。 基于候选查询和查询请求共享的信息的相对覆盖范围确定列表中提交的查询请求与候选查询之间的关系。 然后构建查询层次结构以反映所确定的查询请求和候选查询之间的关系。