会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明授权
    • Snippet extraction and ranking
    • 片段提取和排名
    • US08954425B2
    • 2015-02-10
    • US12796345
    • 2010-06-08
    • Rong XiaoQiang HaoChanghu WangRui CaiLei Zhang
    • Rong XiaoQiang HaoChanghu WangRui CaiLei Zhang
    • G06F17/30
    • G06F17/30867G06F17/30241
    • Described herein is a technology that facilitates efficient automated mining of topic-related aspects of user-generated content based on automated analysis of the user-generated content. Locations are automatically learned based on dividing documents into document segments, and decomposing the segments into local topics and global topics. Techniques are described that facilitate automatically extracting snippets. These techniques include, for example, computer annotating travelogues with learned tags and images, performing topic learning to obtain an interest model, performing location matching based on the interest model, calculating geographic and semantic relevance scores, ranking snippets based on the geographic and semantic relevance scores, and searching snippets with a “location+context term” query.
    • 这里描述了一种技术,其有助于基于对用户生成的内容的自动化分析来有效地自动挖掘用户生成的内容的主题相关方面。 根据将文档分割成文档段,自动学习位置,并将段分解为本地主题和全局主题。 描述了便于自动提取代码段的技术。 这些技术包括例如计算机注释具有学习标签和图像的旅行记录,执行主题学习以获得兴趣模型,基于兴趣模型执行位置匹配,计算地理和语义相关性分数,基于地理和语义相关性来排序片段 分数和搜索带有“位置+上下文术语”查询的片段。
    • 6. 发明授权
    • Website design pattern modeling
    • 网站设计模式建模
    • US08370119B2
    • 2013-02-05
    • US12389368
    • 2009-02-19
    • Rui CaiJiang-Ming YangLei ZhangWei-Ying Ma
    • Rui CaiJiang-Ming YangLei ZhangWei-Ying Ma
    • G06G7/48
    • G06F17/218G06F8/75G06F17/27
    • Website design pattern modeling technique embodiments are presented that model a website's design patterns. This can be based on the website's layout elements, its URL tokens, or both. When based on both, the design patterns can be modeled separately using first the layout elements and then the URL tokens, or vice versa. Alternately, the modeling can be based on coupled layout and URL token patterns. In operation, the modeling involves first identifying layout elements and/or URL tokens found on at least some of the pages of the website. The website design patterns are then modeled based on the occurrences of the identified layout elements and/or URL tokens in pages of the website. In cases where a coupled modeling scheme is employed, a modeling technique that exploits the correlations between the layout elements and URL tokens is used.
    • 呈现网站设计模式建模技术实施例,模拟网站的设计模式。 这可以基于网站的布局元素,其网址令牌或两者兼而有之。 当基于这两者时,可以使用第一个布局元素和URL令牌来单独建模设计模式,反之亦然。 或者,建模可以基于耦合的布局和URL令牌模式。 在操作中,建模涉及首先识别在网站的至少一些页面上发现的布局元素和/或URL令牌。 然后基于网站页面中识别的布局元素和/或URL令牌的出现来对网站设计模式进行建模。 在使用耦合建模方案的情况下,使用利用布局元素和URL令牌之间的相关性的建模技术。
    • 7. 发明申请
    • Snippet Extraction and Ranking
    • 代码段提取和排名
    • US20110302162A1
    • 2011-12-08
    • US12796345
    • 2010-06-08
    • Rong XiaoQiang HaoChanghu WangRui CaiLei Zhang
    • Rong XiaoQiang HaoChanghu WangRui CaiLei Zhang
    • G06F17/30
    • G06F17/30867G06F17/30241
    • Described herein is a technology that facilitates efficient automated mining of topic-related aspects of user-generated content based on automated analysis of the user-generated content. Locations are automatically learned based on dividing documents into document segments, and decomposing the segments into local topics and global topics. Techniques are described that facilitate automatically extracting snippets. These techniques include, for example, computer annotating travelogues with learned tags and images, performing topic learning to obtain an interest model, performing location matching based on the interest model, calculating geographic and semantic relevance scores, ranking snippets based on the geographic and semantic relevance scores, and searching snippets with a “location+context term” query.
    • 这里描述了一种技术,其有助于基于对用户生成的内容的自动化分析来有效地自动挖掘用户生成的内容的主题相关方面。 根据将文档分割成文档段,自动学习位置,并将段分解为本地主题和全局主题。 描述了便于自动提取代码段的技术。 这些技术包括例如计算机注释具有学习标签和图像的旅行记录,执行主题学习以获得兴趣模型,基于兴趣模型执行位置匹配,计算地理和语义相关性分数,基于地理和语义相关性来排序片段 分数和搜索带有“位置+上下文术语”查询的片段。
    • 8. 发明申请
    • WEBSITE DESIGN PATTERN MODELING
    • 网站设计图案建模
    • US20100211927A1
    • 2010-08-19
    • US12389368
    • 2009-02-19
    • Rui CaiJiang-Ming YangLei ZhangWei-Ying Ma
    • Rui CaiJiang-Ming YangLei ZhangWei-Ying Ma
    • G06F9/44
    • G06F17/218G06F8/75G06F17/27
    • Website design pattern modeling technique embodiments are presented that model a website's design patterns. This can be based on the website's layout elements, its URL tokens, or both. When based on both, the design patterns can be modeled separately using first the layout elements and then the URL tokens, or vice versa. Alternately, the modeling can be based on coupled layout and URL token patterns. In operation, the modeling involves first identifying layout elements and/or URL tokens found on at least some of the pages of the website. The website design patterns are then modeled based on the occurrences of the identified layout elements and/or URL tokens in pages of the website. In cases where a coupled modeling scheme is employed, a modeling technique that exploits the correlations between the layout elements and URL tokens is used.
    • 呈现网站设计模式建模技术实施例,模拟网站的设计模式。 这可以基于网站的布局元素,其网址令牌或两者兼而有之。 当基于这两者时,可以使用第一个布局元素和URL令牌来分别设计设计模式,反之亦然。 或者,建模可以基于耦合的布局和URL令牌模式。 在操作中,建模涉及首先识别在网站的至少一些页面上发现的布局元素和/或URL令牌。 然后基于网站页面中识别的布局元素和/或URL令牌的出现来对网站设计模式进行建模。 在使用耦合建模方案的情况下,使用利用布局元素和URL令牌之间的相关性的建模技术。
    • 9. 发明申请
    • EXTRACTING STRUCTURED DATA FROM WEB FORUMS
    • 从网站提取结构化数据
    • US20100211533A1
    • 2010-08-19
    • US12388517
    • 2009-02-18
    • Jiangming YangRui CaiLei ZhangWei-Ying Ma
    • Jiangming YangRui CaiLei ZhangWei-Ying Ma
    • G06F15/18G06N5/02
    • G06N20/00G06F16/958
    • The web forum data extraction technique is designed for the structured data extraction of data on web forums using both page-level information and site-level knowledge. To do this, the technique finds the kinds of page objects a forum site has, which object a page belongs to, and how different page objects are connected with each other. This information can be obtained by re-constructing the sitemap of the target forum which is based on a Data Object Model of the target forum. The web forum data extraction technique collects three kinds of evidence for data extraction: 1) inner-page features which cover both semantic and layout information on an individual page; 2) inter-vertex features which describe linkage-related observations; and 3) inner-vertex features which characterize interrelationships among pages in one vertex. The technique employs Markov Logic Networks to combine the types of evidence statistically for inference and thereby can extract the desired structures.
    • 网络论坛数据提取技术是为了使用页面级信息和站点级知识,在Web论坛上的数据结构化数据提取。 为此,该技术可以找到论坛网站所拥有的页面对象的种类,页面所属的对象以及不同的页面对象如何相互连接。 该信息可以通过重新构建基于目标论坛的数据对象模型的目标论坛的站点地图来获得。 网络论坛数据提取技术收集了三种数据提取证据:1)内页特征,涵盖单个页面上的语义和布局信息; 2)描述连锁相关观察的顶点间特征; 和3)表示一个顶点中的页面之间的相互关系的内顶点特征。 该技术采用马可夫逻辑网络来统计证据的类型,从而推断出所需的结构。