会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明申请
    • Techniques for Generating Balanced and Class-Independent Training Data From Unlabeled Data Set
    • 从非标准数据集中生成平衡和类别独立训练数据的技术
    • US20130097103A1
    • 2013-04-18
    • US13274002
    • 2011-10-14
    • Suresh N. ChariIan Michael MolloyYoungja ParkZijie Qi
    • Suresh N. ChariIan Michael MolloyYoungja ParkZijie Qi
    • G06F15/18G06F17/30
    • G06N20/00
    • Techniques for creating training sets for predictive modeling are provided. In one aspect, a method for generating training data from an unlabeled data set is provided which includes the following steps. A small initial set of data is selected from the unlabeled data set. Labels are acquired for the initial set of data selected from the unlabeled data set resulting in labeled data. The data in the unlabeled data set is clustered using a semi-supervised clustering process along with the labeled data to produce data clusters. Data samples are chosen from each of the clusters to use as the training data. The selecting, presenting, clustering and choosing steps are repeated with one or more additional sets of data selected from the unlabeled data set until a desired amount of training data has been obtained, wherein at each iteration an amount of the labeled data is increased.
    • 提供了用于创建预测建模训练集的技术。 一方面,提供了一种用于从未标记的数据集生成训练数据的方法,包括以下步骤。 从未标记的数据集中选择一小段初始数据。 从未标记的数据集中选择的初始数据集中获取标签,从而产生标记数据。 未标记数据集中的数据使用半监督聚类过程与标记数据一起聚类以产生数据集群。 从每个群集中选择数据样本以用作训练数据。 使用从未标记的数据集中选择的一个或多个附加数据集重复选择,呈现,聚类和选择步骤,直到获得了所需量的训练数据,其中在每次迭代时,标记数据的量增加。
    • 6. 发明授权
    • System and method for semantic video segmentation based on joint audiovisual and text analysis
    • 基于联合视听和文本分析的语义视频分割系统和方法
    • US08121432B2
    • 2012-02-21
    • US12055023
    • 2008-03-25
    • Chitra DoraiYing LiYoungja Park
    • Chitra DoraiYing LiYoungja Park
    • G06K9/36
    • G06F17/30787G06F17/30796
    • System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.
    • 将视频分割成一系列语义单元的系统和方法,其中每个语义单元涉及一般完整的主题。 一种用于将视频分割成一系列语义单元的计算机实现的方法,其中每个语义单元涉及主题或主题,包括将视频划分为多个同构段,分析视频的音频和视觉内容,提取多个 根据视频的多个同构段的每个的语音内容的关键字,以及根据音频和视频的结果检测和合并多个语义相关和时间上相邻的同构段的组成一系列语义单元 分析和关键词提取。 本发明可以应用于产生重要的内容表以及用于视频的索引表,以便于有效的视频主题搜索和浏览。
    • 8. 发明授权
    • System, method, program product, and networking use for recognizing words and their parts of speech in one or more natural languages
    • 系统,方法,程序产品和网络使用,用于以一种或多种自然语言识别单词及其部分语音
    • US07680649B2
    • 2010-03-16
    • US10173931
    • 2002-06-17
    • Youngja Park
    • Youngja Park
    • G06F17/21G06F17/20
    • G06F17/2755G06F17/278
    • A system, method, and computer program are disclosed for recognizing one or more words not listed in a dictionary database. One or more sequences of characters in the word are checked to determine a probability that the word is valid. A prefix removal process removes any prefixes from a word, and obtains information about the removed prefix. A suffix removal process removes any suffixes from the word, and obtains information about the removed suffix. A root process obtains information about a root word from the dictionary database. A combination process then determines if the prefix, the root, and the suffix can be combined into a valid word as defined by one or more combination rules, obtains one or more of the possible parts of speech of the valid word, and stores the parts of speech with the valid word in the dictionary database.
    • 公开了一种用于识别字典数据库中未列出的一个或多个字的系统,方法和计算机程序。 检查单词中的一个或多个字符序列以确定单词有效的概率。 前缀删除过程从单词中删除任何前缀,并获取有关已删除的前缀的信息。 后缀删除过程从单词中删除任何后缀,并获取有关已删除后缀的信息。 根进程从字典数据库获取有关根词的信息。 然后,组合处理确定前缀,根和后缀是否可以组合成由一个或多个组合规则定义的有效字,获得有效字的一个或多个可能的语音部分,并存储部分 的词典数据库中的有效单词。
    • 9. 发明申请
    • SYSTEM AND METHOD FOR SEMANTIC VIDEO SEGMENTATION BASED ON JOINT AUDIOVISUAL AND TEXT ANALYSIS
    • 基于联合音视频分析的语义视频分割系统与方法
    • US20080175556A1
    • 2008-07-24
    • US12055023
    • 2008-03-25
    • Chitra DoraiYing LiYoungja Park
    • Chitra DoraiYing LiYoungja Park
    • H04N5/93
    • G06F17/30787G06F17/30796
    • System and method for partitioning a video into a series of semantic units where each semantic unit relates to a generally complete thematic topic. A computer implemented method for partitioning a video into a series of semantic units wherein each semantic unit relates to a theme or a topic, comprises dividing a video into a plurality of homogeneous segments, analyzing audio and visual content of the video, extracting a plurality of keywords from the speech content of each of the plurality of homogeneous segments of the video, and detecting and merging a plurality of groups of semantically related and temporally adjacent homogeneous segments into a series of semantic units in accordance with the results of both the audio and visual analysis and the keyword extraction. The present invention can be applied to generate important table-of-contents as well as index tables for videos to facilitate efficient video topic searching and browsing.
    • 将视频分割成一系列语义单元的系统和方法,其中每个语义单元涉及一般完整的主题。 一种用于将视频分割成一系列语义单元的计算机实现的方法,其中每个语义单元涉及主题或主题,包括将视频划分为多个同构段,分析视频的音频和视觉内容,提取多个 根据视频的多个同构段的每个的语音内容的关键字,以及根据音频和视频的结果检测和合并多个语义相关和时间上相邻的同构段的组成一系列语义单元 分析和关键词提取。 本发明可以应用于产生重要的内容表以及用于视频的索引表,以便于有效的视频主题搜索和浏览。
    • 10. 发明申请
    • Role Mining With User Attribution Using Generative Models
    • 使用生成模型的用户归因的角色挖掘
    • US20120246098A1
    • 2012-09-27
    • US13411174
    • 2012-03-02
    • Suresh N. ChariIan Michael MolloyYoungja Park
    • Suresh N. ChariIan Michael MolloyYoungja Park
    • G06F15/18
    • G06N99/005G06F21/604
    • Applications of machine learning techniques such as Latent Dirichlet Allocation (LDA) and author-topic models (ATM) to the problems of mining of user roles to specify access control policies from entitlement as well as logs which contain record of the usage of these entitlements are provided. In one aspect, a method for performing role mining given a plurality of users and a plurality of permissions is provided. The method includes the following steps. At least one generative machine learning technique, e.g., LDA, is used to obtain a probability distribution θ for user-to-role assignments and a probability distribution β for role-to-permission assignments. The probability distribution θ for user-to-role assignments and the probability distribution β for role-to-permission assignments are used to produce a final set of roles, including user-to-role assignments and role-to-permission assignments.
    • 潜在的Dirichlet分配(LDA)和作者主题模型(ATM)等机器学习技术的应用对于用户角色的挖掘问题,从授权中指定访问控制策略以及包含这些权利使用记录的日志的应用是 提供。 在一个方面,提供了赋予多个用户和多个权限的用于执行角色挖掘的方法。 该方法包括以下步骤。 使用至少一种生成机器学习技术,例如LDA来获得概率分布; 用于角色角色分配和概率分布&bgr; 用于角色到权限分配。 概率分布与概念; 用于角色角色分配和概率分布; 角色到权限分配用于生成一组最终角色,包括用户角色分配和角色到权限分配。