会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明申请
    • METHOD FOR SEGMENTING COMMUNICATION TRANSCRIPTS USING UNSUPERVSED AND SEMI-SUPERVISED TECHNIQUES
    • 使用不间断和半监督技术分隔通信转录的方法
    • US20090112588A1
    • 2009-04-30
    • US11931806
    • 2007-10-31
    • Krishna KummamuruDeepak S. PadmanabhanShourya RoyL. Venkata Subramaniam
    • Krishna KummamuruDeepak S. PadmanabhanShourya RoyL. Venkata Subramaniam
    • G10L15/06
    • G10L15/04G06F16/355
    • A method is provided for forming discrete segment clusters of one or more sequential sentences from a corpus of communication transcripts of transactional communications that comprises dividing the communication transcripts of the corpus into a first set of sentences spoken by a caller and a second set of sentences spoken by a responder; generating a specified number of sentence clusters by grouping the first and second sets of sentences according to a measure of lexical similarity using an unsupervised partitional clustering method; generating a collection of sequences of sentence types by assigning a distinct sentence type to each sentence cluster and representing each sentence of each communication transcript of the corpus with the sentence type assigned to the sentence cluster into which the sentence is grouped; and generating a specified number of discrete segment clusters by successively merging sentence clusters according to a proximity-based measure between the sentence types assigned to the sentence clusters within sequences of the collection.
    • 提供了一种用于从事务通信的通信转录语料库形成一个或多个顺序句子的离散段聚类的方法,其包括将语料库的通信记录分成由呼叫者说出的第一组句子和第二组句子 由答复者 通过使用无监督分数聚类方法,根据词汇相似度的度量,对第一和第二组句子进行分组,从而产生指定数目的句子群; 通过为每个句子集分配不同的句子类型并以分配给句子分组的句子集合的句子类型表示语料库的每个通信录音的每个句子来生成句子序列的集合; 以及通过根据在集合的序列内分配给句子集群的句子类型之间的基于邻近度的度量连续地合并语句集群来生成指定数量的离散分段集群。
    • 7. 发明授权
    • Method for segmenting communication transcripts using unsupervised and semi-supervised techniques
    • 使用无监督和半监督技术分割沟通成绩单的方法
    • US07912714B2
    • 2011-03-22
    • US12060469
    • 2008-04-01
    • Krishna KummamuruDeepak S. PadmanabanShourya RoyL. Venkata Subramaniam
    • Krishna KummamuruDeepak S. PadmanabanShourya RoyL. Venkata Subramaniam
    • G10L15/06
    • G06F17/3071G10L15/04
    • A method is provided for forming discrete segment clusters of one or more sequential sentences from a corpus of communication transcripts of transactional communications that comprises dividing the communication transcripts of the corpus into a first set of sentences spoken by a caller and a second set of sentences spoken by a responder; generating a set of sentence clusters by grouping the first and second sets of sentences according to a measure of lexical similarity using an unsupervised partitional clustering method; generating a collection of sequences of sentence types by assigning a distinct sentence type to each sentence cluster and representing each sentence of each communication transcript of the corpus with the sentence type assigned to the sentence cluster into which the sentence is grouped; and generating a specified number of discrete segment clusters by successively merging sentence clusters according to a proximity-based measure between the sentence types assigned to the sentence clusters within sequences of the collection.
    • 提供了一种用于从事务通信的通信转录语料库形成一个或多个顺序句子的离散段聚类的方法,其包括将语料库的通信记录分成由呼叫者说出的第一组句子和第二组句子 由答复者 通过使用无监督分数聚类方法,根据词汇相似度的度量,对第一和第二组句子进行分组,从而生成一组句子群; 通过为每个句子集分配不同的句子类型并以分配给句子分组的句子集合的句子类型表示语料库的每个通信录音的每个句子来生成句子序列的集合; 以及通过根据在集合的序列内分配给句子集群的句子类型之间的基于邻近度的度量连续地合并语句集群来生成指定数量的离散分段集群。
    • 8. 发明授权
    • Clustering data including those with asymmetric relationships
    • 聚类数据,包括具有不对称关系的数据
    • US06925460B2
    • 2005-08-02
    • US09815616
    • 2001-03-23
    • Krishna KummamuruRaghuram KrishnapuramPradeep Kumar Dubey
    • Krishna KummamuruRaghuram KrishnapuramPradeep Kumar Dubey
    • G06F7/00G06F17/30
    • G06F17/30719G06F17/3071Y10S707/99933Y10S707/99935
    • The present invention relates to a method, system and computer program product for clustering data points and its application to text summarization, customer profiling for web personalization and product cataloging.The method for clustering data points with defined quantified relationships between them comprises the steps of obtaining lead value for each data point either by deriving from said quantified relationships or as given input, ranking each data point in a lead value sequence list in descending order of lead value, assigning the first data point in said lead value sequence list as the leader of the first cluster, and considering each subsequent data point in said lead value sequence list as a leader of a new cluster if its relationship with the leaders of each of the previous clusters is less than a defined threshold value or as a member of one or more clusters where its relationship with the cluster leader is more than or equal to said threshold value. The said relationships between data points are symmetric or asymmetric. Similarly, system and computer program product have also been claimed.
    • 本发明涉及用于聚类数据点的方法,系统和计算机程序产品及其应用于文本摘要,用于web个性化和产品编目的客户分析。 用于对其中具有定义的量化关系的数据点进行聚类的方法包括以下步骤:通过从所述量化关系导出或作为给定输入来获取每个数据点的引导值,以引导值序列表中的每个数据点按铅的降序排列 将所述引导值序列列表中的第一数据点分配为第一簇的引导符,并且如果其与每个的引导者的关系,则将所述引导值序列列表中的每个后续数据点视为新簇的引导者 先前的簇小于定义的阈值,或作为其与簇首的关系大于或等于所述阈值的一个或多个簇的成员。 数据点之间的关系是对称的或不对称的。 类似地,系统和计算机程序产品也被要求。
    • 10. 发明申请
    • METHOD FOR SEGMENTING COMMUNICATION TRANSCRIPTS USING UNSUPERVISED AND SEMI-SUPERVISED TECHNIQUES
    • 使用不受限制的和受监督的技术分隔通信转录的方法
    • US20090112571A1
    • 2009-04-30
    • US12060469
    • 2008-04-01
    • Krishna KummamuruDeepak S. PadmanabhanShourya RoyL. Venkata Subramaniam
    • Krishna KummamuruDeepak S. PadmanabhanShourya RoyL. Venkata Subramaniam
    • G06F17/20
    • G06F17/3071G10L15/04
    • A method is provided for forming discrete segment clusters of one or more sequential sentences from a corpus of communication transcripts of transactional communications that comprises dividing the communication transcripts of the corpus into a first set of sentences spoken by a caller and a second set of sentences spoken by a responder; generating a set of sentence clusters by grouping the first and second sets of sentences according to a measure of lexical similarity using an unsupervised partitional clustering method; generating a collection of sequences of sentence types by assigning a distinct sentence type to each sentence cluster and representing each sentence of each communication transcript of the corpus with the sentence type assigned to the sentence cluster into which the sentence is grouped; and generating a specified number of discrete segment clusters by successively merging sentence clusters according to a proximity-based measure between the sentence types assigned to the sentence clusters within sequences of the collection.
    • 提供了一种用于从事务通信的通信转录语料库形成一个或多个顺序句子的离散段聚类的方法,其包括将语料库的通信记录分成由呼叫者说出的第一组句子和第二组句子 由答复者 通过使用无监督分数聚类方法,根据词汇相似度的度量,对第一和第二组句子进行分组,从而生成一组句子群; 通过为每个句子集分配不同的句子类型并以分配给句子分组的句子集合的句子类型表示语料库的每个通信录音的每个句子来生成句子序列的集合; 以及通过根据在集合的序列内分配给句子集群的句子类型之间的基于邻近度的度量连续地合并语句集群来生成指定数量的离散分段集群。