会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 7. 发明授权
    • Method of speaker clustering for unknown speakers in conversational
audio data
    • 会话音频数据中未知扬声器的扬声器聚类方法
    • US5598507A
    • 1997-01-28
    • US226523
    • 1994-04-12
    • Donald G. KimberLynn D. WilcoxFrancine R. Chen
    • Donald G. KimberLynn D. WilcoxFrancine R. Chen
    • G10L15/06G10L15/10G10L15/14G10L17/00H04R3/00G10L5/06
    • G10L15/07G10L15/10G10L15/142G10L2015/0631
    • A method for clustering speaker data from a plurality of unknown speakers. The method includes steps of providing a portion of audio data containing speech from at least all the speakers in the audio data and dividing the portion into data clusters. A pairwise distance between each pair of clusters is computed, the pairwise distance being based on a likelihood that two clusters were created by the same speaker, the likelihood measurement being biased by the prior probability of speaker changes. The two clusters with a minimum pairwise distance are combined into a new cluster and speakers models are trained for each of the remaining clusters including the new cluster. The likelihood that two clusters were created by the same speaker may be biased by a Markov duration model based on speaker changes over the length of the initial data clusters.
    • 一种用于聚集来自多个未知扬声器的扬声器数据的方法。 该方法包括以下步骤:从音频数据中的至少所有扬声器提供包含语音的音频数据的一部分,并将该部分分成数据簇。 计算每对群集之间的成对距离,成对距离基于两个聚类是由同一个说话者产生的可能性,似然度测量被扬声器改变的先前概率所偏好。 具有最小成对距离的两个组合被组合成新的集群,并且针对包括新集群在内的每个剩余集群训练说话者模型。 基于扬声器在初始数据簇的长度上的变化,由同一个说话者产生的两个簇的可能性可能被马尔可夫持续时间模型所偏好。