会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 13. 发明申请
    • SPOKEN MAN-MACHINE INTERFACE WITH SPEAKER IDENTIFICATION
    • SPOKEN人机界面与扬声器识别
    • US20080319747A1
    • 2008-12-25
    • US12195136
    • 2008-08-20
    • Ralf KompeThomas Kemp
    • Ralf KompeThomas Kemp
    • G10L15/06
    • G10L15/065G10L17/02G10L17/04G10L2015/223
    • The method of operating a man-machine interface unit includes classifying at least one utterance of a speaker to be of a first type or of a second type. If the utterance is classified to be of the first type, the utterance belongs to a known speaker of a speaker data base, and if the utterance is classified to be of the second type, the utterance belongs to an unknown speaker that is not included in the speaker data base. The method also includes storing a set of utterances of the second type, clustering the set of utterances into clusters, wherein each cluster comprises utterances having similar features, and automatically adding a new speaker to the speaker data base based on utterances of one of the clusters.
    • 操作人机接口单元的方法包括将扬声器的至少一个话语分类为第一类型或第二类型。 如果发音被分类为第一类型,话语属于扬声器数据库的已知扬声器,并且如果话语被分类为第二类型,话语属于未包括在未知扬声器中的未知扬声器 扬声器数据库。 该方法还包括存储第二类型的话语集合,将该组语音聚类成群集,其中每个群集包括具有相似特征的话语,并且基于该群集中的一个的话语自动地将新的扬声器添加到该扬声器数据库 。
    • 15. 发明申请
    • Apparatus and method for segmentation of audio data into meta patterns
    • 将音频数据分割为元模式的装置和方法
    • US20050114388A1
    • 2005-05-26
    • US10985615
    • 2004-11-10
    • Silke GoronzyThomas KempRalf KompeYin LamKrzysztof MarasekRaquel Tato
    • Silke GoronzyThomas KempRalf KompeYin LamKrzysztof MarasekRaquel Tato
    • G10L25/00G06F17/00
    • G10L25/00
    • An audio data segmentation apparatus for segmenting of audio data comprises audio data input means for supplying audio data, audio data clipping means for dividing the audio data supplied by the audio data input means into audio clips of a predetermined length, class discrimination means for discriminating the audio clips supplied by the audio data clipping means into predetermined audio classes, the audio classes identifying a kind of audio data included in the respective audio clip and segmenting means for segmenting the audio data into audio meta patterns based on a sequence of audio classes of consecutive audio clips, each meta pattern being allocated to a predetermined type of contents of the audio data. It is difficult to achieve good results with known methods for segmentation of audio data into meta patterns since the rules for the allocation of the meta patterns are dissatisfying. This problem is solved by the inventive audio data segmentation apparatus further comprising a programme database comprising programme data units to identify a certain kind of programme, a plurality of respective audio meta patterns being allocated to each programme data unit, wherein the segmenting means segments the audio data into corresponding audio meta patterns on the basis of the programme data units of the programme database 5.
    • 用于分割音频数据的音频数据分割装置包括用于提供音频数据的音频数据输入装置,用于将由音频数据输入装置提供的音频数据分割成预定长度的音频剪辑的音频数据剪辑装置,用于鉴别 由音频数据剪辑装置提供的音频剪辑成预定的音频类别,音频类别标识包括在各个音频剪辑中的一种音频数据,以及分割装置,用于基于连续的音频类别序列将音频数据分割成音频元模式 音频剪辑,每个元模式被分配给音频数据的预定类型的内容。 由于元模式的分配规则不满意,因此将音频数据分割为元模式的已知方法难以获得良好的结果。 本发明的音频数据分割装置还包括程序数据库,该程序数据库包括用于识别某种程序的程序数据单元,分配给每个程序数据单元的多个各自的音频元模式,其中分段装置分割音频 基于程序数据库5的程序数据单元将数据转换成相应的音频元模式。
    • 16. 发明授权
    • Method for processing speech using absolute loudness
    • 使用绝对响度处理语音的方法
    • US08200488B2
    • 2012-06-12
    • US10731929
    • 2003-12-10
    • Thomas KempRalf KompeRaquel Tato
    • Thomas KempRalf KompeRaquel Tato
    • G10L15/00
    • G10L17/02G10L17/26
    • The invention provides a method for processing speech comprising the steps of receiving a speech input (SI) of a speaker, generating speech parameters (SP) from said speech input (SI), determining parameters describing an absolute loudness (L) of said speech input (SI), and evaluating (EV) said speech input (SI) and/or said speech parameters (SP) using said parameters describing the absolute loudness (L). In particular, the step of evaluation (EV) comprises a step of emotion recognition and/or speaker identification. Further, a microphone array comprising a plurality of microphones is used for determining said parameters describing the absolute loudness. With a microphone array the distance of the speaker from the microphone array can be determined and the loudness can be normalized by the distance. Thus, the absolute loudness becomes independent from the distance of the speaker to the microphone, and absolute loudness can now be used as an input parameter for emotion recognition and/or speaker identification.
    • 本发明提供了一种处理语音的方法,包括以下步骤:接收讲话者的语音输入(SI),从所述语音输入(SI)产生语音参数(SP),确定描述所述语音输入的绝对响度(L)的参数 (SI),以及使用描述绝对响度(L)的参数来评估(EV)所述语音输入(SI)和/或所述语音参数(SP)。 特别地,评估步骤(EV)包括情绪识别和/或说话人识别的步骤。 此外,包括多个麦克风的麦克风阵列用于确定描述绝对响度的所述参数。 使用麦克风阵列,可以确定扬声器与麦克风阵列的距离,并且响度可以通过距离归一化。 因此,绝对响度与扬声器与麦克风的距离无关,现在可以将绝对响度用作用于情绪识别和/或扬声器识别的输入参数。
    • 17. 发明授权
    • Spoken man-machine interface with speaker identification
    • 口语人机界面,带扬声器识别
    • US07769588B2
    • 2010-08-03
    • US12195136
    • 2008-08-20
    • Ralf KompeThomas Kemp
    • Ralf KompeThomas Kemp
    • G10L15/06
    • G10L15/065G10L17/02G10L17/04G10L2015/223
    • The method of operating a man-machine interface unit includes classifying at least one utterance of a speaker to be of a first type or of a second type. If the utterance is classified to be of the first type, the utterance belongs to a known speaker of a speaker data base, and if the utterance is classified to be of the second type, the utterance belongs to an unknown speaker that is not included in the speaker data base. The method also includes storing a set of utterances of the second type, clustering the set of utterances into clusters, wherein each cluster comprises utterances having similar features, and automatically adding a new speaker to the speaker data base based on utterances of one of the clusters.
    • 操作人机接口单元的方法包括将扬声器的至少一个话语分类为第一类型或第二类型。 如果发音被分类为第一类型,话语属于扬声器数据库的已知扬声器,并且如果话语被分类为第二类型,话语属于未包括在未知扬声器中的未知扬声器 扬声器数据库。 该方法还包括存储第二类型的话语集合,将该组语音聚类成群集,其中每个群集包括具有相似特征的话语,并且基于该群集中的一个的话语自动地将新的扬声器添加到该扬声器数据库 。