会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data
    • 多媒体搜索装置及使用音频数据的扬声器检测来搜索多媒体内容的方法
    • US06317710B1
    • 2001-11-13
    • US09353192
    • 1999-07-14
    • Qian HuangIvan Magrin-ChagnolleauSarangarajan ParthasarathyAaron Edward Rosenberg
    • Qian HuangIvan Magrin-ChagnolleauSarangarajan ParthasarathyAaron Edward Rosenberg
    • G01L1700
    • G10L17/00
    • A multimedia search apparatus and method for searching multimedia content using speaker detection to segment the multimedia content. The multimedia search apparatus receives a search request from a user device. The search request identifies the target speaker for which the search is to be conducted. Based on the search request, the multimedia search apparatus retrieves multimedia content from a multimedia database. The multimedia search apparatus retrieves models, such as Gaussian Mixture Models (GMMs), from a model storage device, corresponding to the target speaker and background data. Based on the retrieved models, the multimedia search device searches the audio data of the multimedia content and segments the audio data. The segments are identified by calculating an average normalized score for a block of frames of the audio data and determining if the average normalized score for the block of frames exceeds one or more predetermined thresholds.
    • 一种多媒体搜索装置和方法,用于使用说话者检测来搜索多媒体内容来分割多媒体内容。 多媒体搜索装置从用户装置接收搜索请求。 搜索请求标识要进行搜索的目标扬声器。 基于搜索请求,多媒体搜索装置从多媒体数据库检索多媒体内容。 多媒体搜索装置从对应于目标说话者和背景数据的模型存储装置中检索诸如高斯混合模型(GMM)的模型。 基于所检索的模型,多媒体搜索装置搜索多媒体内容的音频数据并对音频数据进行分段。 通过计算音频数据的帧块的平均归一化分数并确定帧块的平均归一化分数是否超过一个或多个预定阈值来识别段。
    • 2. 发明授权
    • Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data
    • 多媒体搜索装置及使用音频数据的扬声器检测来搜索多媒体内容的方法
    • US06405166B1
    • 2002-06-11
    • US09976023
    • 2001-10-15
    • Qian HuangIvan Magrin-ChagnolleauSarangarajan ParthasarathyAaron Edward Rosenberg
    • Qian HuangIvan Magrin-ChagnolleauSarangarajan ParthasarathyAaron Edward Rosenberg
    • G10L1700
    • G10L17/00
    • A multimedia search apparatus and method for searching multimedia content using speaker detection to segment the multimedia content. The multimedia search apparatus receives a search request from a user device. The search request identifies the target speaker for which the search is to be conducted. Based on the search request, the multimedia search apparatus retrieves multimedia content from a multimedia database. The multimedia search apparatus retrieves models, such as Gaussian Mixture Models (GMMs), from a model storage device, corresponding to the target speaker and background data. Based on the retrieved models, the multimedia search device searches the multimedia data of the multimedia content and segments the multimedia data. The segments are identified by calculating an average normalized score for a block of frames of the multimedia data and determining if the average normalized score for the block of frames exceeds one or more predetermined thresholds.
    • 一种多媒体搜索装置和方法,用于使用说话者检测来搜索多媒体内容来分割多媒体内容。 多媒体搜索装置从用户装置接收搜索请求。 搜索请求标识要进行搜索的目标扬声器。 基于搜索请求,多媒体搜索装置从多媒体数据库检索多媒体内容。 多媒体搜索装置从对应于目标说话者和背景数据的模型存储装置中检索诸如高斯混合模型(GMM)的模型。 基于所检索的模型,多媒体搜索装置搜索多媒体内容的多媒体数据并分割多媒体数据。 通过计算多媒体数据的帧块的平均归一化分数并确定帧块的平均归一化分数是否超过一个或多个预定阈值来标识段。
    • 10. 发明授权
    • Speaker identification with user-selected password phrases
    • 用户选择的密码短语的扬声器识别
    • US5913192A
    • 1999-06-15
    • US916662
    • 1997-08-22
    • Sarangarajan ParthasarathyAaron Edward Rosenberg
    • Sarangarajan ParthasarathyAaron Edward Rosenberg
    • G10L15/00G10L17/00G10L5/06G10L9/00
    • G10L17/24G10L15/1815G10L2015/085
    • A speaker identification system includes a speaker-independent phrase recognizer. The speaker-independent phrase recognizer scores a password utterance against all the sets of phonetic transcriptions in a lexicon database to determine the N best speaker-independent scores, determines the N best sets of phonetic transcriptions based on the N best speaker-independent scores, and determines the N best possible identities. A speaker-dependent phrase recognizer retrieves the hidden Markov model corresponding to each of the N best possible identities, and scores the password utterance against each of the N hidden Markov models to generate a speaker-dependent score for each of the N best possible identities. A score processor coupled to the outputs of the speaker-independent phrase recognizer and the speaker-dependent phrase recognizer determines a putative identity. A verifier coupled to the score processor authenticates the determined putative identity.
    • 扬声器识别系统包括与扬声器无关的短语识别器。 与扬声器无关的短语识别器对词典数据库中的所有语音转录集进行口令发音评分,以确定N个最佳的独立于演讲者的得分,基于N个最佳的独立于演讲者的得分确定N个最佳语音转录集,以及 确定N最好的身份。 与扬声器相关的短语识别器检索与N个最佳可能身份中的每一个相对应的隐马尔可夫模型,并且对每个N个隐马尔可夫模型对密码发音进行评分,以产生针对N个最佳可能身份中的每一个的说话者相关得分。 耦合到与扬声器无关的短语识别器和与扬声器相关的短语识别器的输出的分数处理器确定推定的身份。 耦合到评分处理器的验证器对所确定的推定身份进行认证。