会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 6. 发明申请
    • SYSTEM, APPARATUS, AND METHOD FOR PERFORMING SPEAKER VERIFICATION USING A UNIVERSAL BACKGROUND MODEL
    • 使用通用背景模型执行扬声器验证的系统,装置和方法
    • WO2017157423A1
    • 2017-09-21
    • PCT/EP2016/055564
    • 2016-03-15
    • TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    • GRANCHAROV, VolodyaKARLSSON, ErlendurSVERRISSON, SigurdurPOBLOTH, Harald
    • G10L17/06G10L17/04
    • G10L17/06G10L17/04
    • An apparatus (102/108), machine-implemented method (300), and computer program product for determining whether an utterance, U, was spoken by a hypothesized speaker, S, are presented. The apparatus includes a module (110a/110b) that obtains data corresponding to an utterance captured by a microphone. It also obtains a set of two or more Gaussian Mixture Models (GMMs), where each GMM comprises a set of parameter pairs. The module further selects, based on the obtained data corresponding to the captured utterance U, a subset of a first set of parameter pairs of the first GMM and a subset of a second set of parameter pairs of the second GMM. The two subsets are combined to form a Universal Background Model (UBM), λ UBM . The module obtains a GMM model, λ S , associated with the hypothesized speaker, S. The segment λ S , λ UBM , and the obtained data corresponding to U are used to determine whether U was spoken by S.
    • 呈现了用于确定是否由假设的说话者S说出话语U的设备(102/108),机器实现的方法(300)和计算机程序产品。 该设备包括获得与由麦克风捕获的话语相对应的数据的模块(110a / 110b)。 它还获得一组两个或更多个高斯混合模型(GMM),其中每个GMM包括一组参数对。 该模块还基于所获得的与捕获的话语U相对应的数据选择第一GMM的第一组参数对的子集和第二GMM的第二组参数对的子集。 这两个子集合起来形成一个通用背景模型(UBM),即λUBM。 该模块获得与假设说话者S相关联的GMM模型λS。该段λS S,λUBM和所获得的 对应于U的数据用于确定U是否由S说出。
    • 8. 发明申请
    • AUTHENTICATION METHOD
    • 认证方法
    • WO2016113521A1
    • 2016-07-21
    • PCT/GB2015/052771
    • 2015-09-24
    • VALIDSOFT UK LIMITED
    • PETERSEN, JohnTHORNHILL, Daniel
    • G10L17/04G10L17/22G10L17/10G06F21/32H04L9/32H04L29/06
    • G06F21/32G10L17/04G10L17/10G10L17/22H04L9/3231H04L63/0861H04L2463/082
    • An authentication method. The method comprises comparing user voice data received via an electronic device to a stored voice template to determine a voice authentication parameter. A voice authentication threshold is determined and the voice authentication parameter is compared to the voice authentication threshold to determine whether to authenticate the user. Determining the voice authentication threshold comprises determining a current value of an enrolment counter, then comparing the current value of the enrolment counter to an enrolment counter threshold and determining whether the stored voice template is fully enrolled according to the result. If the stored voice template is fully enrolled, the voice authentication threshold is set to a first voice authentication threshold. If the stored voice template is not fully enrolled then a device attribute received from the electronic device is compared to a stored device attribute. If the received device attribute matches the stored device attribute, the voice authentication threshold is set to a second voice authentication threshold determined by the current value of the enrolment counter. If the received device attribute does not match the stored device attribute, the voice authentication threshold is set to a third voice authentication threshold.
    • 一种认证方式。 该方法包括将通过电子设备接收的用户语音数据与存储的语音模板进行比较,以确定语音认证参数。 确定语音认证阈值,并将语音认证参数与语音认证阈值进行比较,以确定是否对用户进行认证。 确定语音认证阈值包括确定注册计数器的当前值,然后将注册计数器的当前值与注册计数器阈值进行比较,并根据结果确定所存储的语音模板是否完全注册。 如果存储的语音模板完全注册,则将语音认证阈值设置为第一语音认证阈值。 如果存储的语音模板未被完全注册,则从电子设备接收的设备属性与存储的设备属性进行比较。 如果接收到的设备属性与存储的设备属性相匹配,则将语音认证阈值设置为由注册计数器的当前值确定的第二语音认证阈值。 如果接收到的设备属性与存储的设备属性不匹配,则将语音认证阈值设置为第三语音认证阈值。
    • 9. 发明申请
    • SPEAKER IDENTIFICATION AND UNSUPERVISED SPEAKER ADAPTATION TECHNIQUES
    • 扬声器识别和不可支持的扬声器适配技术
    • WO2016053523A1
    • 2016-04-07
    • PCT/US2015/047281
    • 2015-08-27
    • APPLE INC.
    • KIM, YoonKAJAKEKAR, Sachin, S.
    • G10L17/04G10L17/06G10L15/18G10L15/26
    • G10L17/26G10L15/1822G10L15/26G10L17/04G10L17/06
    • Systems and processes for generating a speaker profile for use in performing speaker identification for a virtual assistant are provided. One example process can include receiving an audio input including user speech and determining whether a speaker of the user speech is a predetermined user based on a speaker profile for the predetermined user. In response to determining that the speaker of the user speech is the predetermined user, the user speech can be added to the speaker profile and operation of the virtual assistant can be triggered. In response to determining that the speaker of the user speech is not the predetermined user, the user speech can be added to an alternate speaker profile and operation of the virtual assistant may not be triggered. In some examples, contextual information can be used to verify results produced by the speaker identification process.
    • 提供了用于生成用于为虚拟助理执行说话者识别的扬声器简档的系统和过程。 一个示例性过程可以包括基于用于预定用户的扬声器简档来接收包括用户语音的音频输入并且确定用户语音的扬声器是否是预定用户。 响应于确定用户语音的扬声器是预定用户,可以将用户语音添加到扬声器简档,并且可以触发虚拟助手的操作。 响应于确定用户语音的讲话者不是预定用户,可以将用户语音添加到备用讲话者简档,并且虚拟助理的操作可能不被触发。 在一些示例中,可以使用上下文信息来验证由说话者识别过程产生的结果。
    • 10. 发明申请
    • SPEAKER VERIFICATION
    • 扬声器验证
    • WO2015161240A2
    • 2015-10-22
    • PCT/US2015/026453
    • 2015-04-17
    • QUALCOMM INCORPORATED
    • VISSER, ErikKIM, Lae-HoonJIN, MinhoGUO, Yinyi
    • G10L17/04
    • G06F3/167G10L15/22G10L17/04G10L17/08G10L17/22
    • A device includes a memory, a receiver, a processor, and a display. The memory is configured to store a speaker model. The receiver is configured to receive an input audio signal. The processor is configured to determine a first confidence level associated with a first portion of the input audio signal based on the speaker model. The processor is also configured to determine a second confidence level associated with a second portion of the input audio signal based on the speaker model. The display is configured to present a graphical user interface associated with the first confidence level or associated with the second confidence level.
    • 设备包括存储器,接收器,处理器和显示器。 存储器被配置为存储扬声器模型。 接收器被配置为接收输入音频信号。 处理器被配置为基于说话者模型来确定与输入音频信号的第一部分相关联的第一置信水平。 处理器还被配置为基于说话者模型来确定与输入音频信号的第二部分相关联的第二置信度。 显示器被配置为呈现与第一置信水平相关联或与第二置信水平相关联的图形用户界面。