会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 6. 发明授权
    • System and method for dynamic facial features for speaker recognition
    • 用于说话者识别的动态面部特征的系统和方法
    • US09218815B2
    • 2015-12-22
    • US14551907
    • 2014-11-24
    • AT&T Intellectual Property I, L.P.
    • Ann K. SyrdalSumit ChopraPatrick HaffnerTaniya MishraIlija ZeljkovicEric Zavesky
    • G06K9/00G10L17/24G06F21/32
    • G10L15/25G06F21/32G06F2221/2103G06K9/00255G06K9/00281G06K9/00288G06K9/00315G06K9/00335G10L17/24G10L21/06
    • Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker's facial expression responding to content of the challenge.
    • 本文公开了用于执行说话者验证的系统,方法和非暂时的计算机可读存储介质。 被配置为实施该方法的系统接收到验证说话者的请求,产生对该请求是唯一的文本挑战,并且响应该请求提示说话者发出文本挑战。 然后当扬声器发出文本挑战时,系统记录扬声器的动态图像特征,并且基于动态图像特征和文本挑战来执行说话者验证。 录制扬声器的动态图像功能可以包括在说出文本挑战时录制扬声器的视频。 动态特征可以包括扬声器的头部,嘴唇,嘴巴,眼睛和/或眉毛的运动模式。 动态图像特征可以涉及讲话者讲话的语音内容,语音韵律以及响应于挑战内容的说话者的面部表情。
    • 10. 发明授权
    • System and method for tightly coupling automatic speech recognition and search
    • 紧密耦合自动语音识别和搜索的系统和方法
    • US09431009B2
    • 2016-08-30
    • US14479980
    • 2014-09-08
    • AT&T Intellectual Property I, L.P.
    • Srinivas BangaloreTaniya Mishra
    • G10L15/18G06F17/30G10L15/08
    • G10L15/18G06F17/30637G06F17/30663G10L15/083
    • Systems, methods, and computer-readable storage media relate to performing a search. A system configured to practice the method first receives from an automatic speech recognition (ASR) system a word lattice based on speech query and receives indexed documents from an information repository. The system composes, based on the word lattice and the indexed documents, at least one triple including a query word, selected indexed document, and weight. The system generates an N-best path through the word lattice based on the at least one triple and re-ranks ASR output based on the N-best path. The system aggregates each weight across the query words to generate N-best listings and returns search results to the speech query based on the re-ranked ASR output and the N-best listings. The lattice can be a confusion network, the arc density of which can be adjusted for a desired performance level.
    • 系统,方法和计算机可读存储介质涉及执行搜索。 配置为实施该方法的系统首先从自动语音识别(ASR)系统接收基于语音查询的字格,并从信息库接收索引的文档。 该系统基于字格和索引文档,组合至少一个包括查询词,选择的索引文档和权重的三元组。 该系统基于至少一个三重生成通过该字格的N个最佳路径,并且基于该N最佳路径重新排列ASR输出。 系统通过查询字聚合每个权重,以产生N最佳列表,并根据重新排列的ASR输出和N最佳列表将搜索结果返回给语音查询。 晶格可以是混淆网络,其电弧密度可以针对期望的性能水平进行调整。