会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明授权
    • System and method for dynamic facial features for speaker recognition
    • 用于说话者识别的动态面部特征的系统和方法
    • US09218815B2
    • 2015-12-22
    • US14551907
    • 2014-11-24
    • AT&T Intellectual Property I, L.P.
    • Ann K. SyrdalSumit ChopraPatrick HaffnerTaniya MishraIlija ZeljkovicEric Zavesky
    • G06K9/00G10L17/24G06F21/32
    • G10L15/25G06F21/32G06F2221/2103G06K9/00255G06K9/00281G06K9/00288G06K9/00315G06K9/00335G10L17/24G10L21/06
    • Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker's facial expression responding to content of the challenge.
    • 本文公开了用于执行说话者验证的系统,方法和非暂时的计算机可读存储介质。 被配置为实施该方法的系统接收到验证说话者的请求,产生对该请求是唯一的文本挑战,并且响应该请求提示说话者发出文本挑战。 然后当扬声器发出文本挑战时,系统记录扬声器的动态图像特征,并且基于动态图像特征和文本挑战来执行说话者验证。 录制扬声器的动态图像功能可以包括在说出文本挑战时录制扬声器的视频。 动态特征可以包括扬声器的头部,嘴唇,嘴巴,眼睛和/或眉毛的运动模式。 动态图像特征可以涉及讲话者讲话的语音内容,语音韵律以及响应于挑战内容的说话者的面部表情。
    • 8. 发明申请
    • System and Method for Adapting Automatic Speech Recognition Pronunciation by Acoustic Model Restructuring
    • 通过声学模型重构适应自动语音识别发音的系统和方法
    • US20150243282A1
    • 2015-08-27
    • US14698183
    • 2015-04-28
    • AT&T Intellectual Property I, L.P.
    • Andrej LJOLJEAlistair D. CONKIEAnn K. Syrdal
    • G10L15/187G10L15/06G10L15/14
    • G10L17/14G10L15/063G10L15/07G10L15/14G10L15/187G10L15/265G10L15/30G10L2015/025
    • Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for recognizing speech by adapting automatic speech recognition pronunciation by acoustic model restructuring. The method identifies an acoustic model and a matching pronouncing dictionary trained on typical native speech in a target dialect. The method collects speech from a new speaker resulting in collected speech and transcribes the collected speech to generate a lattice of plausible phonemes. Then the method creates a custom speech model for representing each phoneme used in the pronouncing dictionary by a weighted sum of acoustic models for all the plausible phonemes, wherein the pronouncing dictionary does not change, but the model of the acoustic space for each phoneme in the dictionary becomes a weighted sum of the acoustic models of phonemes of the typical native speech. Finally the method includes recognizing via a processor additional speech from the target speaker using the custom speech model.
    • 这里公开的是系统,计算机实现的方法和用于通过声学模型重构来适应自动语音识别发音来识别语音的计算机可读存储介质。 该方法识别在目标方言中典型的本地语音训练的声学模型和匹配的发音字典。 该方法从新的演讲者收集演讲,从而收集到的演讲并转录收集的演讲,以产生一个合理的音素格子。 然后,该方法创建一个自定义语音模型,用于通过用于所有似乎合理的音素的声学模型的加权和来表示在发音字典中使用的每个音素,其中发音字典不改变,而是在每个音素的声学空间的模型中 字典成为典型本地语音的音素的声学模型的加权和。 最后,该方法包括使用定制语音模型通过处理器从目标说话者识别附加语音。
    • 9. 发明授权
    • System and method for synthetic voice generation and modification
    • 合成语音产生和修改的系统和方法
    • US08965767B2
    • 2015-02-24
    • US14282035
    • 2014-05-20
    • AT&T Intellectual Property I, L.P.
    • Alistair D. ConkieAnn K. Syrdal
    • G10L13/00G10L13/08G10L13/027G10L13/06H04B7/04H04B7/06H04W72/04
    • G10L13/043G10L13/027G10L13/047G10L13/06G10L25/63H04B7/0404H04B7/0697H04W72/0413
    • Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating a synthetic voice. A system configured to practice the method combines a first database of a first text-to-speech voice and a second database of a second text-to-speech voice to generate a combined database, selects from the combined database, based on a policy, voice units of a phonetic category for the synthetic voice to yield selected voice units, and synthesizes speech based on the selected voice units. The system can synthesize speech without parameterizing the first text-to-speech voice and the second text-to-speech voice. A policy can define, for a particular phonetic category, from which text-to-speech voice to select voice units. The combined database can include multiple text-to-speech voices from different speakers. The combined database can include voices of a single speaker speaking in different styles. The combined database can include voices of different languages.
    • 这里公开了用于产生合成语音的系统,方法和非暂时的计算机可读存储介质。 被配置为实施该方法的系统组合第一文本到语音语音的第一数据库和第二文本到语音语音的第二数据库以生成组合数据库,基于策略从组合数据库中进行选择, 用于合成语音的语音类别的语音单元以产生所选择的语音单元,并且基于所选择的语音单元来合成语音。 该系统可以合成语音,而无需参数化第一个文本到语音的语音和第二个文本到语音的语音。 对于特定语音类别,策略可以定义哪些文本到语音语音来选择语音单元。 组合的数据库可以包括来自不同扬声器的多个文本到语音的声音。 组合的数据库可以包括以不同风格说话的单个扬声器的声音。 组合的数据库可以包括不同语言的语音。
    • 10. 发明授权
    • System and method for generalized preselection for unit selection synthesis
    • 用于单位选择合成的广义预选系统和方法
    • US09564121B2
    • 2017-02-07
    • US14454123
    • 2014-08-07
    • AT&T Intellectual Property I, L.P.
    • Alistair D. ConkieMark BeutnagelYeon-Jun KimAnn K. Syrdal
    • G10L13/06G10L13/047G10L13/00
    • G10L13/06G10L13/00G10L13/047
    • Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for unit selection synthesis. The method causes a computing device to add a supplemental phoneset to a speech synthesizer front end having an existing phoneset, modify a unit preselection process based on the supplemental phoneset, preselect units from the supplemental phoneset and the existing phoneset based on the modified unit preselection process, and generate speech based on the preselected units. The supplemental phoneset can be a variation of the existing phoneset, can include a word boundary feature, can include a cluster feature where initial consonant clusters and some word boundaries are marked with diacritics, can include a function word feature which marks units as originating from a function word or a content word, and/or can include a pre-vocalic or post-vocalic feature. The speech synthesizer front end can incorporates the supplemental phoneset as an extra feature.
    • 本文公开了用于单元选择合成的系统,计算机实现的方法和计算机可读存储介质。 该方法使得计算设备将辅助电话机添加到具有现有电话机的语音合成器前端,基于补充电话机修改单元预选过程,基于修改的单位预选过程从辅助电话机和现有电话机中预选单元 ,并根据预选单位产生语音。 补充手机可以是现有手机的变体,可以包括字边界特征,可以包括其中初始辅音簇和一些字边界用变音符标记的群集特征,可以包括将单位标记为源自于 功能词或内容词,和/或可以包括语音前或后声部特征。 语音合成器前端可以将补充的电话机作为额外的功能。