专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20160329045A1 System and Method for Optimizing Speech Recognition and Natural Language Parameters with User Feedback 有权
公开(公告)号：US20160329045A1
公开(公告)日：2016-11-10
申请号：US15212908
申请日：2016-07-18
申请人： AT&T Intellectual Property I, L.P.
发明人： Andrej LJOLJE , Diamantino Antonio CASEIRO , Mazin GILBERT , Vincent GOFFIN , Taniya Mishra
IPC分类号： G10L15/06 , G10L15/01 , G10L15/18 , G10L15/26
CPC分类号： G10L15/063 , G10L15/01 , G10L15/18 , G10L15/26 , G10L2015/0635
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.

2. 发明授权

US09396725B2 System and method for optimizing speech recognition and natural language parameters with user feedback 有权
标题翻译：用户反馈优化语音识别和自然语言参数的系统和方法
公开(公告)号：US09396725B2
公开(公告)日：2016-07-19
申请号：US14287866
申请日：2014-05-27
申请人： AT&T Intellectual Property I, L.P.
发明人： Andrej Ljolje , Diamantino Antonio Caseiro , Mazin Gilbert , Vincent Goffin , Taniya Mishra
IPC分类号： G10L15/18 , G10L15/26
CPC分类号： G10L15/063 , G10L15/01 , G10L15/18 , G10L15/26 , G10L2015/0635
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for assigning saliency weights to words of an ASR model. The saliency values assigned to words within an ASR model are based on human perception judgments of previous transcripts. These saliency values are applied as weights to modify an ASR model such that the results of the weighted ASR model in converting a spoken document to a transcript provide a more accurate and useful transcription to the user.
摘要翻译：这里公开了用于将显着权重分配给ASR模型的单词的系统，方法和非暂时计算机可读存储介质。分配给ASR模型中的单词的显着性值基于以前的成绩单的人类感知判断。这些显着性值被用作权重以修改ASR模型，使得将口头文档转换成抄本的加权ASR模型的结果为用户提供更准确和有用的转录。

3. 发明授权

US10319370B2 System and method for data-driven socially customized models for language generation 有权
公开(公告)号：US10319370B2
公开(公告)日：2019-06-11
申请号：US15978529
申请日：2018-05-14
申请人： AT&T Intellectual Property I, L.P.
发明人： Taniya Mishra , Alistair D. Conkie , Svetlana Stoyanchev
IPC分类号： G10L13/04 , G10L15/07 , G10L13/027 , G10L15/18 , G10L17/00 , G10L15/183 , G10L15/02 , G10L17/22
摘要： Systems, methods, and computer-readable storage devices for generating speech using a presentation style specific to a user, and in particular the user's social group. Systems configured according to this disclosure can then use the resulting, personalized, text and/or speech in a spoken dialogue or presentation system to communicate with the user. For example, a system practicing the disclosed method can receive speech from a user, identify the user, and respond to the received speech by applying a personalized natural language generation model. The personalized natural language generation model provides communications which can be specific to the identified user.

4. 发明授权

US10089985B2 Smart interactive media content guide 有权
公开(公告)号：US10089985B2
公开(公告)日：2018-10-02
申请号：US14267540
申请日：2014-05-01
申请人： AT&T INTELLECTUAL PROPERTY I, L.P.
发明人： Taniya Mishra , Dimitrios Dimitriadis , Diane Kearns
IPC分类号： G06F3/00 , G06F13/00 , H04N5/445 , G10L15/26 , H04N21/482 , H04N21/41 , H04N21/414 , H04N21/81 , H04N21/472 , G10L13/00
摘要： Television content is provided upon request. A search request for television content is received from a user on a user device. Listings for television content that meet the search request are determined based on the search request. Text describing the listings is converted to corresponding speech describing the listings. Speech describing the listings is provided audibly.

5. 发明授权

US09972309B2 System and method for data-driven socially customized models for language generation 有权
公开(公告)号：US09972309B2
公开(公告)日：2018-05-15
申请号：US15229368
申请日：2016-08-05
申请人： AT&T Intellectual Property I, L.P.
发明人： Taniya Mishra , Alistair D. Conkie , Svetlana Stoyanchev
IPC分类号： G10L13/04 , G10L15/07 , G10L13/027 , G10L15/18 , G10L17/00 , G10L15/183 , G10L15/02 , G10L17/22
CPC分类号： G10L15/07 , G10L13/027 , G10L15/02 , G10L15/1815 , G10L15/183 , G10L17/00 , G10L17/22
摘要： Systems, methods, and computer-readable storage devices for generating speech using a presentation style specific to a user, and in particular the user's social group. Systems configured according to this disclosure can then use the resulting, personalized, text and/or speech in a spoken dialog or presentation system to communicate with the user. For example, a system practicing the disclosed method can receive speech from a user, identify the user, and respond to the received speech by applying a personalized natural language generation model. The personalized natural language generation model provides communications which can be specific to the identified user.

6. 发明授权

US09218815B2 System and method for dynamic facial features for speaker recognition 有权
标题翻译：用于说话者识别的动态面部特征的系统和方法
公开(公告)号：US09218815B2
公开(公告)日：2015-12-22
申请号：US14551907
申请日：2014-11-24
申请人： AT&T Intellectual Property I, L.P.
发明人： Ann K. Syrdal , Sumit Chopra , Patrick Haffner , Taniya Mishra , Ilija Zeljkovic , Eric Zavesky
IPC分类号： G06K9/00 , G10L17/24 , G06F21/32
CPC分类号： G10L15/25 , G06F21/32 , G06F2221/2103 , G06K9/00255 , G06K9/00281 , G06K9/00288 , G06K9/00315 , G06K9/00335 , G10L17/24 , G10L21/06
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker's facial expression responding to content of the challenge.
摘要翻译：本文公开了用于执行说话者验证的系统，方法和非暂时的计算机可读存储介质。被配置为实施该方法的系统接收到验证说话者的请求，产生对该请求是唯一的文本挑战，并且响应该请求提示说话者发出文本挑战。然后当扬声器发出文本挑战时，系统记录扬声器的动态图像特征，并且基于动态图像特征和文本挑战来执行说话者验证。录制扬声器的动态图像功能可以包括在说出文本挑战时录制扬声器的视频。动态特征可以包括扬声器的头部，嘴唇，嘴巴，眼睛和/或眉毛的运动模式。动态图像特征可以涉及讲话者讲话的语音内容，语音韵律以及响应于挑战内容的说话者的面部表情。

7. 发明公开

US20230142720A1 SMART INTERACTIVE MEDIA CONTENT GUIDE 审中-公开
公开(公告)号：US20230142720A1
公开(公告)日：2023-05-11
申请号：US18150460
申请日：2023-01-05
申请人： AT&T Intellectual Property I, L.P.
发明人： Taniya Mishra , Dimitrios Dimitriadis , Diane Kearns
IPC分类号： G10L15/26 , H04N21/482 , H04N21/414 , H04N21/81 , H04N21/472 , H04N21/41
CPC分类号： G10L15/26 , H04N21/4828 , H04N21/41407 , H04N21/8173 , H04N21/8186 , H04N21/47202 , H04N21/4825 , H04N21/472 , H04N21/482 , H04N21/41265 , G10L13/00
摘要： Methods, apparatuses and media for providing content upon request are provided. A search request for content is received from a user. A first filter is applied to the search request to modify the search request before a search algorithm searches for the content to return in response to the search request. Items of content are determined based on the search request to which the first filter is applied. A second filter is applied to the items of content to determine search results. The search results are provided to the user.

8. 发明授权

US11594225B2 Smart interactive media content guide 有权
公开(公告)号：US11594225B2
公开(公告)日：2023-02-28
申请号：US16107196
申请日：2018-08-21
申请人： AT&T INTELLECTUAL PROPERTY I, L.P.
发明人： Taniya Mishra , Dimitrios Dimitriadis , Diane Kearns
IPC分类号： G10L15/26 , H04N21/482 , H04N21/414 , H04N21/81 , H04N21/472 , H04N21/41 , G10L13/00
摘要： Methods, apparatuses and media for providing content upon request are provided. A search request for content is received from a user. A first filter is applied to the search request to modify the search request before a search algorithm searches for the content to return in response to the search request. Items of content are determined based on the search request to which the first filter is applied. A second filter is applied to the items of content to determine search results. The search results are provided to the user.

9. 发明授权

US10042877B2 Personal customer care agent 有权
公开(公告)号：US10042877B2
公开(公告)日：2018-08-07
申请号：US14732282
申请日：2015-06-05
申请人： AT&T INTELLECTUAL PROPERTY I, L.P.
发明人： Junlan Feng , Srinivas Bangalore , Michael James Robert Johnston , Taniya Mishra
IPC分类号： G06Q30/02 , G06Q30/06 , G06F17/30 , G06Q10/00 , G06Q30/00 , G06Q30/04 , H04L29/08
摘要： Information is aggregated and made available to users. A system monitors over the internet a first set of external information sources for a first user based on instructions from a first user profile that specifies information to aggregate for the first user. The system detects, based on the monitoring, new data at one of the first set of information sources. The system obtains the new data at the one of the first set of information sources, independent of preferences of the one of the first set of information sources. The system updates aggregated information for the first user with the new data from the one of the first set of information sources. The updated aggregated information for the first user is made available to the first user.

10. 发明授权

US09431009B2 System and method for tightly coupling automatic speech recognition and search 有权
标题翻译：紧密耦合自动语音识别和搜索的系统和方法
公开(公告)号：US09431009B2
公开(公告)日：2016-08-30
申请号：US14479980
申请日：2014-09-08
申请人： AT&T Intellectual Property I, L.P.
发明人： Srinivas Bangalore , Taniya Mishra
IPC分类号： G10L15/18 , G06F17/30 , G10L15/08
CPC分类号： G10L15/18 , G06F17/30637 , G06F17/30663 , G10L15/083
摘要： Systems, methods, and computer-readable storage media relate to performing a search. A system configured to practice the method first receives from an automatic speech recognition (ASR) system a word lattice based on speech query and receives indexed documents from an information repository. The system composes, based on the word lattice and the indexed documents, at least one triple including a query word, selected indexed document, and weight. The system generates an N-best path through the word lattice based on the at least one triple and re-ranks ASR output based on the N-best path. The system aggregates each weight across the query words to generate N-best listings and returns search results to the speech query based on the re-ranked ASR output and the N-best listings. The lattice can be a confusion network, the arc density of which can be adjusted for a desired performance level.
摘要翻译：系统，方法和计算机可读存储介质涉及执行搜索。配置为实施该方法的系统首先从自动语音识别（ASR）系统接收基于语音查询的字格，并从信息库接收索引的文档。该系统基于字格和索引文档，组合至少一个包括查询词，选择的索引文档和权重的三元组。该系统基于至少一个三重生成通过该字格的N个最佳路径，并且基于该N最佳路径重新排列ASR输出。系统通过查询字聚合每个权重，以产生N最佳列表，并根据重新排列的ASR输出和N最佳列表将搜索结果返回给语音查询。晶格可以是混淆网络，其电弧密度可以针对期望的性能水平进行调整。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式