专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US09767221B2 User profile and its location in a clustered profile landscape 有权
公开(公告)号：US09767221B2
公开(公告)日：2017-09-19
申请号：US12901075
申请日：2010-10-08
申请人： Srinivas Bangalore , Junlan Feng , Michael James Robert Johnston , Taniya Mishra
发明人： Srinivas Bangalore , Junlan Feng , Michael James Robert Johnston , Taniya Mishra
IPC分类号： G06F17/30
CPC分类号： G06F17/30976 , G06F17/30345 , G06F17/30997
摘要： Delivering targeted content includes collecting, via at least one tangible processor, user activity data for users during a specified time period. questions asked by the users during the specified time period are extracted from the user activity data, via the at least one tangible processor, and stored in user profiles for the users. The user profiles are clustered, via the at least one tangible processor, based on the questions asked. Targeted content is delivered, via the at least one tangible processor, to a subset of the users based on the clustering.

2. 发明授权

US10002608B2 System and method for using prosody for voice-enabled search 有权
公开(公告)号：US10002608B2
公开(公告)日：2018-06-19
申请号：US12884959
申请日：2010-09-17
申请人： Srinivas Bangalore , Junlan Feng , Michael Johnston , Taniya Mishra
发明人： Srinivas Bangalore , Junlan Feng , Michael Johnston , Taniya Mishra
IPC分类号： G10L15/18 , G10L25/54 , G10L25/63 , G10L15/22
CPC分类号： G10L15/1807 , G10L25/54 , G10L25/63 , G10L2015/226 , G10L2015/227
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating relevant responses to a user query with voice-enabled search. A system practicing the method receives a word lattice generated by an automatic speech recognizer based on a user speech and a prosodic analysis of the user speech, generates a reweighted word lattice based on the word lattice and the prosodic analysis, approximates based on the reweighted word lattice one or more relevant responses to the query, and presents to a user the responses to the query. The prosodic analysis examines metalinguistic information of the user speech and can identify the most salient subject matter of the speech, assess how confident a speaker is in the content of his or her speech, and identify the attitude, mood, emotion, sentiment, etc. of the speaker. Other information not described in the content of the speech can also be used.

3. 发明申请

US20120072219A1 SYSTEM AND METHOD FOR ENHANCING VOICE-ENABLED SEARCH BASED ON AUTOMATED DEMOGRAPHIC IDENTIFICATION 有权
标题翻译：基于自动人口统计学识别提高语音搜索的系统和方法
公开(公告)号：US20120072219A1
公开(公告)日：2012-03-22
申请号：US12888012
申请日：2010-09-22
申请人： Michael JOHNSTON , Srinivas Bangalore , Junlan Feng , Taniya Mishra
发明人： Michael JOHNSTON , Srinivas Bangalore , Junlan Feng , Taniya Mishra
IPC分类号： G10L15/04
CPC分类号： G06F17/30026 , G06F17/30976 , G06F17/30979 , G10L15/22 , G10L2015/227
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating responses to a user speech query in voice-enabled search based on metadata that include demographic features of the speaker. A system practicing the method recognizes received speech from a speaker to generate recognized speech, identifies metadata about the speaker from the received speech, and feeds the recognized speech and the metadata to a question-answering engine. Identifying the metadata about the speaker is based on voice characteristics of the received speech. The demographic features can include age, gender, socio-economic group, nationality, and/or region. The metadata identified about the speaker from the received speech can be combined with or override self-reported speaker demographic information.
摘要翻译：本文公开的是基于包括说话者的人口统计特征的元数据的用于在基于语音的搜索中近似对用户语音查询的响应的系统，方法和非暂时计算机可读存储介质。实施该方法的系统识别来自扬声器的接收语音以产生识别的语音，从接收到的语音识别关于说话者的元数据，并将识别的语音和元数据馈送到问答引擎。识别关于扬声器的元数据是基于所接收语音的语音特征。人口特征可以包括年龄，性别，社会经济群体，国籍和/或地区。从接收到的语音中识别的关于说话者的元数据可以与自报告的说话者人口统计信息进行组合或覆盖。

4. 发明申请

US20120072217A1 SYSTEM AND METHOD FOR USING PROSODY FOR VOICE-ENABLED SEARCH 有权
标题翻译：用于语音启发搜索的系统和方法
公开(公告)号：US20120072217A1
公开(公告)日：2012-03-22
申请号：US12884959
申请日：2010-09-17
申请人： Srinivas BANGALORE , Junlan Feng , Michael Johnston , Taniya Mishra
发明人： Srinivas BANGALORE , Junlan Feng , Michael Johnston , Taniya Mishra
IPC分类号： G10L15/06
CPC分类号： G10L15/1807 , G10L25/54 , G10L25/63 , G10L2015/226 , G10L2015/227
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating relevant responses to a user query with voice-enabled search. A system practicing the method receives a word lattice generated by an automatic speech recognizer based on a user speech and a prosodic analysis of the user speech, generates a reweighted word lattice based on the word lattice and the prosodic analysis, approximates based on the reweighted word lattice one or more relevant responses to the query, and presents to a user the responses to the query. The prosodic analysis examines metalinguistic information of the user speech and can identify the most salient subject matter of the speech, assess how confident a speaker is in the content of his or her speech, and identify the attitude, mood, emotion, sentiment, etc. of the speaker. Other information not described in the content of the speech can also be used.
摘要翻译：本文公开了用于使用具有语音的搜索近似对用户查询的相关响应的系统，方法和非暂时的计算机可读存储介质。实施该方法的系统基于用户语音和用户语音的韵律分析接收由自动语音识别器生成的单词格，基于单词格点和韵律分析生成重新加权的词格，基于重新加权的词近似对查询进行一个或多个相关响应，并向用户呈现对查询的响应。韵律分析检查用户言语的金属语言信息，可以确定演讲中最突出的主题，评估演讲者在演讲内容中的信心，并确定态度，情绪，情感等情绪。的演讲者。还可以使用在语音内容中未描述的其他信息。

5. 发明授权

US09076146B2 Personal customer care agent 有权
标题翻译：个人客户服务代理
公开(公告)号：US09076146B2
公开(公告)日：2015-07-07
申请号：US12905172
申请日：2010-10-15
申请人： Junlan Feng , Srinivas Bangalore , Michael James Robert Johnston , Taniya Mishra
发明人： Junlan Feng , Srinivas Bangalore , Michael James Robert Johnston , Taniya Mishra
IPC分类号： G06Q30/00 , G06Q10/00 , G06Q30/02 , G06Q30/06
CPC分类号： G06F17/30345 , G06Q10/00 , G06Q30/00 , G06Q30/014 , G06Q30/0282 , G06Q30/04 , G06Q30/0601 , G06Q30/0631 , H04L67/22 , H04L67/306
摘要： Aggregating information includes configuring, by at least one processor, a user profile that indicates user preferences for aggregated information. The at least one processor monitors information sources including the World Wide Web, business websites of interest, and online social media, based on the user preferences. Data obtained from the information sources is presented, based on the monitoring, by the at least one processor, in accordance with a presentation format, as the aggregated information, based on the user preferences. The at least one processor triggers updating of the presented aggregated information based on a change to the data at least one of the information sources and a change to the user profile.
摘要翻译：聚合信息包括由至少一个处理器配置指示聚合信息的用户偏好的用户简档。至少一个处理器基于用户偏好来监视包括万维网，感兴趣的商业网站和在线社交媒体的信息源。基于信息源获得的数据，基于由至少一个处理器根据用户偏好根据呈现格式作为聚合信息的监视来呈现。所述至少一个处理器基于对所述数据的至少一个信息源的改变以及对所述用户简档的改变来触发对所呈现的聚合信息的更新。

6. 发明授权

US08831944B2 System and method for tightly coupling automatic speech recognition and search 有权
标题翻译：紧密耦合自动语音识别和搜索的系统和方法
公开(公告)号：US08831944B2
公开(公告)日：2014-09-09
申请号：US12638649
申请日：2009-12-15
申请人： Srinivas Bangalore , Taniya Mishra
发明人： Srinivas Bangalore , Taniya Mishra
IPC分类号： G10L15/14 , G10L15/08 , G06F17/30
CPC分类号： G10L15/18 , G06F17/30637 , G06F17/30663 , G10L15/083
摘要： Disclosed herein are systems, methods, and computer-readable storage media for performing a search. A system configured to practice the method first receives from an automatic speech recognition (ASR) system a word lattice based on speech query and receives indexed documents from an information repository. The system composes, based on the word lattice and the indexed documents, at least one triple including a query word, selected indexed document, and weight. The system generates an N-best path through the word lattice based on the at least one triple and re-ranks ASR output based on the N-best path. The system aggregates each weight across the query words to generate N-best listings and returns search results to the speech query based on the re-ranked ASR output and the N-best listings. The lattice can be a confusion network, the arc density of which can be adjusted for a desired performance level.
摘要翻译：本文公开了用于执行搜索的系统，方法和计算机可读存储介质。配置为实施该方法的系统首先从自动语音识别（ASR）系统接收基于语音查询的字格，并从信息库接收索引的文档。该系统基于字格和索引文档，组合至少一个包括查询词，选择的索引文档和权重的三元组。该系统基于至少一个三重生成通过该字格的N个最佳路径，并且基于该N最佳路径重新排列ASR输出。系统通过查询字聚合每个权重，以产生N最佳列表，并根据重新排列的ASR输出和N最佳列表将搜索结果返回给语音查询。晶格可以是混淆网络，其电弧密度可以针对期望的性能水平进行调整。

7. 发明授权

US08401853B2 System and method for enhancing voice-enabled search based on automated demographic identification 有权
公开(公告)号：US08401853B2
公开(公告)日：2013-03-19
申请号：US12888012
申请日：2010-09-22
申请人： Michael Johnston , Srinivas Bangalore , Junlan Feng , Taniya Mishra
发明人： Michael Johnston , Srinivas Bangalore , Junlan Feng , Taniya Mishra
IPC分类号： G10L15/04
CPC分类号： G06F17/30026 , G06F17/30976 , G06F17/30979 , G10L15/22 , G10L2015/227
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating responses to a user speech query in voice-enabled search based on metadata that include demographic features of the speaker. A system practicing the method recognizes received speech from a speaker to generate recognized speech, identifies metadata about the speaker from the received speech, and feeds the recognized speech and the metadata to a question-answering engine. Identifying the metadata about the speaker is based on voice characteristics of the received speech. The demographic features can include age, gender, socio-economic group, nationality, and/or region. The metadata identified about the speaker from the received speech can be combined with or override self-reported speaker demographic information.

8. 发明授权

US08897500B2 System and method for dynamic facial features for speaker recognition 有权
标题翻译：用于说话者识别的动态面部特征的系统和方法
公开(公告)号：US08897500B2
公开(公告)日：2014-11-25
申请号：US13101704
申请日：2011-05-05
申请人： Ann K. Syrdal , Sumit Chopra , Patrick Haffner , Taniya Mishra , Ilija Zeljkovic , Eric Zavesky
发明人： Ann K. Syrdal , Sumit Chopra , Patrick Haffner , Taniya Mishra , Ilija Zeljkovic , Eric Zavesky
IPC分类号： G06K9/00 , G10L17/24 , G06F21/32
CPC分类号： G10L15/25 , G06F21/32 , G06F2221/2103 , G06K9/00255 , G06K9/00281 , G06K9/00288 , G06K9/00315 , G06K9/00335 , G10L17/24 , G10L21/06
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker's facial expression responding to content of the challenge.
摘要翻译：本文公开了用于执行说话者验证的系统，方法和非暂时的计算机可读存储介质。被配置为实施该方法的系统接收到验证说话者的请求，产生对该请求是唯一的文本挑战，并且响应该请求提示说话者发出文本挑战。然后当扬声器发出文本挑战时，系统记录扬声器的动态图像特征，并且基于动态图像特征和文本挑战来执行说话者验证。录制扬声器的动态图像功能可以包括在说出文本挑战时录制扬声器的视频。动态特征可以包括扬声器的头部，嘴唇，嘴巴，眼睛和/或眉毛的运动模式。动态图像特征可以涉及讲话者讲话的语音内容，语音韵律以及响应于挑战内容的说话者的面部表情。

9. 发明申请

US20120281885A1 SYSTEM AND METHOD FOR DYNAMIC FACIAL FEATURES FOR SPEAKER RECOGNITION 有权
标题翻译：用于声音识别的动态特征的系统和方法
公开(公告)号：US20120281885A1
公开(公告)日：2012-11-08
申请号：US13101704
申请日：2011-05-05
申请人： Ann K. SYRDAL , Sumit Chopra , Patrick Haffner , Taniya Mishra , Ilija Zeljkovic , Eric Zavesky
发明人： Ann K. SYRDAL , Sumit Chopra , Patrick Haffner , Taniya Mishra , Ilija Zeljkovic , Eric Zavesky
IPC分类号： G06K9/00
CPC分类号： G10L15/25 , G06F21/32 , G06F2221/2103 , G06K9/00255 , G06K9/00281 , G06K9/00288 , G06K9/00315 , G06K9/00335 , G10L17/24 , G10L21/06
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for performing speaker verification. A system configured to practice the method receives a request to verify a speaker, generates a text challenge that is unique to the request, and, in response to the request, prompts the speaker to utter the text challenge. Then the system records a dynamic image feature of the speaker as the speaker utters the text challenge, and performs speaker verification based on the dynamic image feature and the text challenge. Recording the dynamic image feature of the speaker can include recording video of the speaker while speaking the text challenge. The dynamic feature can include a movement pattern of head, lips, mouth, eyes, and/or eyebrows of the speaker. The dynamic image feature can relate to phonetic content of the speaker speaking the challenge, speech prosody, and the speaker's facial expression responding to content of the challenge.
摘要翻译：本文公开了用于执行说话者验证的系统，方法和非暂时的计算机可读存储介质。被配置为实施该方法的系统接收到验证说话者的请求，产生对该请求是唯一的文本挑战，并且响应该请求提示说话者发出文本挑战。然后当扬声器发出文本挑战时，系统记录扬声器的动态图像特征，并且基于动态图像特征和文本挑战来执行说话者验证。录制扬声器的动态图像功能可以包括在说出文本挑战时录制扬声器的视频。动态特征可以包括扬声器的头部，嘴唇，嘴巴，眼睛和/或眉毛的运动模式。动态图像特征可以涉及讲话者讲话的语音内容，语音韵律以及响应于挑战内容的说话者的面部表情。

10. 发明授权

US09240180B2 System and method for low-latency web-based text-to-speech without plugins 有权
标题翻译：用于低延迟基于Web的文本到语音而不需要插件的系统和方法
公开(公告)号：US09240180B2
公开(公告)日：2016-01-19
申请号：US13308860
申请日：2011-12-01
申请人： Alistair D. Conkie , Mark Charles Beutnagel , Taniya Mishra
发明人： Alistair D. Conkie , Mark Charles Beutnagel , Taniya Mishra
IPC分类号： G10L13/00 , G10L13/08 , G10L13/10
CPC分类号： G10L13/04 , G10L13/10
摘要： Disclosed herein are systems, methods, and non-transitory computer-readable storage media for reducing latency in web-browsing TTS systems without the use of a plug-in or Flash® module. A system configured according to the disclosed methods allows the browser to send prosodically meaningful sections of text to a web server. A TTS server then converts intonational phrases of the text into audio and responds to the browser with the audio file. The system saves the audio file in a cache, with the file indexed by a unique identifier. As the system continues converting text into speech, when identical text appears the system uses the cached audio corresponding to the identical text without the need for re-synthesis via the TTS server.
摘要翻译：这里公开的是系统，方法和非暂时的计算机可读存储介质，用于在不使用插件或Flash®模块的情况下减少网页浏览TTS系统中的延迟。根据所公开的方法配置的系统允许浏览器向web服务器发送具有韵律意义的文本段。然后，TTS服务器将文本的语调短语转换为音频，并用音频文件对浏览器进行响应。系统将音频文件保存在缓存中，文件由唯一标识符进行索引。随着系统继续将文本转换为语音，当出现相同的文本时，系统使用对应于相同文本的缓存音频，而不需要经由TTS服务器重新合成。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式