会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明申请
    • SYSTEM AND METHOD FOR USING PROSODY FOR VOICE-ENABLED SEARCH
    • 用于语音启发搜索的系统和方法
    • US20120072217A1
    • 2012-03-22
    • US12884959
    • 2010-09-17
    • Srinivas BANGALOREJunlan FengMichael JohnstonTaniya Mishra
    • Srinivas BANGALOREJunlan FengMichael JohnstonTaniya Mishra
    • G10L15/06
    • G10L15/1807G10L25/54G10L25/63G10L2015/226G10L2015/227
    • Disclosed herein are systems, methods, and non-transitory computer-readable storage media for approximating relevant responses to a user query with voice-enabled search. A system practicing the method receives a word lattice generated by an automatic speech recognizer based on a user speech and a prosodic analysis of the user speech, generates a reweighted word lattice based on the word lattice and the prosodic analysis, approximates based on the reweighted word lattice one or more relevant responses to the query, and presents to a user the responses to the query. The prosodic analysis examines metalinguistic information of the user speech and can identify the most salient subject matter of the speech, assess how confident a speaker is in the content of his or her speech, and identify the attitude, mood, emotion, sentiment, etc. of the speaker. Other information not described in the content of the speech can also be used.
    • 本文公开了用于使用具有语音的搜索近似对用户查询的相关响应的系统,方法和非暂时的计算机可读存储介质。 实施该方法的系统基于用户语音和用户语音的韵律分析接收由自动语音识别器生成的单词格,基于单词格点和韵律分析生成重新加权的词格,基于重新加权的词近似 对查询进行一个或多个相关响应,并向用户呈现对查询的响应。 韵律分析检查用户言语的金属语言信息,可以确定演讲中最突出的主题,评估演讲者在演讲内容中的信心,并确定态度,情绪,情感等情绪。 的演讲者。 还可以使用在语音内容中未描述的其他信息。
    • 6. 发明授权
    • System and method for tightly coupling automatic speech recognition and search
    • 紧密耦合自动语音识别和搜索的系统和方法
    • US08831944B2
    • 2014-09-09
    • US12638649
    • 2009-12-15
    • Srinivas BangaloreTaniya Mishra
    • Srinivas BangaloreTaniya Mishra
    • G10L15/14G10L15/08G06F17/30
    • G10L15/18G06F17/30637G06F17/30663G10L15/083
    • Disclosed herein are systems, methods, and computer-readable storage media for performing a search. A system configured to practice the method first receives from an automatic speech recognition (ASR) system a word lattice based on speech query and receives indexed documents from an information repository. The system composes, based on the word lattice and the indexed documents, at least one triple including a query word, selected indexed document, and weight. The system generates an N-best path through the word lattice based on the at least one triple and re-ranks ASR output based on the N-best path. The system aggregates each weight across the query words to generate N-best listings and returns search results to the speech query based on the re-ranked ASR output and the N-best listings. The lattice can be a confusion network, the arc density of which can be adjusted for a desired performance level.
    • 本文公开了用于执行搜索的系统,方法和计算机可读存储介质。 配置为实施该方法的系统首先从自动语音识别(ASR)系统接收基于语音查询的字格,并从信息库接收索引的文档。 该系统基于字格和索引文档,组合至少一个包括查询词,选择的索引文档和权重的三元组。 该系统基于至少一个三重生成通过该字格的N个最佳路径,并且基于该N最佳路径重新排列ASR输出。 系统通过查询字聚合每个权重,以产生N最佳列表,并根据重新排列的ASR输出和N最佳列表将搜索结果返回给语音查询。 晶格可以是混淆网络,其电弧密度可以针对期望的性能水平进行调整。
    • 10. 发明授权
    • System and method for low-latency web-based text-to-speech without plugins
    • 用于低延迟基于Web的文本到语音而不需要插件的系统和方法
    • US09240180B2
    • 2016-01-19
    • US13308860
    • 2011-12-01
    • Alistair D. ConkieMark Charles BeutnagelTaniya Mishra
    • Alistair D. ConkieMark Charles BeutnagelTaniya Mishra
    • G10L13/00G10L13/08G10L13/10
    • G10L13/04G10L13/10
    • Disclosed herein are systems, methods, and non-transitory computer-readable storage media for reducing latency in web-browsing TTS systems without the use of a plug-in or Flash® module. A system configured according to the disclosed methods allows the browser to send prosodically meaningful sections of text to a web server. A TTS server then converts intonational phrases of the text into audio and responds to the browser with the audio file. The system saves the audio file in a cache, with the file indexed by a unique identifier. As the system continues converting text into speech, when identical text appears the system uses the cached audio corresponding to the identical text without the need for re-synthesis via the TTS server.
    • 这里公开的是系统,方法和非暂时的计算机可读存储介质,用于在不使用插件或Flash®模块的情况下减少网页浏览TTS系统中的延迟。 根据所公开的方法配置的系统允许浏览器向web服务器发送具有韵律意义的文本段。 然后,TTS服务器将文本的语调短语转换为音频,并用音频文件对浏览器进行响应。 系统将音频文件保存在缓存中,文件由唯一标识符进行索引。 随着系统继续将文本转换为语音,当出现相同的文本时,系统使用对应于相同文本的缓存音频,而不需要经由TTS服务器重新合成。