会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • System and Method for Generalized Preselection for Unit Selection Synthesis
    • 单位选择综合广义预选系统与方法
    • US20140350940A1
    • 2014-11-27
    • US14454123
    • 2014-08-07
    • AT&T Intellectual Property I, L.P.
    • Alistair D. CONKIEMark BEUTNAGELYeon-Jun KIMAnn K. SYRDAL
    • G10L13/06G10L13/047
    • G10L13/06G10L13/00G10L13/047
    • Disclosed herein are systems, computer-implemented methods, and computer-readable storage media for unit selection synthesis. The method causes a computing device to add a supplemental phoneset to a speech synthesizer front end having an existing phoneset, modify a unit preselection process based on the supplemental phoneset, preselect units from the supplemental phoneset and the existing phoneset based on the modified unit preselection process, and generate speech based on the preselected units. The supplemental phoneset can be a variation of the existing phoneset, can include a word boundary feature, can include a cluster feature where initial consonant clusters and some word boundaries are marked with diacritics, can include a function word feature which marks units as originating from a function word or a content word, and/or can include a pre-vocalic or post-vocalic feature. The speech synthesizer front end can incorporates the supplemental phoneset as an extra feature.
    • 本文公开了用于单元选择合成的系统,计算机实现的方法和计算机可读存储介质。 该方法使得计算设备将辅助电话机添加到具有现有电话机的语音合成器前端,基于补充电话机修改单元预选过程,基于修改的单位预选过程从辅助电话机和现有电话机中预选单元 ,并根据预选单位产生语音。 补充手机可以是现有手机的变体,可以包括字边界特征,可以包括其中初始辅音簇和一些字边界用变音符标记的群集特征,可以包括将单位标记为源自于 功能词或内容词,和/或可以包括语音前或后声部特征。 语音合成器前端可以将补充的电话机作为额外的功能。
    • 6. 发明申请
    • SYSTEM AND METHOD FOR DATA-DRIVEN INTONATION GENERATION
    • 用于数据驱动产生的系统和方法
    • US20150149178A1
    • 2015-05-28
    • US14087840
    • 2013-11-22
    • AT&T Intellectual Property I, L.P.
    • Yeon-Jun KIMMark Charles BEUTNAGELAlistair D. CONKIETaniya MISHRA
    • G10L13/02
    • G10L13/10
    • Systems, methods, and computer-readable storage media for text-to-speech processing having an improved intonation. The system first receives text to be converted to speech, the text having a first segment and a second segment. The system then compares the text to a database of stored utterances, identifying in the database a first utterance corresponding to the first segment and determining an intonation of the first utterance. When the database does not contain a second utterance corresponding to the second segment, the system generates the speech corresponding to the text by combining the first utterance with a generated second utterance corresponding to the second segment, the generated second utterance having the intonation matching, or based on, the first utterance. These actions lead to an improved, smoother, more human-like synthetic speech output from the system.
    • 用于具有改进的语调的文本到语音处理的系统,方法和计算机可读存储介质。 系统首先接收要转换为语音的文本,该文本具有第一段和第二段。 然后,系统将文本与存储的话语的数据库进行比较,在数据库中标识对应于第一段的第一个发音,并确定第一个发音的语调。 当数据库不包含对应于第二段的第二话语时,系统通过将第一个发音与对应于第二个段的所生成的第二个发音组合,生成具有语调匹配的第二个话语,或者 基于第一个话语。 这些动作导致系统的改进,更平滑,更人性化的合成语音输出。
    • 8. 发明申请
    • System and Method for Cloud-Based Text-to-Speech Web Services
    • 基于云的文本到语音Web服务的系统和方法
    • US20150221298A1
    • 2015-08-06
    • US14684893
    • 2015-04-13
    • AT&T Intellectual Property I, L.P.
    • Mark Charles BEUTNAGELAlistair D. CONKIEYeon-Jun KIMHorst Juergen SCHROETER
    • G10L13/04
    • G10L13/04G10L13/00G10L13/043
    • Disclosed herein are systems, methods, and non-transitory computer-readable storage media for generating speech. One variation of the method is from a server side, and another variation of the method is from a client side. The server side method, as implemented by a network-based automatic speech processing system, includes first receiving, from a network client independent of knowledge of internal operations of the system, a request to generate a text-to-speech voice. The request can include speech samples, transcriptions of the speech samples, and metadata describing the speech samples. The system extracts sound units from the speech samples based on the transcriptions and generates an interactive demonstration of the text-to-speech voice based on the sound units, the transcriptions, and the metadata, wherein the interactive demonstration hides a back end processing implementation from the network client. The system provides access to the interactive demonstration to the network client.
    • 本文公开了用于产生语音的系统,方法和非暂时的计算机可读存储介质。 该方法的一个变体是来自服务器端,并且该方法的另一变体是来自客户端。 由基于网络的自动语音处理系统实现的服务器端方法包括首先从网络客户端接收与系统的内部操作相关的知识,生成文本到语音语音的请求。 该请求可以包括语音样本,语音样本的转录以及描述语音样本的元数据。 该系统基于转录从语音样本中提取声音单元,并基于声音单元,转录和元数据生成文本到语音语音的交互式演示,其中交互式演示隐藏了后端处理实现 网络客户端。 该系统提供对网络客户端的交互式演示的访问。