专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

71. 发明申请

US20140324438A1 SCREEN READER HAVING CONCURRENT COMMUNICATION OF NON-TEXTUAL INFORMATION 有权
标题翻译：具有非文本信息的同时通信的屏幕阅读器
公开(公告)号：US20140324438A1
公开(公告)日：2014-10-30
申请号：US14329543
申请日：2014-07-11
申请人： Freedom Scientific, Inc.
发明人： Christian D. Hofstader , Glen Gordon , Eric Damery , Ralph Ocampo , David Baker , Joseph K. Stephen
IPC分类号： G10L13/033 , G10L13/08 , G10L13/04
CPC分类号： G10L13/0335 , G09B21/006 , G10L13/033 , G10L13/043 , G10L13/08 , G10L2013/083
摘要： A screen reader software product for low-vision users, the software having a reader module collecting textual and non-textual display information generated by a web browser or word processor. Font styling, interface layout information and the like are communicated to the end user by sounds broadcast simultaneously rather than serially with the synthesized speech to improve the speed and efficiency in which information may be digested by the end user.
摘要翻译：一种用于低视力用户的屏幕阅读器软件产品，该软件具有收集由网页浏览器或文字处理器生成的文本和非文本显示信息的阅读器模块。字体样式，界面布局信息等通过同时声音广播而不是与合成语音串行地传送给最终用户，以提高最终用户可以消化信息的速度和效率。

72. 发明申请

US20140305288A1 SYNTHETIC SIMULATION OF A MEDIA RECORDING 审中-公开
标题翻译：媒体记录的合成模拟
公开(公告)号：US20140305288A1
公开(公告)日：2014-10-16
申请号：US14313874
申请日：2014-06-24
申请人： Hank Risan
发明人： Hank Risan
IPC分类号： G10H7/00
CPC分类号： G10H7/00 , G10H7/02 , G10H2240/145 , G10L13/033 , G10L25/48
摘要： A method and system for generating a synthetic simulation of a media recording is disclosed. One embodiment accesses a sound reference archive and heuristically creates a new sound that is matched against at least one sound in the sound reference archive. The media recording is analyzed and a synthetic sound based on the analyzing of the media recording is generated.
摘要翻译：公开了一种用于产生媒体记录的合成模拟的方法和系统。一个实施例访问声音参考存档，并且启发式地创建与声音参考存档中的至少一个声音匹配的新声音。分析介质记录，并且生成基于介质记录分析的合成声音。

73. 发明授权

US08825486B2 Method and apparatus for generating synthetic speech with contrastive stress 有权
公开(公告)号：US08825486B2
公开(公告)日：2014-09-02
申请号：US14161535
申请日：2014-01-22
申请人： Nuance Communications, Inc.
发明人： Darren C. Meyer , Stephen R. Springer
IPC分类号： G10L13/00 , G10L13/02
CPC分类号： G10L13/02 , G10L13/00 , G10L13/033 , G10L13/04
摘要： Techniques for generating synthetic speech with contrastive stress. In one aspect, a speech-enabled application generates a text input including a text transcription of a desired speech output, and inputs the text input to a speech synthesis system. The synthesis system generates an audio speech output corresponding to at least a portion of the text input, with at least one portion carrying contrastive stress, and provides the audio speech output for the speech-enabled application. In another aspect, a speech-enabled application inputs a plurality of text strings, each corresponding to a portion of a desired speech output, to a software module for rendering contrastive stress. The software module identifies a plurality of audio recordings that render at least one portion of at least one of the text strings as speech carrying contrastive stress. The speech-enabled application generates an audio speech output corresponding to the desired speech output using the audio recordings.

74. 发明授权

US08812324B2 Coding, modification and synthesis of speech segments 有权
标题翻译：语音段的编码，修改和综合
公开(公告)号：US08812324B2
公开(公告)日：2014-08-19
申请号：US13254479
申请日：2010-12-21
申请人： Miguel Angel Rodriguez Crespo , Jose Gregorio Escalada Sardina , Ana Armenta Lopez de Vicuna
发明人： Miguel Angel Rodriguez Crespo , Jose Gregorio Escalada Sardina , Ana Armenta Lopez de Vicuna
IPC分类号： G10L13/00
CPC分类号： G10L13/033 , G10L13/06 , G10L19/093
摘要： The invention relates to a method for speech signal analysis, modification and synthesis comprising a phase for the location of analysis windows by means of an iterative process for the determination of the phase of the first sinusoidal component and comparison between the phase value of said component and a predetermined value, a phase for the selection of analysis frames corresponding to an allophone and readjustment of the duration and the fundamental frequency according to certain thresholds and a phase for the generation of synthetic speech from synthesis frames taking the information of the closest analysis frame as spectral information of the synthesis frame and taking as many synthesis frames as periods that the synthetic signal has. The method allows a coherent location of the analysis windows within the periods of the signal and the exact generation of the synthesis instants in a manner synchronous with the fundamental period.
摘要翻译：本发明涉及一种用于语音信号分析，修改和合成的方法，其包括通过用于确定第一正弦分量的相位的迭代过程用于分析窗口的位置的相位以及所述分量的相位值与预定值，用于选择对应于异音素的分析帧的相位，以及根据某些阈值重新调整持续时间和基本频率的相位，以及使用最接近的分析帧的信息从综合帧产生合成语音的相位作为合成帧的频谱信息，并且获取与合成信号具有的周期一样多的合成帧。该方法允许分析窗口在信号的周期内以与基本周期同步的方式精确地产生合成时刻的相干位置。

75. 发明授权

US08793128B2 Speech signal processing system, speech signal processing method and speech signal processing method program using noise environment and volume of an input speech signal at a time point 有权
标题翻译：语音信号处理系统，语音信号处理方法和语音信号处理方法程序，在时间点上使用噪声环境和输入语音信号的音量
公开(公告)号：US08793128B2
公开(公告)日：2014-07-29
申请号：US13365848
申请日：2012-02-03
申请人： Kiyokazu Miki
发明人： Kiyokazu Miki
IPC分类号： G10L15/00 , G10L15/20
CPC分类号： G10L13/033 , G10L21/003 , G10L25/84
摘要： A speech signal processing system that includes a speech input unit for inputting a speech signal; input speech storage unit for storing an input speech signal that is the speech signal inputted through the speech input unit; characteristic estimation unit for referring to the input speech signal stored in the input speech storage unit, and estimating characteristics of an input speech indicated by the input speech signal, the characteristics including an environmental sound included in the input speech signal; reference speech output unit for causing a predetermined speech signal that becomes a reference speech, to output; and characteristic adding unit for adding the characteristics of the input speech estimated by the characteristic estimation unit, in a reference speech signal that is the speech signal caused to output by the reference speech output unit.
摘要翻译：一种语音信号处理系统，包括用于输入语音信号的语音输入单元; 输入语音存储单元，用于存储作为通过语音输入单元输入的语音信号的输入语音信号; 特征估计单元，用于参考存储在输入语音存储单元中的输入语音信号，以及估计由输入语音信号指示的输入语音的特性，包括包括在输入语音信号中的环境声音的特性; 用于使作为参考语音的预定语音信号的参考语音输出单元输出; 以及特征添加单元，用于将由特征估计单元估计的输入语音的特性相加在由参考语音输出单元输出的语音信号的参考语音信号中。

76. 发明申请

US20140188480A1 SYSTEM AND METHOD FOR GENERATING CUSTOMIZED TEXT-TO-SPEECH VOICES 有权
标题翻译：用于生成定制的文本到语音的系统和方法
公开(公告)号：US20140188480A1
公开(公告)日：2014-07-03
申请号：US14196578
申请日：2014-03-04
申请人： AT&T Intellectual Property II, L.P.
发明人： Srinivas BANGALORE , Junlan Feng , Mazin G. Rahim , Juergen Schroeter , Ann K. Syrdal , David Schulz
IPC分类号： G10L13/02
CPC分类号： G10L13/033 , G10L13/00 , G10L13/02 , G10L13/06 , G10L13/08 , G10L15/197
摘要： A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.
摘要翻译：公开了用于为特定应用产生定制的文本到语音语音的系统和方法。该方法包括通过选择用于生成与域相关联的自定义文本到语音语音的语音来生成自定义文本到语音语音，从预先存在的文本数据源收集与域相关联的文本数据，并使用收集的文本数据，通过搜索合成语音单元的预先存在的库存来选择适合于该域的语音单元，或者通过记录所选合成质量水平的最小库存来生成合成语音单元的域内库存。使用合成语音单元的域内库存来生成域的文本到语音定制语音。还可以使用主动学习技术来识别问题短语，其中只需要几分钟的记录数据来传送高质量的TTS定制语音。

77. 发明申请

US20140179281A1 MOBILE TERMINAL HAVING AUTO ANSWERING FUNCTION AND AUTO ANSWERING METHOD FOR USE IN THE MOBILE TERMINAL 有权
标题翻译：具有自动应答功能的移动终端和用于移动终端的自动应答方法
公开(公告)号：US20140179281A1
公开(公告)日：2014-06-26
申请号：US14076672
申请日：2013-11-11
申请人： LG Electronics Inc.
发明人： Kyounghwa KIM , Miyoung KIM
IPC分类号： H04M3/493 , G10L25/48 , H04W4/16
CPC分类号： H04M3/4936 , G06F3/167 , G10L13/033 , G10L15/26 , G10L25/15 , G10L25/48 , H04M1/57 , H04M1/645 , H04M1/72566 , H04M1/72569 , H04M1/72597 , H04M3/493 , H04M3/53391 , H04M2201/40 , H04M2250/74 , H04W4/16
摘要： A method for an auto answering function for use in a mobile terminal of a user including receiving an incoming call from a calling party, answering the incoming call when the mobile terminal is in an auto answering mode, providing a first audio output to the calling party when the calling party is identified as a target user, and receiving a first response input from the calling party after providing the first audio output. The method further includes modifying information of a first application of a plurality of applications based on the first response input, wherein the first application is identified based on content of the first response input.
摘要翻译：一种用于在用户的移动终端中使用的自动应答功能的方法，包括从主叫方接收来话呼叫，当移动终端处于自动应答模式时应答呼入，向主叫方提供第一音频输出当主叫方被识别为目标用户，并且在提供第一音频输出之后从主叫方接收到第一响应输入。该方法还包括基于第一响应输入修改多个应用的第一应用的信息，其中基于第一响应输入的内容识别第一应用。

78. 发明申请

US20140142952A1 ENHANCED INTERFACE FOR USE WITH SPEECH RECOGNITION 有权
标题翻译：使用语音识别的增强接口
公开(公告)号：US20140142952A1
公开(公告)日：2014-05-22
申请号：US14076776
申请日：2013-11-11
申请人： Verizon Services Corp.
发明人： James Mark Kondziela
IPC分类号： G10L21/16
CPC分类号： G10L21/16 , G10L13/033 , G10L15/22
摘要： Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface by providing in combination with at least one user prompt seeking a voice response, an enhanced user keyword prompt intended to facilitate the user selecting a keyword to speak in response to the user prompt. The enhanced keyword prompts may be the same words as those a user can speak as a reply to the user prompt but presented using a different audio presentation method, e.g., speech rate, audio level, or speaker voice, than used for the user prompt. In some cases, the user keyword prompts are different words from the expected user response keywords, or portions of words, e.g., truncated versions of keywords.
摘要翻译：描述了将采用语音识别或其他语音输入的自动化系统的一部分向用户呈现语音提示的改进方法。本发明通过与寻求语音响应的至少一个用户提示一起提供用户界面来改进用户界面，增强的用户关键字提示旨在促进用户响应于用户提示来选择关键字来说话。增强的关键词提示可以是与用户可以说话作为对用户提示的答复相同的单词，而是使用与为用户提示所使用的不同的音频呈现方法（例如语音速率，音频电平或扬声器语音）呈现。在某些情况下，用户关键字提示是与预期的用户响应关键字或单词的部分，例如关键字的截断版本不同的单词。

79. 发明申请

US20140074482A1 VOICE GUIDANCE SYSTEM AND ELECTRONIC EQUIPMENT 有权
标题翻译：语音指导系统和电子设备
公开(公告)号：US20140074482A1
公开(公告)日：2014-03-13
申请号：US13972959
申请日：2013-08-22
申请人： Renesas Electronics Corporation
发明人： Kazuyuki Ohno
IPC分类号： G10L21/06
CPC分类号： G10L21/06 , G06F3/167 , G10L13/033 , G10L13/04 , G10L21/047 , H04N21/482
摘要： A voice guidance system is provided in which the voice guidance is enabled to easily follow a trend of change intervals, a rapid change of change intervals, etc. in a menu operation. The voice guidance system is configured with an input analyzing unit which inputs and analyzes an operation instruction signal of a menu item, a voice guidance control unit which controls voice guidance of the menu item according to the analysis result by the input analyzing unit, and a textual guidance control unit which performs display control of the menu item according to the analysis result by the input analyzing unit. The voice guidance control unit determines reproduction speed of the voice guidance according to the analysis result, on the basis of a speed trend obtained from a speed history as a set of plural pieces of reproduction speed information.
摘要翻译：提供了语音引导系统，其中语音引导能够容易地跟随菜单操作中的变化间隔的趋势，变化间隔的快速变化等。语音引导系统配置有输入分析菜单项的操作指示信号的输入分析单元，根据输入分析单元的分析结果来控制菜单项的语音指导的语音引导控制单元，以及文本引导控制单元，其根据输入分析单元的分析结果执行菜单项的显示控制。语音引导控制单元根据分析结果，基于从速度历史获得的速度趋势作为多个再现速度信息的集合来确定语音引导的再现速度。

80. 发明授权

US08666746B2 System and method for generating customized text-to-speech voices 有权
标题翻译：用于生成定制的文本到语音语音的系统和方法
公开(公告)号：US08666746B2
公开(公告)日：2014-03-04
申请号：US10845364
申请日：2004-05-13
申请人： Srinivas Bangalore , Junlan Feng , Mazin G. Rahim , Juergen Schroeter , David Eugene Schulz , Ann K. Syrdal
发明人： Srinivas Bangalore , Junlan Feng , Mazin G. Rahim , Juergen Schroeter , David Eugene Schulz , Ann K. Syrdal
IPC分类号： G10L13/00 , G10L13/08
CPC分类号： G10L13/033 , G10L13/00 , G10L13/02 , G10L13/06 , G10L13/08 , G10L15/197
摘要： A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.
摘要翻译：公开了用于为特定应用产生定制的文本到语音语音的系统和方法。该方法包括通过选择用于生成与域相关联的自定义文本到语音语音的语音来生成自定义文本到语音语音，从预先存在的文本数据源收集与域相关联的文本数据，并使用收集的文本数据，通过搜索合成语音单元的预先存在的库存来选择适合于该域的语音单元，或者通过记录所选合成质量水平的最小库存来生成合成语音单元的域内库存。使用合成语音单元的域内库存来生成域的文本到语音定制语音。还可以使用主动学习技术来识别问题短语，其中只需要几分钟的记录数据来传送高质量的TTS定制语音。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式