会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 74. 发明授权
    • Coding, modification and synthesis of speech segments
    • 语音段的编码,修改和综合
    • US08812324B2
    • 2014-08-19
    • US13254479
    • 2010-12-21
    • Miguel Angel Rodriguez CrespoJose Gregorio Escalada SardinaAna Armenta Lopez de Vicuna
    • Miguel Angel Rodriguez CrespoJose Gregorio Escalada SardinaAna Armenta Lopez de Vicuna
    • G10L13/00
    • G10L13/033G10L13/06G10L19/093
    • The invention relates to a method for speech signal analysis, modification and synthesis comprising a phase for the location of analysis windows by means of an iterative process for the determination of the phase of the first sinusoidal component and comparison between the phase value of said component and a predetermined value, a phase for the selection of analysis frames corresponding to an allophone and readjustment of the duration and the fundamental frequency according to certain thresholds and a phase for the generation of synthetic speech from synthesis frames taking the information of the closest analysis frame as spectral information of the synthesis frame and taking as many synthesis frames as periods that the synthetic signal has. The method allows a coherent location of the analysis windows within the periods of the signal and the exact generation of the synthesis instants in a manner synchronous with the fundamental period.
    • 本发明涉及一种用于语音信号分析,修改和合成的方法,其包括通过用于确定第一正弦分量的相位的迭代过程用于分析窗口的位置的相位以及所述分量的相位值与 预定值,用于选择对应于异音素的分析帧的相位,以及根据某些阈值重新调整持续时间和基本频率的相位,以及使用最接近的分析帧的信息从综合帧产生合成语音的相位作为 合成帧的频谱信息,并且获取与合成信号具有的周期一样多的合成帧。 该方法允许分析窗口在信号的周期内以与基本周期同步的方式精确地产生合成时刻的相干位置。
    • 75. 发明授权
    • Speech signal processing system, speech signal processing method and speech signal processing method program using noise environment and volume of an input speech signal at a time point
    • 语音信号处理系统,语音信号处理方法和语音信号处理方法程序,在时间点上使用噪声环境和输入语音信号的音量
    • US08793128B2
    • 2014-07-29
    • US13365848
    • 2012-02-03
    • Kiyokazu Miki
    • Kiyokazu Miki
    • G10L15/00G10L15/20
    • G10L13/033G10L21/003G10L25/84
    • A speech signal processing system that includes a speech input unit for inputting a speech signal; input speech storage unit for storing an input speech signal that is the speech signal inputted through the speech input unit; characteristic estimation unit for referring to the input speech signal stored in the input speech storage unit, and estimating characteristics of an input speech indicated by the input speech signal, the characteristics including an environmental sound included in the input speech signal; reference speech output unit for causing a predetermined speech signal that becomes a reference speech, to output; and characteristic adding unit for adding the characteristics of the input speech estimated by the characteristic estimation unit, in a reference speech signal that is the speech signal caused to output by the reference speech output unit.
    • 一种语音信号处理系统,包括用于输入语音信号的语音输入单元; 输入语音存储单元,用于存储作为通过语音输入单元输入的语音信号的输入语音信号; 特征估计单元,用于参考存储在输入语音存储单元中的输入语音信号,以及估计由输入语音信号指示的输入语音的特性,包括包括在输入语音信号中的环境声音的特性; 用于使作为参考语音的预定语音信号的参考语音输出单元输出; 以及特征添加单元,用于将由特征估计单元估计的输入语音的特性相加在由参考语音输出单元输出的语音信号的参考语音信号中。
    • 76. 发明申请
    • SYSTEM AND METHOD FOR GENERATING CUSTOMIZED TEXT-TO-SPEECH VOICES
    • 用于生成定制的文本到语音的系统和方法
    • US20140188480A1
    • 2014-07-03
    • US14196578
    • 2014-03-04
    • AT&T Intellectual Property II, L.P.
    • Srinivas BANGALOREJunlan FengMazin G. RahimJuergen SchroeterAnn K. SyrdalDavid Schulz
    • G10L13/02
    • G10L13/033G10L13/00G10L13/02G10L13/06G10L13/08G10L15/197
    • A system and method are disclosed for generating customized text-to-speech voices for a particular application. The method comprises generating a custom text-to-speech voice by selecting a voice for generating a custom text-to-speech voice associated with a domain, collecting text data associated with the domain from a pre-existing text data source and using the collected text data, generating an in-domain inventory of synthesis speech units by selecting speech units appropriate to the domain via a search of a pre-existing inventory of synthesis speech units, or by recording the minimal inventory for a selected level of synthesis quality. The text-to-speech custom voice for the domain is generated utilizing the in-domain inventory of synthesis speech units. Active learning techniques may also be employed to identify problem phrases wherein only a few minutes of recorded data is necessary to deliver a high quality TTS custom voice.
    • 公开了用于为特定应用产生定制的文本到语音语音的系统和方法。 该方法包括通过选择用于生成与域相关联的自定义文本到语音语音的语音来生成自定义文本到语音语音,从预先存在的文本数据源收集与域相关联的文本数据,并使用收集的 文本数据,通过搜索合成语音单元的预先存在的库存来选择适合于该域的语音单元,或者通过记录所选合成质量水平的最小库存来生成合成语音单元的域内库存。 使用合成语音单元的域内库存来生成域的文本到语音定制语音。 还可以使用主动学习技术来识别问题短语,其中只需要几分钟的记录数据来传送高质量的TTS定制语音。
    • 78. 发明申请
    • ENHANCED INTERFACE FOR USE WITH SPEECH RECOGNITION
    • 使用语音识别的增强接口
    • US20140142952A1
    • 2014-05-22
    • US14076776
    • 2013-11-11
    • Verizon Services Corp.
    • James Mark Kondziela
    • G10L21/16
    • G10L21/16G10L13/033G10L15/22
    • Improved methods of presenting speech prompts to a user as part of an automated system that employs speech recognition or other voice input are described. The invention improves the user interface by providing in combination with at least one user prompt seeking a voice response, an enhanced user keyword prompt intended to facilitate the user selecting a keyword to speak in response to the user prompt. The enhanced keyword prompts may be the same words as those a user can speak as a reply to the user prompt but presented using a different audio presentation method, e.g., speech rate, audio level, or speaker voice, than used for the user prompt. In some cases, the user keyword prompts are different words from the expected user response keywords, or portions of words, e.g., truncated versions of keywords.
    • 描述了将采用语音识别或其他语音输入的自动化系统的一部分向用户呈现语音提示的改进方法。 本发明通过与寻求语音响应的至少一个用户提示一起提供用户界面来改进用户界面,增强的用户关键字提示旨在促进用户响应于用户提示来选择关键字来说话。 增强的关键词提示可以是与用户可以说话作为对用户提示的答复相同的单词,而是使用与为用户提示所使用的不同的音频呈现方法(例如语音速率,音频电平或扬声器语音)呈现。 在某些情况下,用户关键字提示是与预期的用户响应关键字或单词的部分,例如关键字的截断版本不同的单词。
    • 79. 发明申请
    • VOICE GUIDANCE SYSTEM AND ELECTRONIC EQUIPMENT
    • 语音指导系统和电子设备
    • US20140074482A1
    • 2014-03-13
    • US13972959
    • 2013-08-22
    • Renesas Electronics Corporation
    • Kazuyuki Ohno
    • G10L21/06
    • G10L21/06G06F3/167G10L13/033G10L13/04G10L21/047H04N21/482
    • A voice guidance system is provided in which the voice guidance is enabled to easily follow a trend of change intervals, a rapid change of change intervals, etc. in a menu operation. The voice guidance system is configured with an input analyzing unit which inputs and analyzes an operation instruction signal of a menu item, a voice guidance control unit which controls voice guidance of the menu item according to the analysis result by the input analyzing unit, and a textual guidance control unit which performs display control of the menu item according to the analysis result by the input analyzing unit. The voice guidance control unit determines reproduction speed of the voice guidance according to the analysis result, on the basis of a speed trend obtained from a speed history as a set of plural pieces of reproduction speed information.
    • 提供了语音引导系统,其中语音引导能够容易地跟随菜单操作中的变化间隔的趋势,变化间隔的快速变化等。 语音引导系统配置有输入分析菜单项的操作指示信号的输入分析单元,根据输入分析单元的分析结果来控制菜单项的语音指导的语音引导控制单元,以及 文本引导控制单元,其根据输入分析单元的分析结果执行菜单项的显示控制。 语音引导控制单元根据分析结果,基于从速度历史获得的速度趋势作为多个再现速度信息的集合来确定语音引导的再现速度。