会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • SYSTEMS AND METHODS FOR ENCODING AUDIO SIGNALS
    • 编码音频信号的系统和方法
    • US20160300580A1
    • 2016-10-13
    • US14680360
    • 2015-04-07
    • Nuance Communications, Inc.
    • Slava ShechtmanAlexander Sorin
    • G10L19/02
    • G10L19/02G10L19/032G10L19/038
    • Some embodiments relate to techniques for encoding an audio signal represented by a plurality of frames including a first frame. The techniques include using at least one computer hardware processor to perform: obtaining an initial discrete spectral representation of the first frame; obtaining a primary discrete spectral representation of the initial discrete spectral representation at least in part by estimating a phase envelope of the initial discrete spectral representation and evaluating the estimated phase envelope at a discrete set of frequencies; calculating a residual discrete spectral representation of the initial discrete spectral representation based on the initial discrete spectral representation and the primary discrete spectral representation; and encoding the residual discrete spectral representation using a plurality of codewords.
    • 一些实施例涉及用于编码由包括第一帧的多个帧表示的音频信号的技术。 这些技术包括使用至少一个计算机硬件处理器来执行:获得第一帧的初始离散频谱表示; 至少部分地通过估计初始离散频谱表示的相位包络并以离散频率集合估计估计的相位包络来获得初始离散频谱表示的主离散频谱表示; 基于初始离散频谱表示和主离散频谱表示来计算初始离散频谱表示的残差离散频谱表示; 以及使用多个码字对残差离散频谱表示进行编码。
    • 2. 发明授权
    • Deriving geographic distribution of physiological or psychological conditions of human speakers while preserving personal privacy
    • 导出人类生理或心理状况的地理分布,同时保护个人隐私
    • US09159323B2
    • 2015-10-13
    • US13953527
    • 2013-07-29
    • Nuance Communications, Inc.
    • Slava ShechtmanRaphael Steinberg
    • G10L25/66G10L25/45G10L25/72G10L17/00G10L25/00
    • G10L17/005G10L25/00G10L2015/227
    • A method including: obtaining, via a plurality of communication devices, a plurality of speech signals respectively associated with human speakers, the speech signals including verbal components and non-verbal components; identifying a plurality of geographical locations, each geographic location associated with a respective one of the plurality of the communication devices; extracting the non-verbal components from the obtained speech signals; deducing physiological or psychological conditions of the human speakers by analyzing, over a specified period, the extracted non-verbal components, using predefined relations between characteristics of the non-verbal components and physiological or psychological conditions of the human speakers; and providing a geographical distribution of the deduced physiological or psychological conditions of the human speakers by associating the deduced physiological or psychological conditions of the human speakers with geographical locations thereof.
    • 一种方法,包括:通过多个通信设备获得分别与人类说话者相关联的多个语音信号,所述语音信号包括语言分量和非语言分量; 识别多个地理位置,每个地理位置与所述多个所述通信设备中的相应一个相关联; 从所获得的语音信号中提取非语言分量; 通过使用非语言成分的特征与人类说话者的生理或心理状态之间的预定义关系,在指定的时间段内分析所提取的非言语成分来推断人的说话者的生理或心理状况; 并通过将推断的人类发言者的生理或心理状况与其地理位置相关联来提供人类发言者的推导的生理或心理状况的地理分布。
    • 3. 发明申请
    • DERIVING GEOGRAPHIC DISTRIBUTION OF PHYSIOLOGICAL OR PSYCHOLOGICAL CONDITIONS OF HUMAN SPEAKERS WHILE RESERVING PERSONAL PRIVACY
    • 在保留个人隐私的情况下,提供人类生理或心理学条件的地理分布
    • US20130317825A1
    • 2013-11-28
    • US13953527
    • 2013-07-29
    • Nuance Communications, Inc.
    • Slava ShechtmanRaphael Steinberg
    • G10L17/00
    • G10L17/005G10L25/00G10L2015/227
    • A method including: obtaining, via a plurality of communication devices, a plurality of speech signals respectively associated with human speakers, the speech signals including verbal components and non-verbal components; identifying a plurality of geographical locations, each geographic location associated with a respective one of the plurality of the communication devices; extracting the non-verbal components from the obtained speech signals; deducing physiological or psychological conditions of the human speakers by analyzing, over a specified period, the extracted non-verbal components, using predefined relations between characteristics of the non-verbal components and physiological or psychological conditions of the human speakers; and providing a geographical distribution of the deduced physiological or psychological conditions of the human speakers by associating the deduced physiological or psychological conditions of the human speakers with geographical locations thereof.
    • 一种方法,包括:通过多个通信设备获得分别与人类说话者相关联的多个语音信号,所述语音信号包括语言分量和非语言分量; 识别多个地理位置,每个地理位置与所述多个所述通信设备中的相应一个相关联; 从所获得的语音信号中提取非语言分量; 通过使用非语言成分的特征与人类说话者的生理或心理状态之间的预定义关系,在指定的时间段内分析所提取的非言语成分来推断人的说话者的生理或心理状况; 并通过将推断的人类发言者的生理或心理状况与其地理位置相关联来提供人类发言者的推导的生理或心理状况的地理分布。
    • 4. 发明授权
    • Systems and methods for encoding audio signals
    • 用于编码音频信号的系统和方法
    • US09564140B2
    • 2017-02-07
    • US14680360
    • 2015-04-07
    • Nuance Communications, Inc.
    • Slava ShechtmanAlexander Sorin
    • G10L19/00G10L19/02
    • G10L19/02G10L19/032G10L19/038
    • Some embodiments relate to techniques for encoding an audio signal represented by a plurality of frames including a first frame. The techniques include using at least one computer hardware processor to perform: obtaining an initial discrete spectral representation of the first frame; obtaining a primary discrete spectral representation of the initial discrete spectral representation at least in part by estimating a phase envelope of the initial discrete spectral representation and evaluating the estimated phase envelope at a discrete set of frequencies; calculating a residual discrete spectral representation of the initial discrete spectral representation based on the initial discrete spectral representation and the primary discrete spectral representation; and encoding the residual discrete spectral representation using a plurality of codewords.
    • 一些实施例涉及用于编码由包括第一帧的多个帧表示的音频信号的技术。 这些技术包括使用至少一个计算机硬件处理器来执行:获得第一帧的初始离散频谱表示; 至少部分地通过估计初始离散频谱表示的相位包络并以离散频率集合估计估计的相位包络来获得初始离散频谱表示的主离散频谱表示; 基于初始离散频谱表示和主离散频谱表示来计算初始离散频谱表示的残差离散频谱表示; 以及使用多个码字对残差离散频谱表示进行编码。
    • 5. 发明授权
    • Method and apparatus for detecting synthesized speech
    • 用于检测合成语音的方法和装置
    • US09484036B2
    • 2016-11-01
    • US14012081
    • 2013-08-28
    • Nuance Communications, Inc.
    • Zvi KonsHagai AronowitzSlava Shechtman
    • G10L17/22G10L25/51G10L17/26
    • G10L17/22G10L17/26G10L25/51
    • Computer systems employing speaker verification as a security approach to prevent un-authorized access by intruders may be tricked by a synthetic speech with voice characteristics similar to those of an authorized user of the computer system. According to at least one example embodiment, a method and corresponding apparatus for detecting a synthetic speech signal include extracting a plurality of speech features from multiple segments of the speech signal; analyzing the plurality of speech features to determine whether the plurality of speech features exhibit periodic variation behavior; and determining whether the speech signal is a synthetic speech signal or a natural speech signal based on whether or not a periodic variation behavior of the plurality of speech features is detected. The embodiments of synthetic speech detection result in security enhancement of the computer system employing speaker verification.
    • 使用说话人验证作为防止入侵者的未授权访问的安全方法的计算机系统可能被具有类似于计算机系统的授权用户的语音特征的合成语音所欺骗。 根据至少一个示例性实施例,一种用于检测合成语音信号的方法和相应装置包括从语音信号的多个部分中提取多个语音特征; 分析所述多个语音特征以确定所述多个语音特征是否呈现周期性变化行为; 以及基于是否检测到所述多个语音特征的周期性变化行为来确定所述语音信号是合成语音信号还是自然语音信号。 合成语音检测的实施例导致使用说话者验证的计算机系统的安全性增强。
    • 6. 发明申请
    • Method and Apparatus for Detecting Synthesized Speech
    • 用于检测合成语音的方法和装置
    • US20150066512A1
    • 2015-03-05
    • US14012081
    • 2013-08-28
    • NUANCE COMMUNICATIONS, INC.
    • Zvi KonsHagai AronowitzSlava Shechtman
    • G10L17/22
    • G10L17/22G10L17/26G10L25/51
    • Computer systems employing speaker verification as a security approach to prevent un-authorized access by intruders may be tricked by a synthetic speech with voice characteristics similar to those of an authorized user of the computer system. According to at least one example embodiment, a method and corresponding apparatus for detecting a synthetic speech signal include extracting a plurality of speech features from multiple segments of the speech signal; analyzing the plurality of speech features to determine whether the plurality of speech features exhibit periodic variation behavior; and determining whether the speech signal is a synthetic speech signal or a natural speech signal based on whether or not a periodic variation behavior of the plurality of speech features is detected. The embodiments of synthetic speech detection result in security enhancement of the computer system employing speaker verification.
    • 使用说话人验证作为防止入侵者的未授权访问的安全方法的计算机系统可能被具有类似于计算机系统的授权用户的语音特征的合成语音所欺骗。 根据至少一个示例性实施例,一种用于检测合成语音信号的方法和相应装置包括从语音信号的多个部分中提取多个语音特征; 分析所述多个语音特征以确定所述多个语音特征是否呈现周期性变化行为; 以及基于是否检测到所述多个语音特征的周期性变化行为来确定所述语音信号是合成语音信号还是自然语音信号。 合成语音检测的实施例导致使用说话者验证的计算机系统的安全性增强。