专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20160300580A1 SYSTEMS AND METHODS FOR ENCODING AUDIO SIGNALS 有权
标题翻译：编码音频信号的系统和方法
公开(公告)号：US20160300580A1
公开(公告)日：2016-10-13
申请号：US14680360
申请日：2015-04-07
申请人： Nuance Communications, Inc.
发明人： Slava Shechtman , Alexander Sorin
IPC分类号： G10L19/02
CPC分类号： G10L19/02 , G10L19/032 , G10L19/038
摘要： Some embodiments relate to techniques for encoding an audio signal represented by a plurality of frames including a first frame. The techniques include using at least one computer hardware processor to perform: obtaining an initial discrete spectral representation of the first frame; obtaining a primary discrete spectral representation of the initial discrete spectral representation at least in part by estimating a phase envelope of the initial discrete spectral representation and evaluating the estimated phase envelope at a discrete set of frequencies; calculating a residual discrete spectral representation of the initial discrete spectral representation based on the initial discrete spectral representation and the primary discrete spectral representation; and encoding the residual discrete spectral representation using a plurality of codewords.
摘要翻译：一些实施例涉及用于编码由包括第一帧的多个帧表示的音频信号的技术。这些技术包括使用至少一个计算机硬件处理器来执行：获得第一帧的初始离散频谱表示; 至少部分地通过估计初始离散频谱表示的相位包络并以离散频率集合估计估计的相位包络来获得初始离散频谱表示的主离散频谱表示; 基于初始离散频谱表示和主离散频谱表示来计算初始离散频谱表示的残差离散频谱表示; 以及使用多个码字对残差离散频谱表示进行编码。

2. 发明授权

US09159323B2 Deriving geographic distribution of physiological or psychological conditions of human speakers while preserving personal privacy 有权
标题翻译：导出人类生理或心理状况的地理分布，同时保护个人隐私
公开(公告)号：US09159323B2
公开(公告)日：2015-10-13
申请号：US13953527
申请日：2013-07-29
申请人： Nuance Communications, Inc.
发明人： Slava Shechtman , Raphael Steinberg
IPC分类号： G10L25/66 , G10L25/45 , G10L25/72 , G10L17/00 , G10L25/00
CPC分类号： G10L17/005 , G10L25/00 , G10L2015/227
摘要： A method including: obtaining, via a plurality of communication devices, a plurality of speech signals respectively associated with human speakers, the speech signals including verbal components and non-verbal components; identifying a plurality of geographical locations, each geographic location associated with a respective one of the plurality of the communication devices; extracting the non-verbal components from the obtained speech signals; deducing physiological or psychological conditions of the human speakers by analyzing, over a specified period, the extracted non-verbal components, using predefined relations between characteristics of the non-verbal components and physiological or psychological conditions of the human speakers; and providing a geographical distribution of the deduced physiological or psychological conditions of the human speakers by associating the deduced physiological or psychological conditions of the human speakers with geographical locations thereof.
摘要翻译：一种方法，包括：通过多个通信设备获得分别与人类说话者相关联的多个语音信号，所述语音信号包括语言分量和非语言分量; 识别多个地理位置，每个地理位置与所述多个所述通信设备中的相应一个相关联; 从所获得的语音信号中提取非语言分量; 通过使用非语言成分的特征与人类说话者的生理或心理状态之间的预定义关系，在指定的时间段内分析所提取的非言语成分来推断人的说话者的生理或心理状况; 并通过将推断的人类发言者的生理或心理状况与其地理位置相关联来提供人类发言者的推导的生理或心理状况的地理分布。

3. 发明申请

US20130317825A1 DERIVING GEOGRAPHIC DISTRIBUTION OF PHYSIOLOGICAL OR PSYCHOLOGICAL CONDITIONS OF HUMAN SPEAKERS WHILE RESERVING PERSONAL PRIVACY 有权
标题翻译：在保留个人隐私的情况下，提供人类生理或心理学条件的地理分布
公开(公告)号：US20130317825A1
公开(公告)日：2013-11-28
申请号：US13953527
申请日：2013-07-29
申请人： Nuance Communications, Inc.
发明人： Slava Shechtman , Raphael Steinberg
IPC分类号： G10L17/00
CPC分类号： G10L17/005 , G10L25/00 , G10L2015/227
摘要： A method including: obtaining, via a plurality of communication devices, a plurality of speech signals respectively associated with human speakers, the speech signals including verbal components and non-verbal components; identifying a plurality of geographical locations, each geographic location associated with a respective one of the plurality of the communication devices; extracting the non-verbal components from the obtained speech signals; deducing physiological or psychological conditions of the human speakers by analyzing, over a specified period, the extracted non-verbal components, using predefined relations between characteristics of the non-verbal components and physiological or psychological conditions of the human speakers; and providing a geographical distribution of the deduced physiological or psychological conditions of the human speakers by associating the deduced physiological or psychological conditions of the human speakers with geographical locations thereof.
摘要翻译：一种方法，包括：通过多个通信设备获得分别与人类说话者相关联的多个语音信号，所述语音信号包括语言分量和非语言分量; 识别多个地理位置，每个地理位置与所述多个所述通信设备中的相应一个相关联; 从所获得的语音信号中提取非语言分量; 通过使用非语言成分的特征与人类说话者的生理或心理状态之间的预定义关系，在指定的时间段内分析所提取的非言语成分来推断人的说话者的生理或心理状况; 并通过将推断的人类发言者的生理或心理状况与其地理位置相关联来提供人类发言者的推导的生理或心理状况的地理分布。

4. 发明授权

US09564140B2 Systems and methods for encoding audio signals 有权
标题翻译：用于编码音频信号的系统和方法
公开(公告)号：US09564140B2
公开(公告)日：2017-02-07
申请号：US14680360
申请日：2015-04-07
申请人： Nuance Communications, Inc.
发明人： Slava Shechtman , Alexander Sorin
IPC分类号： G10L19/00 , G10L19/02
CPC分类号： G10L19/02 , G10L19/032 , G10L19/038
摘要： Some embodiments relate to techniques for encoding an audio signal represented by a plurality of frames including a first frame. The techniques include using at least one computer hardware processor to perform: obtaining an initial discrete spectral representation of the first frame; obtaining a primary discrete spectral representation of the initial discrete spectral representation at least in part by estimating a phase envelope of the initial discrete spectral representation and evaluating the estimated phase envelope at a discrete set of frequencies; calculating a residual discrete spectral representation of the initial discrete spectral representation based on the initial discrete spectral representation and the primary discrete spectral representation; and encoding the residual discrete spectral representation using a plurality of codewords.
摘要翻译：一些实施例涉及用于编码由包括第一帧的多个帧表示的音频信号的技术。这些技术包括使用至少一个计算机硬件处理器来执行：获得第一帧的初始离散频谱表示; 至少部分地通过估计初始离散频谱表示的相位包络并以离散频率集合估计估计的相位包络来获得初始离散频谱表示的主离散频谱表示; 基于初始离散频谱表示和主离散频谱表示来计算初始离散频谱表示的残差离散频谱表示; 以及使用多个码字对残差离散频谱表示进行编码。

5. 发明授权

US09484036B2 Method and apparatus for detecting synthesized speech 有权
标题翻译：用于检测合成语音的方法和装置
公开(公告)号：US09484036B2
公开(公告)日：2016-11-01
申请号：US14012081
申请日：2013-08-28
申请人： Nuance Communications, Inc.
发明人： Zvi Kons , Hagai Aronowitz , Slava Shechtman
IPC分类号： G10L17/22 , G10L25/51 , G10L17/26
CPC分类号： G10L17/22 , G10L17/26 , G10L25/51
摘要： Computer systems employing speaker verification as a security approach to prevent un-authorized access by intruders may be tricked by a synthetic speech with voice characteristics similar to those of an authorized user of the computer system. According to at least one example embodiment, a method and corresponding apparatus for detecting a synthetic speech signal include extracting a plurality of speech features from multiple segments of the speech signal; analyzing the plurality of speech features to determine whether the plurality of speech features exhibit periodic variation behavior; and determining whether the speech signal is a synthetic speech signal or a natural speech signal based on whether or not a periodic variation behavior of the plurality of speech features is detected. The embodiments of synthetic speech detection result in security enhancement of the computer system employing speaker verification.
摘要翻译：使用说话人验证作为防止入侵者的未授权访问的安全方法的计算机系统可能被具有类似于计算机系统的授权用户的语音特征的合成语音所欺骗。根据至少一个示例性实施例，一种用于检测合成语音信号的方法和相应装置包括从语音信号的多个部分中提取多个语音特征; 分析所述多个语音特征以确定所述多个语音特征是否呈现周期性变化行为; 以及基于是否检测到所述多个语音特征的周期性变化行为来确定所述语音信号是合成语音信号还是自然语音信号。合成语音检测的实施例导致使用说话者验证的计算机系统的安全性增强。

6. 发明申请

US20150066512A1 Method and Apparatus for Detecting Synthesized Speech 有权
标题翻译：用于检测合成语音的方法和装置
公开(公告)号：US20150066512A1
公开(公告)日：2015-03-05
申请号：US14012081
申请日：2013-08-28
申请人： NUANCE COMMUNICATIONS, INC.
发明人： Zvi Kons , Hagai Aronowitz , Slava Shechtman
IPC分类号： G10L17/22
CPC分类号： G10L17/22 , G10L17/26 , G10L25/51
摘要： Computer systems employing speaker verification as a security approach to prevent un-authorized access by intruders may be tricked by a synthetic speech with voice characteristics similar to those of an authorized user of the computer system. According to at least one example embodiment, a method and corresponding apparatus for detecting a synthetic speech signal include extracting a plurality of speech features from multiple segments of the speech signal; analyzing the plurality of speech features to determine whether the plurality of speech features exhibit periodic variation behavior; and determining whether the speech signal is a synthetic speech signal or a natural speech signal based on whether or not a periodic variation behavior of the plurality of speech features is detected. The embodiments of synthetic speech detection result in security enhancement of the computer system employing speaker verification.
摘要翻译：使用说话人验证作为防止入侵者的未授权访问的安全方法的计算机系统可能被具有类似于计算机系统的授权用户的语音特征的合成语音所欺骗。根据至少一个示例性实施例，一种用于检测合成语音信号的方法和相应装置包括从语音信号的多个部分中提取多个语音特征; 分析所述多个语音特征以确定所述多个语音特征是否呈现周期性变化行为; 以及基于是否检测到所述多个语音特征的周期性变化行为来确定所述语音信号是合成语音信号还是自然语音信号。合成语音检测的实施例导致使用说话者验证的计算机系统的安全性增强。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式