专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明申请

US20130231927A1 Formant Based Speech Reconstruction from Noisy Signals 审中-公开
公开(公告)号：US20130231927A1
公开(公告)日：2013-09-05
申请号：US13590005
申请日：2012-08-20
申请人： PIERRE ZAKARAUSKAS , ALEXANDER ESCOTT , CLARENCE S.H. CHU , SHAWN E. STEVENSON
发明人： PIERRE ZAKARAUSKAS , ALEXANDER ESCOTT , CLARENCE S.H. CHU , SHAWN E. STEVENSON
IPC分类号： G10L21/00
CPC分类号： G10L19/012 , G10L19/0017 , G10L21/02 , G10L25/15 , G10L25/75 , G10L2019/0007 , H04R25/00
摘要： Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or the like. In particular, in some implementations, systems, methods and devices are operable to generate a machine readable formant based codebook. In some implementations, the method includes determining whether or not a candidate codebook tuple includes a sufficient amount of new information to warrant either adding the candidate codebook tuple to the codebook or using at least a portion of the candidate codebook tuple to update an existing codebook tuple. Additionally and/or alternatively, in some implementations systems, methods and devices are operable to reconstruct a target voice signal by detecting formants in an audible signal, using the detected formants to select codebook tuples, and using the formant information in the selected codebook tuples to reconstruct the target voice signal.

2. 发明授权

US09020818B2 Format based speech reconstruction from noisy signals 有权
标题翻译：基于噪声信号的基于格式的语音重建
公开(公告)号：US09020818B2
公开(公告)日：2015-04-28
申请号：US13589977
申请日：2012-08-20
申请人： Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
发明人： Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
IPC分类号： G10L15/00 , G10L15/14 , G10L15/26 , G10L21/00 , G10L21/02 , H04R25/00 , G10L25/15 , G10L25/75
CPC分类号： G10L19/012 , G10L19/0017 , G10L21/02 , G10L25/15 , G10L25/75 , G10L2019/0007 , H04R25/00
摘要： Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or the like. In particular, in some implementations, systems, methods and devices are operable to generate a machine readable formant based codebook. In some implementations, the method includes determining whether or not a candidate codebook tuple includes a sufficient amount of new information to warrant either adding the candidate codebook tuple to the codebook or using at least a portion of the candidate codebook tuple to update an existing codebook tuple. Additionally and/or alternatively, in some implementations systems, methods and devices are operable to reconstruct a target voice signal by detecting formants in an audible signal, using the detected formants to select codebook tuples, and using the formant information in the selected codebook tuples to reconstruct the target voice signal.
摘要翻译：本文描述的系统，方法和装置的实现使得能够增强包括在由助听器装置等接收的噪声可听信号中的目标语音信号的可懂度。特别地，在一些实现中，系统，方法和设备可操作以生成基于机器可读共享器的码本。在一些实现中，该方法包括确定候选码本元组是否包括足够量的新信息以保证将候选码本元组添加到码本，或者使用候选码本元组的至少一部分来更新现有码本元组。附加地和/或替代地，在一些实现中，系统，方法和设备可操作以通过检测可听信号中的共振峰来重建目标语音信号，使用检测到的共振峰来选择码本元组，并且使用所选码本元组中的共振峰信息重建目标语音信号。

3. 发明申请

US20130231932A1 Voice Activity Detection and Pitch Estimation 审中-公开
标题翻译：语音活动检测和音调估计
公开(公告)号：US20130231932A1
公开(公告)日：2013-09-05
申请号：US13590022
申请日：2012-08-20
申请人： Pierre Zakarauskas , Alexander Escott , Clarence S.H. Chu , Shawn E. Stevenson
发明人： Pierre Zakarauskas , Alexander Escott , Clarence S.H. Chu , Shawn E. Stevenson
IPC分类号： G10L15/00
CPC分类号： G10L25/78 , G10L25/18 , G10L25/90 , G10L25/93
摘要： Implementations include systems, methods and/or devices operable to detect voice activity in an audible signal by detecting glottal pulses. The dominant frequency of a series of glottal pulses is perceived as the intonation pattern or melody of natural speech, which is also referred to as the pitch. However, as noted above, spoken communication typically occurs in the presence of noise and/or other interference. In turn, the undulation of voiced speech is masked in some portions of the frequency spectrum associated with human speech by the noise and/or other interference. In some implementations, detection of voice activity is facilitated by dividing the frequency spectrum associated with human speech into multiple sub-bands in order to identify glottal pulses that dominate the noise and/or other inference in particular sub-bands. Additionally and/or alternatively, in some implementations the analysis is furthered to provide a pitch estimate of the detected voice activity.
摘要翻译：实现包括可操作以通过检测声门脉冲来检测可听信号中的语音活动的系统，方法和/或设备。一系列声门脉冲的主频被视为自然语音的语调模式或旋律，也称为音调。然而，如上所述，语音通信通常在存在噪声和/或其他干扰的情况下发生。反过来，通过噪声和/或其他干扰，有声语音的波动在与人类语音相关联的频谱的某些部分被屏蔽。在一些实现中，通过将与人类语音相关联的频谱划分成多个子带来便于语音活动的检测，以便识别主导噪声和/或特别是子带的其他推断的声门脉冲。另外和/或替代地，在一些实现中，进一步分析以提供检测到的语音活动的音高估计。

4. 发明授权

US09015044B2 Formant based speech reconstruction from noisy signals 有权
标题翻译：从噪声信号中进行基于共振的语音重建
公开(公告)号：US09015044B2
公开(公告)日：2015-04-21
申请号：US13590005
申请日：2012-08-20
申请人： Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
发明人： Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
IPC分类号： G10L15/00 , G10L15/14 , G10L15/26 , G10L21/00 , G10L21/02 , H04R25/00 , G10L25/15 , G10L25/75
CPC分类号： G10L19/012 , G10L19/0017 , G10L21/02 , G10L25/15 , G10L25/75 , G10L2019/0007 , H04R25/00
摘要： Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or the like. In particular, in some implementations, systems, methods and devices are operable to generate a machine readable formant based codebook. In some implementations, the method includes determining whether or not a candidate codebook tuple includes a sufficient amount of new information to warrant either adding the candidate codebook tuple to the codebook or using at least a portion of the candidate codebook tuple to update an existing codebook tuple. Additionally and/or alternatively, in some implementations systems, methods and devices are operable to reconstruct a target voice signal by detecting formants in an audible signal, using the detected formants to select codebook tuples, and using the formant information in the selected codebook tuples to reconstruct the target voice signal.
摘要翻译：本文描述的系统，方法和装置的实现使得能够增强包括在由助听器装置等接收的噪声可听信号中的目标语音信号的可懂度。特别地，在一些实现中，系统，方法和设备可操作以生成基于机器可读共享器的码本。在一些实现中，该方法包括确定候选码本元组是否包括足够量的新信息以保证将候选码本元组添加到码本，或者使用候选码本元组的至少一部分来更新现有码本元组。附加地和/或替代地，在一些实现中，系统，方法和设备可操作以通过检测可听信号中的共振峰来重建目标语音信号，使用检测到的共振峰来选择码本元组，并且使用所选码本元组中的共振峰信息重建目标语音信号。

5. 发明申请

US20130231923A1 Voice Signal Enhancement 审中-公开
标题翻译：语音信号增强
公开(公告)号：US20130231923A1
公开(公告)日：2013-09-05
申请号：US13589954
申请日：2012-08-20
申请人： Pierre Zakarauskas , Alexander Escott , Clarence S.H. Chu , Shawn E. Stevenson
发明人： Pierre Zakarauskas , Alexander Escott , Clarence S.H. Chu , Shawn E. Stevenson
IPC分类号： G10L21/02
CPC分类号： G10L21/0324 , G10L21/0208 , G10L21/0308 , G10L21/0364 , G10L2021/02082
摘要： Implementations include systems, methods and/or devices operable to enhance the intelligibility of a target speech signal by targeted voice model based processing of a noisy audible signal. In some implementations, an amplitude-independent voice proximity function voice model is used to attenuate signal components of a noisy audible signal that are unlikely to be associated with the target speech signal and/or accentuate the target speech signal. In some implementations, the target speech signal is identified as a near-field signal, which is detected by identifying a prominent train of glottal pulses in the noisy audible signal. Subsequently, in some implementations systems, methods and/or devices perform a form of computational auditory scene analysis by converting the noisy audible signal into a set of narrowband time-frequency units, and selectively accentuating the time-frequency units associated with the target speech signal and deemphasizing others using information derived from the identification of the glottal pulse train.
摘要翻译：实施方式包括可操作以通过基于目标语音模型处理噪声可听信号来增强目标语音信号的可懂度的系统，方法和/或设备。在一些实现中，使用幅度无关的语音接近功能语音模型来衰减不可能与目标语音信号相关联的噪声可听信号的信号分量和/或加强目标语音信号。在一些实现中，目标语音信号被识别为近场信号，其通过在噪声可听信号中识别突出的声门脉冲列来检测。随后，在一些实现中，系统，方法和/或设备通过将噪声可听信号转换成一组窄带时频单元来执行计算听觉场景分析的形式，并且选择性地加强与目标语音信号相关联的时间 - 频率单位并且使用从声门脉冲序列的识别得到的信息来强调他人。

6. 发明授权

US09437213B2 Voice signal enhancement 有权
标题翻译：语音信号增强
公开(公告)号：US09437213B2
公开(公告)日：2016-09-06
申请号：US13589954
申请日：2012-08-20
申请人： Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
发明人： Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
IPC分类号： G10L19/14 , G10L21/0324 , G10L21/0208 , G10L21/0308 , G10L21/0364
CPC分类号： G10L21/0324 , G10L21/0208 , G10L21/0308 , G10L21/0364 , G10L2021/02082
摘要： Implementations include systems, methods and/or devices operable to enhance the intelligibility of a target speech signal by targeted voice model based processing of a noisy audible signal. In some implementations, an amplitude-independent voice proximity function voice model is used to attenuate signal components of a noisy audible signal that are unlikely to be associated with the target speech signal and/or accentuate the target speech signal. In some implementations, the target speech signal is identified as a near-field signal, which is detected by identifying a prominent train of glottal pulses in the noisy audible signal. Subsequently, in some implementations systems, methods and/or devices perform a form of computational auditory scene analysis by converting the noisy audible signal into a set of narrowband time-frequency units, and selectively accentuating the time-frequency units associated with the target speech signal and deemphasizing others using information derived from the identification of the glottal pulse train.
摘要翻译：实施方式包括可操作以通过基于目标语音模型处理噪声可听信号来增强目标语音信号的可懂度的系统，方法和/或设备。在一些实现中，使用幅度无关的语音接近功能语音模型来衰减不可能与目标语音信号相关联的噪声可听信号的信号分量和/或加强目标语音信号。在一些实现中，目标语音信号被识别为近场信号，其通过在噪声可听信号中识别突出的声门脉冲列来检测。随后，在一些实现中，系统，方法和/或设备通过将噪声可听信号转换成一组窄带时频单元来执行计算听觉场景分析的形式，并且选择性地加强与目标语音信号相关联的时间 - 频率单位并且使用从声门脉冲序列的识别得到的信息来强调他人。

7. 发明授权

US09384759B2 Voice activity detection and pitch estimation 有权
标题翻译：语音活动检测和音调估计
公开(公告)号：US09384759B2
公开(公告)日：2016-07-05
申请号：US13590022
申请日：2012-08-20
申请人： Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
发明人： Pierre Zakarauskas , Alexander Escott , Clarence S. H. Chu , Shawn E. Stevenson
IPC分类号： G10L21/00 , G10L25/00 , G10L25/93 , G10L15/00 , G10L15/20 , G10L25/78 , G10L25/90 , G10L25/18
CPC分类号： G10L25/78 , G10L25/18 , G10L25/90 , G10L25/93
摘要： Implementations include systems, methods and/or devices operable to detect voice activity in an audible signal by detecting glottal pulses. The dominant frequency of a series of glottal pulses is perceived as the intonation pattern or melody of natural speech, which is also referred to as the pitch. However, as noted above, spoken communication typically occurs in the presence of noise and/or other interference. In turn, the undulation of voiced speech is masked in some portions of the frequency spectrum associated with human speech by the noise and/or other interference. In some implementations, detection of voice activity is facilitated by dividing the frequency spectrum associated with human speech into multiple sub-bands in order to identify glottal pulses that dominate the noise and/or other inference in particular sub-bands. Additionally and/or alternatively, in some implementations the analysis is furthered to provide a pitch estimate of the detected voice activity.
摘要翻译：实现包括可操作以通过检测声门脉冲来检测可听信号中的语音活动的系统，方法和/或设备。一系列声门脉冲的主频被视为自然语音的语调模式或旋律，也称为音调。然而，如上所述，语音通信通常在存在噪声和/或其他干扰的情况下发生。反过来，通过噪声和/或其他干扰，有声语音的波动在与人类语音相关联的频谱的某些部分被屏蔽。在一些实现中，通过将与人类语音相关联的频谱划分成多个子带来便于语音活动的检测，以便识别支配噪声和/或特别是子带中的其它推断的声门脉冲。另外和/或替代地，在一些实现中，进一步分析以提供检测到的语音活动的音高估计。

8. 发明申请

US20130231924A1 Format Based Speech Reconstruction from Noisy Signals 审中-公开
标题翻译：嘈杂信号基于格式的语音重构
公开(公告)号：US20130231924A1
公开(公告)日：2013-09-05
申请号：US13589977
申请日：2012-08-20
申请人： Pierre Zakarauskas , Alexander Escott , Clarence S.H. Chu , Shawn E. Stevenson
发明人： Pierre Zakarauskas , Alexander Escott , Clarence S.H. Chu , Shawn E. Stevenson
IPC分类号： G10L11/04
CPC分类号： G10L19/012 , G10L19/0017 , G10L21/02 , G10L25/15 , G10L25/75 , G10L2019/0007 , H04R25/00
摘要： Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or the like. In particular, in some implementations, systems, methods and devices are operable to generate a machine readable formant based codebook. In some implementations, the method includes determining whether or not a candidate codebook tuple includes a sufficient amount of new information to warrant either adding the candidate codebook tuple to the codebook or using at least a portion of the candidate codebook tuple to update an existing codebook tuple. Additionally and/or alternatively, in some implementations systems, methods and devices are operable to reconstruct a target voice signal by detecting formants in an audible signal, using the detected formants to select codebook tuples, and using the formant information in the selected codebook tuples to reconstruct the target voice signal.
摘要翻译：本文描述的系统，方法和装置的实现使得能够增强包括在由助听器装置等接收的噪声可听信号中的目标语音信号的可懂度。特别地，在一些实现中，系统，方法和设备可操作以生成基于机器可读共享器的码本。在一些实现中，该方法包括确定候选码本元组是否包括足够量的新信息以保证将候选码本元组添加到码本，或者使用候选码本元组的至少一部分来更新现有码本元组。附加地和/或替代地，在一些实现中，系统，方法和设备可操作以通过检测可听信号中的共振峰来重建目标语音信号，使用检测到的共振峰来选择码本元组，并且使用所选码本元组中的共振峰信息重建目标语音信号。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式