会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Format based speech reconstruction from noisy signals
    • 基于噪声信号的基于格式的语音重建
    • US09020818B2
    • 2015-04-28
    • US13589977
    • 2012-08-20
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • G10L15/00G10L15/14G10L15/26G10L21/00G10L21/02H04R25/00G10L25/15G10L25/75
    • G10L19/012G10L19/0017G10L21/02G10L25/15G10L25/75G10L2019/0007H04R25/00
    • Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or the like. In particular, in some implementations, systems, methods and devices are operable to generate a machine readable formant based codebook. In some implementations, the method includes determining whether or not a candidate codebook tuple includes a sufficient amount of new information to warrant either adding the candidate codebook tuple to the codebook or using at least a portion of the candidate codebook tuple to update an existing codebook tuple. Additionally and/or alternatively, in some implementations systems, methods and devices are operable to reconstruct a target voice signal by detecting formants in an audible signal, using the detected formants to select codebook tuples, and using the formant information in the selected codebook tuples to reconstruct the target voice signal.
    • 本文描述的系统,方法和装置的实现使得能够增强包括在由助听器装置等接收的噪声可听信号中的目标语音信号的可懂度。 特别地,在一些实现中,系统,方法和设备可操作以生成基于机器可读共享器的码本。 在一些实现中,该方法包括确定候选码本元组是否包括足够量的新信息以保证将候选码本元组添加到码本,或者使用候选码本元组的至少一部分来更新现有码本元组 。 附加地和/或替代地,在一些实现中,系统,方法和设备可操作以通过检测可听信号中的共振峰来重建目标语音信号,使用检测到的共振峰来选择码本元组,并且使用所选码本元组中的共振峰信息 重建目标语音信号。
    • 3. 发明申请
    • Voice Activity Detection and Pitch Estimation
    • 语音活动检测和音调估计
    • US20130231932A1
    • 2013-09-05
    • US13590022
    • 2012-08-20
    • Pierre ZakarauskasAlexander EscottClarence S.H. ChuShawn E. Stevenson
    • Pierre ZakarauskasAlexander EscottClarence S.H. ChuShawn E. Stevenson
    • G10L15/00
    • G10L25/78G10L25/18G10L25/90G10L25/93
    • Implementations include systems, methods and/or devices operable to detect voice activity in an audible signal by detecting glottal pulses. The dominant frequency of a series of glottal pulses is perceived as the intonation pattern or melody of natural speech, which is also referred to as the pitch. However, as noted above, spoken communication typically occurs in the presence of noise and/or other interference. In turn, the undulation of voiced speech is masked in some portions of the frequency spectrum associated with human speech by the noise and/or other interference. In some implementations, detection of voice activity is facilitated by dividing the frequency spectrum associated with human speech into multiple sub-bands in order to identify glottal pulses that dominate the noise and/or other inference in particular sub-bands. Additionally and/or alternatively, in some implementations the analysis is furthered to provide a pitch estimate of the detected voice activity.
    • 实现包括可操作以通过检测声门脉冲来检测可听信号中的语音活动的系统,方法和/或设备。 一系列声门脉冲的主频被视为自然语音的语调模式或旋律,也称为音调。 然而,如上所述,语音通信通常在存在噪声和/或其他干扰的情况下发生。 反过来,通过噪声和/或其他干扰,有声语音的波动在与人类语音相关联的频谱的某些部分被屏蔽。 在一些实现中,通过将与人类语音相关联的频谱划分成多个子带来便于语音活动的检测,以便识别主导噪声和/或特别是子带的其他推断的声门脉冲。 另外和/或替代地,在一些实现中,进一步分析以提供检测到的语音活动的音高估计。
    • 4. 发明授权
    • Formant based speech reconstruction from noisy signals
    • 从噪声信号中进行基于共振的语音重建
    • US09015044B2
    • 2015-04-21
    • US13590005
    • 2012-08-20
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • G10L15/00G10L15/14G10L15/26G10L21/00G10L21/02H04R25/00G10L25/15G10L25/75
    • G10L19/012G10L19/0017G10L21/02G10L25/15G10L25/75G10L2019/0007H04R25/00
    • Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or the like. In particular, in some implementations, systems, methods and devices are operable to generate a machine readable formant based codebook. In some implementations, the method includes determining whether or not a candidate codebook tuple includes a sufficient amount of new information to warrant either adding the candidate codebook tuple to the codebook or using at least a portion of the candidate codebook tuple to update an existing codebook tuple. Additionally and/or alternatively, in some implementations systems, methods and devices are operable to reconstruct a target voice signal by detecting formants in an audible signal, using the detected formants to select codebook tuples, and using the formant information in the selected codebook tuples to reconstruct the target voice signal.
    • 本文描述的系统,方法和装置的实现使得能够增强包括在由助听器装置等接收的噪声可听信号中的目标语音信号的可懂度。 特别地,在一些实现中,系统,方法和设备可操作以生成基于机器可读共享器的码本。 在一些实现中,该方法包括确定候选码本元组是否包括足够量的新信息以保证将候选码本元组添加到码本,或者使用候选码本元组的至少一部分来更新现有码本元组 。 附加地和/或替代地,在一些实现中,系统,方法和设备可操作以通过检测可听信号中的共振峰来重建目标语音信号,使用检测到的共振峰来选择码本元组,并且使用所选码本元组中的共振峰信息 重建目标语音信号。
    • 5. 发明申请
    • Voice Signal Enhancement
    • 语音信号增强
    • US20130231923A1
    • 2013-09-05
    • US13589954
    • 2012-08-20
    • Pierre ZakarauskasAlexander EscottClarence S.H. ChuShawn E. Stevenson
    • Pierre ZakarauskasAlexander EscottClarence S.H. ChuShawn E. Stevenson
    • G10L21/02
    • G10L21/0324G10L21/0208G10L21/0308G10L21/0364G10L2021/02082
    • Implementations include systems, methods and/or devices operable to enhance the intelligibility of a target speech signal by targeted voice model based processing of a noisy audible signal. In some implementations, an amplitude-independent voice proximity function voice model is used to attenuate signal components of a noisy audible signal that are unlikely to be associated with the target speech signal and/or accentuate the target speech signal. In some implementations, the target speech signal is identified as a near-field signal, which is detected by identifying a prominent train of glottal pulses in the noisy audible signal. Subsequently, in some implementations systems, methods and/or devices perform a form of computational auditory scene analysis by converting the noisy audible signal into a set of narrowband time-frequency units, and selectively accentuating the time-frequency units associated with the target speech signal and deemphasizing others using information derived from the identification of the glottal pulse train.
    • 实施方式包括可操作以通过基于目标语音模型处理噪声可听信号来增强目标语音信号的可懂度的系统,方法和/或设备。 在一些实现中,使用幅度无关的语音接近功能语音模型来衰减不可能与目标语音信号相关联的噪声可听信号的信号分量和/或加强目标语音信号。 在一些实现中,目标语音信号被识别为近场信号,其通过在噪声可听信号中识别突出的声门脉冲列来检测。 随后,在一些实现中,系统,方法和/或设备通过将噪声可听信号转换成一组窄带时频单元来执行计算听觉场景分析的形式,并且选择性地加强与目标语音信号相关联的时间 - 频率单位 并且使用从声门脉冲序列的识别得到的信息来强调他人。
    • 6. 发明授权
    • Voice signal enhancement
    • 语音信号增强
    • US09437213B2
    • 2016-09-06
    • US13589954
    • 2012-08-20
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • G10L19/14G10L21/0324G10L21/0208G10L21/0308G10L21/0364
    • G10L21/0324G10L21/0208G10L21/0308G10L21/0364G10L2021/02082
    • Implementations include systems, methods and/or devices operable to enhance the intelligibility of a target speech signal by targeted voice model based processing of a noisy audible signal. In some implementations, an amplitude-independent voice proximity function voice model is used to attenuate signal components of a noisy audible signal that are unlikely to be associated with the target speech signal and/or accentuate the target speech signal. In some implementations, the target speech signal is identified as a near-field signal, which is detected by identifying a prominent train of glottal pulses in the noisy audible signal. Subsequently, in some implementations systems, methods and/or devices perform a form of computational auditory scene analysis by converting the noisy audible signal into a set of narrowband time-frequency units, and selectively accentuating the time-frequency units associated with the target speech signal and deemphasizing others using information derived from the identification of the glottal pulse train.
    • 实施方式包括可操作以通过基于目标语音模型处理噪声可听信号来增强目标语音信号的可懂度的系统,方法和/或设备。 在一些实现中,使用幅度无关的语音接近功能语音模型来衰减不可能与目标语音信号相关联的噪声可听信号的信号分量和/或加强目标语音信号。 在一些实现中,目标语音信号被识别为近场信号,其通过在噪声可听信号中识别突出的声门脉冲列来检测。 随后,在一些实现中,系统,方法和/或设备通过将噪声可听信号转换成一组窄带时频单元来执行计算听觉场景分析的形式,并且选择性地加强与目标语音信号相关联的时间 - 频率单位 并且使用从声门脉冲序列的识别得到的信息来强调他人。
    • 7. 发明授权
    • Voice activity detection and pitch estimation
    • 语音活动检测和音调估计
    • US09384759B2
    • 2016-07-05
    • US13590022
    • 2012-08-20
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • Pierre ZakarauskasAlexander EscottClarence S. H. ChuShawn E. Stevenson
    • G10L21/00G10L25/00G10L25/93G10L15/00G10L15/20G10L25/78G10L25/90G10L25/18
    • G10L25/78G10L25/18G10L25/90G10L25/93
    • Implementations include systems, methods and/or devices operable to detect voice activity in an audible signal by detecting glottal pulses. The dominant frequency of a series of glottal pulses is perceived as the intonation pattern or melody of natural speech, which is also referred to as the pitch. However, as noted above, spoken communication typically occurs in the presence of noise and/or other interference. In turn, the undulation of voiced speech is masked in some portions of the frequency spectrum associated with human speech by the noise and/or other interference. In some implementations, detection of voice activity is facilitated by dividing the frequency spectrum associated with human speech into multiple sub-bands in order to identify glottal pulses that dominate the noise and/or other inference in particular sub-bands. Additionally and/or alternatively, in some implementations the analysis is furthered to provide a pitch estimate of the detected voice activity.
    • 实现包括可操作以通过检测声门脉冲来检测可听信号中的语音活动的系统,方法和/或设备。 一系列声门脉冲的主频被视为自然语音的语调模式或旋律,也称为音调。 然而,如上所述,语音通信通常在存在噪声和/或其他干扰的情况下发生。 反过来,通过噪声和/或其他干扰,有声语音的波动在与人类语音相关联的频谱的某些部分被屏蔽。 在一些实现中,通过将与人类语音相关联的频谱划分成多个子带来便于语音活动的检测,以便识别支配噪声和/或特别是子带中的其它推断的声门脉冲。 另外和/或替代地,在一些实现中,进一步分析以提供检测到的语音活动的音高估计。
    • 8. 发明申请
    • Format Based Speech Reconstruction from Noisy Signals
    • 嘈杂信号基于格式的语音重构
    • US20130231924A1
    • 2013-09-05
    • US13589977
    • 2012-08-20
    • Pierre ZakarauskasAlexander EscottClarence S.H. ChuShawn E. Stevenson
    • Pierre ZakarauskasAlexander EscottClarence S.H. ChuShawn E. Stevenson
    • G10L11/04
    • G10L19/012G10L19/0017G10L21/02G10L25/15G10L25/75G10L2019/0007H04R25/00
    • Implementations of systems, method and devices described herein enable enhancing the intelligibility of a target voice signal included in a noisy audible signal received by a hearing aid device or the like. In particular, in some implementations, systems, methods and devices are operable to generate a machine readable formant based codebook. In some implementations, the method includes determining whether or not a candidate codebook tuple includes a sufficient amount of new information to warrant either adding the candidate codebook tuple to the codebook or using at least a portion of the candidate codebook tuple to update an existing codebook tuple. Additionally and/or alternatively, in some implementations systems, methods and devices are operable to reconstruct a target voice signal by detecting formants in an audible signal, using the detected formants to select codebook tuples, and using the formant information in the selected codebook tuples to reconstruct the target voice signal.
    • 本文描述的系统,方法和装置的实现使得能够增强包括在由助听器装置等接收的噪声可听信号中的目标语音信号的可懂度。 特别地,在一些实现中,系统,方法和设备可操作以生成基于机器可读共享器的码本。 在一些实现中,该方法包括确定候选码本元组是否包括足够量的新信息以保证将候选码本元组添加到码本,或者使用候选码本元组的至少一部分来更新现有码本元组 。 附加地和/或替代地,在一些实现中,系统,方法和设备可操作以通过检测可听信号中的共振峰来重建目标语音信号,使用检测到的共振峰来选择码本元组,并且使用所选码本元组中的共振峰信息 重建目标语音信号。