会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明申请
    • Method and apparatus for continuous valued vocal tract resonance tracking using piecewise linear approximations
    • 使用分段线性近似的连续值声道共振跟踪的方法和装置
    • US20050114134A1
    • 2005-05-26
    • US10723995
    • 2003-11-26
    • Li DengHagai AttiasAlejandro AceroLeo Lee
    • Li DengHagai AttiasAlejandro AceroLeo Lee
    • G10L15/10G10L11/00G10L15/02G10L15/14G10L15/28G10L19/06
    • G10L25/48G10L25/15
    • A method and apparatus tracks vocal tract resonance components, including both frequencies and bandwidths, in a speech signal. The components are tracked by defining a state equation that is linear with respect to a past vocal tract resonance vector and that predicts a current vocal tract resonance vector. An observation equation is also defined that is linear with respect to a current vocal tract resonance vector and that predicts at least one component of an observation vector. The state equation, the observation equation, and a sequence of observation vectors are used to identify a sequence of vocal tract resonance vectors using Kalman filter algorithm. Under one embodiment, the observation equation is defined based on a piecewise linear approximation to a non-linear function. The parameters of the linear approximation are selected based on pre-defined regions, which are determined from a crude estimate of a vocal tract resonance vector.
    • 一种方法和装置在语音信号中跟踪声道共振分量,包括频率和频带两者。 通过定义相对于过去声道共振矢量线性的状态方程并且预测当前声道共振矢量来跟踪组件。 还定义了相对于当前声道共振矢量是线性的并且预测观察矢量的至少一个分量的观察方程。 状态方程,观察方程和观察矢量序列用于使用卡尔曼滤波算法识别声道共振矢量序列。 在一个实施例中,基于对非线性函数的分段线性近似来定义观察方程。 基于由声道共振矢量的粗略估计确定的预定义区域来选择线性近似的参数。
    • 6. 发明授权
    • Method of speech recognition using multimodal variational inference with switching state space models
    • 使用多模变分推理与开关状态空间模型的语音识别方法
    • US07480615B2
    • 2009-01-20
    • US10760937
    • 2004-01-20
    • Hagai AttiasLi DengLeo Lee
    • Hagai AttiasLi DengLeo Lee
    • G06F17/20G10L15/14G10L15/00G10L15/28G05B15/00
    • G10L15/14G10L2015/0638
    • A method of efficiently setting posterior probability parameters for a switching state space model begins by defining a window containing at least two but fewer than all of the frames. A separate posterior probability parameter is determined for each frame in the window. The window is then shifted sequentially from left to right in time so that it includes one or more subsequent frames in the sequence of frames. A separate posterior probability parameter is then determined for each frame in the shifted window. This method closely approximates a more rigorous solution but saves computational cost by two to three orders of magnitude. Further, a method of determining the optimal discrete state sequence in the switching state space model is invented that directly exploits the observation vector on a frame-by-frame basis and operates from left to right in time.
    • 开关状态空间模型的有效设置后验概率参数的方法是通过定义包含至少两个但少于所有帧的窗口来开始的。 为窗口中的每个帧确定单独的后验概率参数。 然后,窗口在时间上从左到右依次移位,使得它包括帧序列中的一个或多个后续帧。 然后,在移位的窗口中为每个帧确定单独的后验概率参数。 这种方法非常接近于更严格的解决方案,但可将计算成本节省2到3个数量级。 此外,发明了一种确定开关状态空间模型中的最佳离散状态序列的方法,其直接利用逐帧的观测向量并且在时间上从左到右进行操作。
    • 7. 发明授权
    • Microphone array signal enhancement using mixture models
    • 使用混合模型的麦克风阵列信号增强
    • US07103541B2
    • 2006-09-05
    • US10183267
    • 2002-06-27
    • Hagai AttiasLi Deng
    • Hagai AttiasLi Deng
    • G10L21/02
    • G10L21/02G10L2021/02161
    • A system and method facilitating signal enhancement utilizing mixture models is provided. The invention includes a signal enhancement adaptive system having a speech model, a noise model and a plurality of adaptive filter parameters. The signal enhancement adaptive system employs probabilistic modeling to perform signal enhancement of a plurality of windowed frequency transformed input signals received, for example, for an array of microphones. The signal enhancement adaptive system incorporates information about the statistical structure of speech signals. The signal enhancement adaptive system can be embedded in an overall enhancement system which also includes components of signal windowing and frequency transformation.
    • 提供了利用混合模型促进信号增强的系统和方法。 本发明包括具有语音模型,噪声模型和多个自适应滤波器参数的信号增强自适应系统。 信号增强自适应系统使用概率建模来执行例如为麦克风阵列接收的多个窗口频率变换的输入信号的信号增强。 信号增强自适应系统包括关于语音信号的统计结构的信息。 信号增强自适应系统可以嵌入在整体增强系统中,该系统还包括信号窗口和频率变换的组件。
    • 8. 发明申请
    • Method of speech recognition using multimodal variational inference with switching state space models
    • 使用多模变分推理与开关状态空间模型的语音识别方法
    • US20050159951A1
    • 2005-07-21
    • US10760937
    • 2004-01-20
    • Hagai AttiasLi DengLeo Lee
    • Hagai AttiasLi DengLeo Lee
    • G10L15/06G06F7/00G06F17/10G10L15/10G10L15/12G10L15/14G10L15/00
    • G10L15/14G10L2015/0638
    • A method of efficiently setting posterior probability parameters for a switching state space model begins by defining a window containing at least two but fewer than all of the frames. A separate posterior probability parameter is determined for each frame in the window. The window is then shifted sequentially from left to right in time so that it includes one or more subsequent frames in the sequence of frames. A separate posterior probability parameter is then determined for each frame in the shifted window. This method closely approximates a more rigorous solution but saves computational cost by two to three orders of magnitude. Further, a method of determining the optimal discrete state sequence in the switching state space model is invented that directly exploits the observation vector on a frame-by-frame basis and operates from left to right in time.
    • 开关状态空间模型的有效设置后验概率参数的方法是通过定义包含至少两个但少于所有帧的窗口来开始的。 为窗口中的每个帧确定单独的后验概率参数。 然后,窗口在时间上从左到右依次移位,使得它包括帧序列中的一个或多个后续帧。 然后,在移位的窗口中为每个帧确定单独的后验概率参数。 这种方法非常接近于更严格的解决方案,但可将计算成本节省2到3个数量级。 此外,发明了一种确定开关状态空间模型中的最佳离散状态序列的方法,其直接利用逐帧的观测向量并且在时间上从左到右进行操作。