会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 8. 发明申请
    • SPEECH RECEIVING DEVICE AND VISEME EXTRACTION METHOD AND APPARATUS
    • 语音接收设备和可视化提取方法和设备
    • WO2005093714A1
    • 2005-10-06
    • PCT/US2005/005476
    • 2005-02-22
    • MOTOROLA, INC.BUHRKE, Eric, R.
    • BUHRKE, Eric, R.
    • G10L15/08
    • G10L21/10G10L2021/105
    • A technique for extracting visemes includes receiving successive frames of digitized analog speech information obtained from the speech signal at a fixed rate (210), filtering each of the successive frames of digitized analog speech information to synchronously generate time domain frame classification vectors at the fixed rate (215, 220, 225, 230, 235, 240), and analyzing each of the time domain classification vectors (250) to synchronously generate a set of visemes corresponding to each of the successive frames of digitized speech information at the fixed rate. Each of the time domain frame classification vectors is derived from one of the successive frames of digitized analog speech information. N multi-taper discrete prolate spheroid sequence basis (MTDPSSB) functions (220) that are factors of a Fredholm integral of the first kind may be used for the filtering, and the analyzing may use a spatial classification function (250). The latency is less than 100 milliseconds.
    • 一种用于提取视力的技术包括以固定速率(210)接收从语音信号获得的数字化模拟语音信息的连续帧,对数字化模拟语音信息的每个连续帧进行滤波,以同步地以固定速率生成时域帧分类向量 (215,220,225,230,235,240),并且分析每个时域分类向量(250),以固定速率同步地生成与数字化语音信息的每个连续帧相对应的一组视差。 每个时域帧分类向量从数字化模拟语音信息的连续帧之一导出。 作为第一类Fredholm积分因子的N多锥度离散长椭球体序列(MTDPSSB)函数(220)可以用于滤波,分析可以使用空间分类函数(250)。 延迟小于100毫秒。