会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明申请
    • SPEECH DATA RETRIEVAL APPARATUS, SPEECH DATA RETRIEVAL METHOD, SPEECH DATA RETRIEVAL PROGRAM AND COMPUTER USABLE MEDIUM HAVING COMPUTER READABLE SPEECH DATA RETRIEVAL PROGRAM EMBODIED THEREIN
    • 语音数据检索装置,语音数据检索方法,语音数据检索程序和计算机可读介质具有计算机可读的语音数据检索程序
    • WO2008130018A1
    • 2008-10-30
    • PCT/JP2008/057554
    • 2008-04-11
    • MASSACHUSETTS INSTITUTE OF TECHNOLOGYNIPPON TELEGRAPH AND TELEPHONE CORPORATIONHORI, TakaakiHETHERINGTON, I.LeeHAZEN, Timothy, J.GLASS, James, R.
    • HORI, TakaakiHETHERINGTON, I.LeeHAZEN, Timothy, J.GLASS, James, R.
    • G06F17/30
    • G06F17/3074G06F17/30746
    • A speech data retrieval apparatus (10) includes a speech database (1), a speech recognition unit (2), a confusion network creation unit (3), an inverted index table creation unit (4), a query input unit (6), a query conversion unit (7) and a label string check unit (8). The speech recognition unit (2) reads speech data from the speech database (1), carries out a speech recognition process with respect to the read speech data, and outputs a result of speech recognition process as a lattice in which a phoneme, a syllable, or a word is a base unit. The confusion network creation unit (3) creates a confusion network based on the output lattice and outputs the result of speech recognition process as the confusion network. The inverted index table creation unit (4) creates an inverted index table based on the output confusion network. The query input unit (6) receives a query input by a user, carries out a speech recognition process with respect to the received query, and outputs a result of speech recognition process as a character string. The query conversion unit (7) converts the output character string into a label string in which a phoneme, a syllable, or a word is a base unit. The label string check unit (8) checks the label string against the inverted index table and retrieves speech data which is included in both of the label string and the speech database (1).
    • 语音数据检索装置(10)包括语音数据库(1),语音识别单元(2),混淆网络创建单元(3),反向索引表创建单元(4),查询输入单元(6) ,查询转换单元(7)和标签字符串检查单元(8)。 语音识别单元(2)从语音数据库(1)读取语音数据,对读取的语音数据执行语音识别处理,并将语音识别处理的结果输出为格子,其中音素,音节 ,或一个单词是基本单位。 混淆网络创建单元(3)基于输出格点创建混淆网络,并输出语音识别处理结果作为混淆网络。 反转索引表创建单元(4)基于输出混淆网络创建反向索引表。 查询输入单元(6)接收用户输入的查询,对接收到的查询执行语音识别处理,并输出语音识别处理结果作为字符串。 查询转换单元(7)将输出字符串转换为其中音素,音节或单词是基本单元的标签串。 标签字符串检查单元(8)根据反向索引表检查标签串,并检索包括在标签串和语音数据库(1)中的语音数据。