发明专利
JP2013171243A Speech recognition accuracy estimating device, speech recognition precision estimating method and program
有权
基本信息:
- 专利标题: Speech recognition accuracy estimating device, speech recognition precision estimating method and program
- 专利标题(中):语音识别精度估计装置,语音识别精度估计方法和程序
- 申请号:JP2012036447 申请日:2012-02-22
- 公开(公告)号:JP2013171243A 公开(公告)日:2013-09-02
- 发明人: OGAWA ATSUNORI , HORI TAKAAKI , NAKAMURA ATSUSHI
-
申请人:
Nippon Telegr & Teleph Corp
, 日本電信電話株式会社 -
专利权人:
Nippon Telegr & Teleph Corp
,日本電信電話株式会社 -
当前专利权人:
Nippon Telegr & Teleph Corp
,日本電信電話株式会社 - 优先权: JP2012036447 2012-02-22
- 主分类号: G10L15/01
- IPC分类号: G10L15/01 ; G10L15/18
摘要:
PROBLEM TO BE SOLVED: To provide a speech recognition accuracy estimating device capable of calculating recognition accuracy as a detailed numerical value after estimating the number of correct answers/substitution errors/insertion errors/deletion errors.SOLUTION: The speech recognition accuracy estimating device comprises: a speech recognition part for creating a word confusion network representing probability that any recognition result word exists for each segment from an inputted speech and probability (existence probability of ε) that any recognition result word does not exist; a word alignment network acquisition part for acquiring a word alignment network representing existence probability of a word with maximum existence probability as correct answer probability, total existence probability of words without maximum existence probability except ε as substitution error probability and the existence probability of ε as insertion error probability when ε has not the maximum existence probability in any segment, and representing total existence probability of the words without maximum existence probability as deletion error probability when ε has the maximum existence probability in any segment; and a random recognition accuracy calculating part for calculating speech recognition accuracy.
摘要(中):
要解决的问题:提供一种在估计正确答案/替换错误/插入错误/删除错误的数量之后能够计算识别精度作为详细数值的语音识别精度估计装置。解决方案:语音识别精度估计装置包括: 语音识别部分,用于创建表示任何识别结果字从输入的语音存在于每个段的概率和概率(存在概率)的单词混淆网络,所述识别结果字不存在任何识别结果字; 用于获取表示具有最大存在概率的单词的存在概率的单词对齐网络作为正确答案概率的单词对齐网络获取部分,除“&egr”之外的没有最大存在概率的单词的总存在概率; 作为替代误差概率和&egr的存在概率; 作为插入误差概率 在任何段中没有最大存在概率,并且表示没有最大存在概率的单词的总存在概率作为&egr的删除错误概率; 在任何段中都具有最大存在概率; 以及用于计算语音识别精度的随机识别精度计算部分。
公开/授权文献:
- JP5679345B2 音声認識精度推定装置、音声認識精度推定方法、プログラム 公开/授权日:2015-03-04
信息查询:
EspacenetIPC结构图谱:
G | 物理 |
--G10 | 乐器;声学 |
----G10L | 语言分析或合成;语言识别 |
------G10L15/00 | 语音识别 |
--------G10L15/01 | .语音识别系统的评估或评价 |