会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • AUTOMATIC TEXT CORRECTION
    • 自动文本校正
    • WO2006035402A1
    • 2006-04-06
    • PCT/IB2005/053193
    • 2005-09-28
    • KONINKLIJKE PHILIPS ELECTRONICS N.V.PHILIPS INTELLECTUAL PROPERTY & STANDARDS GMBHPETERS, JochenMATUSOV, Evgeny
    • PETERS, JochenMATUSOV, Evgeny
    • G06F17/22G10L15/26
    • G06F17/273G06F17/2282G10L15/26
    • The present invention provides a method of generating text transformation rules for speech to text transcription systems. The text transformation rules are generated by means of comparing an erroneous text generated by a speech to text transcription system with a correct reference text. Comparison of erroneous and reference text allows to derive a set of text transformation rules that are evaluated by means of a strict application to the training text and successive comparison with the reference text. Evaluation of text transformation rules provides a sufficient approach to determine which of the automatically generated text transformation rules provide an enhancement or degradation of the erroneous text. In this way only those text transformation rules of the set of text transformation rules are selected that guarantee an enhancement of the erroneous text. In this way systematic errors of an automatic speech recognition or natural language process system can be effectively compensated.
    • 本发明提供了一种生成用于语音到文本转录系统的文本转换规则的方法。 通过将语音产生的错误文本与文本转录系统与正确的参考文本进行比较来产生文本转换规则。 错误和参考文本的比较允许导出一组文本转换规则,通过对训练文本的严格应用和与参考文本的连续比较来评估。 文本转换规则的评估提供了一种足够的方法来确定哪些自动生成的文本转换规则提供错误文本的增强或降级。 以这种方式,仅选择文本转换规则集合中的那些文本转换规则,以保证错误文本的增强。 以这种方式,可以有效地补偿自动语音识别或自然语言处理系统的系统误差。
    • 3. 发明申请
    • TOPIC SPECIFIC MODELS FOR TEXT FORMATTING AND SPEECH RECOGNITION
    • 用于文本格式和语音识别的主题特定模型
    • WO2005050621A2
    • 2005-06-02
    • PCT/IB2004/052403
    • 2004-11-12
    • PHILIPS INTELLECTUAL PROPERTY & STANDARDS GMBHKONINKLIJKE PHILIPS ELECTRONICS N. V.PETERS, JochenMATUSOV, EvgenyMEYER, CarstenKLAKOW, Dietrich
    • PETERS, JochenMATUSOV, EvgenyMEYER, CarstenKLAKOW, Dietrich
    • G10L15/22
    • G10L15/183G06F17/211G06F17/2715G10L15/32
    • The present invention relates to a method, a computer system and a computer program product for speech recognition and/or text formatting by making use of topic specific statistical models. A text document which may be obtained from a first speech recognition pass is subject to segmentation and to an assignment of topic specific models for each obtained section. Each model of the set of models provides statistic information about language model probabilities, about text processing or formatting rules, as e.g. the interpretation of commands for punctuation, formatting, text highlighting or of ambiguous text portions requiring specific formatting, as well as a specific vocabulary being characteristic for each section of the recognized text. Furthermore, other properties of a speech recognition and/or formatting system (such as e.g. settings for the speaking rate) may be encoded in the statistical models. The models themselves are generated on the basis of annotated training data and/or by manual coding. Based on the assignment of models to sections of text an improved speech recognition and/or text formatting procedure is performed.
    • 本发明涉及一种通过利用专题统计模型进行语音识别和/或文本格式化的方法,计算机系统和计算机程序产品。 可以从第一语音识别通过获得的文本文档被分割并分配给每个获得的部分的主题特定模型的分配。 模型集合中的每个模型提供关于语言模型概率,关于文本处理或格式化规则的统计信息,例如。 用于标点符号,格式化,文本突出显示的命令的解释或需要特定格式化的不明确的文本部分以及对于识别的文本的每个部分特有的特定词汇表的解释。 此外,可以在统计模型中编码语音识别和/或格式化系统的其他属性(例如用于说话率的设置)。 模型本身是根据注释的训练数据和/或手动编码生成的。 基于将模型分配给文本部分,执行改进的语音识别和/或文本格式化过程。