会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Content-based audio playback emphasis
    • 基于内容的音频播放强调
    • US07844464B2
    • 2010-11-30
    • US11187119
    • 2005-07-22
    • Kjell SchubertJuergen FritschMichael FinkeDetlef Koll
    • Kjell SchubertJuergen FritschMichael FinkeDetlef Koll
    • G10L21/00
    • G10L15/26G06F17/273G06F17/2785G10L15/1807G10L15/22G10L15/265G10L21/04
    • Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.
    • 公开了用于促进校对口头音频流的草稿的过程的技术。 一般来说,通过播放对应的口语音频流,强调音频流中与那些高度相关或可能被错误地转录的那些区域,来校对草稿。 例如,区域可能会被强调为比相关程度低且可能被正确转录的地区的播放速度更慢。 强调音频流中最重要的那些区域是正确转录的,那些最有可能被错误转录的区域增加了校对者准确地纠正这些区域中的任何错误的可能性,从而提高了抄本的整体准确性。
    • 3. 发明授权
    • Content-based audio playback emphasis
    • 基于内容的音频播放强调
    • US08768706B2
    • 2014-07-01
    • US12859883
    • 2010-08-20
    • Kjell SchubertJuergen FritschMichael FinkeDetlef Koll
    • Kjell SchubertJuergen FritschMichael FinkeDetlef Koll
    • G10L21/00
    • G10L15/26G06F17/273G06F17/2785G10L15/1807G10L15/22G10L15/265G10L21/04
    • Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelihood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.
    • 公开了用于促进校对口头音频流的草稿的过程的技术。 一般来说,通过播放对应的口语音频流,强调音频流中与那些高度相关或可能被错误地转录的那些区域,来校对草稿。 例如,区域可能会被强调为比相关程度低且可能被正确转录的地区的播放速度更慢。 强调音频流中最重要的那些区域是正确转录的,那些最有可能被错误转录的区域增加了校对者准确地纠正这些区域中的任何错误的可能性,从而提高了抄本的整体准确性。
    • 7. 发明授权
    • Document transcription system training
    • 文件转录系统培训
    • US08335688B2
    • 2012-12-18
    • US10922513
    • 2004-08-20
    • Girija YegnanarayananMichael FinkeJuergen FritschDetlef KollMonika Woszczyna
    • Girija YegnanarayananMichael FinkeJuergen FritschDetlef KollMonika Woszczyna
    • G10L15/26G10L15/18
    • G10L15/063G10L15/193G10L15/26
    • A system is provided for training an acoustic model for use in speech recognition. In particular, such a system may be used to perform training based on a spoken audio stream and a non-literal transcript of the spoken audio stream. Such a system may identify text in the non-literal transcript which represents concepts having multiple spoken forms. The system may attempt to identify the actual spoken form in the audio stream which produced the corresponding text in the non-literal transcript, and thereby produce a revised transcript which more accurately represents the spoken audio stream. The revised, and more accurate, transcript may be used to train the acoustic model, thereby producing a better acoustic model than that which would be produced using conventional techniques, which perform training based directly on the original non-literal transcript.
    • 提供用于训练用于语音识别的声学模型的系统。 特别地,这样的系统可以用于基于口语音频流和口头音频流的非文字转录来执行训练。 这样的系统可以识别表示具有多个口头形式的概念的非文字记录中的文本。 该系统可以尝试在音频流中识别在非文字转录中产生相应文本的音频流中的实际语音形式,从而产生更准确地表示语音音频流的经修改的脚本。 修改和更准确的誊本可用于训练声学模型,从而产生比使用直接基于原始非文字誊本进行训练的常规技术产生的更好的声学模型。
    • 10. 发明授权
    • Automated extraction of semantic content and generation of a structured document from speech
    • 自动提取语义内容,并从语音生成结构化文档
    • US07584103B2
    • 2009-09-01
    • US10923517
    • 2004-08-20
    • Juergen FritschMichael FinkeDetlef KollMonika WoszczynaGirija Yegnanarayanan
    • Juergen FritschMichael FinkeDetlef KollMonika WoszczynaGirija Yegnanarayanan
    • G10L15/18
    • G10L15/1815G16H15/00
    • Techniques are disclosed for automatically generating structured documents based on speech, including identification of relevant concepts and their interpretation. In one embodiment, a structured document generator uses an integrated process to generate a structured textual document (such as a structured textual medical report) based on a spoken audio stream. The spoken audio stream may be recognized using a language model which includes a plurality of sub-models arranged in a hierarchical structure. Each of the sub-models may correspond to a concept that is expected to appear in the spoken audio stream. Different portions of the spoken audio stream may be recognized using different sub-models. The resulting structured textual document may have a hierarchical structure that corresponds to the hierarchical structure of the language sub-models that were used to generate the structured textual document.
    • 公开了基于语音自动生成结构化文档的技术,包括识别相关概念及其解释。 在一个实施例中,结构化文档生成器使用集成过程来基于口头音频流来生成结构化文本文档(诸如结构化文本医疗报告)。 可以使用包括以分层结构布置的多个子模型的语言模型来识别口语音频流。 每个子模型可以对应于期望出现在口头音频流中的概念。 可以使用不同的子模型来识别口语音频流的不同部分。 所得到的结构化文本文档可以具有对应于用于生成结构化文本文档的语言子模型的分层结构的层次结构。