会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 3. 发明公开
    • TONE FEATURES FOR SPEECH RECOGNITION
    • Tonale的特征的语音识别
    • EP1145225A1
    • 2001-10-17
    • EP00987248.2
    • 2000-11-10
    • Koninklijke Philips Electronics N.V.
    • HUANG, Chang-HanSEIDE, Frank
    • G10L15/18
    • G10L15/1807G10L25/15G10L2025/935
    • Robust acoustic tone features are achieved first by the introduction of on-line, look-ahead trace back of the fundamental frequency (F0) contour with adaptive pruning, this fundamental frequency serves as the signal preprocessing front-end. The F0 contour is subsequently decomposed into lexical tone effect, phrase intonation effect, and random effect by means of time-variant, weighted moving average (MA) filter in conjunction with weighted (placing more emphasis on vowels) least squares of the F0 contour. The intonation effect is removed by subtraction of the F0 contour under superposition assumption. The acoustic tone features are defined as two parts. First, is the coefficients of the second order weighted regression of the de-intonation of the F0 contour over neighbouring frames. The second part deals with the degree of the periodicity of the signal, which are the coefficients of the second order regression of the auto-correlation. These weights of the second order weighted regression of the de-intonation of the F0 contour are designed to emphasize/de-emphasize the voiced/unvoiced segments of the pitch contour in order to preserve the voiced pitch contour for the semi-voiced consonants.
    • 5. 发明公开
    • Spoken dialog system using prominence
    • Sprachdialogsystem mit Anwendung der Prominenz
    • EP2645364A1
    • 2013-10-02
    • EP12162032.2
    • 2012-03-29
    • Honda Research Institute Europe GmbH
    • Heckmann, Martin
    • G10L15/22G10L15/18G10L13/08
    • G10L15/22G10L13/04G10L13/08G10L15/1807G10L25/48
    • The invention presents a method for analyzing speech in a spoken dialog system, comprising the steps of: accepting an utterance by at least one means for accepting acoustical signals, in particular a microphone, analyzing the utterance and obtaining prosodic cues from the utterance using at least one processing engine, wherein the utterance is evaluated based on the prosodic cues to determine a prominence of parts of the utterance, and wherein the utterance is analyzed to detect either at least one marker feature, e.g. a negative statement, a segment with a very high prominence or both, indicative of the utterance containing at least one part to replace at least one part in a previous utterance, the part to be replaced in the previous utterance being determined based on the prominence determined for the parts of the previous utterance and the replacement parts being determined based on the prominence of the parts in the utterance, and wherein the previous utterance is evaluated with the replacement part(s).
    • 本发明提出了一种用于分析口语对话系统中的语音的方法,包括以下步骤:通过至少一种用于接收声信号的装置,特别是麦克风接受话语,分析话语并使用至少一个语音来从话语中获得韵律线索 一个处理引擎,其中基于韵律提示来评估话语以确定话语的部分的突出性,并且其中分析话语以检测至少一个标记特征,例如 一个消极的陈述,一个具有非常高突出性的部分或两者,表示包含至少一个部分的话语以替代先前的发音中的至少一个部分,在先前的发音中要替换的部分是基于确定的突出性来确定的 对于先前发音的部分,并且基于话语中的部分的突出性来确定替换部分,并且其中用替换部分评估先前的发音。
    • 7. 发明公开
    • CONTENT-BASED AUDIO PLAYBACK EMPHASIS
    • INHALTBASIERTE AUDIOWIEDERGABEBETONUNG
    • EP1908055A4
    • 2008-11-26
    • EP06786330
    • 2006-07-06
    • MULTIMODAL TECHNOLOGIES INC
    • SCHUBERT KJELLFRITSCH JUERGENFINKE MICHAELKOLL DETLEF
    • G10L15/22
    • G10L15/26G06F17/273G06F17/2785G10L15/1807G10L15/22G10L15/265G10L21/04
    • Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelyhood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.
    • 公开了用于促进对口头音频流的草稿转录本进行校对的过程的技术。 通常,通过播放相应的口头音频流,强调音频流中与高度相关或可能已被错误转录的那些区域相关的音频流,便于对草稿抄本进行校对。 例如,可以通过比相关度低且可能被正确转录的区域更慢地播放区域来强调区域。 强调正确转录最重要的那些音频流区域和那些最有可能被错误转录的区域会增加校对者准确纠正这些区域中任何错误的可能性,从而提高整个转录本的准确性。
    • 8. 发明公开
    • Speech recognition using prosody
    • Pros。。
    • EP1927979A1
    • 2008-06-04
    • EP07254504.9
    • 2007-11-19
    • Sony Corporation
    • Yamada, Keiichi
    • G10L15/18G10L11/04
    • G10L15/1807G10L25/15G10L25/90
    • Disclosed herein is a voice processing apparatus for recognizing an input voice on the basis of a prosody characteristic of said voice, said voice processing apparatus including: voice acquisition means for acquiring said input voice; acoustic analysis means for finding a relative pitch change on the basis of a frequency-direction difference between a first frequency characteristic seen at each frame time of said input voice acquired by said voice acquisition means and a second frequency characteristic determined in advance; and prosody recognition means for carrying out a prosody recognition process on the basis of said relative pitch change found by said acoustic analysis means in order to produce a result of said prosody recognition process.
    • 本发明公开了一种用于基于所述语音的韵律特性识别输入语音的语音处理装置,所述语音处理装置包括:语音获取装置,用于获取所述输入的语音; 声学分析装置,用于根据由所述语音获取装置获取的所述输入声音的每个帧时间看到的第一频率特性与预先确定的第二频率特性之间的频率方向差找到相对音调变化; 以及韵律识别装置,用于基于由所述声学分析装置发现的所述相对间距变化来执行韵律识别过程,以产生所述韵律识别过程的结果。