专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明公开

EP3396668A1 CLASSIFYING SPEECH UTTERANCES 有权转让
公开(公告)号：EP3396668A1
公开(公告)日：2018-10-31
申请号：EP18153091.6
申请日：2009-06-17
申请人： Voicesense Ltd.
发明人： Degani, Yoav , Zamir, Yishai
IPC分类号： G10L17/26 , G10L15/18 , G10L15/06
CPC分类号： G10L15/06 , G10L15/1807 , G10L17/26
摘要： A computer implemented method of analyzing speech utterances of a speaker in a given situation and context and determining behavioral, psychological and speech style characteristics of the speaker in the given situation, said computer implemented method comprising: creating a speech parameters reference database for classifying speech utterances according to various behavioral, psychological and speech styles characteristics; obtaining speech utterances of a speaker in a specific situation and context; deriving a plurality of secondary speech parameters from said primary parameters; calculating a subset of speech parameters, parameters combinations and parameters' values representative of situational behavioral, psychological and speech styles characteristics, from said secondary parameters in the speech utterance; determining and scoring the situational behavioral, psychological and speech style characteristics in the speech utterance by comparing the calculated subset of speech parameters, parameters combinations and parameters' values with the pre-defined reference database of speech parameters.

2. 发明公开

EP3394803A1 ESCALATION TO A HUMAN OPERATOR 无效
公开(公告)号：EP3394803A1
公开(公告)日：2018-10-31
申请号：EP17745543.3
申请日：2017-06-13
申请人： Google LLC
发明人： SEGALIS, Eyal , WALEVSKI, Daniel , LEVIATHAN, Yaniv
IPC分类号： G06Q10/06 , G06Q10/10
CPC分类号： H04M3/4936 , G06F17/27 , G06F17/2705 , G06F17/2881 , G06N99/005 , G06Q10/06 , G06Q10/10 , G10L13/08 , G10L15/005 , G10L15/1807 , G10L15/1815 , G10L15/222 , G10L25/63 , G10L2015/227 , H04M3/42042 , H04M3/42093 , H04M3/493 , H04M3/58 , H04M3/60 , H04M2242/18
摘要： Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, relating to synthetic call status updates. In some implementations, a method includes determining, by a task manager module, that a triggering event has occurred to provide a current status of a user call request. The method may then determine, by the task manager module, the current status of the user call request. A representation of the current status of the user call request is generated. Then, the generated representation of the current status of the user call request is provided to the user.

3. 发明公开

EP1145225A1 TONE FEATURES FOR SPEECH RECOGNITION 审中-公开
标题翻译： Tonale的特征的语音识别
公开(公告)号：EP1145225A1
公开(公告)日：2001-10-17
申请号：EP00987248.2
申请日：2000-11-10
申请人： Koninklijke Philips Electronics N.V.
发明人： HUANG, Chang-Han , SEIDE, Frank
IPC分类号： G10L15/18
CPC分类号： G10L15/1807 , G10L25/15 , G10L2025/935
摘要： Robust acoustic tone features are achieved first by the introduction of on-line, look-ahead trace back of the fundamental frequency (F0) contour with adaptive pruning, this fundamental frequency serves as the signal preprocessing front-end. The F0 contour is subsequently decomposed into lexical tone effect, phrase intonation effect, and random effect by means of time-variant, weighted moving average (MA) filter in conjunction with weighted (placing more emphasis on vowels) least squares of the F0 contour. The intonation effect is removed by subtraction of the F0 contour under superposition assumption. The acoustic tone features are defined as two parts. First, is the coefficients of the second order weighted regression of the de-intonation of the F0 contour over neighbouring frames. The second part deals with the degree of the periodicity of the signal, which are the coefficients of the second order regression of the auto-correlation. These weights of the second order weighted regression of the de-intonation of the F0 contour are designed to emphasize/de-emphasize the voiced/unvoiced segments of the pitch contour in order to preserve the voiced pitch contour for the semi-voiced consonants.

4. 发明公开

EP0749109A2 Speech recognition for tonal languages 失效
标题翻译：斯普拉谢肯恩
公开(公告)号：EP0749109A2
公开(公告)日：1996-12-18
申请号：EP96850108.0
申请日：1996-06-04
申请人： TELIA AB
发明人： Lyberg, Bertil
IPC分类号： G10L5/06
CPC分类号： G10L15/1807
摘要： The present invention relates to a method and device at speech-to-text conversion. From a given speech the fundamental tone is extracted. A model of the speech is further created from the speech. In the model a duration reproduction in words and sentences is obtained. The duration reproduction is compared with a segment duration in the speech. From the comparison is obtained information which decides which type of accent that exists, at which a text with sentence accent information is produced.
摘要翻译：本发明涉及一种语音到文本转换的方法和装置。从给定的演讲中提取出基调。演讲的进一步创作模式。在模型中，获得单词和句子中的持续时间再现。将持续时间再现与语音中的段持续时间进行比较。从比较中获得的信息决定了哪个类型的口音存在，在此处产生具有句子重音信息的文本。

5. 发明公开

EP2645364A1 Spoken dialog system using prominence 审中-公开
标题翻译： Sprachdialogsystem mit Anwendung der Prominenz
公开(公告)号：EP2645364A1
公开(公告)日：2013-10-02
申请号：EP12162032.2
申请日：2012-03-29
申请人： Honda Research Institute Europe GmbH
发明人： Heckmann, Martin
IPC分类号： G10L15/22 , G10L15/18 , G10L13/08
CPC分类号： G10L15/22 , G10L13/04 , G10L13/08 , G10L15/1807 , G10L25/48
摘要： The invention presents a method for analyzing speech in a spoken dialog system, comprising the steps of: accepting an utterance by at least one means for accepting acoustical signals, in particular a microphone, analyzing the utterance and obtaining prosodic cues from the utterance using at least one processing engine, wherein the utterance is evaluated based on the prosodic cues to determine a prominence of parts of the utterance, and wherein the utterance is analyzed to detect either at least one marker feature, e.g. a negative statement, a segment with a very high prominence or both, indicative of the utterance containing at least one part to replace at least one part in a previous utterance, the part to be replaced in the previous utterance being determined based on the prominence determined for the parts of the previous utterance and the replacement parts being determined based on the prominence of the parts in the utterance, and wherein the previous utterance is evaluated with the replacement part(s).
摘要翻译：本发明提出了一种用于分析口语对话系统中的语音的方法，包括以下步骤：通过至少一种用于接收声信号的装置，特别是麦克风接受话语，分析话语并使用至少一个语音来从话语中获得韵律线索一个处理引擎，其中基于韵律提示来评估话语以确定话语的部分的突出性，并且其中分析话语以检测至少一个标记特征，例如一个消极的陈述，一个具有非常高突出性的部分或两者，表示包含至少一个部分的话语以替代先前的发音中的至少一个部分，在先前的发音中要替换的部分是基于确定的突出性来确定的对于先前发音的部分，并且基于话语中的部分的突出性来确定替换部分，并且其中用替换部分评估先前的发音。

6. 发明公开

EP2188729A1 SYSTEM-EFFECTED TEXT ANNOTATION FOR EXPRESSIVE PROSODY IN SPEECH SYNTHESIS AND RECOGNITION 审中-公开
标题翻译： SYSTEMBEWIRKTE文本注释FOR AUSDRUCKSPROSODIE语音合成和识别
公开(公告)号：EP2188729A1
公开(公告)日：2010-05-26
申请号：EP08797487.9
申请日：2008-08-08
申请人： Lessac Technologies, Inc.
发明人： NITISAROJ, Rattima , MARPLE, Gary , CHANDRA, Nishant
IPC分类号： G06F15/00
CPC分类号： G10L13/10 , G10L15/1807
摘要： The inventive system can automatically annotate the relationship of text and acoustic units for the purposes of: (a) predicting how the text is to be pronounced as expressively synthesized speech, and (b) improving the proportion of expressively uttered speech as correctly identified text representing the speaker's message. The system can automatically annotate text corpora for relationships of uttered speech for a particular speaking style and for acoustic units in terms of context and content of the text to the utterances. The inventive system can use kinesthetically defined expressive speech production phonetics that are recognizable and controllable according to kinesensic feedback principles. In speech synthesis embodiments of the invention, the text annotations can specify how the text is to be expressively pronounced as synthesized speech. Also, acoustically-identifying features for dialects or mispronunciations can be identified to expressively synthesize alternative dialects or stylistic mispronunciations for a speaker from a given text.

7. 发明公开

EP1908055A4 CONTENT-BASED AUDIO PLAYBACK EMPHASIS 有权
标题翻译： INHALTBASIERTE AUDIOWIEDERGABEBETONUNG
公开(公告)号：EP1908055A4
公开(公告)日：2008-11-26
申请号：EP06786330
申请日：2006-07-06
申请人： MULTIMODAL TECHNOLOGIES INC
发明人： SCHUBERT KJELL , FRITSCH JUERGEN , FINKE MICHAEL , KOLL DETLEF
IPC分类号： G10L15/22
CPC分类号： G10L15/26 , G06F17/273 , G06F17/2785 , G10L15/1807 , G10L15/22 , G10L15/265 , G10L21/04
摘要： Techniques are disclosed for facilitating the process of proofreading draft transcripts of spoken audio streams. In general, proofreading of a draft transcript is facilitated by playing back the corresponding spoken audio stream with an emphasis on those regions in the audio stream that are highly relevant or likely to have been transcribed incorrectly. Regions may be emphasized by, for example, playing them back more slowly than regions that are of low relevance and likely to have been transcribed correctly. Emphasizing those regions of the audio stream that are most important to transcribe correctly and those regions that are most likely to have been transcribed incorrectly increases the likelyhood that the proofreader will accurately correct any errors in those regions, thereby improving the overall accuracy of the transcript.
摘要翻译：公开了用于促进对口头音频流的草稿转录本进行校对的过程的技术。通常，通过播放相应的口头音频流，强调音频流中与高度相关或可能已被错误转录的那些区域相关的音频流，便于对草稿抄本进行校对。例如，可以通过比相关度低且可能被正确转录的区域更慢地播放区域来强调区域。强调正确转录最重要的那些音频流区域和那些最有可能被错误转录的区域会增加校对者准确纠正这些区域中任何错误的可能性，从而提高整个转录本的准确性。

8. 发明公开

EP1927979A1 Speech recognition using prosody 有权
标题翻译： Pros。。
公开(公告)号：EP1927979A1
公开(公告)日：2008-06-04
申请号：EP07254504.9
申请日：2007-11-19
申请人： Sony Corporation
发明人： Yamada, Keiichi
IPC分类号： G10L15/18 , G10L11/04
CPC分类号： G10L15/1807 , G10L25/15 , G10L25/90
摘要： Disclosed herein is a voice processing apparatus for recognizing an input voice on the basis of a prosody characteristic of said voice, said voice processing apparatus including: voice acquisition means for acquiring said input voice; acoustic analysis means for finding a relative pitch change on the basis of a frequency-direction difference between a first frequency characteristic seen at each frame time of said input voice acquired by said voice acquisition means and a second frequency characteristic determined in advance; and prosody recognition means for carrying out a prosody recognition process on the basis of said relative pitch change found by said acoustic analysis means in order to produce a result of said prosody recognition process.
摘要翻译：本发明公开了一种用于基于所述语音的韵律特性识别输入语音的语音处理装置，所述语音处理装置包括：语音获取装置，用于获取所述输入的语音; 声学分析装置，用于根据由所述语音获取装置获取的所述输入声音的每个帧时间看到的第一频率特性与预先确定的第二频率特性之间的频率方向差找到相对音调变化; 以及韵律识别装置，用于基于由所述声学分析装置发现的所述相对间距变化来执行韵律识别过程，以产生所述韵律识别过程的结果。

9. 发明公开

EP1422692A2 Automatic insertion of non-verbalized punctuation in speech recognition 审中-公开
标题翻译：自动化Einfügenvon nichtausgesprochenen Satzzeichen in der Spracherkennung
公开(公告)号：EP1422692A2
公开(公告)日：2004-05-26
申请号：EP03257354.5
申请日：2003-11-21
申请人： ScanSoft, Inc.
发明人： Divay, Olivier , Watson, Jonathan , Gold, Allan , Van Even, Stijn
IPC分类号： G10L15/22
CPC分类号： G10L15/1807 , G06F17/2725 , G10L15/22
摘要： Recognizing punctuation in computer-implemented speech recognition includes performing speech recognition on an utterance to produce a recognition result for the utterance. A non-verbalized punctuation mark is identified in a recognition result and the recognition result is formatted based on the identification.
摘要翻译：识别计算机实现的语音识别中的标点符号包括在话语上执行语音识别以产生语音的识别结果。在识别结果中识别非言语标点符号，并且基于识别格式来识别结果。

10. 发明授权

EP0838805B1 Speech recognition apparatus using pitch intensity information 失效
标题翻译：使用基本频率的强度数据的语音
公开(公告)号：EP0838805B1
公开(公告)日：2003-03-26
申请号：EP97118746.3
申请日：1997-10-28
申请人： NEC CORPORATION
发明人： Takagi, Keizaburo
IPC分类号： G10L15/18
CPC分类号： G10L15/1807 , G10L25/78

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式