专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US06358054B1 Method and apparatus for teaching prosodic features of speech 失效
标题翻译：教学韵律特征的方法与装置
公开(公告)号：US06358054B1
公开(公告)日：2002-03-19
申请号：US09587799
申请日：2000-06-06
申请人： Martin Rothenberg
发明人： Martin Rothenberg
IPC分类号： G09B1904
CPC分类号： G09B19/04
摘要： A system and apparatus for teaching prosodic features of speech senses and extracts prosodic or suprasegmental variables of a user's speech segment. Prosodic features of speech include pitch and loudness variations, as opposed to articulatory or sequential features of speech which are the primary determinants of phoneme variations. Once prosodic variables have been extracted from a speech segment, the variables are used to modulate a quasiperiodic waveform such as a sinusoid, a pulse-train, or a synthesized vowel-like waveform, or the parameters can be used to modulate a random-noise-like waveform. A modulated waveform can be played acoustically, and the user can hear the variation of the prosodic parameters without interference from the articulatory parameters of a complete waveform. This auditory feedback can be combined with visual feedback of the speech segment to teach proper prosodic speech formation. Auditory feedback for teaching prosodic features can also be formed without a modulation process by removing articulatory information from a speech segment, and non-acoustic measures sensors of prosodic feature, such as an electroglottograph, can also be employed.
摘要翻译：一种用于教授语音感觉的韵律特征并提取用户语音段的韵律或超节段变量的系统和装置。语音的韵律特征包括音调和响度变化，而不是作为音素变化的主要决定因素的语音的发音或连续特征。一旦从语音片段中提取韵律变量，就可以使用这些变量来调制诸如正弦波，脉冲串或合成元音波形之类的准周期波形，或者这些参数可用于调制随机噪声样波形可以在声学上播放调制波形，并且用户可以听到韵律参数的变化而不受来自完整波形的发音参数的干扰。这种听觉反馈可以与语音段的视觉反馈相结合，以教导适当的韵律语音形成。还可以通过从语音段去除发音信息而形成用于教导韵律特征的听觉反馈，而不需要调制过程，并且还可以采用非声学测量韵律特征的传感器，例如电泳图。

2. 发明授权

US06290504B1 Method and apparatus for reporting progress of a subject using audio/visual adaptive training stimulii 有权
标题翻译：使用音频/视觉适应训练刺激报告主体进展的方法和装置
公开(公告)号：US06290504B1
公开(公告)日：2001-09-18
申请号：US09415885
申请日：1999-10-08
申请人： Angela Jane Benitz , Elizabeth H. Budra , William M. Jenkins , John J. Montgomery
发明人： Angela Jane Benitz , Elizabeth H. Budra , William M. Jenkins , John J. Montgomery
IPC分类号： G09B1904
CPC分类号： G09B5/04 , G06Q20/204 , G09B5/065 , G09B5/14 , G09B19/04 , G09B21/00 , G09B21/009
摘要： An apparatus and method on a computing device for training of auditory and graphical discrimination in humans is provided. The method and apparatus provides a number of stimulus sets, each stimulus set having a number of different phonemes. Speech processing is used to provide multiple levels of emphasis and or stretching for enhancing a subject's ability to discriminate between similarly sounding phonemes. The processing is applied to phonemes and presented to the human as a trial. As a subject correctly identifies phonemes in the stimulus sets, the amount of processing applied to the phonemes is reduced, ultimately to the level of normal speech. A performance feedback mechanism is provided to allow the human to obtain a summary of his/her success over the stimulus sets, at the different processing levels. More detailed feedback is also provided indicating specific processing levels achieved for each of the stimulus sets. Selection buttons are provided on a graphical interface to allow the human to hear a stimulus set at his beginning processing level, and at his currently obtained processing level.
摘要翻译：提供了一种用于训练人体中听觉和图形辨别的计算设备上的装置和方法。该方法和装置提供多个刺激组，每个刺激组具有多个不同的音素。语音处理用于提供多个重点和/或拉伸程度，以增强受试者辨别相似的声音音素的能力。该处理被应用于音素并作为试验呈递给人类。作为主体正确地识别刺激组中的音素，施加到音素的处理量减少，最终降低到正常语音的水平。提供了一种绩效反馈机制，以允许人们在不同的处理级别获得他/她在刺激组上的成功的总结。还提供了更详细的反馈意见，指出针对每个刺激组实现的具体处理水平。在图形界面上提供选择按钮，以允许人们在他的开始处理级别和他当前获得的处理级别上听到一个刺激设置。

3. 发明授权

US06358055B1 Method and apparatus for teaching prosodic features of speech 失效
公开(公告)号：US06358055B1
公开(公告)日：2002-03-19
申请号：US09587800
申请日：2000-06-06
申请人： Martin Rothenberg
发明人： Martin Rothenberg
IPC分类号： G09B1904
CPC分类号： G09B19/04
摘要： A system and apparatus for teaching prosodic features of speech senses and extracts prosodic or suprasegmental variables of a user's speech segment. Prosodic features of speech include pitch and loudness variations, as opposed to articulatory or sequential features of speech which are the primary determinants of phoneme variations. Once prosodic variables have been extracted from a speech segment, the variables are used to modulate a quasiperiodic waveform such as a sinusoid, a pulse-train, or a synthesized vowel-like waveform, or the parameters can be used to modulate a random-noise-like waveform. A modulated waveform can be played acoustically, and the user can hear the variation of the prosodic parameters without interference from the articulatory parameters of a complete waveform. This auditory feedback can be combined with visual feedback of the speech segment to teach proper prosodic speech formation. Auditory feedback for teaching prosodic features can also be formed without a modulation process by removing articulatory information from a speech segment, and non-acoustic measures sensors of prosodic feature, such as an electroglottograph, can also be employed.

4. 发明授权

US06273726B1 Method of associating oral utterances meaningfully with word symbols seriatim in an audio-visual work and apparatus for linear and interactive application 失效
标题翻译：将口头发音有意义地与视听作品中的字符串相关联的方法和用于线性和交互应用的装置
公开(公告)号：US06273726B1
公开(公告)日：2001-08-14
申请号：US09570237
申请日：2000-05-12
申请人： William E. Kirksey , Kyle S. Morris
发明人： William E. Kirksey , Kyle S. Morris
IPC分类号： G09B1904
CPC分类号： G09F27/00 , G09B5/065 , G09B19/04
摘要： An audio-visual work and method of its creation which work has writings placed on the pictures of the work so that as each word or other utterance is heard a writing to be associated with the hearing is coordinated with seeing of the writing such that the future presentation of either the utterance or the writing shall evoke the other in the mind of the original viewer-listener. Each word will when appropriate appear in a legible perspective adjacent to the mouth of the utterer. The work can be displayed linearly or under computer control of the viewer/listener along with additional educational materials.
摘要翻译：一种视听工作及其创作方法，其工作对作品的图片进行了编写，以便随着每一个单词或其他话语被听到与书面相关的写作与听力相关联，与书面的看法协调一致，使未来演讲中的演讲或写作应引起原始观众收看者心目中的另一人。每个字都会在适当的时候出现在与发音口相邻的清晰视角中。工作可以线性显示，也可以在观众/收听者的计算机控制下，以及其他教育材料。

5. 发明授权

US06224384B1 Method and apparatus for training of auditory/visual discrimination using target and distractor phonemes/graphemes 有权
标题翻译：使用目标和干扰素音素/字形来训练听觉/视觉辨别的方法和装置
公开(公告)号：US06224384B1
公开(公告)日：2001-05-01
申请号：US09604443
申请日：2000-06-27
申请人： William M. Jenkins , Michael M. Merzenich , Steven L. Miller , Bret E. Peterson , Paula Tallal
发明人： William M. Jenkins , Michael M. Merzenich , Steven L. Miller , Bret E. Peterson , Paula Tallal
IPC分类号： G09B1904
CPC分类号： G09B5/06 , G09B5/04 , G09B5/065 , G09B5/14 , G09B19/04 , G09B21/00 , G09B21/009
摘要： An apparatus and method for training of auditory and graphical discrimination in humans is provided. The method and apparatus provides a number of stimulus sets, each stimulus set having a target phoneme, and associated grapheme, and a number of distractor phonemes, and associated graphemes. Upon initiation of a trial, a target phoneme is presented to a subject. A stimulus stream is then prepared that consists of a random sequence of distractor phonemes. Located within the sequence of distractor phonemes is the target phoneme. The stimulus sequence is presented to the subject for identification of the target phoneme within the sequence. Speech processing is used to provide multiple levels of emphasis for enhancing a subject's ability to discriminate between similarly sounding phonemes. The processing is applied to the presentation of the target phoneme and the stimulus stream. As a subject correctly identifies target phonemes within stimulus streams, across all provided stimulus sets, the amount of processing applied to the phonemes is reduced, ultimately to the level of normal speech.
摘要翻译：提供了一种用于训练人类听觉和图形辨别的装置和方法。该方法和装置提供多个刺激组，每个刺激组具有目标音素，以及相关联的图形，以及多个牵引器音素和相关联的图形。开始试用后，将目标音素呈现给受试者。然后制备由随机序列的干扰素音素组成的刺激流。位于干扰素音素序列中的是目标音素。刺激序列呈现给受试者以鉴定序列内的目标音素。语音处理用于提供多个重点，以增强被摄体辨别相似的声音音素的能力。该处理被应用于目标音素和刺激流的呈现。作为对象正确地识别刺激流中的目标音素，在所有提供的刺激组中，应用于音素的处理量被减少，最终降低到正常语音的水平。

6. 发明授权

US06592375B2 Method and system for producing engine sounds of a simulated vehicle 有权
标题翻译：用于生产模拟车辆的发动机声音的方法和系统
公开(公告)号：US06592375B2
公开(公告)日：2003-07-15
申请号：US09780249
申请日：2001-02-09
申请人： Michael L. Henry , Mark L. Gruber , Peter W. Mokris
发明人： Michael L. Henry , Mark L. Gruber , Peter W. Mokris
IPC分类号： G09B1904
CPC分类号： G09B9/04
摘要： A vehicle simulation system includes a sound resonant chamber assembly for simulating audio sounds representative of the sounds produced during the operation of the simulated vehicle, comprising a speaker and a sound resonant tube attached to the speaker for enhancing and directing the audio sounds.
摘要翻译：车辆模拟系统包括用于模拟表示在模拟车辆的操作期间产生的声音的音频声音的声音谐振室组件，包括扬声器和连接到扬声器的声音谐振管，用于增强和引导音频声音。

7. 发明授权

US06296489B1 System for sound file recording, analysis, and archiving via the internet for language training and other applications 有权
标题翻译：通过互联网进行语音培训和其他应用程序的声音文件记录，分析和存档系统
公开(公告)号：US06296489B1
公开(公告)日：2001-10-02
申请号：US09339462
申请日：1999-06-23
申请人： Laurie J. Blass , Pamela H. Elder
发明人： Laurie J. Blass , Pamela H. Elder
IPC分类号： G09B1904
CPC分类号： G09B19/04 , G09B19/08
摘要： The invention is a system for sound file recording, comparison, and archiving for network-based language and communications training, or other applications. The invention allows capture of multimedia data from a user, and allows the user to play back his or her self-created sound inputs and to view various comparisons of his or her sound inputs with model sounds. The invention displays a waveform or spectrogram of a model sound superimposed over a waveform (or spectrogram) of the user's sound input. It can display a failure/success indication for the user's sound input relative to a predetermined standard. Further, the invention allows a user to archive sound files for subsequent review and analysis.
摘要翻译：本发明是用于基于网络的语言和通信培训或其他应用的声音文件记录，比较和归档的系统。本发明允许从用户捕获多媒体数据，并且允许用户播放他或她自己创建的声音输入并且查看他或她的声音输入与模型声音的各种比较。本发明显示叠加在用户声音输入的波形（或频谱图）上的模型声音的波形或频谱图。它可以显示相对于预定标准的用户声音输入的故障/成功指示。此外，本发明允许用户归档声音文件以用于随后的审查和分析。

8. 发明授权

US06250928B1 Talking facial display method and apparatus 有权
标题翻译：谈话面部显示方法和装置
公开(公告)号：US06250928B1
公开(公告)日：2001-06-26
申请号：US09223858
申请日：1998-12-31
申请人： Tomaso A. Poggio , Antoine F. Ezzat
发明人： Tomaso A. Poggio , Antoine F. Ezzat
IPC分类号： G09B1904
CPC分类号： G09B19/04
摘要： A method and apparatus of converting input text into an audio-visual speech stream resulting in a talking face image enunciating the text. This method of converting input text into an audio-visual speech stream comprises the steps of: recording a visual corpus of a human-subject, building a viseme interpolation database, and synchronizing the talking face image with the text stream. In a preferred embodiment, viseme transitions are automatically calculated using optical flow methods, and morphing techniques are employed to result in smooth viseme transitions. The viseme transitions are concatenated together and synchronized with the phonemes according to the timing information. The audio-visual speech stream is then displayed in real time, thereby displaying a photo-realistic talking face.
摘要翻译：一种将输入文本转换成视听语音流的方法和装置，导致说出文本的说话面部图像。这种将输入文本转换成视听语音流的方法包括以下步骤：记录人类对象的视觉语料库，构建视觉插值数据库，以及将谈话的脸部图像与文本流同步。在优选的实施方案中，使用光学流动方法自动计算视标跃迁，并且采用变形技术以产生平滑的视觉转换。视觉转换被连接在一起，并根据时间信息与音素同步。然后，实时地显示视听语音流，从而显示照片般逼真的通话面孔。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式