会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Method and apparatus for teaching prosodic features of speech
    • 教学韵律特征的方法与装置
    • US06358054B1
    • 2002-03-19
    • US09587799
    • 2000-06-06
    • Martin Rothenberg
    • Martin Rothenberg
    • G09B1904
    • G09B19/04
    • A system and apparatus for teaching prosodic features of speech senses and extracts prosodic or suprasegmental variables of a user's speech segment. Prosodic features of speech include pitch and loudness variations, as opposed to articulatory or sequential features of speech which are the primary determinants of phoneme variations. Once prosodic variables have been extracted from a speech segment, the variables are used to modulate a quasiperiodic waveform such as a sinusoid, a pulse-train, or a synthesized vowel-like waveform, or the parameters can be used to modulate a random-noise-like waveform. A modulated waveform can be played acoustically, and the user can hear the variation of the prosodic parameters without interference from the articulatory parameters of a complete waveform. This auditory feedback can be combined with visual feedback of the speech segment to teach proper prosodic speech formation. Auditory feedback for teaching prosodic features can also be formed without a modulation process by removing articulatory information from a speech segment, and non-acoustic measures sensors of prosodic feature, such as an electroglottograph, can also be employed.
    • 一种用于教授语音感觉的韵律特征并提取用户语音段的韵律或超节段变量的系统和装置。 语音的韵律特征包括音调和响度变化,而不是作为音素变化的主要决定因素的语音的发音或连续特征。 一旦从语音片段中提取韵律变量,就可以使用这些变量来调制诸如正弦波,脉冲串或合成元音波形之类的准周期波形,或者这些参数可用于调制随机噪声 样波形 可以在声学上播放调制波形,并且用户可以听到韵律参数的变化而不受来自完整波形的发音参数的干扰。 这种听觉反馈可以与语音段的视觉反馈相结合,以教导适当的韵律语音形成。 还可以通过从语音段去除发音信息而形成用于教导韵律特征的听觉反馈,而不需要调制过程,并且还可以采用非声学测量韵律特征的传感器,例如电泳图。
    • 2. 发明授权
    • Method and apparatus for reporting progress of a subject using audio/visual adaptive training stimulii
    • 使用音频/视觉适应训练刺激报告主体进展的方法和装置
    • US06290504B1
    • 2001-09-18
    • US09415885
    • 1999-10-08
    • Angela Jane BenitzElizabeth H. BudraWilliam M. JenkinsJohn J. Montgomery
    • Angela Jane BenitzElizabeth H. BudraWilliam M. JenkinsJohn J. Montgomery
    • G09B1904
    • G09B5/04G06Q20/204G09B5/065G09B5/14G09B19/04G09B21/00G09B21/009
    • An apparatus and method on a computing device for training of auditory and graphical discrimination in humans is provided. The method and apparatus provides a number of stimulus sets, each stimulus set having a number of different phonemes. Speech processing is used to provide multiple levels of emphasis and or stretching for enhancing a subject's ability to discriminate between similarly sounding phonemes. The processing is applied to phonemes and presented to the human as a trial. As a subject correctly identifies phonemes in the stimulus sets, the amount of processing applied to the phonemes is reduced, ultimately to the level of normal speech. A performance feedback mechanism is provided to allow the human to obtain a summary of his/her success over the stimulus sets, at the different processing levels. More detailed feedback is also provided indicating specific processing levels achieved for each of the stimulus sets. Selection buttons are provided on a graphical interface to allow the human to hear a stimulus set at his beginning processing level, and at his currently obtained processing level.
    • 提供了一种用于训练人体中听觉和图形辨别的计算设备上的装置和方法。 该方法和装置提供多个刺激组,每个刺激组具有多个不同的音素。 语音处理用于提供多个重点和/或拉伸程度,以增强受试者辨别相似的声音音素的能力。 该处理被应用于音素并作为试验呈递给人类。 作为主体正确地识别刺激组中的音素,施加到音素的处理量减少,最终降低到正常语音的水平。 提供了一种绩效反馈机制,以允许人们在不同的处理级别获得他/她在刺激组上的成功的总结。 还提供了更详细的反馈意见,指出针对每个刺激组实现的具体处理水平。 在图形界面上提供选择按钮,以允许人们在他的开始处理级别和他当前获得的处理级别上听到一个刺激设置。
    • 3. 发明授权
    • Method and apparatus for teaching prosodic features of speech
    • US06358055B1
    • 2002-03-19
    • US09587800
    • 2000-06-06
    • Martin Rothenberg
    • Martin Rothenberg
    • G09B1904
    • G09B19/04
    • A system and apparatus for teaching prosodic features of speech senses and extracts prosodic or suprasegmental variables of a user's speech segment. Prosodic features of speech include pitch and loudness variations, as opposed to articulatory or sequential features of speech which are the primary determinants of phoneme variations. Once prosodic variables have been extracted from a speech segment, the variables are used to modulate a quasiperiodic waveform such as a sinusoid, a pulse-train, or a synthesized vowel-like waveform, or the parameters can be used to modulate a random-noise-like waveform. A modulated waveform can be played acoustically, and the user can hear the variation of the prosodic parameters without interference from the articulatory parameters of a complete waveform. This auditory feedback can be combined with visual feedback of the speech segment to teach proper prosodic speech formation. Auditory feedback for teaching prosodic features can also be formed without a modulation process by removing articulatory information from a speech segment, and non-acoustic measures sensors of prosodic feature, such as an electroglottograph, can also be employed.
    • 5. 发明授权
    • Method and apparatus for training of auditory/visual discrimination using target and distractor phonemes/graphemes
    • 使用目标和干扰素音素/字形来训练听觉/视觉辨别的方法和装置
    • US06224384B1
    • 2001-05-01
    • US09604443
    • 2000-06-27
    • William M. JenkinsMichael M. MerzenichSteven L. MillerBret E. PetersonPaula Tallal
    • William M. JenkinsMichael M. MerzenichSteven L. MillerBret E. PetersonPaula Tallal
    • G09B1904
    • G09B5/06G09B5/04G09B5/065G09B5/14G09B19/04G09B21/00G09B21/009
    • An apparatus and method for training of auditory and graphical discrimination in humans is provided. The method and apparatus provides a number of stimulus sets, each stimulus set having a target phoneme, and associated grapheme, and a number of distractor phonemes, and associated graphemes. Upon initiation of a trial, a target phoneme is presented to a subject. A stimulus stream is then prepared that consists of a random sequence of distractor phonemes. Located within the sequence of distractor phonemes is the target phoneme. The stimulus sequence is presented to the subject for identification of the target phoneme within the sequence. Speech processing is used to provide multiple levels of emphasis for enhancing a subject's ability to discriminate between similarly sounding phonemes. The processing is applied to the presentation of the target phoneme and the stimulus stream. As a subject correctly identifies target phonemes within stimulus streams, across all provided stimulus sets, the amount of processing applied to the phonemes is reduced, ultimately to the level of normal speech.
    • 提供了一种用于训练人类听觉和图形辨别的装置和方法。 该方法和装置提供多个刺激组,每个刺激组具有目标音素,以及相关联的图形,以及多个牵引器音素和相关联的图形。 开始试用后,将目标音素呈现给受试者。 然后制备由随机序列的干扰素音素组成的刺激流。 位于干扰素音素序列中的是目标音素。 刺激序列呈现给受试者以鉴定序列内的目标音素。 语音处理用于提供多个重点,以增强被摄体辨别相似的声音音素的能力。 该处理被应用于目标音素和刺激流的呈现。 作为对象正确地识别刺激流中的目标音素,在所有提供的刺激组中,应用于音素的处理量被减少,最终降低到正常语音的水平。
    • 7. 发明授权
    • System for sound file recording, analysis, and archiving via the internet for language training and other applications
    • 通过互联网进行语音培训和其他应用程序的声音文件记录,分析和存档系统
    • US06296489B1
    • 2001-10-02
    • US09339462
    • 1999-06-23
    • Laurie J. BlassPamela H. Elder
    • Laurie J. BlassPamela H. Elder
    • G09B1904
    • G09B19/04G09B19/08
    • The invention is a system for sound file recording, comparison, and archiving for network-based language and communications training, or other applications. The invention allows capture of multimedia data from a user, and allows the user to play back his or her self-created sound inputs and to view various comparisons of his or her sound inputs with model sounds. The invention displays a waveform or spectrogram of a model sound superimposed over a waveform (or spectrogram) of the user's sound input. It can display a failure/success indication for the user's sound input relative to a predetermined standard. Further, the invention allows a user to archive sound files for subsequent review and analysis.
    • 本发明是用于基于网络的语言和通信培训或其他应用的声音文件记录,比较和归档的系统。 本发明允许从用户捕获多媒体数据,并且允许用户播放他或她自己创建的声音输入并且查看他或她的声音输入与模型声音的各种比较。 本发明显示叠加在用户声音输入的波形(或频谱图)上的模型声音的波形或频谱图。 它可以显示相对于预定标准的用户声音输入的故障/成功指示。 此外,本发明允许用户归档声音文件以用于随后的审查和分析。
    • 8. 发明授权
    • Talking facial display method and apparatus
    • 谈话面部显示方法和装置
    • US06250928B1
    • 2001-06-26
    • US09223858
    • 1998-12-31
    • Tomaso A. PoggioAntoine F. Ezzat
    • Tomaso A. PoggioAntoine F. Ezzat
    • G09B1904
    • G09B19/04
    • A method and apparatus of converting input text into an audio-visual speech stream resulting in a talking face image enunciating the text. This method of converting input text into an audio-visual speech stream comprises the steps of: recording a visual corpus of a human-subject, building a viseme interpolation database, and synchronizing the talking face image with the text stream. In a preferred embodiment, viseme transitions are automatically calculated using optical flow methods, and morphing techniques are employed to result in smooth viseme transitions. The viseme transitions are concatenated together and synchronized with the phonemes according to the timing information. The audio-visual speech stream is then displayed in real time, thereby displaying a photo-realistic talking face.
    • 一种将输入文本转换成视听语音流的方法和装置,导致说出文本的说话面部图像。 这种将输入文本转换成视听语音流的方法包括以下步骤:记录人类对象的视觉语料库,构建视觉插值数据库,以及将谈话的脸部图像与文本流同步。 在优选的实施方案中,使用光学流动方法自动计算视标跃迁,并且采用变形技术以产生平滑的视觉转换。 视觉转换被连接在一起,并根据时间信息与音素同步。 然后,实时地显示视听语音流,从而显示照片般逼真的通话面孔。