专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US5930336A Voice dialing server for branch exchange telephone systems 失效
标题翻译：分机交换电话系统的语音拨号服务器
公开(公告)号：US5930336A
公开(公告)日：1999-07-27
申请号：US723914
申请日：1996-09-30
申请人： Jean-Claude Junqua , Philippe R. Morin , Ted H. Applebaum
发明人： Jean-Claude Junqua , Philippe R. Morin , Ted H. Applebaum
IPC分类号： G10L15/02 , G10L15/10 , H04M3/42 , H04M3/493 , H04Q3/58 , H04Q3/62 , H04M1/64
CPC分类号： H04M3/4931 , G10L15/02 , G10L15/10 , H04M3/42204 , H04M3/42314 , H04Q3/627 , G10L2015/228 , H04M2201/40
摘要： The voice dialing server plugs into one or more unused extensions of a branch exchange system to provide each of the users on the system with voice dialing services. To use the system a user simply dials the extension to which the server is attached. The server then prompts the user to supply the name of a party to be called. The name is then looked up in a telephone number dictionary unique to that user. The system then places the telephone call by sending commands to the branch exchange system that simulate the operations a user would perform to connect to an outside line or inside extension and then place the call. The server incorporates a speech processing module having a multistage word recognizer that represents speech in terms of high phoneme similarity values. This representation is highly compact, allowing the word recognizer to perform the recognizer and fine match stages with far less processor overhead than frame-by-frame speech recognizers.
摘要翻译：语音拨号服务器插入分支交换系统的一个或多个未使用的分机，以向系统中的每个用户提供语音拨号服务。要使用系统，用户只需拨打服务器所连接的扩展名。服务器然后提示用户提供被叫方的名称。然后在该用户唯一的电话号码字典中查找该名称。然后，该系统通过发送命令发送电话给分支交换系统，该系统模拟用户将执行的连接到外线或内部分机的操作，然后进行呼叫。该服务器包括具有多字词识别器的语音处理模块，其以高音素相似度值表示语音。该表示非常紧凑，允许字识别器执行识别器和精细匹配阶段，而且比逐帧语音识别器远远少于处理器开销。

2. 发明授权

US5892813A Multimodal voice dialing digital key telephone with dialog manager 失效
标题翻译：多模式语音拨号数字电话与对话管理器
公开(公告)号：US5892813A
公开(公告)日：1999-04-06
申请号：US723913
申请日：1996-09-30
申请人： Philippe R. Morin , Ted H. Applebaum , Jean-Claude Junqua
发明人： Philippe R. Morin , Ted H. Applebaum , Jean-Claude Junqua
IPC分类号： G10L15/00 , G10L13/00 , G10L15/18 , G10L15/28 , H04M1/00 , H04M1/247 , H04M1/253 , H04M1/27 , H04M3/42
CPC分类号： H04M1/247 , H04M1/271
摘要： The multimodal telephone prompts the user using both a visual display and synthesized voice. It receives user input via keypad and programmable soft keys associated with the display, and also through user-spoken commands. The voice module includes a two stage speech recognizer that models speech in terms of high similarity values. A dialog manager associated with the voice module maintains the visual and verbal systems in synchronism with one another. The dialog manager administers a state machine that records the dialog context. The dialog context is used to ensure that the appropriate visual prompts are displayed--showing what commands are possible at any given point in the dialog. The speech recognizer also uses the dialog context to select the recognized word candidate that is appropriate to the current context.
摘要翻译：多模式电话提示用户同时使用视觉显示和合成语音。它通过键盘和与显示器相关联的可编程软键接收用户输入，还可以通过用户口令命令。语音模块包括两级语音识别器，其以高相似度值对语音进行建模。与语音模块相关联的对话管理器将视觉和语言系统彼此同步地维持。对话管理器管理记录对话框上下文的状态机。对话框上下文用于确保显示适当的视觉提示，显示对话框中任意给定点可以执行哪些命令。语音识别器还使用对话上下文来选择适合于当前上下文的识别词候选。

3. 发明授权

US06463413B1 Speech recognition training for small hardware devices 有权
标题翻译：小型硬件设备语音识别培训
公开(公告)号：US06463413B1
公开(公告)日：2002-10-08
申请号：US09295276
申请日：1999-04-20
申请人： Ted H. Applebaum , Jean-Claude Junqua
发明人： Ted H. Applebaum , Jean-Claude Junqua
IPC分类号： G10L1514
CPC分类号： G10L15/30 , G10L15/06 , G10L15/187 , G10L15/22 , G10L2015/0638
摘要： A distributed speech processing system for constructing speech recognition reference models that are to be used by a speech recognizer in a small hardware device, such as a personal digital assistant or cellular telephone. The speech processing system includes a speech recognizer residing on a first computing device and a speech model server residing on a second computing device. The speech recognizer receives speech training data and processes it into an intermediate representation of the speech training data. The intermediate representation is then communicated to the speech model server. The speech model server generates a speech reference model by using the intermediate representation of the speech training data and then communicates the speech reference model back to the first computing device for storage in a lexicon associated with the speech recognizer.
摘要翻译：一种用于构建语音识别参考模型的分布式语音处理系统，该语音识别参考模型将被诸如个人数字助理或蜂窝电话之类的小型硬件设备中的语音识别器使用。语音处理系统包括位于第一计算设备上的语音识别器和位于第二计算设备上的语音模型服务器。语音识别器接收语音训练数据并将其处理成语音训练数据的中间表示。然后将中间表示传递给语音模型服务器。语音模型服务器通过使用语音训练数据的中间表示来生成语音参考模型，然后将语音参考模型传送回第一计算设备以存储在与语音识别器相关联的词典中。

4. 发明授权

US06996527B2 Linear discriminant based sound class similarities with unit value normalization 失效
公开(公告)号：US06996527B2
公开(公告)日：2006-02-07
申请号：US09915717
申请日：2001-07-26
申请人： Robert C. Boman , Philippe R. Morin , Ted H. Applebaum
发明人： Robert C. Boman , Philippe R. Morin , Ted H. Applebaum
IPC分类号： G10L15/08
CPC分类号： G10L15/02 , G10L15/10
摘要： A common requirement in automatic speech recognition is to recognize a set of words for any speaker without training the system for each new speaker. A speech recognition system is provided utilizing linear discriminant based phonetic similarities with inter-phonetic unit value normalization. Linear discriminant analysis is utilized using training data with both in-class and out-class sample training utterances for generating linear discriminant vectors for each of the phonetic units. The dot product of each linear discriminant vector and the time spectral pattern vectors generated from the input speech are computed. The resultant raw similarity vectors are then normalized utilizing normalization look-up tables for providing similarity vectors which are utilized by a word matcher for word recognition.

5. 发明授权

US06230129B1 Segment-based similarity method for low complexity speech recognizer 有权
标题翻译：低复杂度语音识别器的基于段的相似度法
公开(公告)号：US06230129B1
公开(公告)日：2001-05-08
申请号：US09199721
申请日：1998-11-25
申请人： Philippe R. Morin , Ted H. Applebaum
发明人： Philippe R. Morin , Ted H. Applebaum
IPC分类号： G10L1502
CPC分类号： G10L15/10 , G10L2015/025
摘要： A digital word prototype is constructed using one or more speech utterance for a given spoken word or phrase. First, a phone model is used to derive phoneme similarity time series for each of a plurality of phonemes which represent the degree of similarity between the speech utterance and a set of standard phonemes contained in the phone model. Next, the phoneme similarity data is normalized in relation to a non-speech part of the input speech signal. The normalized phoneme similarity data is divided into segments, such that the sum of all normalized phoneme similarity values in a segment are equal for each segment. Next, a word model is constructed from the phoneme similarity data. To do so, within each segment, a summation value is determined by summing over speech frames each of the normalized phoneme similarity values associated with a particular phoneme. In this way, the word model is represented by a vector of summation values that compactly correlate to the normalized phoneme similarity data. Lastly, the results of the individually processed utterances for a given spoken word (i.e., the individual word models) are combined to produce a digital word prototype that electronically represents the given spoken word.
摘要翻译：使用针对给定口语单词或短语的一个或多个语音说话来构建数字词原型。首先，使用电话模型来导出多个音素中的每一个的音素相似度时间序列，这些音素表示语音话语和包含在电话模型中的一组标准音素之间的相似程度。接下来，音素相似度数据相对于输入语音信号的非语音部分被归一化。归一化的音素相似度数据被划分成段，使得段中的所有归一化音素相似度之和相等于每个段。接下来，从音素相似度数据构建单词模型。为了这样做，在每个段内，通过对与特定音素相关联的每个标准化音素相似度的语音帧求和来确定求和值。以这种方式，词模型由与归一化音素相似度数据紧密相关的求和值的向量表示。最后，将给定口语单词（即，单词模型）的单独处理的话语的结果组合以产生电子地表示给定口语单词的数字词原型。

6. 发明授权

US5825977A Word hypothesizer based on reliably detected phoneme similarity regions 失效
标题翻译：基于可靠检测的音素相似区域的词假设
公开(公告)号：US5825977A
公开(公告)日：1998-10-20
申请号：US526718
申请日：1995-09-08
申请人： Philippe R. Morin , Ted H. Applebaum
发明人： Philippe R. Morin , Ted H. Applebaum
IPC分类号： G10L15/02 , G10L15/04 , G10L15/10 , G10L15/12 , G10L5/06 , G10L9/00
CPC分类号： G10L15/04 , G10L15/10 , G10L15/12
摘要： The word hypothesizer reduces the search space for more computationally expensive word recognizers. Each periodic interval of input speech is represented as a vector of phoneme similarity values from which the high similarity regions are selected and parameterized. The hypothesizer computes alignment parameters for each of a plurality of previously stored word prototypes, vis-a-vis the high similarity regions of the input speech utterance. Those word prototypes having the highest recognition scores are selected as word candidates for the fine match recognizer.
摘要翻译：假设词减少了更多计算昂贵的字识别器的搜索空间。输入语音的每个周期性间隔被表示为从其中选择和参数化高相似性区域的音素相似性值的向量。相对于输入语音发音的高相似性区域，假设者计算多个先前存储的单词原型中的每一个的对准参数。选择具有最高识别分数的单词原型作为精细匹配识别器的候选词。

7. 发明授权

US5822728A Multistage word recognizer based on reliably detected phoneme similarity regions 失效
标题翻译：基于可靠检测的音素相似区域的多级字识别器
公开(公告)号：US5822728A
公开(公告)日：1998-10-13
申请号：US526746
申请日：1995-09-08
申请人： Ted H. Applebaum , Philippe R. Morin
发明人： Ted H. Applebaum , Philippe R. Morin
IPC分类号： G10L15/02 , G10L15/08 , G10L7/08
CPC分类号： G10L15/08 , G10L15/02
摘要： The multistage word recognizer uses a word reference representation based on reliably detected peaks of phoneme similarity values. The word reference representation captures the basic features of the words by targets that describe the location and shape of stable peaks of phoneme similarity values. The first stage of the word hypothesizer represents each reference word with statistical information on the number of high similarity regions over a predefined number of time intervals. The second stage represents each word by a prototype that consists of a series of phoneme targets and global statistics, namely the average word duration and average match rate. These represent the degree of fit of the word prototype to its training data. Word recognition scores generated in the two stages are converted to dimensionless normalized values and combined by averaging for use in selecting the most probable word candidates.
摘要翻译：多级字识别器使用基于可靠检测的音素相似度峰值的字参考表示。词引用表示法通过描述音素相似度值的稳定峰的位置和形状的目标捕获词的基本特征。单词假设器的第一阶段表示具有关于预定数量的时间间隔上的高相似性区域的数量的统计信息的每个参考词。第二阶段由原型组成，每个单词由一系列音素目标和全球统计数据组成，即平均单词持续时间和平均匹配率。这些代表词原型对其训练数据的拟合程度。在两个阶段产生的词识别分数被转换为无量纲归一化值，并通过平均来组合用于选择最可能的词候选。

8. 发明授权

US5684925A Speech representation by feature-based word prototypes comprising phoneme targets having reliable high similarity 失效
标题翻译：基于特征的词原型的语音表示，其包括具有可靠的高相似性的音素目标
公开(公告)号：US5684925A
公开(公告)日：1997-11-04
申请号：US526719
申请日：1995-09-08
申请人： Philippe R. Morin , Ted H. Applebaum
发明人： Philippe R. Morin , Ted H. Applebaum
IPC分类号： G10L15/06 , G10L5/06
CPC分类号： G10L15/063
摘要： Digitized speech utterances are converted into phoneme similarity data and regions of high similarity are then extracted and used in forming the word prototype. By alignment across speakers unreliable high phoneme similarity regions are eliminated. Word prototype targets are then constructed comprising the following parameters: the phoneme symbol, the average peak height of the phoneme similarity score, the average peak location and the left and right frame locations. For each target a statistical weight is assigned representing the percentage of occurrences the particular high similarity region occurred across all speakers. The word prototype is feature-based allowing a robust speech representation to be constructed without the need for frame-by-frame analysis.
摘要翻译：数字化语音语音被转换为音素相似性数据，然后提取高相似性的区域，并用于形成单词原型。通过对讲，消除了不可靠的高音素相似区域。然后构建词原型目标，其包括以下参数：音素符号，音素相似性得分的平均峰高，平均峰值位置和左右帧位置。对于每个目标，分配统计权重，表示所有发言者发生特定高相似性区域的出现百分比。词原型是基于特征的，允许构建鲁棒的语音表示，而不需要逐帧分析。

9. 发明申请

US20140153747A1 SYSTEM AND METHOD FOR PAIRING A COMMAND DEVICE INCORPORATING A MICROPHONE TO A REMOTELY CONTROLLED MEDICAL SYSTEM 有权
标题翻译：将配备麦克风的命令装置配给远程控制的医疗系统的系统和方法
公开(公告)号：US20140153747A1
公开(公告)日：2014-06-05
申请号：US13693801
申请日：2012-12-04
申请人： Matteo Contolini , Ted H. Applebaum
发明人： Matteo Contolini , Ted H. Applebaum
IPC分类号： H04R3/00
CPC分类号： H04R3/00 , A61B17/00 , A61B2017/00203 , G06F19/00 , G06F19/34 , G06F19/3418 , G10L2015/223 , G16H40/63
摘要： The system includes a remotely controlled medical system having a controller in communication with a command device incorporating a microphone. The system further includes a medical device operable by the controller and a sound generator coupled to the controller. The command device incorporating a microphone is paired to the controller for operating the medical device by detecting the sound. The command device incorporating a microphone transmits a signal in response to the sound to the controller to verify the pairing of the command device incorporating a microphone to the controller such that the controller will only operate the medical device in response to a command issued near the sound generator.
摘要翻译：该系统包括远程控制的医疗系统，其具有与包括麦克风的命令装置通信的控制器。该系统还包括由控制器操作的医疗设备和耦合到控制器的声音发生器。配有麦克风的命令装置与控制器配对，用于通过检测声音来操作医疗装置。结合麦克风的命令装置将响应于声音的信号发送到控制器，以验证将包含麦克风的命令装置与控制器的配对，使得控制器将仅响应于在声音附近发出的命令来操作医疗装置发电机。

10. 发明授权

US06845358B2 Prosody template matching for text-to-speech systems 有权
标题翻译：用于文本到语音系统的韵律模板匹配
公开(公告)号：US06845358B2
公开(公告)日：2005-01-18
申请号：US09755699
申请日：2001-01-05
申请人： Nicholas Kibre , Ted H. Applebaum
发明人： Nicholas Kibre , Ted H. Applebaum
IPC分类号： G10L13/04 , G10L13/08 , G06F17/21 , G10L13/06
CPC分类号： G10L13/10
摘要： A prosody matching template in the form of a tree structure stores indices which point to lookup table and template information prescribing pitch and duration values that are used to add inflection to the output of a text-to-speech synthesizer. The lookup module employs a search algorithm that explores each branch of the tree, assigning penalty scores based on whether the syllable represented by a node of the tree does or does not match the corresponding syllable of the target word. The path with the lowest penalty score is selected as the index into the prosody template table. The system will add nodes by cloning existing nodes in cases where it is not possible to find a one-to-one match between the number of syllables in the target word and the number of nodes in the tree.
摘要翻译：以树结构形式的韵律匹配模板存储指向查找表和模板信息的索引，其中规定了用于向文本到语音合成器的输出添加拐点的音调和持续时间值。查找模块采用探索树的每个分支的搜索算法，基于由树的节点表示的音节是否与目标词的相应音节不匹配来分配惩罚分数。选择具有最低惩罚分数的路径作为韵律模板表的索引。在不可能找到目标单词中的音节数与树中节点数之间的一对一匹配的情况下，系统将通过克隆现有节点来添加节点。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式