专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

71. 发明授权

US5182773A Speaker-independent label coding apparatus 失效
标题翻译：扬声器独立标签编码设备
公开(公告)号：US5182773A
公开(公告)日：1993-01-26
申请号：US673810
申请日：1991-03-22
申请人： Lalit R. Bahl , Michael A. Picheny , David Nahamoo , Peter V. de Souza
发明人： Lalit R. Bahl , Michael A. Picheny , David Nahamoo , Peter V. de Souza
IPC分类号： G10L19/00 , G10L15/02 , G10L19/02 , H03M7/30
CPC分类号： H03M7/3082 , G10L19/038
摘要： The present invention is related to speech recognition and particularly to a new type of vector quantizer and a new vector quantization technique in which the error rate of associating a sound with an incoming speech signal is drastically reduced. To achieve this end, the present invention technique groups the feature vectors in a space into different prototypes at least two of which represent a class of sound. Each of the prototypes may in turn have a number of subclasses or partitions. Each of the prototypes and their subclasses may be assigned respective identifying values. To identify an incoming speech feature vector, at least one of the feature values of the incoming feature vector is compared with the different values of the respective prototypes, or the subclasses of the prototypes. The class of sound whose group of prototypes, or at least one of the prototypes, whose combined value most closely matches the value of the feature value of the feature vector is deemed to be the class corresponding to the feature vector. The feature vector is then labeled with the identifier associated with that class.

72. 发明授权

US08930182B2 Voice transformation with encoded information 有权
标题翻译：具有编码信息的语音变换
公开(公告)号：US08930182B2
公开(公告)日：2015-01-06
申请号：US13049924
申请日：2011-03-17
申请人： Shay Ben-David , Ron Hoory , Zvi Kons , David Nahamoo
发明人： Shay Ben-David , Ron Hoory , Zvi Kons , David Nahamoo
IPC分类号： G10L21/00 , G10L25/90 , G10L25/93 , G10L21/003 , G10L19/018
CPC分类号： G10L21/003 , G10L19/018
摘要： Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.
摘要翻译：提供语音转换的方法，系统和计算机程序产品。该方法包括使用变换参数来变换源语言，以及使用隐写术对输入语音中的变换参数对信息进行编码，其中可以使用输出语音和关于变换参数的信息来重构源语音。还提供了一种用于重建语音变换的方法，包括：接收语音转换系统的输出语音，其中输出语音是使用隐写术编码关于变换参数的信息的变换语音; 提取变换参数信息; 并执行输出语音的逆变换以获得原始源语音的近似。

73. 发明授权

US08082153B2 Conversational computing via conversational virtual machine 有权
标题翻译：通过对话虚拟机进行会话计算
公开(公告)号：US08082153B2
公开(公告)日：2011-12-20
申请号：US12544473
申请日：2009-08-20
申请人： Daniel Coffman , Liam D. Comerford , Steven DeGennaro , Edward A. Epstein , Ponani Gopalakrishnan , Stephane H. Maes , David Nahamoo
发明人： Daniel Coffman , Liam D. Comerford , Steven DeGennaro , Edward A. Epstein , Ponani Gopalakrishnan , Stephane H. Maes , David Nahamoo
IPC分类号： G10L15/28 , G06F3/16
CPC分类号： H04M3/50 , G06F17/30899 , G10L15/22 , G10L15/285 , G10L2015/228 , H04L67/02 , H04M1/72561 , H04M3/42204 , H04M3/44 , H04M3/493 , H04M3/4931 , H04M3/4936 , H04M3/4938 , H04M7/00 , H04M2201/40 , H04M2201/60 , H04M2203/355 , H04M2250/74
摘要： A method for conversational computing includes executing code embodying a conversational virtual machine, registering a plurality of input/output resources with a conversational kernel, providing an interface between a plurality of active applications and the conversational kernel processing input/output data, receiving input queries and input events of a multi-modal dialog across a plurality of user interface modalities of the plurality of active applications, generating output messages and output events of the multi-modal dialog in connection with the plurality of active applications, managing, by the conversational kernel, a context stack associated with the plurality of active applications and the multi-modal dialog to transform the input queries into application calls for the plurality of active applications and convert the output messages into speech, wherein the context stack accumulates a context of each of the plurality of active applications.
摘要翻译：一种用于对话计算的方法包括执行体现对话虚拟机的代码，用对话内核注册多个输入/输出资源，提供多个活动应用与对话内核处理输入/输出数据之间的接口，接收输入查询和通过多个活动应用程序的多个用户界面模式输入多模态对话的事件，生成与多个活动应用相关联的多模式对话的输出消息和输出事件，由对话内核管理，与所述多个活动应用相关联的上下文栈以及将所述输入查询转换为所述多个活动应用的应用调用并将所述输出消息转换为语音的所述多模态对话，其中，所述上下文堆栈累积所述多个活动应用中的每一个的上下文的活跃应用。

74. 发明授权

US07714912B2 Intelligent mirror 有权
标题翻译：智能镜
公开(公告)号：US07714912B2
公开(公告)日：2010-05-11
申请号：US11626406
申请日：2007-01-24
申请人： Alexander Faisman , Genady Grabarnik , David Nahamoo , Apostol Ivanov Natsev , Ganesh N. Ramaswamy
发明人： Alexander Faisman , Genady Grabarnik , David Nahamoo , Apostol Ivanov Natsev , Ganesh N. Ramaswamy
IPC分类号： H04N5/262 , G09B25/00 , G06F17/30 , G09G5/00
CPC分类号： H04N7/181 , A45D42/08 , G06T11/00 , H04N2005/2726
摘要： An intelligent imaging system, includes an image generator that projects multiple angle views of a user, a plurality of cameras for capturing a plurality of images of the user, an image processing unit, a style advisor, and a control mechanism.
摘要翻译：一种智能成像系统，包括投影用户的多个角度视图的图像生成器，用于捕获用户的多个图像的多个照相机，图像处理单元，风格顾问和控制机构。

75. 发明申请

US20080174682A1 INTELLIGENT MIRROR 有权
标题翻译：智能镜
公开(公告)号：US20080174682A1
公开(公告)日：2008-07-24
申请号：US11626406
申请日：2007-01-24
申请人： Alexander Faisman , Genady Grabarnik , David Nahamoo , Apostol Ivanov Natsev , Ganesh N. Ramaswamy
发明人： Alexander Faisman , Genady Grabarnik , David Nahamoo , Apostol Ivanov Natsev , Ganesh N. Ramaswamy
IPC分类号： H04N7/00
CPC分类号： H04N7/181 , A45D42/08 , G06T11/00 , H04N2005/2726
摘要： An intelligent imaging system, includes an image generator that projects multiple angle views of a user, a plurality of cameras for capturing a plurality of images of the user, an image processing unit, a style advisor, and a control mechanism.
摘要翻译：一种智能成像系统，包括投影用户的多个角度视图的图像生成器，用于捕获用户的多个图像的多个照相机，图像处理单元，风格顾问和控制机构。

76. 发明授权

US5680509A Method and apparatus for estimating phone class probabilities a-posteriori using a decision tree 失效
标题翻译：用于使用决策树估计电话类概率的方法和装置
公开(公告)号：US5680509A
公开(公告)日：1997-10-21
申请号：US312584
申请日：1994-09-27
申请人： Ponani S. Gopalakrishnan , David Nahamoo , Mukund Padmanabhan , Michael Alan Picheny
发明人： Ponani S. Gopalakrishnan , David Nahamoo , Mukund Padmanabhan , Michael Alan Picheny
IPC分类号： G10L15/06 , G10L15/08 , G10L5/06
CPC分类号： G10L15/063 , G10L15/08
摘要： A method and apparatus for estimating the probability of phones, a-posteriori, in the context of not only the acoustic feature at that time, but also the acoustic features in the vicinity of the current time, and its use in cutting down the search-space in a speech recognition system. The method constructs and uses a decision tree, with the predictors of the decision tree being the vector-quantized acoustic feature vectors at the current time, and in the vicinity of the current time. The process starts with an enumeration of all (predictor, class) events in the training data at the root node, and successively partitions the data at a node according to the most informative split at that node. An iterative algorithm is used to design the binary partitioning. After the construction of the tree is completed, the probability distribution of the predicted class is stored at all of its terminal leaves. The decision tree is used during the decoding process by tracing a path down to one of its leaves, based on the answers to binary questions about the vector-quantized acoustic feature vector at the current time and its vicinity.
摘要翻译：在不仅在当时的声学特征以及当前时间附近的声学特征的上下文中估计电话的概率的方法和装置，以及其用于减少搜索 - 语音识别系统中的空间。该方法构造并使用决策树，其中决策树的预测变量是当前时间和当前时间附近的矢量量化的声学特征向量。该过程从在根节点的训练数据中的所有（预测器，类）事件的枚举开始，并且根据该节点处的最多信息拆分在节点处依次划分数据。迭代算法用于设计二进制分区。树完成后，预测类的概率分布存储在其所有终端叶上。基于对当前时间及其附近的向量量化声学特征向量的二进制问题的答案，在解码过程中使用决策树通过跟踪到其叶子之一的路径。

77. 发明授权

US5544261A Automatic handwriting recognition using both static and dynamic parameters 失效
公开(公告)号：US5544261A
公开(公告)日：1996-08-06
申请号：US450556
申请日：1995-05-25
申请人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan
发明人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan
IPC分类号： G06K9/46 , G06K9/03 , G06K9/22 , G06K9/62 , G06K9/68 , G06K9/00
CPC分类号： G06K9/6293 , G06K9/00416 , G06K9/00429
摘要： Methods and apparatus are disclosed for recognizing handwritten characters in response to an input signal from a handwriting transducer. A feature extraction and reduction procedure is disclosed that relies on static or shape information, wherein the temporal order in which points are captured by an electronic tablet may be disregarded. A method of the invention generates and processes the tablet data with three independent sets of feature vectors which encode the shape information of the input character information. These feature vectors include horizontal (x-axis) and vertical (y-axis) slices of a bit-mapped image of the input character data, and an additional feature vector to encode an absolute y-axis displacement from a baseline of the bit-mapped image. It is shown that the recognition errors that result from the spatial or static processing are quite different from those resulting from temporal or dynamic processing. Furthermore, it is shown that these differences complement one another. As a result, a combination of these two sources of feature vector information provides a substantial reduction in an overall recognition error rate. Methods to combine probability scores from dynamic and the static character models are also disclosed.

78. 发明授权

US5539839A Automatic handwriting recognition using both static and dynamic parameters 失效
公开(公告)号：US5539839A
公开(公告)日：1996-07-23
申请号：US450558
申请日：1995-05-25
申请人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan
发明人： Jerome R. Bellegarda , David Nahamoo , Krishna S. Nathan
IPC分类号： G06K9/46 , G06K9/03 , G06K9/22 , G06K9/62 , G06K9/68 , G06K9/00
CPC分类号： G06K9/6293 , G06K9/00416 , G06K9/00429
摘要： Methods and apparatus are disclosed for recognizing handwritten characters in response to an input signal from a handwriting transducer. A feature extraction and reduction procedure is disclosed that relies on static or shape information, wherein the temporal order in which points are captured by an electronic tablet may be disregarded. A method of the invention generates and processes the tablet data with three independent sets of feature vectors which encode the shape information of the input character information. These feature vectors include horizontal (x-axis) and vertical (y-axis) slices of a bit-mapped image of the input character data, and an additional feature vector to encode an absolute y-axis displacement from a baseline of the bit-mapped image. It is shown that the recognition errors that result from the spatial or static processing are quite different from those resulting from temporal or dynamic processing. Furthermore, it is shown that these differences complement one another. As a result, a combination of these two sources of feature vector information provides a substantial reduction in an overall recognition error rate. Methods to combine probability scores from dynamic and the static character models are also disclosed.

79. 发明授权

US5280562A Speech coding apparatus with single-dimension acoustic prototypes for a speech recognizer 失效
标题翻译：具有用于语音识别器的单维声学原型的语音编码装置
公开(公告)号：US5280562A
公开(公告)日：1994-01-18
申请号：US770495
申请日：1991-10-03
申请人： Lalit R. Bahl , Jerome R. Bellegarda , Edward A. Epstein , John M. Lucassen , David Nahamoo , Michael A. Picheny
发明人： Lalit R. Bahl , Jerome R. Bellegarda , Edward A. Epstein , John M. Lucassen , David Nahamoo , Michael A. Picheny
IPC分类号： G10L19/00 , G10L15/02 , G10L19/02 , H03M7/30 , G10L9/02
CPC分类号： G10L19/038 , H03M7/3082
摘要： In speech recognition and speech coding, the values of at least two features of an utterance are measured during a series of time intervals to produce a series of feature vector signals. A plurality of single-dimension prototype vector signals having only one parameter value are stored. At least two single-dimension prototype vector signals having parameter values representing first feature values, and at least two other single-dimension prototype vector signals have parameter values representing second feature values. A plurality of compound-dimension prototype vector signals have unique identification values and comprise one first-dimension and one second-dimension prototype vector signal. At least two compound-dimension prototype vector signals comprise the same first-dimension prototype vector signal. The feature values of each feature vector signal are compared to the parameter values of the compound-dimension prototype vector signals to obtain prototype match scores. The identification values of the compound-dimension prototype vector signals having the best prototype match scores for the feature vectors signals are output as a sequence of coded representations of an utterance to be recognized. A match score, comprising an estimate of the closeness of a match between a speech unit and the sequence of coded representations of the utterance, is generated for each of a plurality of speech units. At least one speech subunit, of one or more best candidate speech units having the best match scores, is displayed.
摘要翻译：在语音识别和语音编码中，在一系列时间间隔期间测量话音的至少两个特征的值，以产生一系列特征向量信号。存储仅具有一个参数值的多个单维原型矢量信号。具有表示第一特征值的参数值和至少两个其它单维原型矢量信号的至少两个单维原型矢量信号具有表示第二特征值的参数值。多个复合尺寸原型矢量信号具有唯一的识别值，并且包括一个第一维和一个第二维原型矢量信号。至少两个复合维度原型矢量信号包括相同的第一维原型矢量信号。将每个特征向量信号的特征值与化合物维度原型矢量信号的参数值进行比较，以获得原型匹配分数。具有特征矢量信号的具有最佳原型匹配分数的复合维度原型矢量信号的识别值被输出为将被识别的话语的编码表示的序列。针对多个语音单元中的每一个生成包括语音单元与语音编码表示序列之间的匹配的接近度的估计的匹配分数。显示具有最佳匹配分数的一个或多个最佳候选语音单元的至少一个语音子单元。

80. 发明授权

US5263117A Method and apparatus for finding the best splits in a decision tree for a language model for a speech recognizer 失效
标题翻译：在用于语音识别器的语言模型的决策树中找到最佳分割的方法和装置
公开(公告)号：US5263117A
公开(公告)日：1993-11-16
申请号：US427420
申请日：1989-10-26
申请人： Arthur J. Nadas , David Nahamoo
发明人： Arthur J. Nadas , David Nahamoo
IPC分类号： G10L11/00 , G06T7/00 , G10L15/10 , G10L15/18 , G10L9/02
CPC分类号： G10L15/197
摘要： A method and apparatus for finding the best or near best binary classification of a set of observed events, according to a predictor feature X so as to minimize the uncertainty in the value of a category feature Y. Each feature has three or more possible values. First, the predictor feature value and the category feature value of each event is measured. The events are then split, arbitrarily, into two sets of predictor feature values. From the two sets of predictor feature values, an optimum pair of sets of category feature values is found having the lowest uncertainty in the value of the predictor feature. From the two optimum sets of category feature values, an optimum pair of sets is found having the lowest uncertainty in the value of the category feature. An event is then classified according to whether its predictor feature value is a member of a set of optimal predictor feature values.

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式