专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US06721698B1 Speech recognition from overlapping frequency bands with output data reduction 失效
标题翻译：具有输出数据缩减的重叠频带的语音识别
公开(公告)号：US06721698B1
公开(公告)日：2004-04-13
申请号：US09698773
申请日：2000-10-27
申请人： Ramalingam Hariharan , Juha Häkkinen , Imre Kiss , Jilei Tian , Olli Viikki
发明人： Ramalingam Hariharan , Juha Häkkinen , Imre Kiss , Jilei Tian , Olli Viikki
IPC分类号： G10L1902
CPC分类号： G10L15/02
摘要： A speech recognition feature extractor includes a time-to-frequency domain transformer for generating spectral values in the frequency domain from a speech signal; a partitioning means for generating a first set and an additional set of spectral values in the frequency domain; a first feature generator for generating a first group of speech features using the first set of spectral values; a additional feature generator for generating an additional group of speech features using the additional set of spectral values; the feature generators arranged to operate in parallel, an assembler for assembling an output set of speech features from at least one speech feature from the first group of speech features and at least one speech feature from the additional group of speech features, and an anti-aliasing and sampling rate reduction block, where the first and the additional set of spectral values comprise at least one common spectral value.
摘要翻译：语音识别特征提取器包括用于从语音信号产生频域中的频谱值的时间 - 频域变换器;用于在频域中产生第一组和附加频谱值集合的分割装置;第一特征生成器用于使用所述第一组频谱值来生成第一组语音特征;附加特征生成器，用于使用附加的频谱值集合生成附加语音特征组; 布置成并行操作的特征发生器，用于从来自第一语音特征组的至少一个语音特征和来自附加语音特征组的至少一个语音特征组合语音特征的输出集合的汇编器，混叠和采样率降低块，其中第一和附加频谱值集合包括至少一个公共频谱值。

2. 发明申请

US20080154600A1 System, Method, Apparatus and Computer Program Product for Providing Dynamic Vocabulary Prediction for Speech Recognition 审中-公开
标题翻译：系统，方法，设备和计算机程序产品为语音识别提供动态词汇预测
公开(公告)号：US20080154600A1
公开(公告)日：2008-06-26
申请号：US11614159
申请日：2006-12-21
申请人： Jilei Tian , Jussi Leppanen , Imre Kiss
发明人： Jilei Tian , Jussi Leppanen , Imre Kiss
IPC分类号： G10L15/00
CPC分类号： G10L15/083
摘要： An apparatus for providing dynamic vocabulary prediction for setting up a speech recognition network of resource constrained portable devices may include a recognition network element. The recognition network element may be configured to determine a confidence measure for each candidate recognized word for a current word to be recognized. The recognition network element may also be configured to select a subset of candidate recognized words as selected candidate words based on the confidence measure of each one of the candidate recognized words, and determine a recognition network for a next word to be recognized, the recognition network including likely follower words for each of the selected candidate words, e.g. using language model and highly frequently used words.
摘要翻译：用于提供用于建立资源约束的便携式设备的语音识别网络的动态词汇预测的装置可以包括识别网络元件。识别网络元件可以被配置为为要识别的当前字确定每个候选识别词的置信度量。识别网元还可以被配置为基于每个候选识别字的置信度来选择候选识别字的子集作为所选择的候选字，并且确定要被识别的下一个字的识别网络，识别网络包括每个所选候选词的可能的跟随词，例如使用语言模型和高度常用的单词。

3. 发明申请

US20060235685A1 Framework for voice conversion 审中-公开
标题翻译：语音转换框架
公开(公告)号：US20060235685A1
公开(公告)日：2006-10-19
申请号：US11107344
申请日：2005-04-15
申请人： Jani Nurminen , Jilei Tian , Imre Kiss
发明人： Jani Nurminen , Jilei Tian , Imre Kiss
IPC分类号： G10L15/26
CPC分类号： G10L13/033 , G10L19/0018 , G10L2021/0135
摘要： This invention relates to a framework for converting a source speech signal associated with a source voice into a target speech signal that is a representation of the source speech signal associated with a target voice. The source speech signal is encoded into samples of encoding parameters, wherein the encoding comprises the step of segmenting the source speech signal into segments based on characteristics of the source speech signal. The samples of the encoding parameters, or a converted representation of the samples of the encoding parameters are then decoded to obtain the target speech signal. Therein, in the encoding, the decoding or in a separate step, samples of parameters related to the source speech signal are converted into samples of parameters related to the target speech signal. Therein, at least one of the encoding and the converting depends on the segments of the source speech signal.
摘要翻译：本发明涉及一种用于将与源语音相关联的源语音信号转换成作为与目标语音相关联的源语音信号的表示的目标语音信号的框架。源语音信号被编码为编码参数的采样，其中编码包括基于源语音信号的特性将源语音信号分割成段的步骤。然后对编码参数的样本或编码参数的样本的转换表示进行解码以获得目标语音信号。其中，在编码，解码或单独的步骤中，与源语音信号相关的参数样本被转换成与目标语音信号相关的参数的采样。其中，编码和转换中的至少一个取决于源语音信号的段。

4. 发明授权

US09978365B2 Method and system for providing a voice interface 有权
公开(公告)号：US09978365B2
公开(公告)日：2018-05-22
申请号：US12263012
申请日：2008-10-31
申请人： Mark R. Adler , Imre Kiss , Joseph H. Polifroni , Tao Wu
发明人： Mark R. Adler , Imre Kiss , Joseph H. Polifroni , Tao Wu
IPC分类号： G06F3/16 , G10L15/22 , G10L13/027 , G10L15/18
CPC分类号： G10L15/22 , G06F3/167 , G10L13/027 , G10L15/1822 , G10L2015/228
摘要： A classifier voice interface of a user terminal may receive a query, may parse the query to identify an attribute, and may process the query to select a first domain-specific voice interface of a plurality of domain-specific voice interfaces based on the attribute, wherein each of the domain-specific voice interfaces comprises specialized information to process queries of different types. The classifier voice interface may further instruct the first domain-specific voice interface to process the query.

5. 发明申请

US20060293889A1 Error correction for speech recognition systems 审中-公开
标题翻译：语音识别系统的纠错
公开(公告)号：US20060293889A1
公开(公告)日：2006-12-28
申请号：US11169277
申请日：2005-06-27
申请人： Imre Kiss , Jussi Leppanen
发明人： Imre Kiss , Jussi Leppanen
IPC分类号： G10L15/26
CPC分类号： G10L15/22 , G10L2015/0631
摘要： Words in a sequence of words that is obtained from speech recognition of an input speech sequence are presented to a user, and at least one of the words in the sequence of words is replaced, in case it has been selected by a user for correction. Words with a low recognition confidence value are emphasized; alternative word candidates for the at least one selected word are ordered according to an ordering criterion; after replacing a word, an order of alternative word candidates for neighboring words in the sequence is updated; the replacement word is derived from a spoken representation of the at least one selected word by speech recognition with a limited vocabulary; and the word that replaces the at least one selected word is derived from a spoken and spelled representation of the at least one selected word.
摘要翻译：在输入语音序列的语音识别中获得的单词序列中的单词被呈现给用户，并且在由用户选择用于校正的情况下，替换单词序列中的单词中的至少一个。强调具有低识别置信度值的词语; 根据排序标准对至少一个所选择的单词的替代单词候选进行排序; 在替换单词之后，更新序列中相邻单词的替代单词候选的顺序; 所述替换单词通过具有有限词汇的语音识别从所述至少一个所选择的单词的口语表示中导出; 并且替换所述至少一个所选择的单词的单词从所述至少一个所选择的单词的口语和拼写表示中得出。

6. 发明授权

US06175641B1 Detector for recognizing the living character of a finger in a fingerprint recognizing apparatus 失效
标题翻译：用于在指纹识别装置中识别手指的生命特征的检测器
公开(公告)号：US06175641B1
公开(公告)日：2001-01-16
申请号：US09051154
申请日：1998-04-02
申请人： P{acute over (e)}ter Kall{acute over (o)} , Imre Kiss , Andr{acute over (a)}s Podmaniczky , J{acute over (a)}nos T{acute over (a)}losi
发明人： P{acute over (e)}ter Kall{acute over (o)} , Imre Kiss , Andr{acute over (a)}s Podmaniczky , J{acute over (a)}nos T{acute over (a)}losi
IPC分类号： G06K920
CPC分类号： G06K9/00013 , A61B5/1172 , G06K9/0012
摘要： Detector for recognizing the living character of a finger which is arranged in a fingerprint recognizing apparatus and the detector is in contact with a print area (2) of the living finger constituting a print forming element (1) and the apparatus comprises for the examination of the print area (2) a print detector (5) which has a print imaging surface (4) partially covered by the print area (2). The detector comprises an electrode system (3) made of an electrically conductive material and sensing the presence of the print forming element (1), and an electrical evaluation unit coupled through electrical contacts (10) to the electrode system (3), the unit senses the change in state in the electrode system (3) caused by the proximity of the print forming element (1). The electrode system (3) is arranged on a portion of the print detector (5) covered by the print area (2) and it is coupled to the print imaging surface (4).
摘要翻译：用于识别布置在指纹识别装置中的手指的生命特征的检测器，并且检测器与构成打印形成元件（1）的生命的打印区域（2）接触，并且该装置包括用于检查打印区域（2）具有由打印区域（2）部分地覆盖的打印成像表面（4）的打印检测器（5）。检测器包括由导电材料制成并感测印刷形成元件（1）的存在的电极系统（3），以及通过电触点（10）耦合到电极系统（3）的电学评估单元，该单元感测由印刷形成元件（1）的靠近引起的电极系统（3）中的状态变化。电极系统（3）布置在由打印区域（2）覆盖的打印检测器（5）的一部分上，并且其连接到打印成像表面（4）。

7. 发明授权

US08355913B2 Speech recognition with adjustable timeout period 失效
标题翻译：具有可调节超时时间的语音识别
公开(公告)号：US08355913B2
公开(公告)日：2013-01-15
申请号：US11556227
申请日：2006-11-03
申请人： Imre Kiss
发明人： Imre Kiss
IPC分类号： G10L15/22 , G10L15/26
CPC分类号： G10L15/26
摘要： Input of dictated information in an information processing apparatus is controlled. Utterances of speech are detected and interpreted as words. Word by word confirmation of the interpreted words is detected, the confirmation being associated with an adjustable timeout period. The timeout period may be adjusted according to a number of different measures, including an average time needed for confirmation, an average success rate of dictation, by the pace of dictation as performed by a user, and by a user's history based on statistics from previously performed dictation procedures.
摘要翻译：控制信息处理装置中的规定信息的输入。语言的语言被检测和解释为单词。检测到解码字的单词确认，该确认与可调整的超时周期相关联。可以根据多个不同的措施来调整超时时间，包括用于确认的平均时间，听写的平均成功率，用户执行的听写速度，以及基于之前的统计的用户历史执行听写程序。

8. 发明申请

US20100114944A1 METHOD AND SYSTEM FOR PROVIDING A VOICE INTERFACE 有权
标题翻译：提供语音接口的方法和系统
公开(公告)号：US20100114944A1
公开(公告)日：2010-05-06
申请号：US12263012
申请日：2008-10-31
申请人： Mark R. Adler , Imre Kiss , Joseph H. Polifroni , Tao Wu
发明人： Mark R. Adler , Imre Kiss , Joseph H. Polifroni , Tao Wu
IPC分类号： G06F17/30
CPC分类号： G10L15/22 , G06F3/167 , G10L13/027 , G10L15/1822 , G10L2015/228
摘要： Methods and systems for providing a voice interface are disclosed. A classifier voice interface of a user terminal may receive a query, may parse the query to identify an attribute, and may process the query to select a first domain-specific voice interface of a plurality of domain-specific voice interface based on the attribute, wherein each of the domain-specific voice interface comprises specialized information to process queries of different types. The classifier voice interface may further instruct the first domain-specific voice interface to process the query.
摘要翻译：公开了用于提供语音接口的方法和系统。用户终端的分类器语音接口可以接收查询，可以解析查询以识别属性，并且可以基于该属性处理查询以选择多个域专用语音接口的第一特定于语言的语音接口，其中每个域专用语音接口包括用于处理不同类型的查询的专用信息。分类器语音接口可以进一步指示第一域专用语音接口来处理查询。

9. 发明申请

US20070004462A1 Mobile communication terminal 失效
标题翻译：移动通信终端
公开(公告)号：US20070004462A1
公开(公告)日：2007-01-04
申请号：US11170784
申请日：2005-06-29
申请人： Paul Lafata , Akseli Anttila , Harri Wikberg , Kirsi Karimaki , Imre Kiss
发明人： Paul Lafata , Akseli Anttila , Harri Wikberg , Kirsi Karimaki , Imre Kiss
IPC分类号： H04B1/38 , H04M1/00
CPC分类号： H04M1/72525 , H04M1/72563
摘要： A mobile communication apparatus capable of presenting themes, a telecommunication system comprising a such apparatus, and a corresponding method are disclosed. The apparatus comprises a processor arranged to generate an audio signal in response to a set theme. The audio signal comprises a speech signal, wherein speech of the speech signal have voice characteristics which depend on the theme. Alternatively, processor is arranged to set a theme in response to contact information stored in a contact information database of the appatatus and associated with actions performed by the apparatus.
摘要翻译：公开了能够呈现主题的移动通信设备，包括这种设备的电信系统和相应的方法。该装置包括被配置为响应于设定的主题产生音频信号的处理器。音频信号包括语音信号，其中语音信号的语音具有取决于主题的语音特征。或者，处理器被布置成响应于存储在应用程序的联系人信息数据库中并与该设备执行的动作相关联的联系人信息来设置主题。

10. 发明授权

US5764347A Optical imaging system 失效
标题翻译：光学成像系统
公开(公告)号：US5764347A
公开(公告)日：1998-06-09
申请号：US765944
申请日：1997-01-13
申请人： Andras Podmaniczky , Peter Kallo , Janos Talosi , Imre Kiss
发明人： Andras Podmaniczky , Peter Kallo , Janos Talosi , Imre Kiss
IPC分类号： G02B17/08 , G02B5/04 , G02B27/18 , G06K9/00 , G06T1/00 , G06K9/74
CPC分类号： G06K9/00046 , G02B5/04
摘要： Optical imaging system between an object plane (2.2) of a total reflexion prism (2) and an image plane, mainly for a fingerprint reading apparatus, that comprises an optics (3) for imaging the object plane to the image plane, and an electronic image detector (4) in the image plane. The optics defines an optical axis (3.0) and input and output pupils, respectively. The total reflexion prism (2) is arranged in front of the input pupil of the optics (3). The prism has a first surface receiving light for illuminating the object plane through the interior of the prism and a further surface through which light reflected from the object plane passes towards the optics. The object plane closes an angle with the optical axis, which is preferably between 45.degree. and 65.degree. if the refraction index of the prism is between 1.5 and 1.85. The object plane (2.2) of the total reflexion prism (2) is offset relative to the optical axis (3.0) in normal direction and the image detector (4) is also offset in normal direction relative to the optical axis (3.0) to an extent which corresponds to the location of the image of said object plane.
摘要翻译： PCT No.PCT / HU95 / 00030 Sec。 371日期1997年1月13日 102（e）日期1997年1月13日PCT Filed June 26，1995 PCT Pub。公开号WO96 / 02896 日期1996年2月1日在全反射棱镜（2）的物平面（2.2）和主要用于指纹读取装置的图像平面之间的光学成像系统包括用于将物平面成像到图像的光学元件（3）平面和图像平面中的电子图像检测器（4）。光学器件分别定义光轴（3.0）和输入和输出光瞳。全反射棱镜（2）布置在光学器件（3）的输入光瞳前面。棱镜具有第一表面，其接收用于照射通过棱镜内部的物体平面的光，以及另外的表面，通过该表面从物体平面反射的光通过光学器件。物平面与光轴成一个角度，如果棱镜的折射率在1.5和1.85之间，则其优选在45°和65°之间。全反射棱镜（2）的物平面（2.2）相对于光轴（3.0）在正常方向上偏移，图像检测器（4）也相对于光轴（3.0）在法线方向偏移到对应于所述物体平面的图像的位置的程度。

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式