会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Speech recognition from overlapping frequency bands with output data reduction
    • 具有输出数据缩减的重叠频带的语音识别
    • US06721698B1
    • 2004-04-13
    • US09698773
    • 2000-10-27
    • Ramalingam HariharanJuha HäkkinenImre KissJilei TianOlli Viikki
    • Ramalingam HariharanJuha HäkkinenImre KissJilei TianOlli Viikki
    • G10L1902
    • G10L15/02
    • A speech recognition feature extractor includes a time-to-frequency domain transformer for generating spectral values in the frequency domain from a speech signal; a partitioning means for generating a first set and an additional set of spectral values in the frequency domain; a first feature generator for generating a first group of speech features using the first set of spectral values; a additional feature generator for generating an additional group of speech features using the additional set of spectral values; the feature generators arranged to operate in parallel, an assembler for assembling an output set of speech features from at least one speech feature from the first group of speech features and at least one speech feature from the additional group of speech features, and an anti-aliasing and sampling rate reduction block, where the first and the additional set of spectral values comprise at least one common spectral value.
    • 语音识别特征提取器包括用于从语音信号产生频域中的频谱值的时间 - 频域变换器;用于在频域中产生第一组和附加频谱值集合的分割装置;第一特征生成器 用于使用所述第一组频谱值来生成第一组语音特征;附加特征生成器,用于使用附加的频谱值集合生成附加语音特征组; 布置成并行操作的特征发生器,用于从来自第一语音特征组的至少一个语音特征和来自附加语音特征组的至少一个语音特征组合语音特征的输出集合的汇编器, 混叠和采样率降低块,其中第一和附加频谱值集合包括至少一个公共频谱值。
    • 3. 发明申请
    • Framework for voice conversion
    • 语音转换框架
    • US20060235685A1
    • 2006-10-19
    • US11107344
    • 2005-04-15
    • Jani NurminenJilei TianImre Kiss
    • Jani NurminenJilei TianImre Kiss
    • G10L15/26
    • G10L13/033G10L19/0018G10L2021/0135
    • This invention relates to a framework for converting a source speech signal associated with a source voice into a target speech signal that is a representation of the source speech signal associated with a target voice. The source speech signal is encoded into samples of encoding parameters, wherein the encoding comprises the step of segmenting the source speech signal into segments based on characteristics of the source speech signal. The samples of the encoding parameters, or a converted representation of the samples of the encoding parameters are then decoded to obtain the target speech signal. Therein, in the encoding, the decoding or in a separate step, samples of parameters related to the source speech signal are converted into samples of parameters related to the target speech signal. Therein, at least one of the encoding and the converting depends on the segments of the source speech signal.
    • 本发明涉及一种用于将与源语音相关联的源语音信号转换成作为与目标语音相关联的源语音信号的表示的目标语音信号的框架。 源语音信号被编码为编码参数的采样,其中编码包括基于源语音信号的特性将源语音信号分割成段的步骤。 然后对编码参数的样本或编码参数的样本的转换表示进行解码以获得目标语音信号。 其中,在编码,解码或单独的步骤中,与源语音信号相关的参数样本被转换成与目标语音信号相关的参数的采样。 其中,编码和转换中的至少一个取决于源语音信号的段。
    • 5. 发明申请
    • Error correction for speech recognition systems
    • 语音识别系统的纠错
    • US20060293889A1
    • 2006-12-28
    • US11169277
    • 2005-06-27
    • Imre KissJussi Leppanen
    • Imre KissJussi Leppanen
    • G10L15/26
    • G10L15/22G10L2015/0631
    • Words in a sequence of words that is obtained from speech recognition of an input speech sequence are presented to a user, and at least one of the words in the sequence of words is replaced, in case it has been selected by a user for correction. Words with a low recognition confidence value are emphasized; alternative word candidates for the at least one selected word are ordered according to an ordering criterion; after replacing a word, an order of alternative word candidates for neighboring words in the sequence is updated; the replacement word is derived from a spoken representation of the at least one selected word by speech recognition with a limited vocabulary; and the word that replaces the at least one selected word is derived from a spoken and spelled representation of the at least one selected word.
    • 在输入语音序列的语音识别中获得的单词序列中的单词被呈现给用户,并且在由用户选择用于校正的情况下,替换单词序列中的单词中的至少一个。 强调具有低识别置信度值的词语; 根据排序标准对至少一个所选择的单词的替代单词候选进行排序; 在替换单词之后,更新序列中相邻单词的替代单词候选的顺序; 所述替换单词通过具有有限词汇的语音识别从所述至少一个所选择的单词的口语表示中导出; 并且替换所述至少一个所选择的单词的单词从所述至少一个所选择的单词的口语和拼写表示中得出。
    • 7. 发明授权
    • Speech recognition with adjustable timeout period
    • 具有可调节超时时间的语音识别
    • US08355913B2
    • 2013-01-15
    • US11556227
    • 2006-11-03
    • Imre Kiss
    • Imre Kiss
    • G10L15/22G10L15/26
    • G10L15/26
    • Input of dictated information in an information processing apparatus is controlled. Utterances of speech are detected and interpreted as words. Word by word confirmation of the interpreted words is detected, the confirmation being associated with an adjustable timeout period. The timeout period may be adjusted according to a number of different measures, including an average time needed for confirmation, an average success rate of dictation, by the pace of dictation as performed by a user, and by a user's history based on statistics from previously performed dictation procedures.
    • 控制信息处理装置中的规定信息的输入。 语言的语言被检测和解释为单词。 检测到解码字的单词确认,该确认与可调整的超时周期相关联。 可以根据多个不同的措施来调整超时时间,包括用于确认的平均时间,听写的平均成功率,用户执行的听写速度,以及基于之前的统计的用户历史 执行听写程序。
    • 10. 发明授权
    • Optical imaging system
    • 光学成像系统
    • US5764347A
    • 1998-06-09
    • US765944
    • 1997-01-13
    • Andras PodmaniczkyPeter KalloJanos TalosiImre Kiss
    • Andras PodmaniczkyPeter KalloJanos TalosiImre Kiss
    • G02B17/08G02B5/04G02B27/18G06K9/00G06T1/00G06K9/74
    • G06K9/00046G02B5/04
    • Optical imaging system between an object plane (2.2) of a total reflexion prism (2) and an image plane, mainly for a fingerprint reading apparatus, that comprises an optics (3) for imaging the object plane to the image plane, and an electronic image detector (4) in the image plane. The optics defines an optical axis (3.0) and input and output pupils, respectively. The total reflexion prism (2) is arranged in front of the input pupil of the optics (3). The prism has a first surface receiving light for illuminating the object plane through the interior of the prism and a further surface through which light reflected from the object plane passes towards the optics. The object plane closes an angle with the optical axis, which is preferably between 45.degree. and 65.degree. if the refraction index of the prism is between 1.5 and 1.85. The object plane (2.2) of the total reflexion prism (2) is offset relative to the optical axis (3.0) in normal direction and the image detector (4) is also offset in normal direction relative to the optical axis (3.0) to an extent which corresponds to the location of the image of said object plane.
    • PCT No.PCT / HU95 / 00030 Sec。 371日期1997年1月13日 102(e)日期1997年1月13日PCT Filed June 26,1995 PCT Pub。 公开号WO96 / 02896 日期1996年2月1日在全反射棱镜(2)的物平面(2.2)和主要用于指纹读取装置的图像平面之间的光学成像系统包括用于将物平面成像到图像的光学元件(3) 平面和图像平面中的电子图像检测器(4)。 光学器件分别定义光轴(3.0)和输入和输出光瞳。 全反射棱镜(2)布置在光学器件(3)的输入光瞳前面。 棱镜具有第一表面,其接收用于照射通过棱镜内部的物体平面的光,以及另外的表面,通过该表面从物体平面反射的光通过光学器件。 物平面与光轴成一个角度,如果棱镜的折射率在1.5和1.85之间,则其优选在45°和65°之间。 全反射棱镜(2)的物平面(2.2)相对于光轴(3.0)在正常方向上偏移,图像检测器(4)也相对于光轴(3.0)在法线方向偏移到 对应于所述物体平面的图像的位置的程度。