会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 2. 发明授权
    • Method for goal-oriented speech translation in hand-held devices using meaning extraction and dialogue
    • 使用意义提取和对话的手持设备中面向目标的语音翻译方法
    • US06233561B1
    • 2001-05-15
    • US09290628
    • 1999-04-12
    • Jean-Claude JunquaRoland KuhnMatteo ContoliniMurat KaraormanKen FieldMichael GallerYi Zhao
    • Jean-Claude JunquaRoland KuhnMatteo ContoliniMurat KaraormanKen FieldMichael GallerYi Zhao
    • G10L1522
    • G10L15/1822G10L15/1815
    • A computer-implemented method and apparatus is provided for processing a spoken request from a user. A speech recognizer converts the spoken request into a digital format. A frame data structure associates semantic components of the digitized spoken request with predetermined slots. The slots are indicative of data which are used to achieve a predetermined goal. A speech understanding module which is connected to the speech recognizer and to the frame data structure determines semantic components of the spoken request. The slots are populated based upon the determined semantic components. A dialog manager which is connected to the speech understanding module may determine at least one slot which is unpopulated based upon the determined semantic components and in a preferred embodiment may provide confirmation of the populated slots. A computer generated-request is formulated in order for the user to provide data related to the unpopulated slot. The method and apparatus are well-suited (but not limited) to use in a hand-held speech translation device.
    • 提供了一种用于处理来自用户的口头请求的计算机实现的方法和装置。 语音识别器将口头请求转换为数字格式。 帧数据结构将数字化语音请求的语义分量与预定时隙相关联。 这些时隙指示用于实现预定目标的数据。 连接到语音识别器和帧数据结构的语音理解模块确定语音请求的语义分量。 基于确定的语义分量来填充时隙。 连接到语音理解模块的对话管理器可以基于所确定的语义组件来确定未填充的至少一个时隙,并且在优选实施例中可以提供填充时隙的确认。 制定计算机生成请求以便用户提供与未填充槽相关的数据。 该方法和装置非常适合(但不限于)在手持语音翻译装置中使用。
    • 3. 发明授权
    • Method for generating spelling-to-pronunciation decision tree
    • 拼写到发音决策树的方法
    • US06230131B1
    • 2001-05-08
    • US09069308
    • 1998-04-29
    • Roland KuhnJean-Claude JunquaMatteo Contolini
    • Roland KuhnJean-Claude JunquaMatteo Contolini
    • G10L1308
    • G10L13/08
    • Decision trees are used to store a series of yes-no questions that can be used to convert spelled-word letter sequences into pronunciations. Letter-only trees, having internal nodes populated with questions about letters in the input sequence, generate one or more pronunciations based on probability data stored in the leaf nodes of the tree. The pronunciations may then be improved by processing them using mixed trees which are populated with questions about letters in the sequence and also questions about phonemes associated with those letters. The mixed tree screens out pronunciations that would not occur in natural speech, thereby greatly improving the results of the letter-to-pronunciation transformation.
    • 决策树用于存储可用于将拼写字母序列转换为发音的一系列“是”的问题。 仅有信息树,内部节点填充有关输入序列中的字母的问题,根据存储在树的叶节点中的概率数据生成一个或多个发音。 然后可以通过使用填充有序列中的字母的问题的混合树以及与这些字母相关的音素的问题来处理它们来发音。 混合树屏蔽了自然语言中不会发生的发音,从而大大提高了字母到发音转换的结果。
    • 6. 发明授权
    • Method and apparatus for improved speech recognition with supplementary information
    • 用于通过补充信息改进语音识别的方法和装置
    • US06983244B2
    • 2006-01-03
    • US10652146
    • 2003-08-29
    • Jean-Claude JunquaRoland KuhnMatteo ContoliniRathinavelu Chengalvarayan
    • Jean-Claude JunquaRoland KuhnMatteo ContoliniRathinavelu Chengalvarayan
    • G10L15/22
    • H04M1/271G10L15/08G10L15/10G10L15/22
    • A method for improving recognition results of a speech recognizer uses supplementary information to confirm recognition results. A user inputs speech to a speech recognizer. The speech recognizer resides on a mobile device or on a server at a remote location. The speech recognizer determines a recognition result based on the input speech. A confidence measure is calculated for the recognition result. If the confidence measure is below a threshold, the user is prompted for supplementary data. The supplementary data is determined dynamically based on ambiguities between the input speech and the recognition result, wherein the supplementary data will distinguish the input speech over potential incorrect results. The supplementary data may be a subset of alphanumeric characters that comprise the input speech, or other data associated with a desired result, such as an area code or location. The user may provide the supplementary data verbally, or manually using a keypad, touchpad, touchscreen, or stylus pen.
    • 用于改善语音识别器的识别结果的方法使用补充信息来确认识别结果。 用户向语音识别器输入语音。 语音识别器驻留在移动设备或远程位置的服务器上。 语音识别器基于输入语音来确定识别结果。 计算识别结果的置信度量。 如果置信度量值低于阈值,则会提示用户提供补充数据。 基于输入语音和识别结果之间的模糊度来动态地确定补充数据,其中补充数据将通过潜在的不正确结果区分输入语音。 补充数据可以是组成输入语音的字母数字字符的子集,或与期望结果相关联的其他数据,例如区域代码或位置。 用户可以口头提供补充数据,或者使用键盘,触摸板,触摸屏或触控笔手动提供补充数据。
    • 7. 发明申请
    • Method and apparatus for improved speech recognition with supplementary information
    • 用于通过补充信息改进语音识别的方法和装置
    • US20050049860A1
    • 2005-03-03
    • US10652146
    • 2003-08-29
    • Jean-Claude JunquaRoland KuhnMatteo ContoliniRathinavelu Chengalvarayan
    • Jean-Claude JunquaRoland KuhnMatteo ContoliniRathinavelu Chengalvarayan
    • G10L15/08G10L15/10G10L15/22H04M1/27G10L15/00
    • H04M1/271G10L15/08G10L15/10G10L15/22
    • A method for improving recognition results of a speech recognizer uses supplementary information to confirm recognition results. A user inputs speech to a speech recognizer. The speech recognizer resides on a mobile device or on a server at a remote location. The speech recognizer determines a recognition result based on the input speech. A confidence measure is calculated for the recognition result. If the confidence measure is below a threshold, the user is prompted for supplementary data. The supplementary data is determined dynamically based on ambiguities between the input speech and the recognition result, wherein the supplementary data will distinguish the input speech over potential incorrect results. The supplementary data may be a subset of alphanumeric characters that comprise the input speech, or other data associated with a desired result, such as an area code or location. The user may provide the supplementary data verbally, or manually using a keypad, touchpad, touchscreen, or stylus pen.
    • 用于改善语音识别器的识别结果的方法使用补充信息来确认识别结果。 用户向语音识别器输入语音。 语音识别器驻留在移动设备或远程位置的服务器上。 语音识别器基于输入语音来确定识别结果。 计算识别结果的置信度量。 如果置信度量值低于阈值,则会提示用户提供补充数据。 基于输入语音和识别结果之间的模糊度来动态地确定补充数据,其中补充数据将通过潜在的不正确结果区分输入语音。 补充数据可以是组成输入语音的字母数字字符的子集,或与期望结果相关联的其他数据,例如区域代码或位置。 用户可以口头提供补充数据,或者使用键盘,触摸板,触摸屏或触控笔手动提供补充数据。
    • 8. 发明授权
    • Context-dependent acoustic models for medium and large vocabulary speech recognition with eigenvoice training
    • 用于具有本征语音训练的中大词汇语音识别的背景相关声学模型
    • US06571208B1
    • 2003-05-27
    • US09450392
    • 1999-11-29
    • Roland KuhnJean-Claude JunquaMatteo Contolini
    • Roland KuhnJean-Claude JunquaMatteo Contolini
    • G01L1700
    • G10L15/07
    • A reduced dimensionality eigenvoice analytical technique is used during training to develop context-dependent acoustic models for allophones. The eigenvoice technique is also used during run time upon the speech of a new speaker. The technique removes individual speaker idiosyncrasies, to produce more universally applicable and robust allophone models. In one embodiment the eigenvoice technique is used to identify the centroid of each speaker, which may then be “subtracted out” of the recognition equation. In another embodiment maximum likelihood estimation techniques are used to develop common decision tree frameworks that may be shared across all speakers when constructing the eigenvoice representation of speaker space.
    • 在训练期间使用减小的维度本征语音分析技术来开发用于异音素的上下文相关的声学模型。 特定语音技术在运行时也用于新演讲者的演讲。 该技术可以消除单个扬声器的特性,从而产生更普遍适用和强大的异音模型。 在一个实施例中,本征语音技术用于识别每个说话者的质心,然后可以将其“减去”识别方程。 在另一个实施例中,最大似然估计技术用于开发在构建扬声器空间的本征声表示时可以在所有扬声器之间共享的共同决策树框架。
    • 9. 发明授权
    • Method and system for automatically determining phonetic transcriptions associated with spelled words
    • 用于自动确定与拼写单词相关的语音转录的方法和系统
    • US06233553B1
    • 2001-05-15
    • US09148912
    • 1998-09-04
    • Matteo ContoliniJean-Claude JunquaRoland Kuhn
    • Matteo ContoliniJean-Claude JunquaRoland Kuhn
    • G10L1904
    • G10L15/065G10L2015/086
    • New entries are added to the lexicon by entering them as spelled words. A transcription generator, such as a decision-tree-based phoneme or morpheme transcription generator, converts each spelled word into a set of n-best transcriptions or sequences. Meanwhile, user input or automatically generated speech corresponding to the spelled word is processed by an automatic speech recognizer and the recognizer rescores the transcriptions or sequences produced by the transcription generator. One or more of the highest scored (highest confidence) transcriptions may be added to the lexicon to update it. If desired, the spelled word-pronunciation pairs generated by the system can be used to retrain the transcription generator, making the system adaptive or self-learning.
    • 通过输入新词条作为拼写单词添加到词典中。 转录生成器,例如基于决策树的音素或语素转录发生器,将每个拼写单词转换为一组n个最佳转录或序列。 同时,由自动语音识别器处理对应于拼写字的用户输入或自动产生的语音,并且识别器重新分配由转录发生器产生的转录或序列。 可以将一个或多个最高得分(最高置信度)转录添加到词典中进行更新。 如果需要,系统产生的拼写字 - 发音对可用于重新训练转录发生器,使系统自适应或自学习。