会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Translation system and method in which words are translated by a
specialized dictionary and then a general dictionary
    • 翻译系统和方法,其中单词由专门词典翻译,然后通用词典翻译
    • US6085162A
    • 2000-07-04
    • US733808
    • 1996-10-18
    • Julius Cherny
    • Julius Cherny
    • G10L15/18G06F17/27G06F17/28G10L13/00G10L15/00G10L9/06
    • G06F17/2872G06F17/2735G06F17/2765G06F17/2818G06F17/289
    • Methods and apparatus for performing translation between different language are provided. The present invention includes a translation system that performs translation having increased accuracy by providing a three-dimensional topical dual-language database. The topical database includes a set of source-to-target language translations for each topic that the database is being used for. In one embodiment, a user first selects the topic of conversation, then words spoken into a telephone are translated and produced as synthesized voice signals from another telephone so that a near real-time conversation may be had between two people speaking different languages. An additional feature of the present invention is the addition of a computer terminal that displays the input and output phrases so that either user may edit the input phrases, or indicate that the translation was ambiguous and request a rephrasing of the material.
    • 提供用于执行不同语言之间的翻译的方法和装置。 本发明包括通过提供三维局部双语言数据库来提高精度的翻译的翻译系统。 主题数据库包括数据库正在使用的每个主题的一组源到目标语言翻译。 在一个实施例中,用户首先选择对话主题,然后将来自电话的话转换成来自另一个电话的合成语音信号,从而可以在两个不同语言的人之间进行接近实时的对话。 本发明的附加特征是添加了显示输入和输出短语的计算机终端,使得任一用户可以编辑输入短语,或指示翻译是不明确的,并请求重新表述材料。
    • 2. 发明授权
    • Cartridge-based, interactive speech recognition method with a response
creation capability
    • 基于墨盒的交互式语音识别方法,具有响应创造能力
    • US5946658A
    • 1999-08-31
    • US165512
    • 1998-10-02
    • Yasunaga MiyazawaMitsuhiro InazumiHiroshi HasegawaIsao EdatsuneOsamu Urano
    • Yasunaga MiyazawaMitsuhiro InazumiHiroshi HasegawaIsao EdatsuneOsamu Urano
    • G10L15/00G10L15/06G10L15/26G10L9/06G10L5/02
    • G10L15/26G10L2015/0638G10L2015/088
    • A technique for improving speech recognition in low-cost, speech interactive devices. This technique calls for selectively implementing a speaker-specific word enrollment and detection unit in parallel with a word detection unit to permit comprehension of spoken commands or messages when no recognizable words are found. Preferably, specific speaker detection will be based on the speaker's own personal list of words or expression. Other facets include complementing non-specific pre-registered word characteristic information with individual, speaker-specific verbal characteristics to improve recognition in cases where the speaker has unusual speech mannerisms or accent and response alteration in which speaker-specification registration functions are leveraged to provide access and permit changes to a predefined responses table according to user needs and tastes. Also disclosed is the externalization and modularization of non-specific speaker recognition, action and response information to enhance adaptability of the speech recognizer without sacrificing product cost competitiveness or overall device responsiveness.
    • 一种用于在低成本语音交互设备中改善语音识别的技术。 该技术要求与字检测单元并行地选择性地实现与扬声器特定的单词注册和检测单元,以便在找不到可识别的单词时允许理解口语命令或消息。 优选地,具体的说话者检测将基于说话者自己的单词或表达的个人列表。 其他方面包括补充非特定的预先登记的单词特征信息,具有单独的具有说话者的语言特征,以在讲话者具有不寻常的语音方式或重音和响应改变的情况下改善识别,其中利用说话者说明书注册功能来提供访问 并允许根据用户需求和口味对预定义的响应表进行更改。 还公开了非特定说话人识别,动作和响应信息的外部化和模块化,以增强语音识别器的适应性,而不牺牲产品成本竞争力或整体设备响应性。
    • 5. 发明授权
    • Speech supported navigation of a pointer in a graphical user interface
    • 语音支持图形用户界面中指针的导航
    • US5920841A
    • 1999-07-06
    • US882667
    • 1997-06-25
    • Claus SchottmullerViktor Schwab
    • Claus SchottmullerViktor Schwab
    • G06F3/02G06F3/033G06F3/041G06F3/048G06F3/16G10L9/06
    • G06F3/16G06F3/04812
    • A method and an apparatus for speech controlled navigation of a pointer in a graphical user interface. Previous methods use speech commands like arrow keys of a keyboard and lack user friendly interfaces. The method and apparatus therefore provides a space of discrete position states (quantization) for the pointer which can be navigated only via those discrete positions by means of speech command input. The granularity of the discrete position states can be adapted to the respective application window and the position states can be represented by a system of coordinates where the speech command input is based on absolute or relative coordinates. Advantageously a copy image of the graphical user interface of operation can be provided in front of or beside the actual user interface and a proxy pointer device is displayed on the copy image. In one embodiment, only the copy image comprises the discrete position states, and the speech input commands are only transferred to the copy image. Navigation of the proxy pointer device within the copy is transferred and converted into commands within the actual user interface. By this method, an operation event effected by a manipulation of the proxy pointer effects a corresponding event at the user interface.
    • 一种用于图形用户界面中指针的语音控制导航的方法和装置。 以前的方法使用诸如键盘的箭头键的语音命令,并且缺少用户友好的接口。 因此,该方法和装置为指针提供离散位置状态(量化)的空间,该空间只能通过语音命令输入通过这些离散位置导航。 离散位置状态的粒度可以适应于相应的应用窗口,位置状态可以由坐标系统来表示,其中语音命令输入基于绝对坐标或相对坐标。 有利地,可以在实际用户界面的前面或旁边提供操作的图形用户界面的复制图像,并且在复制图像上显示代理指针装置。 在一个实施例中,仅复印图像包括离散位置状态,并且语音输入命令仅被传送到复制图像。 复制内的代理指针设备的导航被传送并转换成实际用户界面内的命令。 通过该方法,通过代理指示器的操纵而实现的操作事件在用户界面处产生相应的事件。
    • 6. 发明授权
    • Speaker independent speech recognition method utilizing multiple
training iterations
    • 使用多次训练迭代的扬声器独立语音识别方法
    • US5806034A
    • 1998-09-08
    • US510321
    • 1995-08-02
    • Joe A. NaylorWilliam Y. HuangLawrence G. Bahler
    • Joe A. NaylorWilliam Y. HuangLawrence G. Bahler
    • G10L15/14G10L9/06
    • G10L15/144
    • A method for recognizing spoken utterances of a speaker is disclosed, the method comprising the steps of providing a database of labeled speech data; providing a prototype of a Hidden Markov Model (HMM) definition to define the characteristics of the HMM; and parameterizing speech utterances according to one of linear prediction parameters or Mel-scale filter bank parameters. The method further includes selecting a frame period for accommodating the parameters and generating HMMs and decoding to specified speech utterances by causing the user to utter predefined training speech utterances for each HMM. The method then statistically computes the generated HMMs with the prototype HMM to provide a set of fully trained HMMs for each utterance indicative of the speaker. The trained HMMs are used for recognizing a speaker by computing Laplacian distances via distance table lookup for utterances of the speaker during the selected frame period; and iteratively decoding node transitions corresponding to the spoken utterances during the selected frame period to determine which predefined utterance is present.
    • 公开了一种用于识别扬声器的讲话语音的方法,所述方法包括以下步骤:提供标记语音数据的数据库; 提供隐马尔可夫模型(HMM)定义的原型来定义HMM的特征; 并根据线性预测参数或Mel-scale滤波器组参数之一参数化语音话语。 该方法还包括通过使用户对每个HMM发出预定义的训练语音话语来选择用于容纳参数的帧周期和生成HMM并解码为指定的语音话语。 该方法然后用原型HMM统计计算所生成的HMM,以便为指示说话者的每个话语提供一组经过充分训练的HMM。 所训练的HMM用于通过在所选择的帧周期期间通过对说话者的话语的距离表查找来计算拉普拉斯算子来识别扬声器; 并且在所选择的帧周期期间迭代地解码对应于所说话语音的节点转换,以确定哪个预定义的话语存在。
    • 7. 发明授权
    • Spectrogram-feature-based speech syllable and word recognition using
syllabic language dictionary
    • 基于谱图特征的语音音节和使用音节语言字典的单词识别
    • US5675705A
    • 1997-10-07
    • US475767
    • 1995-06-07
    • Tara Chand Singhal
    • Tara Chand Singhal
    • G10L15/02G10L7/08G10L9/06
    • G10L15/02G10L25/18
    • A speech recognizing device performing speech syllable recognition and language word identification. The speech syllable recognition is performed on an ensemble composed of nearly one thousand syllables formed by the human vocal system, which allows for variations caused by language dialects and speech accents. For syllable recognition, the nearly one thousand speech syllables, using a spectrogram-feature-based approach, are parsed in a hierarchical structure based on the region of the vocal system from where the syllable emanated from, root syllable from that vocal region, vowel-caused variation of the root syllable, and syllable duration. The syllable's coded representation includes sub-codes for each of the levels of this hierarchical structure. For identification, speech words composed of sequences of coded syllables are mapped to known language words and their grammatical attribute, using a syllabic dictionary where the same words spoken differently map to a known language word.
    • 执行语音音节识别和语言词识别的语音识别装置。 语音音节识别是由由人类声乐系统形成的近千个音节组成的整体进行的,这种语音允许由语言方言和语音口音引起的变化。 为了音节识别,使用基于谱图特征的方法将近一千个语音音节以基于声音系统的区域的分层结构进行解析,其中音节从该声部发出,根音从该声部区域,元音 - 导致根音节的变化和音节的持续时间。 音节的编码表示包括该层次结构的每个级别的子代码。 为了识别,由编码音节序列组成的语音词被映射到已知语言单词及其语法属性,使用音节相同的单词不同地映射到已知语言单词的音节词典。
    • 9. 发明授权
    • Pattern recognition scheme using probabilistic models based on mixtures
distribution of discrete distribution
    • 基于离散分布的混合分布的概率模型的模式识别方案
    • US6064958A
    • 2000-05-16
    • US934376
    • 1997-09-19
    • Satoshi TakahashiShigeki Sagayama
    • Satoshi TakahashiShigeki Sagayama
    • G06K9/62G10L15/14G10L9/06
    • G10L15/144G06K9/6297
    • A pattern recognition scheme using probabilistic models that are capable of reducing a calculation cost for the output probability while improving a recognition performance even when a number of mixture component distributions of respective states is small, by arranging distributions with low calculation cost and high expressive power as the mixture component distribution. In this pattern recognition scheme, a probability of each probabilistic model expressing features of each recognition category with respect to each input feature vector derived from each input signal is calculated, where the probabilistic model represents a feature parameter subspace in which feature vectors of each recognition category exist and the feature parameter subspace is expressed by using mixture distributions of one-dimensional discrete distributions with arbitrary distribution shapes which are arranged in respective dimensions. Then, a recognition category expressed by a probabilistic model with a highest probability among a plurality of probabilistic models is outputted as a recognition result.
    • 使用概率模型的模式识别方案,即使当各种状态的混合分量分布的数量少时,也能够通过以低的计算成本和高的表现力排列分布来提高识别性能,从而降低输出概率的计算成本, 混合物成分分布。 在该模式识别方案中,计算表示从每个输入信号导出的每个输入特征矢量的每个识别类别的特征的每个概率模型的概率,其中概率模型表示其中每个识别类别的特征向量的特征参数子空间 存在,并且通过使用以相应尺寸排列的具有任意分布形状的一维离散分布的混合分布来表示特征参数子空间。 然后,输出由多个概率模型中具有最高概率的概率模型表示的识别类别作为识别结果。