会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Speech coding apparatus with single-dimension acoustic prototypes for a
speech recognizer
    • 具有用于语音识别器的单维声学原型的语音编码装置
    • US5280562A
    • 1994-01-18
    • US770495
    • 1991-10-03
    • Lalit R. BahlJerome R. BellegardaEdward A. EpsteinJohn M. LucassenDavid NahamooMichael A. Picheny
    • Lalit R. BahlJerome R. BellegardaEdward A. EpsteinJohn M. LucassenDavid NahamooMichael A. Picheny
    • G10L19/00G10L15/02G10L19/02H03M7/30G10L9/02
    • G10L19/038H03M7/3082
    • In speech recognition and speech coding, the values of at least two features of an utterance are measured during a series of time intervals to produce a series of feature vector signals. A plurality of single-dimension prototype vector signals having only one parameter value are stored. At least two single-dimension prototype vector signals having parameter values representing first feature values, and at least two other single-dimension prototype vector signals have parameter values representing second feature values. A plurality of compound-dimension prototype vector signals have unique identification values and comprise one first-dimension and one second-dimension prototype vector signal. At least two compound-dimension prototype vector signals comprise the same first-dimension prototype vector signal. The feature values of each feature vector signal are compared to the parameter values of the compound-dimension prototype vector signals to obtain prototype match scores. The identification values of the compound-dimension prototype vector signals having the best prototype match scores for the feature vectors signals are output as a sequence of coded representations of an utterance to be recognized. A match score, comprising an estimate of the closeness of a match between a speech unit and the sequence of coded representations of the utterance, is generated for each of a plurality of speech units. At least one speech subunit, of one or more best candidate speech units having the best match scores, is displayed.
    • 在语音识别和语音编码中,在一系列时间间隔期间测量话音的至少两个特征的值,以产生一系列特征向量信号。 存储仅具有一个参数值的多个单维原型矢量信号。 具有表示第一特征值的参数值和至少两个其它单维原型矢量信号的至少两个单维原型矢量信号具有表示第二特征值的参数值。 多个复合尺寸原型矢量信号具有唯一的识别值,并且包括一个第一维和一个第二维原型矢量信号。 至少两个复合维度原型矢量信号包括相同的第一维原型矢量信号。 将每个特征向量信号的特征值与化合物维度原型矢量信号的参数值进行比较,以获得原型匹配分数。 具有特征矢量信号的具有最佳原型匹配分数的复合维度原型矢量信号的识别值被输出为将被识别的话语的编码表示的序列。 针对多个语音单元中的每一个生成包括语音单元与语音编码表示序列之间的匹配的接近度的估计的匹配分数。 显示具有最佳匹配分数的一个或多个最佳候选语音单元的至少一个语音子单元。
    • 3. 发明授权
    • MVC (model-view-controller) based multi-modal authoring tool and development environment
    • 基于MVC(模型视图 - 控制器)的多模式创作工具和开发环境
    • US06996800B2
    • 2006-02-07
    • US10007037
    • 2001-12-04
    • John M. LucassenStephane H. Maes
    • John M. LucassenStephane H. Maes
    • G06F9/44
    • G06F8/38
    • Application development tools and method for building multi-channel, multi-device and multi-modal applications, and in particular, to systems and methods for developing applications whereby a user can interact in parallel with the same information via a multiplicity of channels and user interfaces, while a unified, synchronized views of the information are presented across the various channels or devices deployed by the user to interact with the information. In a preferred embodiment, application frameworks and development tools are preferably based on a MVC (Model-View-Controller) design paradigm that is adapted to provide synchronized multi-modal interactions. Multi-channel authoring can be developed using a similar methodology.
    • 用于构建多通道,多设备和多模式应用的应用程序开发工具和方法,特别是用于开发应用程序的系统和方法,由此用户可以通过多个通道和用户界面与相同的信息并行交互 而信息的统一的,同步的视图则呈现在用户部署的各种渠道或设备上,以与信息进行交互。 在优选实施例中,应用框架和开发工具优选地基于适于提供同步多模态交互的MVC(模型 - 视图 - 控制器)设计范例。 可以使用类似的方法开发多渠道创作。
    • 4. 发明授权
    • Method and system for reducing perplexity in speech recognition via
caller identification
    • 通过呼叫者识别减少语音识别困惑的方法和系统
    • US5802251A
    • 1998-09-01
    • US523755
    • 1995-09-05
    • Paul S. CohenJohn M. LucassenElton B. Sherwin, Jr.Jorge L. Vizcaino
    • Paul S. CohenJohn M. LucassenElton B. Sherwin, Jr.Jorge L. Vizcaino
    • G10L15/10G10L15/00G10L15/06G10L15/20G10L17/00H04Q7/38G10L3/00
    • G10L15/065G10L15/06G10L15/20G10L17/00
    • A method and system are disclosed for reducing perplexity in a speech recognition system within a telephonic network based upon determined caller identity. In a speech recognition system which processes input frames of speech against stored templates representing speech, a core library of speech templates is created and stored representing a basic vocabulary of speech. Multiple caller-specific libraries of speech templates are also created and stored, each library containing speech templates which represent a specialized vocabulary and pronunciations for a specific geographic location and a particular individual. Additionally, the caller-specific libraries of speech templates are preferably processed to reflect the reduced bandwidth, transmission channel variations and other signal variations introduced into the system via a telephonic network. The identification of a caller is determined upon connection to the network via standard caller identification circuitry and upon detection of a spoken utterance, that utterance is processed against the core library, if the caller's identity cannot be determined, or against a particular caller-specific library, if the caller's identity can be determined, thereby greatly enhancing the efficiency and accuracy of speech recognition by the system.
    • 公开了一种基于确定的呼叫者身份来减少电话网络内的语音识别系统中的困惑的方法和系统。 在针对表示语音的存储模板处理输入语音帧的语音识别系统中,创建并存储代表语音的基本词汇表的语音模板的核心库。 还创建并存储多个特定于语音模板的调用者库,每个库包含表示特定地理位置和特定个人的专门词汇和发音的语音模板。 此外,优选地处理呼叫者特定的语音模板库以反映通过电话网络引入到系统中的减少的带宽,传输信道变化​​和其他信号变化。 通过标准呼叫者识别电路连接到网络并且在检测到说话话语之后确定呼叫者的识别,如果呼叫者的身份不能被确定,或针对特定的呼叫者特定的库 如果可以确定呼叫者的身份,从而大大提高系统语音识别的效率和准确性。
    • 5. 发明授权
    • MVC (Model-View-Controller) based multi-modal authoring tool and development environment
    • 基于MVC(Model-View-Controller)的多模式创作工具和开发环境
    • US07900186B2
    • 2011-03-01
    • US11190572
    • 2005-07-27
    • John M. LucassenStephane H. Maes
    • John M. LucassenStephane H. Maes
    • G06F9/44
    • G06F8/38
    • Application development tools and method for building multi-channel, multi-device and multi-modal applications, and in particular, to systems and methods for developing applications whereby a user can interact in parallel with the same information via a multiplicity of channels and user interfaces, while a unified, synchronized views of the information are presented across the various channels or devices deployed by the user to interact with the information. In a preferred embodiment, application frameworks and development tools are preferably based on a MVC (Model-View-Controller) design paradigm that is adapted to provide synchronized multi-modal interactions. Multi-channel authoring can be developed using a similar methodology.
    • 用于构建多通道,多设备和多模式应用的应用程序开发工具和方法,特别是用于开发应用程序的系统和方法,由此用户可以通过多个通道和用户界面与相同的信息并行交互 而信息的统一的,同步的视图则呈现在用户部署的各种渠道或设备上,以与信息进行交互。 在优选实施例中,应用框架和开发工具优选地基于适于提供同步多模态交互的MVC(模型 - 视图 - 控制器)设计范例。 可以使用类似的方法开发多渠道创作。
    • 7. 发明授权
    • Method and system for location-specific speech recognition
    • 位置特定语音识别的方法和系统
    • US5524169A
    • 1996-06-04
    • US175701
    • 1993-12-30
    • Paul S. CohenJohn M. LucassenRoger M. MillerElton B. Sherwin, Jr.
    • Paul S. CohenJohn M. LucassenRoger M. MillerElton B. Sherwin, Jr.
    • G10L15/10G10L15/00G10L15/26G10L15/28H04Q7/34G10L5/06
    • G10L15/26G10L15/28G10L2015/226G10L2015/228
    • A method and system for reducing perplexity in a speech recognition system based upon determined geographic location. In a mobile speech recognition system which processes input frames of speech against stored templates representing speech, a core library of speech templates is created and stored representing a basic vocabulary of speech. Multiple location-specific libraries of speech templates are also created and stored, each library containing speech templates representing a specialized vocabulary for a specific geographic location. The geographic location of the mobile speech recognition system is then periodically determined utilizing a cellular telephone system, a geopositioning satellite system or other similar systems and a particular one of the location-specific libraries of speech templates is identified for the current location of the system. Input frames of speech are then processed against the combination of the core library and the particular location-specific library to greatly enhance the accuracy and efficiency of speech recognition by the system. Each location-specific library preferably includes speech templates representative of location place names, proper names, and business establishments within a specific geographic location.
    • 一种用于基于确定的地理位置减少语音识别系统中的困惑的方法和系统。 在针对表示语音的存储模板处理输入语音帧的移动语音识别系统中,创建并存储代表语音的基本词汇表的语音模板的核心库。 还创建和存储多个位置特定的语音模板库,每个库包含表示特定地理位置的专门词汇表的语音模板。 然后,利用蜂窝电话系统,地理定位卫星系统或其他类似系统周期性地确定移动语音识别系统的地理位置,并为该系统的当前位置识别特定位置的语音模板库。 然后根据核心库和特定位置特定库的组合对输入的语音帧进行处理,以大大增强系统语音识别的准确性和效率。 每个位置特定的图书馆优选地包括表示特定地理位置内的位置地名,专有名称和商业场所的语音模板。