会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 72. 发明授权
    • Voice transformation with encoded information
    • 具有编码信息的语音变换
    • US08930182B2
    • 2015-01-06
    • US13049924
    • 2011-03-17
    • Shay Ben-DavidRon HooryZvi KonsDavid Nahamoo
    • Shay Ben-DavidRon HooryZvi KonsDavid Nahamoo
    • G10L21/00G10L25/90G10L25/93G10L21/003G10L19/018
    • G10L21/003G10L19/018
    • Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.
    • 提供语音转换的方法,系统和计算机程序产品。 该方法包括使用变换参数来变换源语言,以及使用隐写术对输入语音中的变换参数对信息进行编码,其中可以使用输出语音和关于变换参数的信息来重构源语音。 还提供了一种用于重建语音变换的方法,包括:接收语音转换系统的输出语音,其中输出语音是使用隐写术编码关于变换参数的信息的变换语音; 提取变换参数信息; 并执行输出语音的逆变换以获得原始源语音的近似。
    • 76. 发明授权
    • Method and apparatus for estimating phone class probabilities
a-posteriori using a decision tree
    • 用于使用决策树估计电话类概率的方法和装置
    • US5680509A
    • 1997-10-21
    • US312584
    • 1994-09-27
    • Ponani S. GopalakrishnanDavid NahamooMukund PadmanabhanMichael Alan Picheny
    • Ponani S. GopalakrishnanDavid NahamooMukund PadmanabhanMichael Alan Picheny
    • G10L15/06G10L15/08G10L5/06
    • G10L15/063G10L15/08
    • A method and apparatus for estimating the probability of phones, a-posteriori, in the context of not only the acoustic feature at that time, but also the acoustic features in the vicinity of the current time, and its use in cutting down the search-space in a speech recognition system. The method constructs and uses a decision tree, with the predictors of the decision tree being the vector-quantized acoustic feature vectors at the current time, and in the vicinity of the current time. The process starts with an enumeration of all (predictor, class) events in the training data at the root node, and successively partitions the data at a node according to the most informative split at that node. An iterative algorithm is used to design the binary partitioning. After the construction of the tree is completed, the probability distribution of the predicted class is stored at all of its terminal leaves. The decision tree is used during the decoding process by tracing a path down to one of its leaves, based on the answers to binary questions about the vector-quantized acoustic feature vector at the current time and its vicinity.
    • 在不仅在当时的声学特征以及当前时间附近的声学特征的上下文中估计电话的概率的方法和装置,以及其用于减少搜索 - 语音识别系统中的空间。 该方法构造并使用决策树,其中决策树的预测变量是当前时间和当前时间附近的矢量量化的声学特征向量。 该过程从在根节点的训练数据中的所有(预测器,类)事件的枚举开始,并且根据该节点处的最多信息拆分在节点处依次划分数据。 迭代算法用于设计二进制分区。 树完成后,预测类的概率分布存储在其所有终端叶上。 基于对当前时间及其附近的向量量化声学特征向量的二进制问题的答案,在解码过程中使用决策树通过跟踪到其叶子之一的路径。
    • 79. 发明授权
    • Speech coding apparatus with single-dimension acoustic prototypes for a
speech recognizer
    • 具有用于语音识别器的单维声学原型的语音编码装置
    • US5280562A
    • 1994-01-18
    • US770495
    • 1991-10-03
    • Lalit R. BahlJerome R. BellegardaEdward A. EpsteinJohn M. LucassenDavid NahamooMichael A. Picheny
    • Lalit R. BahlJerome R. BellegardaEdward A. EpsteinJohn M. LucassenDavid NahamooMichael A. Picheny
    • G10L19/00G10L15/02G10L19/02H03M7/30G10L9/02
    • G10L19/038H03M7/3082
    • In speech recognition and speech coding, the values of at least two features of an utterance are measured during a series of time intervals to produce a series of feature vector signals. A plurality of single-dimension prototype vector signals having only one parameter value are stored. At least two single-dimension prototype vector signals having parameter values representing first feature values, and at least two other single-dimension prototype vector signals have parameter values representing second feature values. A plurality of compound-dimension prototype vector signals have unique identification values and comprise one first-dimension and one second-dimension prototype vector signal. At least two compound-dimension prototype vector signals comprise the same first-dimension prototype vector signal. The feature values of each feature vector signal are compared to the parameter values of the compound-dimension prototype vector signals to obtain prototype match scores. The identification values of the compound-dimension prototype vector signals having the best prototype match scores for the feature vectors signals are output as a sequence of coded representations of an utterance to be recognized. A match score, comprising an estimate of the closeness of a match between a speech unit and the sequence of coded representations of the utterance, is generated for each of a plurality of speech units. At least one speech subunit, of one or more best candidate speech units having the best match scores, is displayed.
    • 在语音识别和语音编码中,在一系列时间间隔期间测量话音的至少两个特征的值,以产生一系列特征向量信号。 存储仅具有一个参数值的多个单维原型矢量信号。 具有表示第一特征值的参数值和至少两个其它单维原型矢量信号的至少两个单维原型矢量信号具有表示第二特征值的参数值。 多个复合尺寸原型矢量信号具有唯一的识别值,并且包括一个第一维和一个第二维原型矢量信号。 至少两个复合维度原型矢量信号包括相同的第一维原型矢量信号。 将每个特征向量信号的特征值与化合物维度原型矢量信号的参数值进行比较,以获得原型匹配分数。 具有特征矢量信号的具有最佳原型匹配分数的复合维度原型矢量信号的识别值被输出为将被识别的话语的编码表示的序列。 针对多个语音单元中的每一个生成包括语音单元与语音编码表示序列之间的匹配的接近度的估计的匹配分数。 显示具有最佳匹配分数的一个或多个最佳候选语音单元的至少一个语音子单元。