会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 6. 发明授权
    • Compression and decompression of data vectors
    • 数据向量的压缩和解压缩
    • US08510105B2
    • 2013-08-13
    • US11256667
    • 2005-10-21
    • Jani K. Nurminen
    • Jani K. Nurminen
    • G10L19/00G10L19/12
    • H03M7/40G10L2019/0005
    • For an enhanced sequential compression of data vectors in a respective compression pass, a current data vector is mapped to at least one current code vector of at least one codebook in at least one quantization stage. The at least one codebook is reordered taking account of at least one intermediate result from the current compression pass and at least one intermediate result from a preceding compression pass. At least one codebook index that is associated in the at least one reordered codebook to the at least one current code vector is then provided for further use. For a decompression of compressed data vectors represented by such codebook indices, at least one codebook index is mapped to at least one code vector of at least one equally reordered codebook.
    • 为了在相应的压缩过程中对数据向量进行增强的顺序压缩,将当前数据矢量映射到至少一个量化级中的至少一个码本的至少一个当前码矢量。 考虑到来自当前压缩遍的至少一个中间结果和来自前一压缩遍的至少一个中间结果,重新排序至少一个码本。 然后提供至少一个与至少一个重排序码本相关联的至少一个当前码矢量的码本索引用于进一步使用。 对于由这样的码本索引表示的压缩数据向量的解压缩,至少一个码本索引被映射到至少一个同样重排的码本的至少一个码矢量。
    • 7. 发明授权
    • Method, apparatus and computer program product for providing voice conversion using temporal dynamic features
    • 用于使用时间动态特征提供语音转换的方法,装置和计算机程序产品
    • US07848924B2
    • 2010-12-07
    • US11788263
    • 2007-04-17
    • Jani K. NurminenVictor PopaJilei Tian
    • Jani K. NurminenVictor PopaJilei Tian
    • G10L21/00
    • G10L13/033
    • An apparatus for providing voice conversion using temporal dynamic features includes a feature extractor and a transformation element. The feature extractor may be configured to extract dynamic feature vectors from source speech. The transformation element may be in communication with the feature extractor and configured to apply a first conversion function to a signal including the extracted dynamic feature vectors to produce converted dynamic feature vectors. The first conversion function may have been trained using at least dynamic feature data associated with training source speech and training target speech. The transformation element may be further configured to produce converted speech based on an output of applying the first conversion function.
    • 用于使用时间动态特征提供语音转换的装置包括特征提取器和变换元件。 特征提取器可以被配置为从源语音提取动态特征向量。 变换元件可以与特征提取器通信并且被配置为将第一转换函数应用于包括所提取的动态特征向量的信号以产生转换的动态特征向量。 可以使用至少与训练源语音和训练目标语音相关联的动态特征数据来训练第一转换功能。 转换元件还可以被配置为基于应用第一转换函数的输出来产生转换的语音。
    • 8. 发明授权
    • Optimization of text-based training set selection for language processing modules
    • 优化语言处理模块的基于文本的训练集选择
    • US07831549B2
    • 2010-11-09
    • US10944517
    • 2004-09-17
    • Jian TileiJani K. Nurminen
    • Jian TileiJani K. Nurminen
    • G07F7/00
    • G10L15/063G10L13/08
    • A device and a method provide for selection of a database from a corpus using an, optimization function. The method includes defining a size of a database, calculating a distance using a distance function for each pair in a set of pairs, and executing an optimization function using the distance to select each entry saved in the database until the number of saved entries equals the size of the database. Each pair in the set of pairs includes either two entries selected from a corpus or one entry selected from a set of previously selected entries and another entry selected from a set of a remaining portion of the corpus. The distance function may be a Levenshtein distance function or a generalized Levenshtein distance function.
    • 设备和方法使用优化功能提供从语料库中选择数据库。 该方法包括定义数据库的大小,使用一组对中的每对的距离函数计算距离,以及使用距离来执行优化功能,以选择保存在数据库中的每个条目,直到保存的条目数等于 数据库的大小。 该组对中的每一对包括从语料库中选择的两个条目或从一组先前选择的条目中选择的一个条目以及从语料库的剩余部分的集合中选择的另一条目。 距离函数可以是Levenshtein距离函数或广义Levenshtein距离函数。
    • 10. 发明申请
    • Prosody Conversion
    • 韵律转换
    • US20080082333A1
    • 2008-04-03
    • US11536701
    • 2006-09-29
    • Jani K. NurminenElina Helander
    • Jani K. NurminenElina Helander
    • G10L17/00
    • G10L21/00G10L13/04G10L2021/0135
    • A contour for a syllable (or other speech segment) in a voice undergoing conversion is transformed. The transform of that contour is then used to identify one or more source syllable transforms in a codebook. Information regarding the context and/or linguistic features of the contour being converted can also be compared to similar information in the codebook when identifying an appropriate source transform. Once a codebook source transform is selected, an inverse transformation is performed on a corresponding codebook target transform to yield an output contour. The corresponding codebook target transform represents a target voice version of the same syllable represented by the selected codebook source transform. The output contour may be further processed to improve conversion quality.
    • 正在进行转换的语音中的音节(或其他语音段)的轮廓被转换。 然后,该轮廓的变换用于识别码本中的一个或多个源音节转换。 关于正在转换的轮廓的上下文和/或语言特征的信息也可以在识别适当的源变换时与码本中的类似信息进行比较。 一旦选择了码本源变换,就对相应的码本目标变换执行逆变换以产生输出轮廓。 相应的码本目标变换表示由所选码本源变换表示的相同音节的目标语音版本。 可以进一步处理输出轮廓以提高转换质量。