会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Method and apparatus for speech encoding and decoding by sinusoidal analysis and waveform encoding with phase reproducibility
    • 通过正弦分析和具有相位再现性的波形编码进行语音编码和解码的方法和装置
    • US07454330B1
    • 2008-11-18
    • US08736546
    • 1996-10-24
    • Masayuki NishiguchiKazuyuki IijimaJun MatsumotoShiro Omori
    • Masayuki NishiguchiKazuyuki IijimaJun MatsumotoShiro Omori
    • G10L19/14
    • G10L19/0212G10L19/02G10L19/04G10L19/06G10L19/12G10L25/27G10L25/93
    • A speech encoding method and apparatus in which an input speech signal is divided in terms of blocks or frames as encoding units and encoded in terms of the encoding units, whereby explosive and fricative consonants can be impeccably reproduced, while there is an attenuation of the occurrence of foreign sounds being generated at a transient portion between voiced (V) and unvoiced (UV) portions, so that the speech with high clarity devoid of “stuffed” feeling may be produced. The encoding apparatus includes a first encoding unit for finding residuals of linear predictive coding (LPC) of an input speech signal for performing harmonic coding and a second encoding unit for encoding the input speech signal by waveform coding. The first encoding unit and the second encoding unit are used for encoding a voiced (V) portion and an unvoiced (UV) portion of the input signal, respectively. Code excited linear prediction (CELP) encoding employing vector quantization by a closed loop search of an optimum vector using an analysis-by-synthesis method is used for the second encoding unit. A corresponding decoding method and apparatus is also provided.
    • 一种语音编码方法和装置,其中输入语音信号以块或帧为单位编码,并以编码单位编码,由此可以无可挑剔地复制爆炸和摩擦辅音,同时存在衰减的发生 在V(V)和无声(UV)部分之间的瞬态部分产生外来声音,从而可能产生具有高“透明度”感的语音。 编码装置包括:第一编码单元,用于求出用于执行谐波编码的输入语音信号的线性预测编码(LPC)的残差;以及第二编码单元,用于通过波形编码对输入的语音信号进行编码。 第一编码单元和第二编码单元分别用于对输入信号的有声(V)部分和无声(UV)部分进行编码。 第二编码单元使用通过使用合成分析法的最佳向量的闭环搜索采用矢量量化的码激励线性预测(CELP)编码。 还提供了相应的解码方法和装置。
    • 3. 发明授权
    • Voice encoding method and apparatus using modified discrete cosine
transform
    • 使用修正离散余弦变换的语音编码方法和装置
    • US5819212A
    • 1998-10-06
    • US736507
    • 1996-10-24
    • Jun MatsumotoShiro OmoriMasayuki NishiguchiKazuyuki Iijima
    • Jun MatsumotoShiro OmoriMasayuki NishiguchiKazuyuki Iijima
    • G10L19/02G10L19/04G10L19/07G10L9/00
    • G10L19/0212G10L19/0208G10L19/04G10L19/07
    • A method and apparatus for encoding an input signal, such as a broad-range speech signal, in which a number of decoding operations with different bit rates are enabled for assuring a high encoding bit rate and for minimizing deterioration of the reproduced sound even with a low bit rate. The signal encoding method includes a band-splitting step for splitting an input signal into a number of bands and a step of encoding signals of the bands in a different manner depending on signal characteristics of the bands. Specifically, a low-range side signal is taken out by a low-pass filter from an input signal entering a terminal, and analyzed for Linear Predictive coding by an Linear Predictive coding analysis quantization unit. After finding the Linear Predictive coding residuals, as short-term prediction residuals by an Linear Predictive coding inverted filter, the pitch is found by a pitch analysis circuit. Then, pitch residuals are found by long-term prediction by a pitch inverted filter. The pitch residuals are processed with modified discrete cosine transform by a modified discrete cosine transform (MDCT) circuit and vector-quantized by a vector-quantization circuit. The resulting quantization indices are transmitted along with the pitch lag and the pitch gain. The linear spectral pairs linear spectral pairs are also sent as parameter representing LPC coefficients.
    • 一种用于编码诸如宽范围语音信号的输入信号的方法和装置,其中能够使用不同比特率的多个解码操作用于确保高编码比特率,并且即使使用 低比特率。 信号编码方法包括用于将输入信号分割成多个频带的频带分解步骤和根据频带的信号特性以不同方式编码频带的信号的步骤。 具体地,通过低通滤波器从进入终端的输入信号中取出低范围侧信号,并通过线性预测编码分析量化单元分析线性预测编码。 在找到线性预测编码残差之后,通过线性预测编码反相滤波器作为短期预测残差,音调由音调分析电路找到。 然后,通过音调反向滤波器的长期预测来发现音调残差。 用经修正的离散余弦变换(MDCT)电路,用修正离散余弦变换处理音调残差,并由矢量量化电路进行矢量量化。 产生的量化索引与音调滞后和音调增益一起发送。 线性谱对线性谱对也作为表示LPC系数的参数发送。
    • 6. 发明授权
    • Speech decoding method and apparatus
    • 语音解码方法及装置
    • US5752222A
    • 1998-05-12
    • US736342
    • 1996-10-23
    • Masayuki NishiguchiKazuyuki IijimaJun MatsumotoShiro Omori
    • Masayuki NishiguchiKazuyuki IijimaJun MatsumotoShiro Omori
    • G10L19/00G10L19/14H03M7/30G10L3/02G10L9/00
    • G10L19/26
    • A speech decoding method and apparatus for decoding encoded speech signals and subsequently post-filtering the decoded signals, wherein the filter coefficient of a spectral shaping filter in a post-filter fed with an encoded and subsequently decoded speech signal is updated with a sub-frame period, while the gain of a gain adjustment circuit for correcting gain changes caused by the spectral shaping is updated with a frame period that is eight times as long as the sub-frame period. This achieves switching of the filter coefficient so as to be changed smoothly with a higher follow-up speed, while suppressing level changes otherwise caused by frequent gain switching. The result is improved characteristics of a post-filter used for spectral shaping of a decoded signal supplied from the signal decoder and more effective post-filter processing.
    • 一种用于对编码的语音信号进行解码并随后对解码的信号进行后置滤波的语音解码方法和装置,其中,用编码和随后解码的语音信号馈送的后置滤波器中的频谱整形滤波器的滤波器系数用子帧 而用于校正由频谱整形引起的增益变化的增益调整电路的增益是以子帧周期的8倍的帧周期来更新的。 这实现了滤波器系数的切换以便以更高的跟随速度平滑地改变,同时抑制另外由频繁增益切换引起的电平变化。 结果是用于从信号解码器提供的解码信号的频谱整形的后置滤波器的改进的特性以及更有效的后置滤波处理。
    • 7. 发明授权
    • Voiced/unvoiced decision using a plurality of sigmoid-transformed
parameters for speech coding
    • 使用多个S形变换参数进行语音编码的发声/清音决定
    • US06023671A
    • 2000-02-08
    • US833970
    • 1997-04-11
    • Kazuyuki IijimaMasayuki NishiguchiJun MatsumotoShiro Omori
    • Kazuyuki IijimaMasayuki NishiguchiJun MatsumotoShiro Omori
    • G10L11/00G10L11/02G10L11/06G10L15/02G10L19/00H03M7/30G10L9/00
    • G10L25/93
    • A method and apparatus for voiced/unvoiced decision for judging whether an input speech signal is voiced or unvoiced. The input parameters for performing the voiced/unvoiced (V/UV) decision are comprehensively judged in order to enable high-precision V/UV decision by a simplified algorithm. Parameters for the voiced/unvoiced (V/UV) decision include the frame-averaged energy of the input speech signal lev, the normalized autocorrelation peak value r0r, the spectral similarity degree pos, the number of zero crossings nZero, and the pitch lag pch. If these parameters are denoted by x, these parameters are converted by function calculation circuits using a sigmoid function g(x) represented byg(x)=A/(1+exp (-(x-b)/a))where A, a, and b are constants differing with each input parameter. Using the parameters converted by this sigmoid function g(x), the voiced/unvoiced decision is made a V/UV decision circuit.
    • 用于用于判断输入语音信号是有声还是无声的有声/无声决定的方法和装置。 综合判断用于执行有声/无声(V / UV)判定的输入参数,以便通过简化算法实现高精度V / UV判定。 有声/无声(V / UV)决定的参数包括输入语音信号lev的帧平均能量,归一化自相关峰值r0r,频谱相似度pos,过零次数nZero和音调滞后pch 。 如果这些参数由x表示,这些参数由函数计算电路使用由g(x)= A /(1 + exp( - (xb)/ a))表示的S形函数g(x)转换,其中A,a, b是与每个输入参数不同的常数。 使用由该S形函数g(x)转换的参数,将有声/无声决定作为V / UV判定电路。
    • 8. 发明授权
    • Method and apparatus for decoding and changing the pitch of an encoded
speech signal
    • 用于对编码语音信号进行解码和改变音调的方法和装置
    • US5873059A
    • 1999-02-16
    • US736989
    • 1996-10-25
    • Kazuyuki IijimaMasayuki NishiguchiJun MatsumotoShiro Omori
    • Kazuyuki IijimaMasayuki NishiguchiJun MatsumotoShiro Omori
    • G10L11/06G10L13/00G10L13/02G10L19/04G10L21/04H03M7/30G10L9/00
    • G10L13/033G10L21/01G10L19/087
    • A method and apparatus for reproducing speech signals at a controlled speed and for synthesizing speech includes a dividing unit that divides the input speech into time segments and an encoding unit that discriminates whether each of the speech segments is voiced or unvoiced. Based on the results of the discrimination, the encoding unit performs sinusoidal synthesis and encoding for voiced segments and vector quantization by closed-loop search for an optimum vector using an analysis-by-synthesis method for unvoiced segments in order to find encoded parameters. A period modification unit modifies the length of time associated with each signal segment and calculates a set of modified encoded parameters. In the speech synthesizing unit, encoded speech signal data is output from the encoding unit and pitch data and amplitude data specifying the spectral envelope are sent via a data conversion unit to a waveform synthesis unit, where the number of amplitude data points of the spectral envelope is changed without changing the shape of the spectral envelope, so that the pitch of the signal may be varied without changing its phoneme. A waveform synthesis unit synthesizes the speech waveform based on the converted spectral envelope data and pitch data.
    • 用于以受控速度再现语音信号并用于合成语音的方法和装置包括将输入语音划分成时间段的分割单元和鉴别每个语音段是有声还是无声的编码单元。 基于鉴别的结果,编码单元通过使用用于清音段的合成分析方法对最佳向量进行闭环搜索,对浊音段和矢量量化进行正弦合成和编码,以便找到编码参数。 周期修改单元修改与每个信号段相关联的时间长度,并计算一组经修改的编码参数。 在语音合成单元中,编码语音信号数据从编码单元输出,音调数据和指定频谱包络的​​振幅数据经由数据转换单元发送到波形合成单元,其中频谱包络的​​振幅数据点的数量 在不改变频谱包络的​​形状的情况下改变,使得信号的音调可以改变而不改变其音素。 波形合成单元基于转换的频谱包络数据和音调数据来合成语音波形。