会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明申请
    • TIME-DOMAIN TIME/PITCH SCALING OF SPEECH OR AUDIO SIGNALS, WITH TRANSIENT HANDLING
    • 语音时钟/音频信号的时域/时间调整,瞬时处理
    • WO1998020482A1
    • 1998-05-14
    • PCT/US1997020310
    • 1997-11-06
    • CREATIVE TECHNOLOGY LTD.LAROCHE, Jean
    • CREATIVE TECHNOLOGY LTD.
    • G10L03/02
    • G10L21/01
    • Method and apparatus for time-scaling and/or pitch shifting by discarding and/or repeating segments of a signal (302). The signal (302) is stored as a series of samples on a memory (124) where it is readable by one or more read pointers (304, 306). Periodicity of segments of the signal (302) is determined by evaluating normalized cross-correlation over a range of possible periods. Transients are detected by monitoring changes in rms signal value. To achieve time compression or time stretching, a segment is skipped/discarded whenever a maximum time-discrepancy between the current output and an ideal output is reached or a high periodicity is detected, a jump of the optimal length would not make this time discrepancy too high, and no transient is present in the segment to be skipped/discarded.
    • 用于通过丢弃和/或重复信号的段来对时间缩放和/或音调移位的方法和装置(302)。 信号(302)作为一系列样本存储在存储器(124)上,其中它可被一个或多个读指针(304,306)读取。 信号(302)的段的周期性通过在可能的周期的范围内评估归一化的互相关来确定。 通过监测有效值信号值的变化检测瞬态。 为了实现时间压缩或时间延伸,每当达到当前输出和理想输出之间的最大时间差异或检测到高周期性时,跳过/丢弃一个段,最佳长度的跳跃也不会使此时间差异 高,并且在要跳过/丢弃的段中不存在瞬态。
    • 3. 发明申请
    • MULTI-STAGE SPEECH CODER WITH TRANSFORM CODING OF PREDICTION RESIDUAL SIGNALS WITH QUANTIZATION BY AUDITORY MODELS
    • 具有通过审计模型量化的预测残留信号变换编码的多级语音编码器
    • WO1997031367A1
    • 1997-08-28
    • PCT/US1997002898
    • 1997-02-26
    • AT & T CORP.CHEN, Juin-Hwey
    • AT & T CORP.
    • G10L03/02
    • G10L19/002G10L19/0212G10L19/12
    • A speech compression system called "Transform Predictive Coding" or TPC, provides encoding for 7 kHz band speech at 16 kHz sampling at a target bit-rate of 16 or 32 kb/s one or two bits per sample. The system uses short and long term prediction to remove redundancy. The prediction residual is transformed and coded in the frequency domain as shown on the figure by (110) after accepting time domain data from (60) and parameter input from (100), which corrects the spectrum for auditory perception. The TPC coder uses only open-loop quantization as shown by (70) and therefore has low complexity. The speech quality is transparent at 32 kb/s, is very good at 24 kb/s, and is acceptable at 16 kb/s.
    • 称为“变换预测编码”或TPC的语音压缩系统以16kHz采样的16kHz或32kb / s的目标比特率提供7kHz频带语音的编码,每个样本一个或两个比特。 该系统使用短期和长期预测来消除冗余。 在接收来自(60)的时域数据和(100)的参数输入后,预测残差在频域中如图所示变换和编码,如图110所示,其校正了听觉感知的频谱。 TPC编码器仅使用如(70)所示的开环量化,因此具有低复杂度。 语音质量在32kb / s下是透明的,在24kb / s下是非常好的,并且在16kb / s下是可接受的。
    • 6. 发明申请
    • METHOD AND APPARATUS FOR MITIGATING AUDIO DEGRADATION IN A COMMUNICATION SYSTEM
    • 在通信系统中减轻音频降级的方法和装置
    • WO1995022817A1
    • 1995-08-24
    • PCT/US1994014751
    • 1994-12-22
    • MOTOROLA INC.
    • MOTOROLA INC.KOTZIN, Michael, D.
    • G10L03/02
    • G10L19/005G10L19/18
    • Audio degradation is minimized in scenarios where tandem coding occurs. One such scenario is in the environment of voice mail service. Characteristics of an audio information signal are determined, and the signal is classified (303) as to whether further coding (306) should be performed and, if so, which rate/type of coding should be performed. Characteristics of the audio signal which are determined are, inter alia, quality characteristics, rate of previous coding, type of previous coding and the source of previous coding of the audio information signal. The source of previous coding determined may further include, inter alia, an analog network, a digital network, a PSTN or a wireless communication system. Based on this information, the voice mail service will either choose not to further code the audio information signal or code the audio information signal with the best coding algorithm available.
    • 在发生串联编码的情况下,音频降级最小化。 一种这样的情况是在语音邮件服务的环境中。 确定音频信息信号的特征,并且对于是否应当执行进一步的编码(306)并且如果是的话应该执行哪个速率/编码类型来对信号进行分类(303)。 确定的音频信号的特征尤其包括质量特性,先前编码的速率,先前编码的类型和音频信息信号的先前编码的来源。 确定的先前编码的来源还可以包括模拟网络,数字网络,PSTN或无线通信系统。 基于该信息,语音邮件服务将选择不进一步编码音频信息信号或使用可用的最佳编码算法对音频信息信号进行编码。
    • 7. 发明申请
    • DEVICE FOR PROCESSING A SOUND SIGNAL AND APPARATUS COMPRISING SUCH A DEVICE
    • 用于处理声音信号的装置和包含这种装置的装置
    • WO1995014297A1
    • 1995-05-26
    • PCT/FR1993001134
    • 1993-11-18
    • LEFEVRE, FrankGUILLARM, Gilles
    • G10L03/02
    • G10L21/04G09B19/04G09B21/009G10L2021/065
    • Device for processing a sound signal (10) comprising an event detector, capable of recognizing different sound phenomena and comprising a time structure analyzer (12) of each event for distinguishing continuous-type event sequences from explosion-type event sequences, a signal energy analyzer (11) analysing the signal for each event and a frequential structure analyzer (13) for distinguishing sequences in which the event has a harmonic structure composed of formants from random structure-type sequences. The device also comprises an event modifier (21) comprising a modification unit (21c) for modifying the time structure, a modification unit (21a) for altering the energy of at least one of the events and a modification unit (21b) for modifying the frequential structure of at least one of the events.
    • 用于处理包括事件检测器的声音信号(10)的装置,其能够识别不同的声音现象,并且包括用于区分连续型事件序列与爆炸型事件序列的每个事件的时间结构分析器(12),信号能量分析器 (11)分析每个事件的信号,以及频率结构分析器(13),用于区分事件具有来自随机结构类型序列的共振峰组成的谐波结构的序列。 该装置还包括一个事件修改器(21),包括用于修改时间结构的修改单元(21c),用于改变至少一个事件的能量的修改单元(21a)和修改单元 至少有一个事件的频率结构。