会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明授权
    • Computer-based method and apparatus for classifying statement types
based on intonation analysis
    • 基于计算机的方法和装置,用于基于语调分析对语句类型进行分类
    • US5995924A
    • 1999-11-30
    • US83449
    • 1998-05-22
    • Alvin Mark Terry
    • Alvin Mark Terry
    • G10L11/04G10L15/18G10L7/00
    • G10L15/1807G10L25/24G10L25/30G10L25/90
    • A computer-based method and apparatus for classifying statement types using intonation analysis. The method and apparatus identify a user's potential query when the user responds to information during dialog with an automated dialog system. Pitch information is extracted, via a cepstrum, from the speech signal. In one embodiment, the pitch intonation is processed to form a smoothed pitch or intonation contour. Then the smoothed pitch contour is processed by a set of shape detectors and this output, together with statistical information, is sent to a rule-based algorithm which attempts to classify the statement type. In another embodiment, the smoothed pitch contour is processed by a pattern recognition system such as a neural network trained with a back-propagation learning algorithm.
    • 一种基于计算机的方法和装置,用于使用语调分析对语句类型进行分类。 当用户在与自动化对话系统对话期间响应信息时,该方法和装置识别用户的潜在查询。 通过倒频谱从语音信号中提取音高信息。 在一个实施例中,处理音调语调以形成平滑的音调或语调轮廓。 然后,平滑的俯仰轮廓由一组形状检测器处理,并将该输出与统计信息一起发送到试图对语句类型进行分类的基于规则的算法。 在另一个实施例中,平滑的俯仰轮廓由诸如用反向传播学习算法训练的神经网络的模式识别系统来处理。
    • 6. 发明授权
    • Method and apparatus for enhancement of telephonic speech signals
    • 用于增强电话语音信号的方法和装置
    • US5737719A
    • 1998-04-07
    • US574527
    • 1995-12-19
    • Alvin Mark Terry
    • Alvin Mark Terry
    • G10L21/00G10L21/02G10L3/02
    • G10L21/0364G10L2021/065
    • A method and apparatus for enhancing the intelligibility of a telephonic speech signal within the available bandwidth and intensity limits of a telephone communication network. The method combines enhancement of both the formant ratio and the consonant/vowel energy ratio to realize a speech signal more intelligible to a hearing impaired user. The invention uses an auditory model of the human ear. A speech signal is put through a filter bank designed to simulate the cochlear filter shapes and filter spacing of a healthy cochlea. The energy output from each of a plurality of filters is computed and used to form an auditory spectrum. The peaks associated with strong first and second formants are identified, and the second formant is enhanced relative to the first formant by attenuating the first formant. Also, consonants in the speech signal are identified as having an energy level below a threshold associated with vowels, but above the threshold associated with silent regions. Consonant regions are amplified. The net effect is to provide more energy in regions of the second formant and the consonants to enhance the intelligibility of the speech signal.
    • 一种用于在电话通信网络的可用带宽和强度极限内增强电话语音信号的可懂度的方法和装置。 该方法结合了共振峰比和辅音/元音能量比的增强,以实现听觉受损用户更易理解的语音信号。 本发明使用人耳的听觉模型。 通过设计用于模拟健康耳蜗的耳蜗滤波器形状和滤波器间距的滤波器组放置语音信号。 计算多个滤波器中每一个的能量输出并用于形成听觉谱。 识别与强的第一和第二共振峰相关的峰,并且通过减弱第一共振峰,第二共振峰相对于第一共振峰增强。 此外,语音信号中的辅音被识别为具有低于与元音相关联的阈值但高于与无声区域相关联的阈值的能级。 辅音区域被放大。 净效应是在第二共振峰和辅音区域中提供更多的能量,以提高语音信号的清晰度。