会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 5. 发明授权
    • Identifying far-end sound
    • 识别远端声音
    • US08219387B2
    • 2012-07-10
    • US11953764
    • 2007-12-10
    • Ross CutlerXinding SunSenthil Velayutham
    • Ross CutlerXinding SunSenthil Velayutham
    • G06F15/00G10L11/00G10L19/12G10L21/02G10L17/00
    • G06K9/6293G10L2021/02082G10L2021/02166
    • Frames containing audio data may be received, the audio data having been derived from a microphone array, at least some of the frames containing residual acoustic echo after having acoustic echo partially removed therefrom. Probability distribution functions are determined from the frames of audio data. A probability distribution function comprises likelihoods that respective directions are directions of sources of sounds. An active speaker may be identified in frames of video data based on the video data and based on audio information derived from the audio data, where use of the audio information as a basis for identifying the active speaker is controlled by determining whether the probability distribution functions indicate that corresponding audio data includes residual acoustic echo.
    • 可以接收包含音频数据的帧,音频数据已经从麦克风阵列导出,至少一些帧在具有从其中部分地去除声学回声之后包含残余声学回声。 概率分布函数由音频数据的帧确定。 概率分布函数包括各个方向是声源的方向的似然性。 可以基于视频数据在视频数据的帧中基于从音频数据导出的音频信息来识别有源扬声器,其中通过确定概率分布函数是否控制通过音频信息作为用于识别有源说话者的基础的使用 指示对应的音频数据包括残余声学回声。
    • 6. 发明授权
    • High-quality gradient-corrected linear interpolation for demosaicing of color images
    • 高质量梯度校正线性插值,用于彩色图像的去马赛克
    • US07502505B2
    • 2009-03-10
    • US10801450
    • 2004-03-15
    • Henrique S. MalvarLi-wei HeRoss Cutler
    • Henrique S. MalvarLi-wei HeRoss Cutler
    • G06K9/00G06K9/32
    • G06T3/4015
    • A gradient-corrected linear interpolation method and system for the demosaicing of color images. The method and system compute an interpolation using some a current technique (preferably a bilinear interpolation technique to reduce computational complexity), compute a correction term (such as a gradient of a desired color at a given pixel), and linearly combine the interpolation and the correction term to produce a corrected, high-quality interpolation of a missing color value at a pixel. The correction term may be a gradient correction term computed from the current color of the current pixel. This gradient is directly used to affect and correct the estimated color value produced by the prior art interpolation technique. The gradient-corrected linear interpolation method and system may also apply a gradient-correction gain to the gradient correction term. This gradient-correction gain affects the amount of gradient correction that is applied to the interpolation.
    • 用于彩色图像去马赛克的渐变校正线性插值方法和系统。 该方法和系统使用一些当前技术(优选双线性插值技术来减少计算复杂度)来计算插值,计算校正项(例如给定像素处的期望颜色的梯度),并且将内插和 校正项,以产生在像素处缺失颜色值的校正的高质量插值。 校正项可以是从当前像素的当前颜色计算的梯度校正项。 该梯度直接用于影响和校正由现有技术插值技术产生的估计颜色值。 梯度校正线性插值方法和系统还可以对梯度校正项应用梯度校正增益。 该梯度校正增益影响应用于插值的梯度校正量。
    • 7. 发明授权
    • System and method for audio/video speaker detection
    • 用于音频/视频扬声器检测的系统和方法
    • US07343289B2
    • 2008-03-11
    • US10606061
    • 2003-06-25
    • Ross CutlerAshish Kapoor
    • Ross CutlerAshish Kapoor
    • G10L13/00
    • G10L15/25G10L25/30G10L25/78
    • A system and method for detecting speech utilizing audio and video inputs. In one aspect, the invention collects audio data generated from a microphone device. In another aspect, the invention collects video data and processes the data to determine a mouth location for a given speaker. The audio and video are inputted into a time-delay neural network that processes the data to determine which target is speaking. The neural network processing is based upon a correlation to detected mouth movement from the video data and audio sounds detected by the microphone.
    • 一种利用音频和视频输入来检测语音的系统和方法。 一方面,本发明收集从麦克风装置产生的音频数据。 在另一方面,本发明收集视频数据并处理数据以确定给定说话者的嘴部位置。 音频和视频被输入到时间延迟神经网络中,处理数据以确定哪个目标在说话。 神经网络处理基于与从视频数据检测到的嘴部移动和由麦克风检测到的音频声音的相关性。