会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Identifying far-end sound
    • 识别远端声音
    • US08219387B2
    • 2012-07-10
    • US11953764
    • 2007-12-10
    • Ross CutlerXinding SunSenthil Velayutham
    • Ross CutlerXinding SunSenthil Velayutham
    • G06F15/00G10L11/00G10L19/12G10L21/02G10L17/00
    • G06K9/6293G10L2021/02082G10L2021/02166
    • Frames containing audio data may be received, the audio data having been derived from a microphone array, at least some of the frames containing residual acoustic echo after having acoustic echo partially removed therefrom. Probability distribution functions are determined from the frames of audio data. A probability distribution function comprises likelihoods that respective directions are directions of sources of sounds. An active speaker may be identified in frames of video data based on the video data and based on audio information derived from the audio data, where use of the audio information as a basis for identifying the active speaker is controlled by determining whether the probability distribution functions indicate that corresponding audio data includes residual acoustic echo.
    • 可以接收包含音频数据的帧,音频数据已经从麦克风阵列导出,至少一些帧在具有从其中部分地去除声学回声之后包含残余声学回声。 概率分布函数由音频数据的帧确定。 概率分布函数包括各个方向是声源的方向的似然性。 可以基于视频数据在视频数据的帧中基于从音频数据导出的音频信息来识别有源扬声器,其中通过确定概率分布函数是否控制通过音频信息作为用于识别有源说话者的基础的使用 指示对应的音频数据包括残余声学回声。
    • 2. 发明申请
    • Identifying far-end sound
    • 识别远端声音
    • US20090150149A1
    • 2009-06-11
    • US11953764
    • 2007-12-10
    • Ross CulterXinding SunSenthil Velayutham
    • Ross CulterXinding SunSenthil Velayutham
    • G10L17/00
    • G06K9/6293G10L2021/02082G10L2021/02166
    • Frames containing audio data may be received, the audio data having been derived from a microphone array, at least some of the frames containing residual acoustic echo after having acoustic echo partially removed therefrom. Probability distribution functions are determined from the frames of audio data. A probability distribution function comprises likelihoods that respective directions are directions of sources of sounds. An active speaker may be identified in frames of video data based on the video data and based on audio information derived from the audio data, where use of the audio information as a basis for identifying the active speaker is controlled by determining whether the probability distribution functions indicate that corresponding audio data includes residual acoustic echo.
    • 可以接收包含音频数据的帧,音频数据已经从麦克风阵列导出,至少一些帧在具有从其中部分地去除声学回声之后包含残余声学回声。 概率分布函数由音频数据的帧确定。 概率分布函数包括各个方向是声源的方向的似然性。 可以基于视频数据在视频数据的帧中基于从音频数据导出的音频信息来识别有源扬声器,其中通过确定概率分布函数是否控制通过音频信息作为用于识别有源说话者的基础的使用 指示对应的音频数据包括残余声学回声。
    • 6. 发明申请
    • Dynamic Switching of Microphone Inputs for Identification of a Direction of a Source of Speech Sounds
    • 用于识别语音源的方向的麦克风输入的动态切换
    • US20100092007A1
    • 2010-04-15
    • US12251525
    • 2008-10-15
    • Xinding Sun
    • Xinding Sun
    • H04R3/00
    • H04R3/005G10L25/00G10L2021/02166H04N7/147H04N7/15
    • This disclosure describes techniques of automatically identifying a direction of a speech source relative to an array of directional microphones using audio streams from some or all of the directional microphones. Whether the direction of the speech source is identified using audio streams from some of the directional microphones or from all of the directional microphones depends on whether using audio streams from a subgroup of the directional microphones or using audio streams from all of the directional microphones is more likely to correctly identify the direction of the speech source. Switching between using audio streams from some of the directional microphones and using audio streams from all of the directional microphones may occur automatically to best identify the direction of the speech source. A display screen at a remote venue may then display images having angles of view that are centered generally in the direction of the speech source.
    • 本公开描述了使用来自一些或所有定向麦克风的音频流自动识别语音源相对于定向麦克风阵列的方向的技术。 使用来自一些定向麦克风或所有定向麦克风的音频流来识别语音源的方向取决于是使用来自定向麦克风的子组的音频流还是使用来自所有定向麦克风的音频流更多 可能正确识别语音源的方向。 使用来自一些定向麦克风的音频流和使用来自所有定向麦克风的音频流之间的切换可以自动发生,以最好地识别语音源的方向。 然后,远程场地的显示屏幕可以显示具有视角的图像,该视角的大致在语音源的方向上居中。
    • 7. 发明授权
    • Digital video processing method and apparatus thereof
    • 数字视频处理方法及其装置
    • US07656951B2
    • 2010-02-02
    • US10633617
    • 2003-08-05
    • Hyun-doo ShinYang-lim ChoiB. S. ManjunathXinding Sun
    • Hyun-doo ShinYang-lim ChoiB. S. ManjunathXinding Sun
    • H04N7/12H04B1/66
    • G06F17/30784H04N5/147H04N19/48
    • A digital video processing method and an apparatus thereof are provided. The method for processing digital images received in the form of compressed video streams comprising the step of determining a region intensity histogram (RIH) based on information on motion compensation of inter frames. The RIH information is obtained based on the motion compensation values of inter frames, and the RIH information is a good indicator of motion information of a video scene. Also, since the RIH information is quite a good indicator of intensity of the video scene, video streams having similar intensities can be effectively searched by searching for similar video scenes based on the RIH information obtained by the digital video processing method.
    • 提供了一种数字视频处理方法及其装置。 用于处理以压缩视频流形式接收的数字图像的方法,包括基于帧间运动补偿的信息来确定区域强度直方图(RIH)的步骤。 RIH信息是基于帧间的运动补偿值获得的,RIH信息是视频场景的运动信息的良好指标。 此外,由于RIH信息对于视频场景的强度是相当好的指标,因此可以通过基于通过数字视频处理方法获得的RIH信息搜索类似的视频场景来有效地搜索具有相似强度的视频流。