会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 4. 发明授权
    • Remote audio device management system
    • 远程音频设备管理系统
    • US08126155B2
    • 2012-02-28
    • US10612429
    • 2003-07-02
    • Qiong LiuDonald G. KimberJonathan T. FooteChunyuan LiaoJohn E. Adcock
    • Qiong LiuDonald G. KimberJonathan T. FooteChunyuan LiaoJohn E. Adcock
    • H04R29/00H04N7/18H04N5/232
    • H04H60/04
    • An audio device management system (ADMS) manages remote audio devices via user selections in video links. The system enhances audio acquisition quality by receiving and processing human suggestions, forming customized two-way audio links according to user requests, and learning audio pickup strategies and camera management strategies from user operations. The ADMS control interface for a remote user provides a multi-window GUI that provides an overview window and selection display window. The ADMS provides users with more flexibility to enhance audio signals according to their needs and makes it more convenient to form customized two-way audio links without requiring users to remember a list of phone numbers. The ADMS also automatically manages available microphones for audio pickup based on microphone sound quality and the system's past experience when users monitor a structured audio environment without explicitly expressing their attentions in the video window.
    • 音频设备管理系统(ADMS)通过视频链路中的用户选择来管理远程音频设备。 该系统通过接收和处理人的建议,根据用户要求形成定制的双向音频链接,以及从用户操作学习音频拾取策略和相机管理策略来提高音频采集质量。 远程用户的ADMS控制界面提供了一个提供概览窗口和选择显示窗口的多窗口GUI。 ADMS为用户提供了更多的灵活性,可根据需要增强音频信号,并使其更方便地形成定制的双向音频链接,而不需要用户记住电话号码列表。 ADMS还可以自动管理可用的麦克风,用于基于麦克风音质的音频拾音,以及当用户监视结构化音频环境而不明确表达其在视频窗口中的注意时,系统的过去体验。
    • 5. 发明授权
    • Telepresence system and method for video teleconferencing
    • 视讯会议的网真系统及方法
    • US07154526B2
    • 2006-12-26
    • US10617549
    • 2003-07-11
    • Jonathan T. FooteJohn AdcockQiong LiuTimothy E. Black
    • Jonathan T. FooteJohn AdcockQiong LiuTimothy E. Black
    • H04N7/14
    • H04N7/142
    • A system in accordance with one embodiment of the present invention comprises a device for facilitating video communication between a remote participant and another location. The device can comprise a screen adapted to display the remote participant, the screen having a posture adapted to be controlled by the remote participant. A camera can be mounted adjacent to the screen, and can allow the subject to view a selected conference participant or a desired location such that when the camera is trained on the selected participant or desired location a gaze of the remote participant displayed by the screen appears substantially directed at the selected participant or desired location.
    • 根据本发明的一个实施例的系统包括用于促进远程参与者和另一位置之间的视频通信的装置。 设备可以包括适于显示远程参与者的屏幕,屏幕具有适于由远程参与者控制的姿势。 摄像机可以安装在屏幕附近,并且可以允许被摄体观看所选择的会议参与者或期望的位置,使得当在所选择的参与者或期望位置上训练相机时,屏幕显示的远程参与者的凝视出现 基本上指向所选择的参与者或期望位置。
    • 6. 发明授权
    • Systems and methods for the automatic segmentation and clustering of ordered information
    • 有序信息的自动分段和聚类的系统和方法
    • US06915009B2
    • 2005-07-05
    • US09947385
    • 2001-09-07
    • Jonathan T. FooteMatthew Cooper
    • Jonathan T. FooteMatthew Cooper
    • H04N5/91G06K9/62G06K9/34
    • G06K9/00711G06K9/00718G06K9/00765G06K9/6218
    • Techniques segmenting ordered information such as audio, video and text are provided by windowing and parameterizing an ordered information stream and storing of the parameterized and windowed information into a two-dimensional representation such as a matrix. The similarity between the parameter vectors is determined and an orthogonal matrix decomposition such as singular value decomposition is applied to the similarity matrix. The singular values or eigenvalues of the resulting decomposition indicate major components or segments of the ordered information. The boundaries of the major components may be determined using the determined singular vectors to provide, for example, smart cut-and-paste of ordered information in which boundaries are automatically identified by the singular vectors; automatic categorization and retrieval of ordered information and automatic summarization of ordered information.
    • 通过对有序信息流进行窗口化和参数化以及将参数化和加窗信息存储为诸如矩阵的二维表示来提供分类诸如音频,视频和文本的有序信息的技术。 确定参数向量之间的相似度,并将正交矩阵分解(如奇异值分解)应用于相似矩阵。 所得分解的奇异值或特征值表示有序信息的主要组成部分。 可以使用所确定的奇异向量来确定主要分量的边界,以提供例如智能切割和粘贴有序信息,其中边界由单个向量自动识别; 有序信息的自动分类和检索以及有序信息的自动汇总。
    • 7. 发明授权
    • Systems and methods for the automatic extraction of audio excerpts
    • 自动提取音频摘录的系统和方法
    • US07260439B2
    • 2007-08-21
    • US09985073
    • 2001-11-01
    • Jonathan T. FooteMatthew L. CooperLynn D. Wilcox
    • Jonathan T. FooteMatthew L. CooperLynn D. Wilcox
    • G06F17/00
    • G11B27/28
    • A method of extracting audio excerpts comprises: segmenting audio data into a plurality of audio data segments; setting a fitness criteria for the plurality of audio data segments; analyzing the plurality of audio data segments based on the fitness criteria; and selecting one of the plurality of audio data segments that satisfies the fitness criteria. In various exemplary embodiments, the method of extracting audio excerpts further comprises associating the selected one of the plurality of audio data segments with video data. In such embodiments, associating the selected one of the plurality of audio data segments with video data may comprise associating the selected one of the plurality of audio data segments with a keyframe.
    • 提取音频摘录的方法包括:将音频数据分割成多个音频数据段; 为所述多个音频数据段设置适合性标准; 基于适合性标准分析多个音频数据段; 以及选择满足适合度标准的多个音频数据段中的一个。 在各种示例性实施例中,提取音频摘录的方法还包括将所述多个音频数据段中的所选择的一个与视频数据相关联。 在这样的实施例中,将多个音频数据段中的所选择的一个与视频数据相关联可以包括将多个音频数据段中的所选择的一个与关键帧相关联。
    • 9. 发明授权
    • Methods and apparatuses for segmenting an audio-visual recording using image similarity searching and audio speaker recognition
    • 用于使用图像相似性搜索和音频扬声器识别分割视听记录的方法和装置
    • US06404925B1
    • 2002-06-11
    • US09266561
    • 1999-03-11
    • Jonathan T. FooteLynn Wilcox
    • Jonathan T. FooteLynn Wilcox
    • G06K962
    • G06K9/00758G06F17/30746G06F17/30787G10L17/00G11B27/28Y10S707/99931Y10S707/99933
    • Methods for segmenting audio-video recording of meetings containing slide presentations by one or more speakers are described. These segments serve as indexes into the recorded meeting. If an agenda is provided for the meeting, these segments can be labeled using information from the agenda. The system automatically detects intervals of video that correspond to presentation slides. Under the assumption that only one person is speaking during an interval when slides are displayed in the video, possible speaker intervals are extracted from the audio soundtrack by finding these regions. Since the same speaker may talk across multiple slide intervals, the acoustic data from these intervals is clustered to yield an estimate of the number of distinct speakers and their order. Clustering the audio data from these intervals yields an estimate of the number of different speakers and their order. Merged clustered audio intervals corresponding to a single speaker are then used as training data for a speaker segmentation system. Using speaker identification techniques, the full video is then segmented into individual presentations based on the extent of each presenter's speech. The speaker identification system optionally includes the construction of a hidden Markov model trained on the audio data from each slide interval. A Viterbi assignment then segments the audio according to speaker.
    • 描述了由一个或多个扬声器分割包含幻灯片呈现的会议音频视频记录的方法。 这些段作为记录会议的索引。 如果为会议提供议程,则可以使用来自议程的信息来标记这些细分。 系统自动检测与演示幻灯片相对应的视频间隔。 假设在视频中显示幻灯片的间隔期间只有一个人正在说话,通过查找这些区域,可以从音频音轨提取可能的扬声器间隔。 由于相同的说话者可以在多个幻灯片间隔中进行交谈,所以将来自这些间隔的声学数据进行聚类,以产生不同扬声器数量及其顺序的估计。 从这些间隔聚集音频数据产生不同扬声器数量及其顺序的估计。 然后将对应于单个扬声器的合并的群集音频间隔用作用于讲话者分割系统的训练数据。 使用扬声器识别技术,根据每位演讲者的讲话范围,将完整的视频分割成单独的演示文稿。 扬声器识别系统可选地包括针对来自每个幻灯片间隔的音频数据训练的隐马尔可夫模型的构造。 维特比分配然后根据扬声器分割音频。
    • 10. 发明授权
    • System and method for detecting and ranking images in order of usefulness based on vignette score
    • 用于基于小插曲得分的有用性检测和排序图像的系统和方法
    • US07492921B2
    • 2009-02-17
    • US11032576
    • 2005-01-10
    • Jonathan T. Foote
    • Jonathan T. Foote
    • G06K9/00G06K9/62
    • G06F17/30247
    • A system and method for detecting useful images and for ranking images in order of usefulness based on a vignette score describing how closely each one resembles a “vignette,” or a central object or image surrounded by a featureless or deemphasized background. Several methods for determining an image's vignette score are disclosed as examples. Variance ratio analysis entails calculation of the ratio of variance between the edge region of the image and the entire image. Statistical model analysis entails developing a statistical classifier capable of determining a statistical model of each image class based on pre-entered training data. Spatial frequency analysis involves estimating the energy at different spatial frequencies in the central and edge regions and in the image as a whole. A vignette score is calculated as the ratio of mid-frequency energies in the edge region to the mid-frequency energies of the entire image.
    • 一种用于检测有用图像并根据用于评估图像的顺序对图像进行排序的系统和方法,所述小插曲得分描述了每个图像类似于“小插曲”的密切程度,或由无特征或不加重背景包围的中心对象或图像。 作为示例公开了用于确定图像晕影得分的几种方法。 方差比分析需要计算图像的边缘区域与整个图像之间的方差比。 统计模型分析需要开发能够基于预先输入的训练数据来确定每个图像类别的统计模型的统计分类器。 空间频率分析涉及估计中央和边缘区域以及整个图像中不同空间频率的能量。 晕影得分被计算为边缘区域中的中频能量与整个图像的中频能量的比率。