会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 43. 发明申请
    • Multimedia Integration Description Scheme, Method and System For MPEG-7
    • MPEG-7多媒体集成说明方案,方法与系统
    • US20060167876A1
    • 2006-07-27
    • US11278671
    • 2006-04-04
    • Ana BenitezShih-Fu ChangQian HuangSeungyup PaekAtul Puri
    • Ana BenitezShih-Fu ChangQian HuangSeungyup PaekAtul Puri
    • G06F7/00G06F17/30G06F17/21
    • G06F17/30044G06F17/3002G06F17/30038G06F17/30858G06F17/30914G06F17/30958Y10S707/99945Y10S707/99948
    • The invention provides a system and method for integrating multimedia descriptions in a way that allows humans, software components or devices to easily identify, represent, manage, retrieve, and categorize the multimedia content. In this manner, a user who may be interested in locating a specific piece of multimedia content from a database, Internet, or broadcast media, for example, may search for and find the multimedia content. In this regard, the invention provides a system and method that receives multimedia content and separates the multimedia content into separate components which are assigned to multimedia categories, such as image, video, audio, synthetic and text. Within each of the multimedia categories, the multimedia content is classified and descriptions of the multimedia content are generated. The descriptions are then formatted, integrated, using a multimedia integration description scheme, and the multimedia integration description is generated for the multimedia content. The multimedia description is then stored into a database. As a result, a user may query a search engine which then retrieves the multimedia content from the database whose integration description matches the query criteria specified by the user. The search engine can then provide the user a useful search result based on the multimedia integration description.
    • 本发明提供一种用于以允许人,软件组件或设备容易地识别,表示,管理,检索和分类多媒体内容的方式来集成多媒体描述的系统和方法。 以这种方式,例如,可能有兴趣从数据库,因特网或广播媒体定位特定的多媒体内容的用户可以搜索和查找多媒体内容。 在这方面,本发明提供了一种接收多媒体内容并将多媒体内容分离成分配给多媒体类别(诸如图像,视频,音频,合成和文本)的组件的系统和方法。 在多媒体类别的每一个内,对多媒体内容进行分类,并且生成多媒体内容的描述。 然后使用多媒体集成描述方案对所述描述进行格式化,集成化,并且为多媒体内容生成多媒体集成描述。 然后将多媒体描述存储到数据库中。 结果,用户可以查询搜索引擎,该搜索引擎然后从整合描述与用户指定的查询标准匹配的数据库中检索多媒体内容。 然后,搜索引擎可以基于多媒体整合描述向用户提供有用的搜索结果。
    • 44. 发明授权
    • Synthetic audiovisual description scheme, method and system for MPEG-7
    • MPEG-7的综合视听描述方案,方法和系统
    • US06593936B1
    • 2003-07-15
    • US09495171
    • 2000-02-01
    • Qian HuangJoern OstermannAtul PuriRaj Kumar Rajendran
    • Qian HuangJoern OstermannAtul PuriRaj Kumar Rajendran
    • G06T1500
    • G06F17/30017G06F17/30858G06F17/30864H04N1/00209
    • A method and system for description of synthetic audiovisual content makes it easier for humans, software components or devices to identify, manage, categorize, search, browse and retrieve such content. For instance, a user may wish to search for specific synthetic audiovisual objects in digital libraries, Internet web sites or broadcast media; such a search is enabled by the invention. Key characteristics of synthetic audiovisual content itself such as the underlying 2d or 3d models and parameters for animation of these models are used to describe it. More precisely, to represent features of synthetic audiovisual content, depending on the description scheme to be used, a number of descriptors are selected and assigned values. The description scheme instantiated with descriptor values is used to generate the description, which is then stored for actual use during query/search. Typically, a user, to search for a needed synthetic audiovisual content initiates a query that is passed on to a search engine that then retrieves the candidate content from one or more databases whose description closely matches the query criteria specified by the user.
    • 用于描述合成视听内容的方法和系统使得人,软件组件或设备更容易识别,管理,分类,搜索,浏览和检索这些内容。 例如,用户可能希望在数字图书馆,互联网网站或广播媒体中搜索特定的合成音像对象; 通过本发明实现了这种搜索。 合成视听内容本身的关键特征,例如基本的2d或3d模型和这些模型的动画参数用于描述。 更精确地,为了表示合成视听内容的特征,根据要使用的描述方案,选择多个描述符并分配值。 用描述符值实例化的描述方案用于生成描述,然后在查询/搜索期间将其描述为实际使用。 通常,用户搜索所需的合成视听内容启动传递到搜索引擎的查询,该搜索引擎然后从其描述与用户指定的查询标准密切匹配的一个或多个数据库检索候选内容。
    • 45. 发明授权
    • Extracting textual information from a video sequence
    • 从视频序列中提取文本信息
    • US06587586B1
    • 2003-07-01
    • US08999903
    • 1997-06-12
    • Yuntao CuiQian Huang
    • Yuntao CuiQian Huang
    • G06K934
    • G06K9/325G06K9/6297G06K2209/15
    • A method for extracting an image representing textual information from a video sequence includes the following steps. First, receiving a sequence of video frames, each including an image of textual information. Then, locating the textual information in each frame of the video sequence to form a stack of text arrays, each array containing data representing substantially only the textual information. Finally, extracting a single textual image array representing the image of the textual information from the stack of text arrays. Apparatus for extracting an image representing textual information from a video sequence includes a source of a video sequence having a plurality of frames, each containing an image of the textual information; and a processor, coupled to the video sequence source, responsive to all of the plurality of frames, for generating a single array representing an image of the textual information.
    • 从视频序列提取表示文本信息的图像的方法包括以下步骤。 首先,接收一系列视频帧,每个视频帧包括文本信息的图像。 然后,将文本信息定位在视频序列的每个帧中以形成一组文本数组,每个数组包含基本上仅表示文本信息的数据。 最后,从文本堆栈中提取表示文本信息图像的单个文本图像数组。 用于从视频序列提取表示文本信息的图像的装置包括具有多个帧的视频序列的源,每个帧包含文本信息的图像; 以及处理器,耦合到所述视频序列源,响应于所述多个帧,用于生成表示所述文本信息的图像的单个阵列。
    • 46. 发明授权
    • Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data
    • 多媒体搜索装置及使用音频数据的扬声器检测来搜索多媒体内容的方法
    • US06405166B1
    • 2002-06-11
    • US09976023
    • 2001-10-15
    • Qian HuangIvan Magrin-ChagnolleauSarangarajan ParthasarathyAaron Edward Rosenberg
    • Qian HuangIvan Magrin-ChagnolleauSarangarajan ParthasarathyAaron Edward Rosenberg
    • G10L1700
    • G10L17/00
    • A multimedia search apparatus and method for searching multimedia content using speaker detection to segment the multimedia content. The multimedia search apparatus receives a search request from a user device. The search request identifies the target speaker for which the search is to be conducted. Based on the search request, the multimedia search apparatus retrieves multimedia content from a multimedia database. The multimedia search apparatus retrieves models, such as Gaussian Mixture Models (GMMs), from a model storage device, corresponding to the target speaker and background data. Based on the retrieved models, the multimedia search device searches the multimedia data of the multimedia content and segments the multimedia data. The segments are identified by calculating an average normalized score for a block of frames of the multimedia data and determining if the average normalized score for the block of frames exceeds one or more predetermined thresholds.
    • 一种多媒体搜索装置和方法,用于使用说话者检测来搜索多媒体内容来分割多媒体内容。 多媒体搜索装置从用户装置接收搜索请求。 搜索请求标识要进行搜索的目标扬声器。 基于搜索请求,多媒体搜索装置从多媒体数据库检索多媒体内容。 多媒体搜索装置从对应于目标说话者和背景数据的模型存储装置中检索诸如高斯混合模型(GMM)的模型。 基于所检索的模型,多媒体搜索装置搜索多媒体内容的多媒体数据并分割多媒体数据。 通过计算多媒体数据的帧块的平均归一化分数并确定帧块的平均归一化分数是否超过一个或多个预定阈值来标识段。