会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Automatic training of character templates using a transcription and a
two-dimensional image source model
    • 使用转录和二维图像源模型自动训练角色模板
    • US5689620A
    • 1997-11-18
    • US431223
    • 1995-04-28
    • Gary E. KopecPhilip Andrew ChouLeslie T. Niles
    • Gary E. KopecPhilip Andrew ChouLeslie T. Niles
    • G06K9/66G06K9/62G06T1/40G06K9/00
    • G06K9/6256
    • A technique for automatically training a set of character templates using unsegmented training samples uses as input a two-dimensional (2D) image of characters, called glyphs, as the source of training samples, a transcription associated with the 2D image as a source of labels for the glyph samples, and an explicit, formal 2D image source model that models as a grammar the structural and functional features of a set of 2D images that may be used as the source of training data. The input transcription may be a literal transcription associated with the 2D input image, or it may be nonliteral, for example containing logical structure tags for document formatting, such as found in markup languages. The technique uses spatial positioning information about the 2D image modeled by the 2D image source model and uses labels in the transcription to determine labeled glyph positions in the 2D image that identify locations of glyph samples. The character templates are produced using the input 2D image and the labeled glyph positions without assigning pixels to glyph samples prior to training. In one implementation, the 2D image source model is a regular grammar having the form of a finite state transition network, and the transcription is also represented as a finite state network. The two networks are merged to produce a transcription-image network, which is used to decode the input 2D image to produce labeled glyph positions that identify training data samples in the 2D image. In one implementation of the template construction process, a pixel scoring technique is used to produce character templates contemporaneously from blocks of training data samples aligned at glyph positions.
    • 用于使用未分段训练样本自动训练一组角色模板的技术将作为训练样本的来源的称为字形的二维(2D)图像的字符(2D)用作输入,与2D图像相关联的转录作为标签的来源 对于字形样本,以及一个明确的,正式的2D图像源模型,其将模型化为可以用作训练数据源的一组2D图像的结构和功能特征作为语法。 输入转录可以是与2D输入图像相关联的文字转录,或者它可以是非标准的,例如包含用于文档格式化的逻辑结构标签,例如以标记语言找到的。 该技术使用关于由2D图像源模型建模的2D图像的空间定位信息,并使用转录中的标签来确定2D图像中识别字形样本位置的标记字形位置。 使用输入的2D图像和标记的字形位置产生字符模板,而不在训练之前将像素分配给字形样本。 在一个实现中,2D图像源模型是具有有限状态转换网络形式的规则语法,并且转录也被表示为有限状态网络。 两个网络被合并以产生转录图像网络,其用于解码输入的2D图像以产生识别2D图像中的训练数据样本的标记的字形位置。 在模板构建过程的一个实现中,使用像素评分技术从与字形位置对齐的训练数据样本的块同时产生字符模板。
    • 2. 发明授权
    • Method for aligning a text image to a transcription of the image
    • 将文本图像与图像转录对齐的方法
    • US5689585A
    • 1997-11-18
    • US431004
    • 1995-04-28
    • Dan S. BloombergLeslie T. NilesGary E. KopecPhilip Andrew Chou
    • Dan S. BloombergLeslie T. NilesGary E. KopecPhilip Andrew Chou
    • G06K9/20G06K9/72
    • G06K9/00469G06K9/72G06K2209/01
    • A method for establishing a relationship between a text image and a transcription associated with the text image uses conventional image processing techniques to identify one or more geometric attributes, or image parameters, of each of a sequence of regions of the text image. The transcription labels in the transcription are analyzed to determine a comparable set of parameters in transcription label sequence. A matching operation then matches the respective parameters of the two sequences to identify image regions that match with transcription regions. The result is an output data structure that minimally identifies image locations of interest to a subsequent operation that processes the text image. The output data structure may also pair each of the image locations of interest to a transcription location, in effect producing a set of labeled image locations. In one embodiment, the sequence of locations of words and their observed lengths in the text image are determined. The transcription is analyzed to identify words, and transcription word lengths are computed using an estimated image character width of glyphs in the text image. The sequence of observed image word lengths is then matched to the sequence of computed transcription word lengths using a dynamic programming algorithm that finds a best path through a two-dimensional lattice of nodes and transitions between nodes, where the transitions represent pairs of sequences of zero or more word lengths. An output data structure contains entries, each of which pairs a transcription word with a matching image word location.
    • 用于建立文本图像与与文本图像相关联的转录之间的关系的方法使用常规图像处理技术来识别文本图像的区域序列中的每一个的一个或多个几何属性或图像参数。 分析转录中的转录标记以确定转录标记序列中可比较的一组参数。 匹配操作然后匹配两个序列的相应参数以识别与转录区域匹配的图像区域。 结果是输出数据结构,其最小程度地识别处理文本图像的后续操作感兴趣的图像位置。 输出数据结构还可以将感兴趣的每个图像位置配对到转录位置,实际上产生一组标记的图像位置。 在一个实施例中,确定单词的位置序列及其在文本图像中的观察长度。 分析转录以识别词,并且使用文本图像中的字形的估计图像字符宽度来计算转录词长度。 然后使用动态规划算法将观察到的图像字长度的序列与计算出的转录词长度的序列匹配,该动态规划算法通过节点的二维网格和节点之间的转换找到最佳路径,其中,转换代表零序列对 或更多字长。 输出数据结构包含条目,每个条目将转录词与匹配的图像字位置配对。
    • 3. 发明授权
    • Unsupervised training of character templates using unsegmented samples
    • 使用未分段样本的角色模板的无监督训练
    • US5956419A
    • 1999-09-21
    • US430635
    • 1995-04-28
    • Gary E. KopecPhilip Andrew Chou
    • Gary E. KopecPhilip Andrew Chou
    • G06K9/62G06K9/68
    • G06K9/68G06K9/6256
    • A method for operating a machine to perform unsupervised training of a set of character templates uses as the source of training samples an image source of character images, called glyphs, that need not be manually or automatically segmented or isolated prior to training. A recognition operation performed on the image source of character images produces a labeled glyph position data structure that includes, for each glyph in the image source, a glyph image position in the image source associating an estimated image location of the glyph in the image source with a character label paired with the glyph image position that indicates the character in the character set being trained. The labeled glyph position data and the image source are then used to determine sample image regions in the image source; each sample image region is large enough to contain at least a single glyph but need not be restricted in size to only contain a single glyph. The template construction process using unsegmented samples is mathematically modeled as an optimization problem that optimizes a function that represents the set of character templates being trained as an ideal image to be reconstructed to match the input image. The method produces all of the character templates substantially contemporaneously by using a novel pixel scoring technique that implements an approximation of a maximum likelihood criterion subject to a constraint on the templates produced which holds that foreground pixels in adjacently positioned character images have substantially nonoverlapping foreground pixels. The character templates produced may be binary templates or arrays of probability values.
    • 用于操作机器执行一组字符模板的无监督训练的方法用作训练的来源,在训练之前不需要手动地或自动地分割或分离字符图像的称为字形的图像源。 对字符图像的图像源执行的识别操作产生标记的字形位置数据结构,其包括对于图像源中的每个字形,图像源中的字形图像位置将图像源中的字形的估计图像位置与 与字形图像位置配对的字符标签,其指示被训练的字符集中的字符。 然后使用标记的字形位置数据和图像源来确定图像源中的样本图像区域; 每个样本图像区域足够大以至少包含单个字形,但不需要将其限制为仅包含单个字形。 使用未分段样本的模板构建过程在数学上被建模为优化问题,其优化表示被训练为被重建以匹配输入图像的理想图像的字符模板集合的函数。 该方法通过使用新颖的像素评分技术来产生所有的字符模板,该新颖的像素评分技术实现对所产生的模板的约束的最大似然准则的近似,该模板保持相邻位置的字符图像中的前景像素具有基本上不重叠的前景像素。 生成的字符模板可以是二进制模板或概率值数组。
    • 4. 发明授权
    • Method of producing character templates using unsegmented samples
    • 使用未分段样本生成角色模板的方法
    • US5706364A
    • 1998-01-06
    • US431714
    • 1995-04-28
    • Gary E. KopecPhilip Andrew Chou
    • Gary E. KopecPhilip Andrew Chou
    • G06K9/62G06R9/62
    • G06K9/6255
    • A method for producing, or training, a set of character templates uses as the source of training samples an image source of character images, called glyphs, that are not previously segmented or isolated for training. Also used is a labeled glyph position data structure that includes, for each glyph in the image source, a glyph image position in the image source associating an image location of the glyph with a character label paired with the glyph image position that indicates the character in the character set being trained. The labeled glyph position data is used to identify a collection of glyph sample image regions in the image source for each character in the character set; each glyph sample image region is large enough to contain a glyph and typically contains adjacent glyphs for other characters. The invention mathematically characterizes the template construction problem using unsegmented samples as an optimization problem that optimizes a function that represents the set of character templates being trained as an ideal image to be reconstructed to match the input image. The method produces all of the character templates contemporaneously by using a novel pixel scoring technique that implements an approximation of a maximum likelihood criterion subject to a constraint on the templates produced which holds that foreground pixels in adjacently positioned character images have substantially nonoverlapping foreground pixels. The character templates produced may be binary templates or arrays of pixel color probability values, and may also have substantially disjoint supports, such that adjacently imaged templates have substantially no overlapping foreground pixels.
    • 用于生成或训练一组字符模板的方法用作训练的来源,将未被分段或隔离的字符图像的图像源(称为字形)进行采样,以进行训练。 还使用了标记字形位置数据结构,其包括对于图像源中的每个字形,图像源中的字形图像位置将字形的图像位置与字符标签配对,该字符标签与指示字符的字形位置相配合 正在训练的角色集。 标记的字形位置数据用于识别字符集中每个字符的图像源中的字形样本图像区域的集合; 每个字形样本图像区域足够大以包含字形,并且通常包含用于其他字符的相邻字形。 本发明在数学上表征了使用未分段样本的模板构造问题作为优化问题,其优化表示正被训练为要重构以匹配输入图像的理想图像的文本模板集合的函数。 该方法通过使用新颖的像素评分技术同时产生所有字符模板,该新技术实现最大似然准则的近似,受制于所产生的模板的约束,其保持相邻定位的字符图像中的前景像素具有基本上不重叠的前景像素。 产生的字符模板可以是二进制模板或像素颜色概率值的阵列,并且还可以具有基本上不相交的支持,使得相邻成像的模板基本上不具有重叠的前景像素。
    • 5. 发明申请
    • RECOVERING DIS-OCCLUDED AREAS USING TEMPORAL INFORMATION INTEGRATION
    • 使用时间信息整合恢复分散区域
    • US20130294710A1
    • 2013-11-07
    • US13463934
    • 2012-05-04
    • Philip Andrew ChouCha ZhangZhengyou ZhangShujie Liu
    • Philip Andrew ChouCha ZhangZhengyou ZhangShujie Liu
    • G06K9/32
    • G06K9/32G06T7/593
    • A temporal information integration dis-occlusion system and method for using historical data to reconstruct a virtual view containing an occluded area. Embodiments of the system and method use temporal information of the scene captured previously to obtain a total history. This total history is warped onto information captured by a camera at a current time in order to help reconstruct the dis-occluded areas. The historical data (or frames) from the total history match only a portion of the frames contained in the captured information. This warping yields warped history information. Warping is performed by using one of two embodiments to match points in an estimation of the current information to points in the captured information. Next, regions of current information are split using a classifier. The warped history information and the captured information then are merged to obtain an estimate for the current information and the reconstructed virtual view.
    • 一种用于使用历史数据重建包含遮挡区域的虚拟视图的时间信息整合遮挡系统和方法。 系统和方法的实施例使用先前捕获的场景的时间信息来获得总历史。 这个总历史在当前时间由相机拍摄的信息扭曲,以帮助重建被遮挡的区域。 来自总历史记录的历史数据(或帧)仅匹配捕获信息中包含的帧的一部分。 这种扭曲产生扭曲的历史信息。 通过使用两个实施例中的一个实现扭曲,以将当前信息的估计中的点与捕获的信息中的点进行匹配。 接下来,使用分类器分割当前信息的区域。 然后将翘曲的历史信息和捕获的信息合并,以获得当前信息和重建的虚拟视图的估计。
    • 6. 发明授权
    • VoIP contextual information processing
    • VoIP语境信息处理
    • US08228824B2
    • 2012-07-24
    • US11398817
    • 2006-04-06
    • David MilsteinDavid HowellKuansan WangLinda CriddleMichael D MaluegPhilip Andrew ChouScott ForbesTimothy M Moore
    • David MilsteinDavid HowellKuansan WangLinda CriddleMichael D MaluegPhilip Andrew ChouScott ForbesTimothy M Moore
    • H04Q11/00
    • H04L29/06027
    • A method and system for processing contextual information relating to an exchange of a conversation over a communication channel is provided. Several users, and/or service providers are allowed to specify a set of rules relating to a conversation channel. Contextual information, including information relating to the specified set of rules and conditions of the users, is exchanged among the users and/or several service providers when one user requests to initiate a communication channel. The received contextual information is processed to extract a set of rules and the current conditions of users. If the current conditions of the users satisfy the set of rules, a corresponding communication channel is established among the users. However, additional contextual information may be received and processed whenever there is a change in the contextual information during the conversation. Appropriate actions to the existing communication channel may be determined based on the changes.
    • 提供了一种用于处理与通信通道上的对话交换有关的上下文信息的方法和系统。 允许几个用户和/或服务提供商指定与会话信道相关的一组规则。 当一个用户请求启动通信信道时,在用户和/或若干服务提供商之间交换包括与指定的一组规则和条件有关的信息的上下文信息。 处理收到的上下文信息以提取一组规则和用户的当前条件。 如果用户的当前条件满足规则集,则在用户之间建立相应的通信信道。 然而,只要在会话期间上下文信息发生变化,就可以接收和处理附加的上下文信息。 可以基于变化来确定对现有通信信道的适当动作。
    • 7. 发明申请
    • SPEECH SEPARATION WITH MICROPHONE ARRAYS
    • 与麦克风阵列的语音分离
    • US20090214052A1
    • 2009-08-27
    • US12035439
    • 2008-02-22
    • Zicheng LiuPhilip Andrew ChouJacek Dmochowski
    • Zicheng LiuPhilip Andrew ChouJacek Dmochowski
    • H04R3/00
    • H04R27/00G10L21/0272
    • A system that facilitates blind source separation in a distributed microphone meeting environment for improved teleconferencing. Input sensor (e.g., microphone) signals are transformed to the frequency-domain and independent component analysis is applied to compute estimates of frequency-domain processing matrices. Modified permutations of the processing matrices are obtained based upon a maximum magnitude based de-permutation scheme. Estimates of the plurality of source signals are provided based upon the modified frequency-domain processing matrices and input sensor signals.Optionally, segments during which the set of active sources is a subset of the set of all sources can be exploited to compute more accurate estimates of frequency-domain mixing matrices. Source activity detection can be applied to determine which speaker(s), if any, are active. Thereafter, a least squares post-processing of the frequency-domain independent components analysis outputs can be employed to adjust the estimates of the source signals based on source inactivity.
    • 一种促进分布式麦克风会议环境中盲源分离的系统,用于改进电话会议。 输入传感器(例如麦克风)信号被变换到频域,并且应用独立分量分析来计算频域处理矩阵的估计。 基于最大幅度的去排列方案获得处理矩阵的修改排列。 基于改进的频域处理矩阵和输入传感器信号来提供多个源信号的估计。 可选地,可以利用其中该组活动源是所有源的集合的子集的段来计算频域混合矩阵的更准确的估计。 源活动检测可以应用于确定哪些扬声器(如果有)是活动的。 此后,可以采用频域独立分量分析输出的最小二乘后处理,以基于源不活动来调整源信号的估计。
    • 10. 发明申请
    • Multimode authentication using VOIP
    • 使用VOIP进行多模式认证
    • US20070283142A1
    • 2007-12-06
    • US11447470
    • 2006-06-05
    • David MilsteinPhilip Andrew ChouScott C. ForbesTimothy M. Moore
    • David MilsteinPhilip Andrew ChouScott C. ForbesTimothy M. Moore
    • H04L9/00
    • H04L63/08H04L65/1069
    • Generally described, multimode authentication over a VoIP communication channel is provided. A calling client and a called client may be authenticated for a communication channel establishment. When a calling client requests a call connection with a called client, the calling client is authenticated for the communication channel, based on exchanged contextual information between the calling client and the called client. Likewise, the called client is authenticated for the communication channel by the calling client. Upon authentication, a communication channel is established, over which the calling client and the called client are allowed to exchange more contextual and voice/multimedia information. During a conversation, when a secured service is desired by any of the clients, a series of authentication processes can be performed to grant access to the secured service over the communication channel without loss of the communication channel connection.
    • 通常描述,提供了通过VoIP通信信道的多模式认证。 呼叫客户端和被叫客户端可以被认证用于通信信道建立。 当主叫客户端请求与被叫客户端的呼叫连接时,根据主叫客户端和被叫客户端之间交换的上下文信息,对通信信道对呼叫客户端进行身份验证。 同样地,被叫客户端被呼叫客户端认证为通信信道。 在认证时,建立通信信道,通过该通信信道,主叫客户端和被叫客户端被允许交换更多的上下文和语音/多媒体信息。 在对话期间,当任何客户端希望获得安全服务时,可以执行一系列认证过程,以通过通信信道授予对安全服务的访问,而不会丢失通信信道连接。