会员体验
专利管家(专利管理)
工作空间(专利管理)
风险监控(情报监控)
数据分析(专利分析)
侵权分析(诉讼无效)
联系我们
交流群
官方交流:
QQ群: 891211   
微信请扫码    >>>
现在联系顾问~
热词
    • 1. 发明授权
    • Automatic extraction of character ground truth data from images
    • 从图像中自动提取字符地面真值数据
    • US08755595B1
    • 2014-06-17
    • US13186173
    • 2011-07-19
    • Alessandro BissaccoKrishnendu Chaudhury
    • Alessandro BissaccoKrishnendu Chaudhury
    • G06K9/62
    • G06K9/6256
    • Embodiments for automatic extraction of character ground truth data from images are disclosed. A transcription may be rendered in a plurality of fonts and orientations to obtain a set of candidate word templates with associated character bounding boxes. A word template may be selected from the set of candidate word templates, wherein the selected word template corresponds to a word patch from an image. The character bounding boxes, of the selected word template, may be evaluated in a plurality of orientations about each respective character from the word patch to obtain a set of candidate character templates. For each respective character from the word patch, a character template may be selected from the set of candidate character templates, wherein each selected character template corresponds to the respective character from the word patch. Character ground truth data comprising the selected character templates oriented to correspond to the word patch, may be provided as training data for recognizing the characters of the word patch from the image.
    • 公开了从图像自动提取字符地面真值数据的实施例。 可以以多个字体和取向呈现转录,以获得具有相关联的字符边界框的一组候选词模板。 可以从候选词模板集中选择单词模板,其中所选择的单词模板对应于来自图像的单词补丁。 所选择的单词模板的字符边界框可以从关于每个相应字符的多个方向从单词补丁中求出,以获得一组候选字符模板。 对于来自单词补丁的每个相应字符,可以从候选字符模板集合中选择字符模板,其中每个所选字符模板对应于来自字补丁的相应字符。 可以将包括定向为对应于单词补丁的所选择的字符模板的角色地面真实数据提供为用于从图像识别单词补丁的字符的训练数据。
    • 2. 发明申请
    • OPTICAL CHARACTER RECOGNITION BY ITERATIVE RE-SEGMENTATION OF TEXT IMAGES USING HIGH-LEVEL CUES
    • 通过使用高级别的文本图像迭代重新分类来进行光学字符识别
    • US20150055866A1
    • 2015-02-26
    • US13480728
    • 2012-05-25
    • Mark Joseph CumminsAlessandro Bissacco
    • Mark Joseph CumminsAlessandro Bissacco
    • G06K9/34
    • G06K9/344G06K9/723G06K2209/01
    • Disclosed techniques include receiving an electronic image containing depictions of characters, segmenting at least some of the depictions of characters using a first segmentation technique to produce a first segmented portion, and performing a first character recognition on the first segmented portion to determine a first sequence of characters. The techniques also include determining, based on the performing the first character recognition, that the first sequence of characters does not match the depictions of characters. The techniques further include segmenting at least some of the depictions of characters using a second segmentation technique, based on the determining, to produce a second segmented portion, and performing a second character recognition on at least a portion of the second segmented portion to produce a second sequence of characters. The techniques also include outputting a third sequence of characters based on at least part of the second sequence of characters.
    • 所公开的技术包括接收包含字符描述的电子图像,使用第一分割技术分割至少一些字符描绘以产生第一分段部分,以及在第一分段部分上执行第一字符识别以确定第一序列 人物。 这些技术还包括基于执行第一字符识别确定第一字符序列与字符的描绘不匹配。 所述技术还包括基于确定产生第二分割部分并且在第二分割部分的至少一部分上执行第二字符识别来使用第二分割技术来分割字符的至少一些描绘,以产生 第二个字符序列。 这些技术还包括基于第二个字符序列的至少一部分来输出第三个字符序列。
    • 3. 发明授权
    • Text recognition for textually sparse images
    • 文本稀疏图像的文本识别
    • US08718365B1
    • 2014-05-06
    • US12608877
    • 2009-10-29
    • Alessandro BissaccoHartmut Neven
    • Alessandro BissaccoHartmut Neven
    • G06K9/34G06K9/32
    • G06K9/34G06K9/32G06K9/3208G06K9/3233G06K9/342G06K9/6857G06K2209/01
    • A text recognition server is configured to recognize text in a sparse text image. Specifically, given an image, the server specifies a plurality of “patches” (blocks of pixels within the image). The system applies a text detection algorithm to the patches to determine a number of the patches that contain text. This application of the text detection algorithm is used both to estimate the orientation of the image and to determine whether the image is textually sparse or textually dense. If the image is determined to be textually sparse, textual patches are identified and grouped into text regions, each of which is then separately processed by an OCR algorithm, and the recognized text for each region is combined into a result for the image as a whole.
    • 文本识别服务器被配置为识别稀疏文本图像中的文本。 具体地,给定图像,服务器指定多个“补丁”(图像内的像素块)。 系统将文本检测算法应用于修补程序,以确定包含文本的多个修补程序。 文本检测算法的这种应用被用于估计图像的取向并确定图像是文本稀疏的还是文本密集的。 如果图像被确定为文本上稀疏的,则文本补丁被识别并分组成文本区域,然后每个文本区域被OCR算法分开处理,并且将每个区域的识别文本合并为整个图像的结果 。
    • 4. 发明授权
    • Detecting humans via their pose
    • 通过他们的姿势来检测人类
    • US07519201B2
    • 2009-04-14
    • US11553388
    • 2006-10-26
    • Ming-Hsuan YangAlessandro Bissacco
    • Ming-Hsuan YangAlessandro Bissacco
    • G06K9/00G06K9/62
    • G06K9/4647G06K9/00369
    • A method and system efficiently and accurately detects humans in a test image and classifies their pose. In a training stage, a probabilistic model is derived in an unsupervised or semi-supervised manner such that at least some poses are not manually labeled. The model provides two sets of model parameters to describe the statistics of images containing humans and images of background scenes. In a testing stage, the probabilistic model is used to determine if a human is present in the image, and classify the human's pose based on the poses in the training images. A solution is efficiently provided to both human detection and pose classification by using the same probabilistic model to solve the problems.
    • 一种方法和系统有效和准确地检测测试图像中的人类并对其姿态进行分类。 在训练阶段,以无监督或半监督的方式导出概率模型,使得至少一些姿势不是手动标记的。 该模型提供两组模型参数来描述包含人类和背景场景图像的图像的统计。 在测试阶段,概率模型用于确定人物是否存在于图像中,并且基于训练图像中的姿态对人的姿势进行分类。 通过使用相同的概率模型来解决问题,有效地提供了人类检测和姿态分类的解决方案。
    • 7. 发明授权
    • Systems and methods for selecting interest point descriptors for object recognition
    • 用于选择对象识别的兴趣点描述符的系统和方法
    • US08086616B1
    • 2011-12-27
    • US12404857
    • 2009-03-16
    • Alessandro BissaccoUlrich BuddemeierHartmut Neven
    • Alessandro BissaccoUlrich BuddemeierHartmut Neven
    • G06F7/00
    • G06K9/623
    • Systems and methods for selecting interest point descriptors for object recognition. In an embodiment, the present invention estimates performance of local descriptors by (1) receiving a local descriptor relating to an object in a first image; (2) identifying one or more nearest neighbor descriptors relating to one or more images different from the first image, the nearest neighbor descriptors comprising nearest neighbors of the local descriptor; (3) calculating a quality score for the local descriptor based on the number of nearest neighbor descriptors that relate to images showing the object; and (4) determining, on the basis of the quality score, if the local descriptor is effective in identifying the object.
    • 用于选择对象识别的兴趣点描述符的系统和方法。 在一个实施例中,本发明通过(1)接收与第一图像中的对象有关的局部描述符来估计本地描述符的性能; (2)识别与不同于第一图像的一个或多个图像相关的一个或多个最近邻描述符,最近邻描述符包括本地描述符的最近邻; (3)基于与显示对象的图像相关的最近邻描述符的数量来计算本地描述符的质量得分; 以及(4)基于质量分数来确定本地描述符是否有效地识别对象。
    • 8. 发明授权
    • Fast human pose estimation using appearance and motion via multi-dimensional boosting regression
    • 通过多维加速回归,使用外观和运动的快速人体姿态估计
    • US07778446B2
    • 2010-08-17
    • US11950662
    • 2007-12-05
    • Ming-Hsuan YangAlessandro Bissacco
    • Ming-Hsuan YangAlessandro Bissacco
    • G06K9/00
    • G06K9/00342G06K9/6257
    • Methods and systems are described for three-dimensional pose estimation. A training module determines a mapping function between a training image sequence and pose representations of a subject in the training image sequence. The training image sequence is represented by a set of appearance and motion patches. A set of filters are applied to the appearance and motion patches to extract features of the training images. Based on the extracted features, the training module learns a multidimensional mapping function that maps the motion and appearance patches to the pose representations of the subject. A testing module outputs a fast human pose estimation by applying the learned mapping function to a test image sequence.
    • 描述了用于三维姿态估计的方法和系统。 训练模块确定训练图像序列和训练图像序列中的对象的姿态表示之间的映射函数。 训练图像序列由一组外观和运动补丁表示。 将一组滤镜应用于外观和运动补片以提取训练图像的特征。 基于提取的特征,训练模块学习将运动和外观补片映射到对象的姿态表示的多维映射函数。 测试模块通过将学习的映射函数应用于测试图像序列来输出快速人体姿态估计。
    • 9. 发明授权
    • System and method of determining building numbers
    • 确定建筑物数量的系统和方法
    • US08787673B2
    • 2014-07-22
    • US13181081
    • 2011-07-12
    • Bo WuAlessandro BissaccoRaymond W. SmithKong man CheungAndrea FromeShlomo Urbach
    • Bo WuAlessandro BissaccoRaymond W. SmithKong man CheungAndrea FromeShlomo Urbach
    • G06K9/18
    • G06K9/3258G06K9/00G06K9/18G06K2209/01G06Q50/10
    • A system and method is provided for automatically recognizing building numbers in street level images. In one aspect, a processor selects a street level image that is likely to be near an address of interest. The processor identifies those portions of the image that are visually similar to street numbers, and then extracts the numeric values of the characters displayed in such portions. If an extracted value corresponds with the building number of the address of interest such as being substantially equal to the address of interest, the extracted value and the image portion are displayed to a human operator. The human operator confirms, by looking at the image portion, whether the image portion appears to be a building number that matches the extracted value. If so, the processor stores a value that associates that building number with the street level image.
    • 提供了一种用于自动识别街道图像中的建筑物编号的系统和方法。 在一个方面,处理器选择可能靠近感兴趣的地址的街道级图像。 处理器识别图像中与街道号码视觉相似的那些部分,然后提取在这些部分中显示的字符的数值。 如果提取的值对应于感兴趣的地址的建筑物号码,例如基本上等于感兴趣的地址,则提取的值和图像部分被显示给人类操作者。 人类操作者通过观察图像部分来确认图像部分是否看起来是与提取的值相匹配的建筑物号码。 如果是这样,处理器存储将建筑物号码与街道图像相关联的值。
    • 10. 发明申请
    • DETECTING HUMANS VIA THEIR POSE
    • 通过他们的检测人类
    • US20070098254A1
    • 2007-05-03
    • US11553388
    • 2006-10-26
    • Ming-Hsuan YangAlessandro Bissacco
    • Ming-Hsuan YangAlessandro Bissacco
    • G06K9/62G06K9/00
    • G06K9/4647G06K9/00369
    • A method and system efficiently and accurately detects humans in a test image and classifies their pose. In a training stage, a probabilistic model is derived in an unsupervised or semi-supervised manner such that at least some poses are not manually labeled. The model provides two sets of model parameters to describe the statistics of images containing humans and images of background scenes. In a testing stage, the probabilistic model is used to determine if a human is present in the image, and classify the human's pose based on the poses in the training images. A solution is efficiently provided to both human detection and pose classification by using the same probabilistic model to solve the problems.
    • 一种方法和系统有效和准确地检测测试图像中的人类并对其姿态进行分类。 在训练阶段,以无监督或半监督的方式导出概率模型,使得至少一些姿势不是手动标记的。 该模型提供两组模型参数来描述包含人类和背景场景图像的图像的统计。 在测试阶段,概率模型用于确定人物是否存在于图像中,并且基于训练图像中的姿态对人的姿势进行分类。 通过使用相同的概率模型来解决问题,有效地提供了人类检测和姿态分类的解决方案。