专利快速检索-快速检索全球专利，免费商用专利数据库-IPRDB

1. 发明授权

US08385643B2 Determination of inputted image to be document or non-document 有权
标题翻译：输入图像的确定为文档或非文档
公开(公告)号：US08385643B2
公开(公告)日：2013-02-26
申请号：US12353440
申请日：2009-01-14
申请人： Jilin Li , Zhi-Gang Fan , Yadong Wu , Bo Wu , Ning Le
发明人： Jilin Li , Zhi-Gang Fan , Yadong Wu , Bo Wu , Ning Le
IPC分类号： G06K9/00 , G06K9/54 , G06F17/00
CPC分类号： G06K9/00463 , G06K9/00456 , G06K9/38 , G06K9/4647 , G06K2209/01
摘要： A preprocessing section binarizes input image data and calculates a total black pixel ratio. A feature extracting section detects connected components included in the binary image data and detects circumscribing bounding boxes of the connected components. Predetermined connected components are removed from all of the connected components based on the sizes of the detected circumscribing bounding boxes and bounding box black pixel ratios. By using the connected components that remain after removing the unnecessary connected components, a histogram is generated by specifying the sizes of the circumscribing bounding boxes as classes and numbers of the connected components as the frequencies of occurrence. A determining section determines whether the input image data is document image data or non-document image data based on information related to the generated histogram and the total black pixel ratio.
摘要翻译：预处理部分对输入图像数据进行二值化并计算总黑色像素比。特征提取部分检测二进制图像数据中包括的连接分量并检测连接分量的外接边界框。基于检测到的外接边界框和边框黑色像素比的尺寸，将所有连接的组件从所有连接的组件中移除。通过使用除去不必要的连接部件后剩余的连接部件，通过将外接边界框的尺寸指定为连接部件的类别和编号作为发生频率来生成直方图。确定部分基于与所生成的直方图和总黑色像素比相关的信息来确定输入图像数据是文档图像数据还是非文档图像数据。

2. 发明授权

US08208765B2 Search and retrieval of documents indexed by optical character recognition 有权
标题翻译：搜索和检索通过光学字符识别索引的文档
公开(公告)号：US08208765B2
公开(公告)日：2012-06-26
申请号：US11972446
申请日：2008-01-10
申请人： Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
发明人： Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
IPC分类号： G06K9/00
CPC分类号： G06F17/30253 , G06K9/723 , G06K2209/01 , G06K2209/011
摘要： An image of a character string composed of M pieces of characters is clipped from a document image, and the image is divided into separate characters. Image features of each character image are extracted. Based on the image features, N (N>1, integer) pieces of character images in descending order of degree of similarity are selected as candidate characters, from a character image feature dictionary which stores the image features of character image in units of character, and a first index matrix of M×N cells is prepared. A candidate character string composed of a plurality of candidate characters constituting a first column of the first index matrix, is subjected to a lexical analysis according to a language model, and whereby a second index matrix having a character string which makes sense is prepared. In the language model, statistics are taken and then, the lexical analysis is performed.
摘要翻译：从文件图像剪切由M个字符组成的字符串的图像，并且将图像划分为单独的字符。提取每个字符图像的图像特征。基于图像特征，从以字符为单位存储字符图像的图像特征的字符图像特征词典中，选择相似度降序的N（N> 1，整数）个字符图像作为候选字符，并准备M×N个单元的第一个索引矩阵。由构成第一索引矩阵的第一列的多个候选字符组成的候选字符串根据语言模型进行词法分析，由此准备具有有意义的字符串的第二索引矩阵。在语言模型中，进行统计，然后进行词法分析。

3. 发明授权

US08200012B2 Image determination apparatus, image search apparatus and computer readable recording medium storing an image search program 有权
标题翻译：图像确定装置，图像搜索装置和存储图像搜索程序的计算机可读记录介质
公开(公告)号：US08200012B2
公开(公告)日：2012-06-12
申请号：US12393772
申请日：2009-02-26
申请人： Jilin Li , Zhi-Gang Fan , Yadong Wu , Bo Wu
发明人： Jilin Li , Zhi-Gang Fan , Yadong Wu , Bo Wu
IPC分类号： G06K9/34
CPC分类号： G06K9/54 , G06K9/346 , G06K9/522
摘要： A preprocessing section binarizes input image data and calculates a total black pixel ratio. A feature extracting section detects connected components contained in the binarized image data and detects circumscribing bounding boxes that circumscribe these connected components, respectively. Based on sizes of the circumscribing bounding boxes detected and numbers of black pixels contained therein, predetermined connected components are removed. A determining section generates an edge map by using the residual connected components, and performs two-dimensional fast Fourier transform thereon to generate spectral data. The determining section performs two-dimensional fast Fourier transform on template images to generate spectral data. The determining section determines, based on these pieces of spectral data, whether or not a circular shape is contained in the input image data.
摘要翻译：预处理部分对输入图像数据进行二值化并计算总黑色像素比。特征提取部分检测二值化图像数据中包含的连接分量并检测分别围绕这些连接分量的外接边界框。基于检测到的外接边界框的大小和包含在其中的黑色像素的数量，去除了预定的连接部件。确定部通过使用剩余连接分量来生成边缘图，并对其进行二维快速傅立叶变换以产生光谱数据。确定部分对模板图像执行二维快速傅立叶变换以产生光谱数据。确定部分基于这些光谱数据确定输入图像数据中是否包含圆形形状。

4. 发明授权

US08027550B2 Image-document retrieving apparatus, method of retrieving image document, program, and recording medium 有权
标题翻译：图像文件检索装置，检索图像文档，程序和记录介质的方法
公开(公告)号：US08027550B2
公开(公告)日：2011-09-27
申请号：US11998793
申请日：2007-11-30
申请人： Mang Chen , Bo Wu , Yadong Wu , Chen Xu
发明人： Mang Chen , Bo Wu , Yadong Wu , Chen Xu
IPC分类号： G06K9/54 , G06F17/00
CPC分类号： G06K9/4604 , G06K9/00456 , G06K2209/01
摘要： Feature vectors used in discrimination of images include information on feature blocks of images in an image-document retrieving apparatus of the present invention. Text areas of a page image document are combined to form rectangular images. On the basis of information on the rectangular images that are extracted, a geometric structure of the page is analyzed, the page image document is divided into plural blocks, and then a plurality of feature blocks describing features of the page document image are selected from the plural blocks. The feature vectors are constituted of information on the feature blocks thus selected. This makes it possible to provide an image-document retrieving apparatus and a method of retrieving image documents, by which retrieval of image documents containing mainly text and a graphic is improvable in accuracy.
摘要翻译：用于鉴别图像的特征向量包括关于本发明的图像文档检索装置中的图像的特征块的信息。页面图像文档的文本区域被组合以形成矩形图像。基于提取的矩形图像的信息，分析页面的几何结构，将页面图像文档分割成多个块，然后从描述页面文档图像的特征的多个特征块中选择多个块。特征矢量由这样选择的特征块的信息构成。这使得可以提供图像文档检索装置和检索图像文档的方法，通过该图像文档检索可以精确地改进主要包含文本和图形的图像文档。

5. 发明申请

US20090028435A1 CHARACTER IMAGE EXTRACTING APPARATUS AND CHARACTER IMAGE EXTRACTING METHOD 有权
标题翻译：字符提取设备和字符提取方法
公开(公告)号：US20090028435A1
公开(公告)日：2009-01-29
申请号：US11963613
申请日：2007-12-21
申请人： Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
发明人： Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
IPC分类号： G06K9/46
CPC分类号： G06K9/34 , G06K9/342 , G06K9/348 , G06K2209/01
摘要： In an extracting step, the extracting portion obtains a linked component composed of a plurality of mutually linking pixels from a character string region composed of a plurality of characters, and extracts section elements from the character string region, the section elements each being surrounded by a circumscribing figure circumscribing to the linked component. In the first altering step, the first altering portion combines section elements at least having a mutually overlapping part among the extracted section elements so as to prepare a new section element. In the first selecting step, the first selecting portion determines a reference size in advance and selects section elements having a size greater than the reference size, from among the section elements altered in the first altering step.
摘要翻译：在提取步骤中，提取部分从由多个字符组成的字符串区域中获得由多个相互关联的像素组成的链接成分，并从字符串区域中提取出部分元素，限定连接组件的外观图。在第一改变步骤中，第一改变部分组合至少在提取的部分元素中具有相互重叠的部分的部分元素，以便准备新的部分元素。在第一选择步骤中，第一选择部分从第一改变步骤中改变的部分元素中预先确定参考尺寸并且选择具有大于参考尺寸的尺寸的部分元素。

6. 发明申请

US20080244378A1 Information processing device, information processing system, information processing method, program, and storage medium 审中-公开
标题翻译：信息处理装置，信息处理系统，信息处理方法，程序和存储介质
公开(公告)号：US20080244378A1
公开(公告)日：2008-10-02
申请号：US12002671
申请日：2007-12-18
申请人： Mang Chen , Bo Wu , Yadong Wu , Chen Xu , Ning Le
发明人： Mang Chen , Bo Wu , Yadong Wu , Chen Xu , Ning Le
IPC分类号： G06F17/21
CPC分类号： G06K9/033 , G06K9/00456
摘要： An information processing device includes: a feature extracting section for extracting, as format information, a format feature of a process-target document from image data of the process-target document, on which filling-in spaces of plural items are printed; a document recognizing section for comparing the format information of the process-target document with registered format information stored in a storage device, and specifying a registered document that corresponds to the process-target document, the registered format information regarding format features of registered documents; a data acquiring section for converting characters in the image data of the process-target document into text data; and a distributing section for grouping the image data and text data of the characters into plural groups according to a separation rule that is set for the registered document, the characters being written in the fill-in spaces of the items of the process-target document, and for transmitting the different groups to different external devices. With this, information such as personal information to be protected can be processed, preventing an operator dealing with the information from obtaining the whole information.
摘要翻译：一种信息处理设备，包括：特征提取部分，用于从打印有多个项目的填充空间的处理对象文档的图像数据中提取作为格式信息的处理对象文档的格式特征; 文档识别部分，用于将处理目标文档的格式信息与存储在存储装置中的登记格式信息进行比较，并且指定对应于处理目标文档的注册文档，关于注册文档的格式特征的注册格式信息; 数据获取部分，用于将处理目标文档的图像数据中的字符转换为文本数据; 以及分配部，用于根据为登记文件设定的分离规则，将图像数据和文字数据分组成多个组，所述字符被写入处理对象文档的项目的填写空间中，并将不同组发送到不同的外部设备。因此，可以处理诸如要保护的个人信息的信息，从而防止处理信息的操作者获得整个信息。

7. 发明授权

US08255215B2 Method and apparatus for locating speech keyword and speech recognition system 有权
标题翻译：用于定位语音关键词和语音识别系统的方法和装置
公开(公告)号：US08255215B2
公开(公告)日：2012-08-28
申请号：US12443063
申请日：2007-09-27
申请人： Fengqin Li , Yadong Wu , Qinqtao Yang , Chen Chen
发明人： Fengqin Li , Yadong Wu , Qinqtao Yang , Chen Chen
IPC分类号： G10L15/26 , G10L19/14 , G10L15/04
CPC分类号： G10L15/02 , G10L15/10 , G10L15/142 , G10L2015/025 , G10L2015/088
摘要： It is an object of the present invention to provide a method and apparatus for locating a keyword of a speech and a speech recognition system. The method includes the steps of: by extracting feature parameters from frames constituting the recognition target speech, forming a feature parameter vector sequence that represents the recognition target speech; by normalizing of the feature parameter vector sequence with use of a codebook containing a plurality of codebook vectors, obtaining a feature trace of the recognition target speech in a vector space; and specifying the position of a keyword by matching prestored keyword template traces with the feature trace. According to the present invention, a keyword template trace and a feature space trace of a recognition target speech are drawn in accordance with an identical codebook. This causes resampling to be unnecessary in performing linear movement matching of speech wave frames having similar phonological feature structures. This makes it possible to improve the speed of location and recognition while ensuring the precision of recognition.
摘要翻译：本发明的目的是提供一种用于定位语音和语音识别系统的关键词的方法和装置。该方法包括以下步骤：通过从构成识别目标语音的帧中提取特征参数，形成表示识别目标语音的特征参数向量序列; 通过使用包含多个码本向量的码本来归一化特征参数矢量序列，获得矢量空间中的识别目标语音的特征轨迹; 并通过将预先存储的关键字模板跟踪与特征跟踪相匹配来指定关键字的位置。根据本发明，根据相同的码本绘制关键字模板跟踪和识别对象语音的特征空间轨迹。这导致在执行具有相似的语音特征结构的语音波帧的线性移动匹配中，重新采样是不必要的。这样可以提高位置和识别的速度，同时确保识别精度。

8. 发明授权

US08160402B2 Document image processing apparatus 有权
标题翻译：文件图像处理装置
公开(公告)号：US08160402B2
公开(公告)日：2012-04-17
申请号：US11972477
申请日：2008-01-10
申请人： Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
发明人： Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
IPC分类号： G06K9/03 , G06K9/18
CPC分类号： G06F17/30253 , G06K9/723 , G06K2209/01 , G06K2209/011
摘要： An image of a character string composed of M pieces of characters is clipped from a document image, and the image is divided character by character, and image features of each character image are extracted. On the basis of the image features, N (N>1, integer) pieces of character images in descending order of degree of similarity are selected as candidate characters from a character image feature dictionary which stores the image features of character image in units of character, and the first index matrix of M×N cells is prepared. A candidate character string composed of a plurality of candidate characters constituting the first column of the first index matrix, is subjected to a lexical analysis according to a predetermined language model, whereby a second index matrix adjusted into a character string which makes sense is prepared to be utilized for searching.
摘要翻译：从文件图像中剪辑由M个字符组成的字符串的图像，并且逐个地分割图像，并且提取每个字符图像的图像特征。基于图像特征，从以字符为单位存储字符图像的图像特征的字符图像特征词典中选择作为相似度降序的N（N> 1，整数）个字符图像的候选字符，并准备M×N个单元的第一个索引矩阵。由构成第一索引矩阵的第一列的多个候选字符构成的候选字符串根据预定语言模型进行词法分析，由此将调整为有意义的字符串的第二索引矩阵准备为用于搜索。

9. 发明申请

US20120014601A1 HANDWRITING RECOGNITION METHOD AND DEVICE 审中-公开
标题翻译：手写识别方法和设备
公开(公告)号：US20120014601A1
公开(公告)日：2012-01-19
申请号：US13258084
申请日：2010-06-23
申请人： Shuhong Jiang , Bo Wu , Yadong Wu , Wei Miao , Ailong Li
发明人： Shuhong Jiang , Bo Wu , Yadong Wu , Wei Miao , Ailong Li
IPC分类号： G06K9/34
CPC分类号： G06K9/00416 , G06F1/1626 , G06F1/1643 , G06F1/169 , G06F3/04883 , G06K9/00422
摘要： A handwriting recognition method and a handwriting recognition device are provided to recognize a character sequence continuously inputted by a user for convenience. The present method comprises steps of calculating various features of the inputted character sequence which include single character recognition accuracy features and space geometry features of different stroke combinations in the inputted character sequence, calculating segmentation reliabilities of respective stroke combinations in different segmented patterns by using a probabilistic model in which coefficients of the probabilistic model are estimated by a parameter estimation method through sample trainings, recognizing characters in different writing patterns by using a multiple-template matching method when performing single character recognition of the stroke combinations, searching for the best segmentation path and conducting post-processing to optimize the recognition results. The present method and device have advantages of simple structure, low hardware requirement, fast recognition speed and high recognition accuracy and can be implemented in an embedded system.
摘要翻译：提供手写识别方法和手写识别装置，以方便用户识别连续输入的字符序列。本方法包括以下步骤：计算输入的字符序列的各种特征，其包括输入的字符序列中的不同笔划组合的单个字符识别精度特征和空间几何特征，通过使用概率来计算不同分段模式中的各笔划组合的分段可靠性模型，其中通过样本训练通过参数估计方法估计概率模型的系数，当执行笔划组合的单个字符识别时，通过使用多模板匹配方法识别不同写入模式中的字符，搜索最佳分割路径和进行后期处理以优化识别结果。本发明的方法和装置具有结构简单，硬件要求低，识别速度快，识别精度高等优点，可以在嵌入式系统中实现。

10. 发明授权

US08295600B2 Image document processing device, image document processing method, program, and storage medium 有权
标题翻译：图像文件处理装置，图像文件处理方法，程序和存储介质
公开(公告)号：US08295600B2
公开(公告)日：2012-10-23
申请号：US11952823
申请日：2007-12-07
申请人： Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
发明人： Bo Wu , Jianjun Dou , Ning Le , Yadong Wu , Jing Jia
IPC分类号： G06K9/34 , G06K9/72 , G06K9/54 , G06K9/60
CPC分类号： G06K9/4671 , G06K9/481 , G06K2209/01
摘要： An image document processing device extracts a character sequence image having M number of characters in an image document, divides the image into individual character images, extracts features of the individual character images, and based on the features, selects N (N is an integer more than 1) character images in the order of degree of matching from a font-feature dictionary for storing features of all character images according to fonts, and generates an M×N index matrix for the extracted character sequence. In searching, the device searches an index-information storage section with respect to each search character included in a search keyword in an input search expression, and extracts an image document including an index matrix including the search keyword. This provides an image document processing device and an image document processing method each allowing indexing not requiring user's operation and each allowing highly precise searching without OCR recognition.
摘要翻译：图像文档处理装置提取图像文档中具有M个字符的字符序列图像，将图像分割成单个字符图像，提取各个字符图像的特征，并且基于特征，选择N（N是更整数比1字符图像按照根据字体存储所有字符图像的特征的字体特征字典的匹配程度的顺序，并且生成用于提取的字符序列的M×N索引矩阵。在搜索中，设备针对输入搜索表达式中搜索关键字中包括的每个搜索字符搜索索引信息存储部分，并且提取包括包括搜索关键字的索引矩阵的图像文档。这提供了一种图像文档处理设备和图像文档处理方法，每个图像文档处理方法允许不需要用户操作的索引，并且每个允许在没有OCR识别的情况下进行高

你已经成功收藏专利！

检索式保存成功!

IPRDB

热门服务

关于我们

友情链接

联系方式