The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 12, 2017

Filed:

Jun. 05, 2014
Applicant:

Xerox Corporation, Norwalk, CT (US);

Inventors:

Sudhagar Subbaian, Tamilnadu, IN;

Sainarayanan Gopalakrishnan, Chennai, IN;

Xing Li, Webster, NY (US);

Clara Cuciurean-Zapan, Fairport, NY (US);

Assignee:

Xerox Corporation, Norwalk, CT (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/34 (2006.01); G06K 9/62 (2006.01); G06K 9/00 (2006.01); G06K 9/46 (2006.01);
U.S. Cl.
CPC ...
G06K 9/6267 (2013.01); G06K 9/00456 (2013.01); G06K 9/4638 (2013.01);
Abstract

A method and system for segmenting text from non-text portions of a digital image using the size, solidity, and run length characteristics of connected components within the image data. For a connected component comprising a rectangular group of pixels enclosing a set of connected pixels having the same binary state, the size characteristic may be based on a ratio of height to width of the connected component and the total number of pixels within the connected component, the solidity characteristic may be based on a ratio of pixels within the connected component to a total number of pixels within a convex hull of the set of connected pixels, and the run length characteristic may be based on a number of transitions within the connected component.


Find Patent Forward Citations

Loading…