The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Dec. 28, 1999
Filed:
Nov. 28, 1995
James V Mahoney, San Francisco, CA (US);
Xerox Corporation, Stamford, CT (US);
Abstract
The present invention is a method for analyzing image data, and more particularly for analyzing of image data representing images containing text to partition the image into running and non-running text regions and to further classify the non-running text regions therein. The present invention utilizes characteristics of running text regions to identify such regions and to subsequently group all non-running text regions into related groups prior to the classification of the non-running text regions. Classification of the non-running text regions is accomplished by analyzing whether the non-running text regions exhibit pronounced horizontal and/or vertical alignment of the text blocks therein. Once the analysis is complete, alignment information is used to determine the number of 'rows' and 'columns' so as to classify the non-running text region as text, a horizontal sequence, a vertical sequence, or a table.