The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 05, 2025

Filed:

Sep. 01, 2022
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Shangbang Long, Sunnyvale, CA (US);

Siyang Qin, Danville, CA (US);

Dmitry Panteleev, Princeton, NJ (US);

Alessandro Bissacco, Los Angeles, CA (US);

Yasuhisa Fujii, Sunnyvale, CA (US);

Michail Raptis, Venice, CA (US);

Assignee:

Google LLC, Mountain View, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 20/62 (2022.01); G06V 10/82 (2022.01); G06V 30/14 (2022.01); G06V 30/414 (2022.01);
U.S. Cl.
CPC ...
G06V 20/63 (2022.01); G06V 10/82 (2022.01); G06V 30/1448 (2022.01); G06V 30/414 (2022.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for jointly performing text detection and layout analysis. In one aspect, a method comprises processing the image and a set of object queries to generate an encoded representation of the image and an encoded representation of the set of object queries; processing the encoded representation of the image and the encoded representation of the set of object queries to generate a set of text detection masks; processing the encoded representation of the set of object queries to generate layout relevance measures; processing the encoded representation of the set of object queries to generate textness scores for the text detection masks; generating a text detection output that defines respective areas of the image that include text items; and generating a layout analysis output that defines clusters of respective areas of the image identified by the text detection masks.


Find Patent Forward Citations

Loading…