The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 19, 2022

Filed:

Mar. 30, 2020
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Ron Litman, Tel-Aviv, IL;

Oron Anschel, Haifa, IL;

Shahar Tsiper, Haifa, IL;

Roee Litman, Petach-Tikva, IL;

Shai Mazor, Binyamina, IL;

Jonathan Wu, Seattle, WA (US);

Raghavan Manmatha, San Francisco, CA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/62 (2006.01); G06K 9/42 (2006.01);
U.S. Cl.
CPC ...
G06K 9/6256 (2013.01); G06K 9/42 (2013.01); G06K 9/6228 (2013.01); G06K 9/6261 (2013.01); G06K 9/6267 (2013.01); G06K 2209/01 (2013.01);
Abstract

Techniques for recognizing text in an image are described. An exemplary method may include receiving a request to recognize text in an image; extracting features from the image and generating a visual feature sequence from the extracted features; performing selective contextual refinement at least one selective contextual refinement block of a stack of selective contextual refinement blocks to generate a text prediction by: generating a contextual feature map and combining the contextual feature map with the visual feature sequence into a visual feature space, and applying a selective decoder that utilizes a two-step attention on the visual feature space to generate a text prediction, wherein the two-step attention includes performing a 1-D self-attention computation to generate attentional features and decoding the attentional features to generate the text prediction; and outputting the generated text prediction.


Find Patent Forward Citations

Loading…