The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 11, 2025

Filed:

Aug. 26, 2022
Applicant:

Oracle International Corporation, Redwood Shores, CA (US);

Inventors:

Liyu Gong, Austin, TX (US);

Yuying Wang, Seattle, WA (US);

Zhonghai Deng, Redmond, WA (US);

Iman Zadeh, Los Angeles, CA (US);

Jun Qian, Bellevue, WA (US);

Assignee:

Oracle International Corporation, Redwood Shores, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/263 (2020.01); G06V 10/82 (2022.01); G06V 30/246 (2022.01);
U.S. Cl.
CPC ...
G06V 30/246 (2022.01); G06F 40/263 (2020.01); G06V 10/82 (2022.01);
Abstract

The present embodiments relate to a language identification system for predicting a language and text content of text lines in an image-based document. The language identification system uses a trainable neural network model that integrates multiple neural network models in a single unified end-to-end trainable architecture. A CNN and an RNN of the model can process text lines and derive visual and contextual features of the text lines. The derived features can be used to predict a language and text content for the text line. The CNN and the RNN can be jointly trained by determining losses based on the predicted language and content and corresponding language labels and text labels for each text line.


Find Patent Forward Citations

Loading…