The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 29, 2025

Filed:

Nov. 28, 2022
Applicant:

Oracle International Corporation, Redwood Shores, CA (US);

Inventors:

Yazhe Hu, Seattle, WA (US);

Tao Sheng, Bellevue, WA (US);

Jun Qian, Bellevue, WA (US);

Assignee:

ORACLE INTERNATIONAL CORPORATION, Redwood Shores, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06V 30/19 (2022.01); G06V 30/148 (2022.01); G06V 30/41 (2022.01);
U.S. Cl.
CPC ...
G06V 30/19147 (2022.01); G06V 30/153 (2022.01); G06V 30/41 (2022.01);
Abstract

Automated techniques are for generating a large volume of diverse training data that can be used for training machine learning models to extract KV pairs from document images. Given a single input document image and associated annotation data, a large number of diverse synthetic training datapoints are automatically generated by a synthetic data generation system, each datapoint including a synthetic document image and associated annotation data. The generated synthetic training datapoints can be used to train and improve the performance of ML models for extracting KV pairs from document images. In certain implementations, multiple synthetic datapoints are generated by varying the values associated with a key for a content item within the input document image.


Find Patent Forward Citations

Loading…