The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Oct. 28, 2025
Filed:
Jul. 30, 2021
Applicant:
Tungsten Automation Corporation, Irvine, CA (US);
Inventor:
Hu Cao, Cypress, CA (US);
Assignee:
TUNGSTEN AUTOMATION CORPORATION, Irvine, CA (US);
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 30/19 (2022.01); G06V 30/18 (2022.01); G06V 30/412 (2022.01); G06V 30/413 (2022.01);
U.S. Cl.
CPC ...
G06V 30/19147 (2022.01); G06V 30/18181 (2022.01); G06V 30/19173 (2022.01); G06V 30/412 (2022.01); G06V 30/413 (2022.01);
Abstract
A machine learning based key-value extraction model extracts fields/entities from documents. The input images are processed through OCR. A list of words (uni-grams) and their coordinates are extracted from the original images. Following word cleaning and manipulation, n-gram creation (multi-words), and feature engineering, the transformed data is fed into a classification algorithm to predict if a uni-gram or n-gram is one of the target entities or a non-entity. Following the first step that includes unique feature engineering, a second step improves extraction accuracy among the fields/entities.