The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 23, 2024

Filed:

Sep. 02, 2021
Applicant:

Bank of America Corporation, Charlotte, NC (US);

Inventor:

Aftab Khan, Richardson, TX (US);

Assignee:

Bank of America Corporation, Charlotte, NC (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 30/12 (2022.01); G06N 20/00 (2019.01); G06F 40/284 (2020.01); G06F 40/242 (2020.01); G06V 30/413 (2022.01); G06F 18/214 (2023.01); G06F 18/21 (2023.01);
U.S. Cl.
CPC ...
G06V 30/133 (2022.01); G06F 18/214 (2023.01); G06F 18/217 (2023.01); G06F 40/242 (2020.01); G06F 40/284 (2020.01); G06N 20/00 (2019.01); G06V 30/413 (2022.01);
Abstract

An apparatus includes a memory and a processor. The memory stores a dictionary and a machine learning algorithm trained to classify text. The processor receives an image of a page, converts the image into a set of text, and identifies a plurality of tokens within the text. Each token includes one or more contiguous characters that are both preceded and followed by whitespace within the text. The processor identifies invalid tokens by removing tokens of the plurality of tokens that correspond to words of the dictionary. The processor calculates, based on a ratio of a total number of valid tokens to a total number of tokens, a score. In response to determining that the score is greater than a threshold, the processor applies the machine learning algorithm to classify the text into a category and stores the image and/or text in a database according to the category.


Find Patent Forward Citations

Loading…