The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 13, 2024

Filed:

Oct. 07, 2021
Applicant:

Cognistic, Llc, Gibsonia, PA (US);

Inventors:

Roshan Bhave, Pittsburgh, PA (US);

Sanjay Chopra, Gibsonia, PA (US);

Eric Nyberg, Pittsburgh, PA (US);

Longxiang Zhang, Pittsburgh, PA (US);

Ihor Markevych, Pittsburgh, PA (US);

Assignee:

COGNISTIC, LLC, Gibsonia, PA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06K 9/00 (2022.01); G06F 18/214 (2023.01); G06F 18/23213 (2023.01); G06N 20/00 (2019.01); G06V 30/18 (2022.01); G06V 30/413 (2022.01);
U.S. Cl.
CPC ...
G06F 18/23213 (2023.01); G06F 18/214 (2023.01); G06N 20/00 (2019.01); G06V 30/18 (2022.01); G06V 30/413 (2022.01);
Abstract

One embodiment provides a method for clustering documents based upon a structure of each of the documents, including: receiving, at a device utilizing the machine-learning model, at least one document, each including a plurality of characters and having a structure; converting, for each of the at least one document, each of the plurality of characters to one of a plurality of character representations, wherein the converting includes identifying an attribute of a character and selecting a character representation corresponding to the attribute; producing at least one array for each of the one or more documents, wherein the at least one array includes the plurality of characters converted to the character representations; and clustering the at least one document into document clusters having similar structures by grouping the at least one arrays into groups of arrays having similarities, wherein each document cluster include documents corresponding to the arrays within one of the groups of arrays.


Find Patent Forward Citations

Loading…