The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 30, 2025

Filed:

Apr. 18, 2025
Applicant:

Citibank, N.a., New York, NY (US);

Inventors:

Ganesh Prasad Bhat, West Orange, NJ (US);

Ramee S. Karthikeyan, Monmouth Junction, NJ (US);

Cameron Paul Lim, Union City, NJ (US);

Alex Michael Eng, New York, NY (US);

Subramanian Sankaran, Flushing, NY (US);

Joshua Goldman, Merrick, NY (US);

Matthew Ryan Mitsui, New York, NY (US);

Wei Jie Ng, Jersey City, NJ (US);

James Myers, New York, NY (US);

John E. Ortega, New York, NY (US);

Alberto Cetoli, London, GB;

Minjeong Cho, London, GB;

Jason Ryan Engelbrecht, London, GB;

Ines Teixeira, London, GB;

Yael Man, Tel Aviv, IL;

Assignee:

CITIBANK, N.A., , NY (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/00 (2019.01); G06F 16/174 (2019.01); G06F 16/34 (2019.01); G06F 16/35 (2019.01);
U.S. Cl.
CPC ...
G06F 16/1748 (2019.01); G06F 16/345 (2019.01); G06F 16/35 (2019.01);
Abstract

The systems and methods disclosed herein obtain (e.g., via a user interface) a collection of unstructured data, where each document includes a content set. Using a first AI model set, multiple summaries are generated by categorizing each document into clusters based on vector comparisons of content sets and summarizing the content for each cluster. A second AI model set (same as or different from the first AI model set) identifies duplicate content within the unstructured data by generating similarity values between pairs of summaries and determining if the similarity values meet a predefined threshold. A report is generated (e.g., on the user interface) indicating the duplicate content sets and/or the collection of unstructured data.


Find Patent Forward Citations

Loading…