The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Sep. 16, 2025
Filed:
Dec. 27, 2021
Trustees of Dartmouth College, Hanover, NH (US);
Venkatramanan Subrahmanian, Evanston, IL (US);
Dongkai Chen, Mountain View, CA (US);
Haipeng Chen, Williamsburg, VA (US);
Deepti Poluru, Hanover, NH (US);
Almas Abdibayev, Hanover, NH (US);
Trustees of Dartmouth College, Hanover, NH (US);
Abstract
A computer-implemented method, system and computer program product for generating fake documents. A corpus of domain specific documents is built and word embeddings for each word in such documents are identified as embedding vectors. Concepts in the corpus are then clustered together by clustering the embedding vectors. A feasible candidate replacement set is generated for each concept using the clustered concepts in the corpus. After such pre-processing steps are accomplished, concepts are extracted from a document. The concept importance values are computed for these extracted concepts, in which the extracted concepts are clustered into bins based on such measurements. A joint optimization problem is solved to identify both the concepts in the document to be replaced using the clustered concepts in the bins as well as the corresponding replacement concepts obtained from the clustered concepts in the corpus. Such replacements are made to generate a fake document.