The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Nov. 29, 2022
Filed:
Jun. 12, 2019
International Business Machines Corporation, Armonk, NY (US);
Pathirage D. S. U. Perera, San Jose, CA (US);
Eitan D. Farchi, Pardes Hana, IL;
Orna Raz, Haifa, IL;
Ramani Routray, San Jose, CA (US);
Sheng Hua Bao, San Jose, CA (US);
Marcel Zalmanovici, Kiriat Motzkin, IL;
International Business Machines Corporation, Armonk, NY (US);
Abstract
A computer system trains a machine learning model. A vector representation is generated for each document in a collection of documents. The documents are clustered based on the vector representations of the documents to produce a plurality of clusters. A training set is produced by selecting one or more documents from each cluster, wherein the selected documents represent a sample of the collection of documents to train the machine learning model. The machine learning model is trained by applying the training set to the machine learning model. Embodiments of the present invention further include a method and program product for training a machine learning model in substantially the same manner described above.