The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 16, 2025

Filed:

Apr. 05, 2023
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Ella Rabinovich, Hod Hasharon, IL;

Matan Vetzler, Givat Shmuel, IL;

Samuel Solomon Ackerman, Haifa, IL;

Ateret Anaby - Tavor, Givat Ada, IL;

Eitan Daniel Farchi, Pardes Hanna-Karkur, IL;

Orna Raz, Haifa, IL;

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06N 5/022 (2023.01); G10L 15/06 (2013.01); G10L 15/18 (2013.01);
U.S. Cl.
CPC ...
G10L 15/1815 (2013.01); G06N 5/022 (2013.01); G10L 15/063 (2013.01); G10L 2015/0631 (2013.01); G10L 2015/0638 (2013.01);
Abstract

Various systems and methods are presented regarding detecting data drift. The data of interest can be batches of utterances received at an interface (e.g., a chatbot). The batches of utterances can be compared with topics present in training data utilized to train a data classifier (e.g., an autoencoder), wherein topics identified in the batches of utterances that are not present in the training data can be considered to be novel topics. The greater the presence of novel topics in a batch of utterances, the greater the divergence of the batch of utterances from the content of the training data. The novel topics can be identified and subsequently applied to the training data such that the data classifier can be re-trained with the novel topics, thereby causing the data classifier to be contemporaneous with the novel topics. In an embodiment, the utterances can be short streams of text, symbols, and suchlike.


Find Patent Forward Citations

Loading…