The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 09, 2023

Filed:

Mar. 24, 2021
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Arjun Natarajan, Old Tappan, NJ (US);

Ashish Kundu, San Jose, CA (US);

Roger C. Raphael, San Jose, CA (US);

Aniya Aggarwal, New Delhi, IN;

Rajesh M. Desai, San Jose, CA (US);

Joshua F. Payne, San Antonio, TX (US);

Mu Qiao, Belmont, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
H04L 9/40 (2022.01); G06N 3/088 (2023.01); G06N 3/045 (2023.01);
U.S. Cl.
CPC ...
H04L 63/0428 (2013.01); G06N 3/045 (2023.01); G06N 3/088 (2013.01);
Abstract

Preserving distributions of data values of a data asset in a data anonymization operation is provided. Anonymizing data values is performed by transforming sensitive data in a set of columns over rows of the data asset while preserving distribution of the data values in the set of transformed columns to a defined degree using a set of autoencoders and loss function. The autoencoders are base trained from preexisting data in a data assets catalog and actively trained during data dissemination. Parametric coefficients of the loss function are configured and the threshold is generated using policies from an enforcement decision for the data asset and data consumer. The loss function value of a selected row is compared to the threshold. Transformed data values of the selected row are transcribed to an output row when the loss function value is greater than the threshold and disseminated to the data consumer.


Find Patent Forward Citations

Loading…