The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 19, 2022

Filed:

Nov. 16, 2020
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Killian Levacher, Dublin, IE;

Stefano Braghin, Dublin, IE;

Naoise Holohan, Maynooth, IE;

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 9/44 (2018.01); G06F 11/36 (2006.01); G06F 11/34 (2006.01);
U.S. Cl.
CPC ...
G06F 11/3692 (2013.01); G06F 11/3452 (2013.01); G06F 11/3684 (2013.01); G06F 11/3688 (2013.01);
Abstract

Embodiments include a method for one or more processors to receive an organic dataset and a domain knowledge base. The one or more processors identify private data entities present within the organic dataset. The one or more processors determine statistical properties of the private data entities identified within the organic dataset. The one or more processors create a plurality of test data templates by removing the private data entities from the organic dataset. The one or more processors select from the domain knowledge base, synthetic data entities that match a data type of the removed private data entities, respectively, and align with the statistical properties of the private data entities, and the one or more processors generate synthetic test data by inserting, respectively, the synthetic data entities of the matching data type for the removed private data entities in the test data templates.


Find Patent Forward Citations

Loading…