The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 18, 2023

Filed:

Dec. 21, 2022
Applicant:

Intuit Inc., Mountain View, CA (US);

Inventors:

Sheer Dangoor, Tel Aviv, IL;

Yair Horesh, Tel Aviv, IL;

Assignee:

INTUIT INC., Mountain View, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/279 (2020.01); G06F 40/166 (2020.01); G06F 21/62 (2013.01);
U.S. Cl.
CPC ...
G06F 40/166 (2020.01); G06F 21/6254 (2013.01); G06F 40/279 (2020.01);
Abstract

Systems and methods for k-anonymizing a corpus of documents using linguistic similarities and embeddings distances between words. For instance, a word pair is selected based on linguistic similarity (e.g., belonging to the same part of speech) and small embeddings distance. For the selected word pair, a plurality of words is retrieved, also based on linguistic similarity to, and embeddings distances from, the selected word pair. Out of the plurality of words, a third word is identified that has a closer linguistic similarity to the word pair and also has smaller embeddings distances from the word pair. Each word in the word pair is then replaced by the third word. The process is repeated until k-anonymity is achieved.


Find Patent Forward Citations

Loading…