The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 10, 2022

Filed:

Feb. 19, 2021
Applicant:

Sift Science, Inc., San Francisco, CA (US);

Inventors:

Wei Liu, Seattle, WA (US);

Jintae Kim, San Francisco, CA (US);

Michael Legore, San Francisco, CA (US);

Yong Fu, San Francisco, CA (US);

Cat Perry, San Francisco, CA (US);

Rachel Mitrano, San Francisco, CA (US);

James Volz, San Francisco, CA (US);

Liz Kao, San Francisco, CA (US);

Assignee:

Sift Science, Inc., San Francisco, CA (US);

Attorneys:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
H04L 29/06 (2006.01); G06N 20/00 (2019.01); G06N 5/04 (2006.01); G06F 16/21 (2019.01); G06F 16/28 (2019.01); G06F 16/2455 (2019.01);
U.S. Cl.
CPC ...
H04L 63/1425 (2013.01); G06F 16/217 (2019.01); G06F 16/2455 (2019.01); G06F 16/285 (2019.01); G06N 5/04 (2013.01); G06N 20/00 (2019.01); H04L 63/1416 (2013.01);
Abstract

A machine learning-based system and method for content clustering and content threat assessment includes generating embedding values for each piece of content of corpora of content data; implementing unsupervised machine learning models that: receive model input comprising the embeddings values of each piece of content of the corpora of content data; and predict distinct clusters of content data based on the embeddings values of the corpora of content data; assessing the distinct clusters of content data; associating metadata with each piece of content defining a member in each of the distinct clusters of content data based on the assessment, wherein the associating the metadata includes attributing to each piece of content within the clusters of content data a classification label of one of digital abuse/digital fraud and not digital abuse/digital fraud; and identifying members or content clusters having digital fraud/digital abuse based on querying the distinct clusters of content data.


Find Patent Forward Citations

Loading…