The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 13, 2022

Filed:

Sep. 14, 2021
Applicant:

Netskope, Inc., Santa Clara, CA (US);

Inventors:

Yihua Liao, Fremont, CA (US);

Ari Azarafrooz, Rancho Santa Margarita, CA (US);

Najmeh Miramirkhani, Santa Clara, CA (US);

Zhi Xu, Cupertino, CA (US);

Assignee:

Netskope, Inc., Santa Clara, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G08B 23/00 (2006.01); G06F 12/16 (2006.01); G06F 12/14 (2006.01); G06F 11/00 (2006.01); H04L 9/40 (2022.01); G06T 1/00 (2006.01); G06F 40/279 (2020.01); G06F 40/205 (2020.01); G06F 40/126 (2020.01); G06N 3/04 (2006.01);
U.S. Cl.
CPC ...
H04L 63/1483 (2013.01); G06F 40/126 (2020.01); G06F 40/205 (2020.01); G06F 40/279 (2020.01); G06N 3/04 (2013.01); G06T 1/0021 (2013.01); H04L 63/0281 (2013.01); H04L 63/1408 (2013.01);
Abstract

Disclosed is classifying a URL and a page accessed via the URL as phishing or not. URL embedder extracts characters in a predetermined set from the URL to produce a character string trained using ground truth classification of the URL, producing a URL embedding. HTML parser accesses content at the URL and extracts HTML tokens from the page. Further, HTML encoder, trained on HTML tokens extracted from pages at example URLs, each example URL accompanied by a ground truth image captured from the page accessed via the example URL, produces an HTML encoding of the extracted tokens. Also, phishing classifier layers, trained on the URL embedding and the HTML encoding of example URLs, processes a concatenated input of the URL embedding and the HTML encoding to produce a score of a phishing risk.


Find Patent Forward Citations

Loading…