The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 07, 2025

Filed:

May. 09, 2024
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Jack Wilson Stokes, Iii, North Bend, WA (US);

Pranav Ravindra Maneriker, Columbus, OH (US);

Arunkumar Gururajan, Sammamish, WA (US);

Diana Anca Carutasu, Bellevue, WA (US);

Edir Vinicio Garcia Lazo, Seattle, WA (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
H04L 9/40 (2022.01); G06F 40/284 (2020.01); G06N 3/045 (2023.01); G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
H04L 63/1483 (2013.01); G06F 40/284 (2020.01); G06N 3/045 (2023.01); G06N 3/08 (2013.01);
Abstract

The technology described herein can identify phishing URLs using transformers. The technology tokenizes useful features from the subject URL. The useful features can include the text of the URL and other data associated with the URL, such as certificate data for the subject URL, a referrer URL, an IP address, etc. The technology may build a joint Byte Pair Encoding for the features. The token encoding may be processed through a transformer, resulting in a transformer output. The transformer output, which may be described as a token embedding, may be input to a classifier to determine whether the URL is a phishing URL. Additional or improved URL training data may be generated by permuting token order, by simulating a homoglyph attack, and by simulating a compound word attack.


Find Patent Forward Citations

Loading…