The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 11, 2025

Filed:

Mar. 22, 2022
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Oriana Riva, Redmond, WA (US);

Shweti Mahajan, Kirkland, WA (US);

Pratyay Banerjee, Tempe, AZ (US);

Kushal Arora, Montreal, CA;

Weiwei Yang, Seattle, WA (US);

Christopher Miles White, Seattle, WA (US);

Sahisnu Mazumder, Santa Clara, CA (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 9/451 (2018.01); G06F 40/284 (2020.01); G06F 40/40 (2020.01); G06V 10/774 (2022.01); G06V 10/80 (2022.01); G06V 40/40 (2022.01);
U.S. Cl.
CPC ...
G06F 9/451 (2018.02); G06F 40/284 (2020.01); G06F 40/40 (2020.01); G06V 10/774 (2022.01); G06V 10/811 (2022.01);
Abstract

A computer implemented method includes accessing training data that includes images, associated alternative text, and proximately located text providing instructions describing a user interface. The images are paired with text captions derived from the proximately located text and image's alternative text, training a vision and language model in a self-supervised manner using language masking, region masking, and image-text alignment techniques on respective image region features and tokenized text captions. Fine-tuning of the vision and language model is performed to obtain a specialized model representing user interface elements and associated functions.


Find Patent Forward Citations

Loading…