The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 29, 2025

Filed:

Jan. 15, 2021
Applicant:

Microsoft Technology Licensing, Llc, Redmond, WA (US);

Inventors:

Soyoung Peraud, Redmond, WA (US);

Alexandre Rochette, Montreal, CA;

Gabriel Arien Desgarennes, Issaquah, WA (US);

Niel Chah, Toronto, CA;

Abhishek Kumar, Redmond, WA (US);

Timothy James Hazen, Arlington, MA (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/09 (2022.12); G06F 18/22 (2022.12); G06F 18/23 (2022.12); G06F 18/2411 (2022.12); G06F 18/2431 (2022.12); G06N 3/084 (2022.12); G06N 20/00 (2018.12);
U.S. Cl.
CPC ...
G06N 20/00 (2018.12); G06F 18/22 (2022.12); G06F 18/23 (2022.12); G06F 18/2411 (2022.12); G06F 18/2431 (2022.12); G06N 3/084 (2012.12); G06N 3/09 (2022.12);
Abstract

A classifier may be trained with less than all datasets manually annotated with labels. A small subset of verbatims may be manually labeled with topic labels as seeds. Data augmentations can be used to acquire seed verbatim sets for known topics and to assign temporary pseudo labels to the rest of the verbatims based on their vector space proximity to the labeled seed verbatims. The training may involve classification epochs during which embeddings are updated with the assumption that the pseudo labels are ground-truth labels. The training may also involve labeling epochs during which the updated embeddings are used to update the vectors corresponding to the verbatims, and pseudo labels are updated based on updated vector coordinates in the vector space. As the training process progresses through the epochs, the embeddings will converge.


Find Patent Forward Citations

Loading…