The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 24, 2023

Filed:

Jan. 22, 2020
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Yang Song, Bellevue, WA (US);

Raghav Gupta, Mountain View, CA (US);

Dengyong Zhou, Redmond, WA (US);

Sanqiang Zhao, Pittsburgh, PA (US);

Assignee:

GOOGLE LLC, Mountain View, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/088 (2023.01); G06F 40/284 (2020.01); G06N 3/045 (2023.01);
U.S. Cl.
CPC ...
G06N 3/088 (2013.01); G06F 40/284 (2020.01); G06N 3/045 (2023.01);
Abstract

Provided is a knowledge distillation technique for training a student language model that, relative to a larger teacher language model, has a significantly smaller vocabulary, lower embedding dimensions, and/or hidden state dimensions. Specifically, aspects of the present disclosure are directed to a dual-training mechanism that trains the teacher and student language models simultaneously to obtain optimal word embeddings for the student vocabulary. In some implementations, this approach can be combined with learning shared projection matrices that transfer layer-wise knowledge from the teacher language model to the student language model. Example experimental results have also demonstrated higher compression efficiency and accuracy when compared with other state-of-the-art compression techniques, including the ability to compress the BERTmodel by more than 60×, with only a minor drop in downstream task metrics, resulting in a language model with a footprint of under 7 MB.


Find Patent Forward Citations

Loading…