The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 11, 2021

Filed:

May. 20, 2020
Applicant:

Google Llc, Mountain View, CA (US);

Inventors:

Kenton Chiu Tsun Lee, Mountain View, CA (US);

Kelvin Gu, Mountain View, CA (US);

Zora Tung, Mountain View, CA (US);

Panupong Pasupat, Mountain View, CA (US);

Ming-Wei Chang, Mountain View, CA (US);

Assignee:

GOOGLE LLC, Mountain View, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/49 (2020.01); G06F 40/56 (2020.01); G06N 5/02 (2006.01); G06K 9/62 (2006.01);
U.S. Cl.
CPC ...
G06F 40/49 (2020.01); G06F 40/56 (2020.01); G06K 9/6259 (2013.01); G06N 5/022 (2013.01); G06N 5/025 (2013.01);
Abstract

Systems and methods for pre-training and fine-tuning of neural-network-based language models are disclosed in which a neural-network-based textual knowledge retriever is trained along with the language model. In some examples, the knowledge retriever obtains documents from an unlabeled pre-training corpus, generates its own training tasks, and learns to retrieve documents relevant to those tasks. In some examples, the knowledge retriever is further refined using supervised open-QA questions. The framework of the present technology provides models that can intelligently retrieve helpful information from a large unlabeled corpus, rather than requiring all potentially relevant information to be stored implicitly in the parameters of the neural network. This framework may thus reduce the storage space and complexity of the neural network, and also enable the model to more effectively handle new tasks that may be different than those on which it was pre-trained.


Find Patent Forward Citations

Loading…