The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 03, 2024

Filed:

Nov. 12, 2021
Applicant:

Adobe Inc., San Jose, CA (US);

Inventors:

Aniruddha Mahapatra, Kolkata, IN;

Sharmila Reddy Nangi, Telangana, IN;

Aparna Garimella, Telangana, IN;

Anandha velu Natarajan, Tamil Nadu, IN;

Assignee:

Adobe Inc., San Jose, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 40/289 (2020.01); G06F 18/214 (2023.01); G06F 40/211 (2020.01); G06F 40/284 (2020.01); G06F 40/30 (2020.01); G06F 40/42 (2020.01); G06F 18/22 (2023.01);
U.S. Cl.
CPC ...
G06F 40/289 (2020.01); G06F 18/214 (2023.01); G06F 40/211 (2020.01); G06F 40/284 (2020.01); G06F 40/30 (2020.01); G06F 40/42 (2020.01); G06F 18/22 (2023.01);
Abstract

Embodiments of the present invention provide systems, methods, and computer storage media for pre-training entity extraction models to facilitate domain adaptation in resource-constrained domains. In an example embodiment, a first machine learning model is used to encode sentences of a source domain corpus and a target domain corpus into sentence embeddings. The sentence embeddings of the target domain corpus are combined into a target corpus embedding. Training sentences from the source domain corpus within a threshold of similarity to the target corpus embedding are selected. A second machine learning model is trained on the training sentences selected from the source domain corpus.


Find Patent Forward Citations

Loading…