The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Sep. 05, 2023

Filed:

Dec. 05, 2019
Applicant:

Meta Platforms, Inc., Menlo Park, CA (US);

Inventors:

Bichen Wu, Menlo Park, CA (US);

Peizhao Zhang, Fremont, CA (US);

Peter Vajda, Palo Alto, CA (US);

Xiaoliang Dai, Princeton, NJ (US);

Yanghan Wang, Sunnyvale, CA (US);

Yuandong Tian, San Carlos, CA (US);

Assignee:

META PLATFORMS, INC., Menlo Park, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2023.01); G06N 3/082 (2023.01); G06N 3/045 (2023.01); G06N 3/047 (2023.01); G06N 3/084 (2023.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01); G06N 3/045 (2023.01); G06N 3/047 (2023.01); G06N 3/082 (2013.01); G06N 3/084 (2013.01);
Abstract

Computer implemented systems are described that implement a differentiable neural architecture search (DNAS) engine executing on one or more processors. The DNAS engine is configured with a stochastic super net defining a layer-wise search space having a plurality of candidate layers, each of the candidate layers specifying one or more operators for a neural network architecture. Further, the DNAS engine is configured to process training data to train weights for the operators in the stochastic super net based on a loss function representing a latency of the respective operator on a target platform, and to select a set of candidate neural network architectures from the trained stochastic super net. The DNAS engine may, for example, be configured to train the stochastic super net by traversing the layer-wise search space using gradient-based optimization of network architecture distribution.


Find Patent Forward Citations

Loading…