The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 13, 2021

Filed:

Jul. 22, 2019
Applicant:

Vmware, Inc., Palo Alto, CA (US);

Inventors:

Dev Nag, Palo Alto, CA (US);

Yanislav Yankov, Palo Alto, CA (US);

Dongni Wang, Palo Alto, CA (US);

Gregory T. Burk, Colorado Springs, CO (US);

Nicholas Mark Grant Stephen, Paris, FR;

Assignee:

VMware, Inc., Palo Alto, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 20/00 (2019.01); G06F 9/54 (2006.01); G06N 7/00 (2006.01);
U.S. Cl.
CPC ...
G06N 20/00 (2019.01); G06F 9/542 (2013.01); G06N 7/005 (2013.01);
Abstract

The current document is directed to automated reinforcement-learning-based application managers that that are trained using adversarial training. During adversarial training, potentially disadvantageous next actions are selected for issuance by an automated reinforcement-learning-based application manager at a lower frequency than selection of next actions, according to a policy that is learned to provide optimal or near-optimal control over a computing environment that includes one or more applications controlled by the automated reinforcement-learning-based application manager. By selecting disadvantageous actions, the automated reinforcement-learning-based application manager is forced to explore a much larger subset of the system-state space during training, so that, upon completion of training, the automated reinforcement-learning-based application manager has learned a more robust and complete optimal or near-optimal control policy than had the automated reinforcement-learning-based application manager been trained by simulators or using management actions and computing-environment responses recorded during previous controlled operation of a computing-environment.


Find Patent Forward Citations

Loading…