The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Sep. 05, 2023
Filed:
Feb. 18, 2019
Sumit Sanyal, Santa Cruz, CA (US);
Anil Hebbar, Santa Cruz, CA (US);
Abdul Puliyadan Kunnil Muneer, Bangalore, IN;
Abhinav Kaushik, Bangalore, IN;
Bharat Kumar Padi, Bangalore, IN;
Jeroen Bédorf, Heerhugowaard, NL;
Tijmen Tieleman, Diemen, NL;
Sumit Sanyal, Santa Cruz, CA (US);
Anil Hebbar, Santa Cruz, CA (US);
Abdul Puliyadan Kunnil Muneer, Bangalore, IN;
Abhinav Kaushik, Bangalore, IN;
Bharat Kumar Padi, Bangalore, IN;
Jeroen Bédorf, Heerhugowaard, NL;
Tijmen Tieleman, Diemen, NL;
Abstract
Reinforcement learning enables a framework of information technology assets that include software elements, computational hardware assets, and/or, bundled software and computational hardware systems and products. The performance of successive sessions of an inner loop reinforcement learning is directed and monitored by an outer loop reinforcement learning wherein the outer loop reinforcement learning is designed to reduce financial costs and computational asset requirements and/or optimize learning time in successive instantiations of inner loop reinforcement learning training sessions. The framework enables consideration of the license costs of domain specific simulators, the usage cost of hardware platforms, and the progress of a particular reinforcement learning training. The framework further enables reductions of these costs to orchestrate and train a neural network under budget constraints with respect to the available hardware and software licenses available at runtime. These improvements and optimizations may be performed by using heuristics and neural network algorithms.