The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 11, 2022

Filed:

Mar. 07, 2018
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Tu-Hoa Pham, Koto, JP;

Giovanni De Magistris, Kawasaki, JP;

Ryuki Tachibana, Yokohama, JP;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2006.01); G06N 5/04 (2006.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01); G06N 5/04 (2013.01);
Abstract

A computer-implemented method, computer program product, and system are provided for deep reinforcement learning to control a subject device. The method includes training, by a processor, a neural network to receive state information of a target of the subject device as an input and provide action information for the target as an output. The method further includes inputting, by the processor, current state information of the target into the neural network to obtain current action information for the target. The method also includes correcting, by the processor, the current action information minimally to obtain corrected action information that meets a set of constraints. The method additionally includes performing an action by the subject device based on the corrected action information for the target to obtain a reward from the target.


Find Patent Forward Citations

Loading…