The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Nov. 01, 2022

Filed:

May. 11, 2020
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventor:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G05B 13/00 (2006.01); G06F 30/27 (2020.01); G05B 13/02 (2006.01); G05B 13/04 (2006.01); G06N 20/00 (2019.01); G06F 17/16 (2006.01); G06N 7/00 (2006.01); G06F 111/04 (2020.01); G06F 111/10 (2020.01);
U.S. Cl.
CPC ...
G06F 30/27 (2020.01); G05B 13/0265 (2013.01); G05B 13/048 (2013.01); G06F 17/16 (2013.01); G06N 7/005 (2013.01); G06N 20/00 (2019.01); G06F 2111/04 (2020.01); G06F 2111/10 (2020.01);
Abstract

A method for automatically reducing the dimensionality of a mathematical representation of a controlled application system is provided. The method includes receiving, at a control system, data corresponding to control action and system state variables relating to the controlled application system, fitting a constrained reinforcement learning (CRL) model to the controlled application system based on the data, and automatically identifying a subset of the system state variables by selecting control action variables of interest and identifying system state variables that drive the CRL model to recommend each control action variable of interest. The method also includes automatically performing state space dimensionality reduction of the CRL model using the subset of system state variables, estimating a transition probability matrix for a constrained Markov decision process (CMDP) model of the controlled application system, and formulating the CMDP model as a linear programming (LP) problem using the transition probability matrix and several costs.


Find Patent Forward Citations

Loading…