The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G05B 13/02 (2006.01); G06N 3/08 (2006.01); G06N 3/04 (2006.01); F24F 11/30 (2018.01); G05B 13/04 (2006.01); F24F 11/63 (2018.01);

U.S. Cl.

CPC ...

G05B 13/027 (2013.01); F24F 11/30 (2018.01); F24F 11/63 (2018.01); G05B 13/048 (2013.01); G06N 3/04 (2013.01); G06N 3/08 (2013.01);

Abstract

Systems and methods for training a reinforcement learning (RL) model for HVAC control are disclosed herein. A calibrated simulation model is used to train a surrogate model of the HVAC system operating within a building. The surrogate model is used to generate simulated experience data for the HVAC system. The simulated experience data can be used to train a reinforcement learning (RL) model of the HVAC system. The RL model is used to control the HVAC system based on the current state of the system and the best predicted action to perform in the current state. The HVAC system generates real experience data based on the actual operation of the HVAC system within the building. The real experience data is used to retrain the surrogate model, and additional simulated experience data is generated using the surrogate model. The RL model can be retrained using the additional simulated experience data.

Find Patent Forward Citations