The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 07, 2023

Filed:

Dec. 23, 2019
Applicant:

Johnson Controls Tyco Ip Holdings Llp, Milwaukee, WI (US);

Inventors:

Young M. Lee, Old Westbury, NY (US);

Zhanhong Jiang, Milwaukee, WI (US);

Viswanath Ramamurti, San Leandro, CA (US);

Sugumar Murugesan, Santa Clara, CA (US);

Kirk H. Drees, Cedarburg, WI (US);

Michael James Risbeck, Madison, WI (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G05B 13/02 (2006.01); G06N 3/08 (2006.01); G06N 3/04 (2006.01); F24F 11/30 (2018.01); G05B 13/04 (2006.01); F24F 11/63 (2018.01);
U.S. Cl.
CPC ...
G05B 13/027 (2013.01); F24F 11/30 (2018.01); F24F 11/63 (2018.01); G05B 13/048 (2013.01); G06N 3/04 (2013.01); G06N 3/08 (2013.01);
Abstract

Systems and methods for training a reinforcement learning (RL) model for HVAC control are disclosed herein. A calibrated simulation model is used to train a surrogate model of the HVAC system operating within a building. The surrogate model is used to generate simulated experience data for the HVAC system. The simulated experience data can be used to train a reinforcement learning (RL) model of the HVAC system. The RL model is used to control the HVAC system based on the current state of the system and the best predicted action to perform in the current state. The HVAC system generates real experience data based on the actual operation of the HVAC system within the building. The real experience data is used to retrain the surrogate model, and additional simulated experience data is generated using the surrogate model. The RL model can be retrained using the additional simulated experience data.


Find Patent Forward Citations

Loading…