The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 08, 2024

Filed:

Sep. 23, 2020
Applicant:

Tata Consultancy Services Limited, Mumbai, IN;

Inventors:

Srinarayana Nagarathinam, Chennai, IN;

Avinash Achar, Chennai, IN;

Arunchandar Vasan, Chennai, IN;

Assignee:
Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G05B 13/02 (2006.01); F24F 11/62 (2018.01); G05B 15/02 (2006.01); G06N 3/088 (2023.01);
U.S. Cl.
CPC ...
G05B 13/027 (2013.01); F24F 11/62 (2018.01); G05B 15/02 (2013.01); G06N 3/088 (2013.01);
Abstract

Reinforcement Learning agent interacting with a real-world building to determine optimal policy may not be viable due to comfort constraints. Embodiments of the present disclosure provide multi-deep agent RL for dynamically controlling electrical equipment in buildings, wherein a simulation model is generated using design specification of (i) controllable electrical equipment (or subsystem) and (ii) building. Each RL agent is trained using simulation model and deployed in the subsystem. Reward function for each subsystem includes some portion of reward from other subsystem(s). Based on reward function of each RL agent, each RL agent learns an optimal control parameter during execution of RL agent in subsystem. Further, a global optimal control parameter list is generated using the optimal control parameter. The control parameters in the global optimal control parameters list are fine-tuned to improve subsystem's performance. Information on fine-tuning parameters of the subsystem and reward function are used for training RL agents.


Find Patent Forward Citations

Loading…