The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

G06N 3/092 (2023.01); G06N 20/00 (2019.01); G05B 2219/32334 (2013.01); G05B 2219/33056 (2013.01); G05B 2219/34082 (2013.01); G05B 2219/40499 (2013.01); G06N 7/00 (2013.01);

Abstract

A method and an apparatus for exclusive reinforcement learning are provided, comprising: collecting information of states of an environment through the communication interface and performing a statistical analysis on the states using the collected information; determining a first state value of a first state among the states in a training phase and a second state value of a second state among the states in an inference phase based on analysis results of the statistical analysis; performing reinforcement learning by using one reinforcement learning unit of a plurality of reinforcement learning unit which performs reinforcement learnings from different perspectives according to the first state value; and selecting one of actions determined by the plurality of reinforcement learning unit based on the second state value and applying selected action to the environment.

Find Patent Forward Citations