The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Mar. 03, 2020
Filed:
Jul. 19, 2019
Korea Internet & Security Agency, Jeollanam-do, KR;
Sung Taek Oh, Jeollanam-do, KR;
Woong Go, Jeollanam-do, KR;
Mi Joo Kim, Jeollanam-do, KR;
Jae Hyuk Lee, Jeollanam-do, KR;
Jun Hyung Park, Jeollanam-do, KR;
KOREA INTERNET & SECURITY AGENCY, Jeollanam-do, KR;
Abstract
There is provided a reinforcement learning method in which a discount factor is automatically adjusted, the method being executed by a computing device and comprising repeatedly training a reinforcement learning model, which determines an evaluation result of input data, using the input data, wherein the repeatedly training of the reinforcement learning model comprises obtaining first result data which is output as a result of inputting the input data to the reinforcement learning model. obtaining second result data which is the result of evaluating the input data using a first evaluation model. obtaining a first return which is the result of adding a discount factor to a first reward given in consideration of whether the first result data and the second result data match. training the reinforcement learning model using the first return and automatically adjusting the discount factor by considering the second result data.