The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
May. 09, 2023
Filed:
Sep. 13, 2018
Fujitsu Limited, Kawasaki, JP;
Okinawa Institute of Science and Technology School Corporation, Okinawa, JP;
Tomotake Sasaki, Kawasaki, JP;
Eiji Uchibe, Kunigami, JP;
Kenji Doya, Kunigami, JP;
Hirokazu Anai, Hachioji, JP;
Hitoshi Yanami, Kawasaki, JP;
Hidenao Iwane, Kawasaki, JP;
FUJITSU LIMITED KAWASAKI, JAPAN, Kawasaki, JP;
OKINAWA INSTITUTE OF SCIENCE AND TECHNOLOGY SCHOOL CORPORATION, Okinawa, JP;
Abstract
A non-transitory, computer-readable recording medium stores therein a reinforcement learning program that uses a value function and causes a computer to execute a process comprising: estimating first coefficients of the value function represented in a quadratic form of inputs at times in the past than a present time and outputs at the present time and the times in the past, the first coefficients being estimated based on inputs at the times in the past, the outputs at the present time and the times in the past, and costs or rewards that corresponds to the inputs at the times in the past; and determining second coefficients that defines a control law, based on the value function that uses the estimated first coefficients and determining input values at times after estimation of the first coefficients.