The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Jul. 02, 2024
Filed:
Nov. 25, 2020
Beijing Didi Infinity Technology and Development Co., Ltd., Beijing, CN;
Kevin Knight, Los Angeles, CA (US);
Mariia Ryskina, Pittsburgh, PA (US);
Arkady Arkhangorodsky, Los Angeles, CA (US);
Ajay Nagesh, Los Angeles, CA (US);
Scot Fang, Los Angeles, CA (US);
Abstract
An automatic agent may be trained using reinforcement learning. A secret task may be obtained for a simulated user, and the secret task may be unknown to the automatic agent. At least one instruction to complete the secret task may be obtained from the simulated user according to at least one RL policy. At least one action may be generated by the automatic agent based on the at least one instruction and the at least one RL policy. Rewards may be determined for the simulated user and the automatic agent in response to determining that the at least one action successfully completes the secret task. The at least one RL policy may be adjusted based on the determined rewards.