The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 20, 2024

Filed:

Dec. 22, 2020
Applicant:

Deepmind Technologies Limited, London, GB;

Inventors:

Gabriel Dulac-Arnold, Paris, FR;

Richard Andrew Evans, London, GB;

Benjamin Kenneth Coppin, Cottenham, GB;

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06N 3/08 (2023.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for selecting actions from large discrete action sets. One of the methods includes receiving a particular observation representing a particular state of an environment; and selecting an action from a discrete set of actions to be performed by an agent interacting with the environment, comprising: processing the particular observation using an actor policy network to generate an ideal point; determining, from the points that represent actions in the set, the k nearest points to the ideal point; for each nearest point of the k nearest points: processing the nearest point and the particular observation using a Q network to generate a respective Q value for the action represented by the nearest point; and selecting the action to be performed by the agent from the k actions represented by the k nearest points based on the Q values.


Find Patent Forward Citations

Loading…