The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 16, 2021

Filed:

Sep. 29, 2017
Applicant:

Alphaics Corporation, Wilmington, DE (US);

Inventor:

Nagendra Nagaraja, Bangalore, IN;

Assignee:

ALPHAICS CORPORATION, Wilmington, DE (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06N 20/00 (2019.01); G06N 3/08 (2006.01); G06N 5/04 (2006.01); G06N 7/00 (2006.01); G06N 3/04 (2006.01); G06N 3/00 (2006.01); G06N 3/06 (2006.01);
U.S. Cl.
CPC ...
G06N 3/08 (2013.01); G06N 3/004 (2013.01); G06N 3/0445 (2013.01); G06N 3/0454 (2013.01); G06N 3/06 (2013.01); G06N 5/043 (2013.01); G06N 7/005 (2013.01); G06N 20/00 (2019.01);
Abstract

The embodiments herein disclose a system and method for implementing reinforcement learning agents using a reinforcement learning processor. An application-domain specific instruction set (ASI) for implementing reinforcement learning agents and reward functions is created. Further, instructions are created by including at least one of the reinforcement learning agent ID vectors, the reinforcement learning environment ID vectors, and length of vector as an operand. The reinforcement learning agent ID vectors and the reinforcement learning environment ID vectors are pointers to a base address of an operations memory. Further, at least one of said reinforcement learning agent ID vector and reinforcement learning environment ID vector is embedded into operations associated with the decoded instruction. The instructions retrieved by agent ID vector indexed operation are executed using a second processor, and applied onto a group of reinforcement learning agents. The operations defined by the instructions are stored in an operations storage memory.


Find Patent Forward Citations

Loading…