The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 31, 2020

Filed:

Oct. 25, 2017
Applicant:

Dropbox, Inc., San Francisco, CA (US);

Inventors:

Aditi Jain, San Francisco, CA (US);

Manveer Singh Chawla, San Francisco, CA (US);

Thomas Berg, San Francisco, CA (US);

Swapnil Zarekar, San Francisco, CA (US);

Robert Kajic, San Francisco, CA (US);

Karandeep Johar, San Francisco, CA (US);

Aaron Feldstein, San Francisco, CA (US);

Walter Kim, San Francisco, CA (US);

Joe Nudell, San Francisco, CA (US);

Jenny Dong, San Francisco, CA (US);

Jared Wilson, San Francisco, CA (US);

Luke Thompson, San Francisco, CA (US);

David Kriegman, San Francisco, CA (US);

Assignee:

DROPBOX, INC., San Francisco, CA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
H04L 12/58 (2006.01); H04L 29/06 (2006.01); H04L 29/08 (2006.01); G06F 9/50 (2006.01);
U.S. Cl.
CPC ...
H04L 51/26 (2013.01); G06F 9/5038 (2013.01); H04L 51/36 (2013.01); H04L 63/102 (2013.01); H04L 67/22 (2013.01);
Abstract

Computer-implemented techniques include, during a delayed processing window, receiving reward data for arm actions taken, where the arm actions were chosen based on a previous version of an arm choice policy, and the previous version of the arm choice policy was determined based on a previous set of reward data for a previous set of arm actions taken. When the delayed processing window has closed, a new arm choice policy is determined based at least in part on the action-reward data, and the previous set of reward data and/or the previous arm choice policy. After a request to choose an arm choice is received, a particular arm action to take is determined based on the new arm choice policy. This chosen arm is provided in response to the request.


Find Patent Forward Citations

Loading…