The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 24, 2023

Filed:

Mar. 17, 2021
Applicant:

Hewlett Packard Enterprise Development Lp, Houston, TX (US);

Inventors:

Xiongbing Ou, Santa Clara, CA (US);

Thomas Anthony Phelan, Santa Clara, CA (US);

David Lee, Houston, TX (US);

Assignee:
Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 12/0811 (2016.01); G06F 16/182 (2019.01); G06F 12/121 (2016.01); G06F 9/54 (2006.01);
U.S. Cl.
CPC ...
G06F 12/0811 (2013.01); G06F 12/121 (2013.01); G06F 16/182 (2019.01); G06F 9/541 (2013.01); G06F 2212/1021 (2013.01);
Abstract

Embodiments described herein are generally directed to caching and data access improvements in a large scale data processing environment. According to an example, an agent running on a first worker node of a cluster receives a read request from a task. The worker node of the cluster to which the data at issue is mapped is identified. When the first worker node is the identified worker node, it is determined whether its cache contains the data; if so, the data is fetched from a remote data lake and the agent locally caches the data; otherwise, when the identified worker node is another worker node of the compute cluster, the data is fetched from a remote agent of that worker node. The agent responds to the read request with cached data, data returned by the remote data lake, or data returned by the remote data agent as the case may be.


Find Patent Forward Citations

Loading…