The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 14, 2024

Filed:

Mar. 29, 2023
Applicant:

Snowflake Inc., Bozeman, MT (US);

Inventors:

Abdullah Al Mahmood, Bothell, WA (US);

Chong Han, Bellevue, WA (US);

Ganeshan Ramachandran Iyer, Bellevue, WA (US);

Jiaxing Liang, Bellevue, WA (US);

Nithin Mahesh, Kirkland, WA (US);

Yanrui Zhang, Mercer Island, WA (US);

Assignee:

Snowflake Inc., Bozeman, MT (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/23 (2019.01); G06F 16/174 (2019.01); G06F 16/27 (2019.01);
U.S. Cl.
CPC ...
G06F 16/2365 (2019.01); G06F 16/1748 (2019.01); G06F 16/27 (2019.01);
Abstract

Embodiments of the present disclosure provide techniques for deduplicating files during internal stage replication using a directory table of the replicated internal stage that is modified as a cache for storing and retrieving original file-level metadata for the replicated files. An initial list of candidate files for loading from the internal stage to a table of the target deployment is prepared based on the files listed in the internal stage, and refined using a directory table lookup. If there is any inconsistency between the files registered in the directory table and the files listed in the internal stage, the target deployment will inspect the user-defined file-level metadata to obtain original file-level metadata for each file that is present in the internal stage but not in the directory table. This information may be used during deduplication to ensure that no duplicate files are loaded.


Find Patent Forward Citations

Loading…