The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 13, 2026

Filed:

Feb. 09, 2024
Applicant:

Snap Inc., Santa Monica, CA (US);

Inventors:

Willi Menapace, Santa Monica, CA (US);

Aliaksandr Siarohin, Los Angeles, CA (US);

Ivan Skorokhodov, Los Angeles, CA (US);

Sergey Tulyakov, Santa Monica, CA (US);

Assignee:

Snap Inc., Santa Monica, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06T 11/00 (2006.01); G06T 3/4007 (2024.01); G06T 3/4046 (2024.01); G06T 5/60 (2024.01); G06T 5/70 (2024.01);
U.S. Cl.
CPC ...
G06T 11/00 (2013.01); G06T 3/4007 (2013.01); G06T 3/4046 (2013.01); G06T 5/60 (2024.01); G06T 5/70 (2024.01); G06T 2207/10016 (2013.01); G06T 2207/20016 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/20084 (2013.01);
Abstract

Hierarchical patch-wise diffusion models (HPDMs) use a diffusion paradigm that learns a hierarchical distribution of patches instead of whole videos for efficient patch-wise training of diffusion models. To enforce consistency between the patches, deep context fusion may be used to propagate the context information from low-scale to high-scale patches in a hierarchical manner. To accelerate patch-wise training and inference, adaptive computation also may be used to allocate more computational resources and network capacity towards coarse image details and to cheapen synthesis of high-frequency texture details. All the processing stages are jointly trained to provide spatially aligned global context to the higher levels of the cascade. As a result, the model does not operate on the full-resolution inputs, which allows the model to be trained on high-resolution video datasets in an end-to-end fashion.


Find Patent Forward Citations

Loading…