The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Dec. 26, 2023

Filed:

Jul. 12, 2021
Applicant:

Spree3d Corporation, Incline Village, NV (US);

Inventors:

Mohamed N. Moustafa, Metuchen, NJ (US);

Ahmed A. Ewais, New Cairo, EG;

Amr A. Ali, Cairo, EG;

Assignee:

Spree3D Corporation, Incline Village, NV (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G11B 27/02 (2006.01); G06N 3/08 (2023.01); G06T 9/00 (2006.01); G06N 3/045 (2023.01);
U.S. Cl.
CPC ...
G11B 27/02 (2013.01); G06N 3/045 (2023.01); G06N 3/08 (2013.01); G06T 9/002 (2013.01);
Abstract

Apparati, methods, and computer readable media for inserting identity information from a source image (static image or video) () into a destination video (), while mimicking motion of the destination video (). In an apparatus embodiment, an identity encoder () is configured to encode identity information of the source image (). When source image () is a multi-frame static image or a video, an identity code aggregator () is positioned at an output of the identity encoder (), and produces an identity vector (). A driver encoder () is coupled to the destination (driver) video (), and has two components: a pose encoder () configured to encode pose information of the destination video (), and a motion encoder () configured to separately encode motion information of the destination video (). The driver encoder () produces two vectors: a pose vector () and a motion vector (). A neural network generator () has three inputs: the identity vector (), the pose vector (), and the motion vector (). The neural network generator () is configured to generate, in response to these three inputs, a composite video () comprising identity information of the source image () inserted into the destination video (), where the composite video () has substantially the same temporal information as the destination video ().


Find Patent Forward Citations

Loading…