The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 01, 2025

Filed:

Dec. 15, 2020
Applicant:

Deepbrain Ai Inc., Seoul, KR;

Inventor:

Gyeongsu Chae, Seoul, KR;

Assignee:

DEEPBRAIN AI INC., Seoul, KR;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06V 20/40 (2022.01); G06V 10/82 (2022.01); G06V 40/16 (2022.01); G10L 15/02 (2006.01); G10L 25/30 (2013.01); G10L 25/57 (2013.01);
U.S. Cl.
CPC ...
G06V 20/46 (2022.01); G06V 10/82 (2022.01); G06V 40/171 (2022.01); G10L 15/02 (2013.01); G10L 25/30 (2013.01); G10L 25/57 (2013.01);
Abstract

A speech video generation device according to an embodiment includes a first encoder, which receives an input of a person background image that is a video part in a speech video of a predetermined person, and extracts an image feature vector from the person background image, a second encoder, which receives an input of a speech audio signal that is an audio part in the speech video, and extracts a voice feature vector from the speech audio signal, a combining unit, which generates a combined vector by combining the image feature vector output from the first encoder and the voice feature vector output from the second encoder, a first decoder, which reconstructs the speech video of the person using the combined vector as an input, and a second decoder, which predicts a landmark of the speech video using the combined vector as an input.


Find Patent Forward Citations

Loading…