The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 25, 2025

Filed:

Jul. 19, 2024
Applicant:

Nanjing Silicon Intelligence Technology Co., Ltd., Nanjing, CN;

Inventors:

Huapeng Sima, Nanjing, CN;

Maolin Zhang, Nanjing, CN;

Liyan Mao, Nanjing, CN;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06T 13/00 (2011.01); G06T 7/70 (2017.01); G06T 7/73 (2017.01); G06T 13/20 (2011.01); G06V 10/77 (2022.01); G06V 10/778 (2022.01); G06V 20/40 (2022.01); G06V 40/16 (2022.01); G10L 15/02 (2006.01); G10L 25/24 (2013.01);
U.S. Cl.
CPC ...
G06T 13/00 (2013.01); G06T 7/74 (2017.01); G06V 10/7715 (2022.01); G06V 10/778 (2022.01); G06V 20/46 (2022.01); G06V 40/174 (2022.01); G10L 15/02 (2013.01); G10L 25/24 (2013.01); G06T 2207/10016 (2013.01); G06T 2207/20081 (2013.01); G06T 2207/30201 (2013.01);
Abstract

Disclosed are a method for generating a dynamic image based on audio, a device, and a storage medium, relating to the field of natural human-computer interactions. The method includes: obtaining a reference image and reference audio input by a user; determining a target head pose feature and a target expression coefficient feature based on the reference image and a trained generation network model, and adjusting the trained generation network model based on the target head pose feature and the target expression coefficient feature, to obtain a target generation network model; and processing a to-be-processed image based on the reference audio, the reference image, and the target generation network model, to obtain a target dynamic image. An image object in the to-be-processed image is same as that in the reference image. In this case, a corresponding digital person can be obtained based on a single picture of a target person.


Find Patent Forward Citations

Loading…