The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 11514925 B1

Date of Patent:

Nov. 29, 2022

Filed:

Apr. 30, 2020

Using a predictive model to automatically enhance audio having various audio quality issues

Applicants:

Adobe Inc., San Jose, CA (US);

The Trustees of Princeton University, Princeton, NJ (US);

Inventors:

Zeyu Jin, San Jose, CA (US);

Jiaqi Su, San Jose, CA (US);

Adam Finkelstein, Princeton, NJ (US);

Assignees:

ADOBE INC., San Jose, CA (US);

THE TRUSTEES OF PRINCETON UNIVERSITY, Princeton, NJ (US);

Attorney:

Kilpatrick Townsend & Stockton LLP

Primary Examiner:

Linda Wong

Int. Cl.

CPC ...

G10L 21/0364 (2013.01); G10L 25/30 (2013.01); G10L 25/18 (2013.01); G06N 3/08 (2006.01); G06N 3/04 (2006.01);

U.S. Cl.

CPC ...

G10L 21/0364 (2013.01); G06N 3/0454 (2013.01); G06N 3/084 (2013.01); G10L 25/18 (2013.01); G10L 25/30 (2013.01);

Abstract

Operations of a method include receiving a request to enhance a new source audio. Responsive to the request, the new source audio is input into a prediction model that was previously trained. Training the prediction model includes providing a generative adversarial network including the prediction model and a discriminator. Training data is obtained including tuples of source audios and target audios, each tuple including a source audio and a corresponding target audio. During training, the prediction model generates predicted audios based on the source audios. Training further includes applying a loss function to the predicted audios and the target audios, where the loss function incorporates a combination of a spectrogram loss and an adversarial loss. The prediction model is updated to optimize that loss function. After training, based on the new source audio, the prediction model generates a new predicted audio as an enhanced version of the new source audio.

Find Patent Forward Citations