The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 28, 2023

Filed:

Apr. 14, 2020
Applicant:

Sony Interactive Entertainment Inc., Tokyo, JP;

Inventor:

Sudha Krishnamurthy, Foster City, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/68 (2019.01); G06N 20/00 (2019.01); G10L 15/16 (2006.01); G06N 3/08 (2006.01); G06N 3/04 (2006.01); G06N 3/084 (2023.01);
U.S. Cl.
CPC ...
G06N 3/084 (2013.01); G06F 16/68 (2019.01); G06N 3/0454 (2013.01); G06N 20/00 (2019.01); G10L 15/16 (2013.01);
Abstract

An automated method, system, and computer readable medium for generating sound effect recommendations for visual input by training machine learning models that learn audio-visual correlations from a reference image or video, a positive audio signal, and a negative audio signal. A machine learning algorithm is used with a reference visual input, a positive audio signal input or a negative audio signal input to train a multimodal clustering neural network to output representations for the visual input and audio input as well as correlation scores between the audio and visual representations. The trained multimodal clustering neural network is configured to learn representations in such a way that the visual representation and positive audio representation have higher correlation scores than the visual representation and a negative audio representation or an unrelated audio representation.


Find Patent Forward Citations

Loading…