The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 30, 2010

Filed:

Sep. 10, 2007
Applicants:

John R. Hershey, San Diego, CA (US);

Trausti Thor Kristajanson, Redmond, WA (US);

Hagai Attias, Seattle, WA (US);

Nebojsa Jojic, Redmond, WA (US);

Inventors:

John R. Hershey, San Diego, CA (US);

Trausti Thor Kristajanson, Redmond, WA (US);

Hagai Attias, Seattle, WA (US);

Nebojsa Jojic, Redmond, WA (US);

Assignee:

Microsoft Corporation, Redmond, WA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G10L 21/02 (2006.01); G10L 11/00 (2006.01); G06K 9/00 (2006.01);
U.S. Cl.
CPC ...
Abstract

A system and method facilitating speech detection and/or enhancement utilizing audio/video fusion is provided. The present invention fuses audio and video in a probabilistic generative model that implements cross-model, self-supervised learning, enabling rapid adaptation to audio visual data. The system can learn to detect and enhance speech in noise given only a short (e.g., 30 second) sequence of audio-visual data. In addition, it automatically learns to track the lips as they move around in the video.


Find Patent Forward Citations

Loading…