The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 27, 2019

Filed:

Nov. 13, 2006
Applicants:

J. Carl Cooper, Los Gatos, CA (US);

Mirko Dusan Vojnovic, Santa Clara, CA (US);

Jibanananda Roy, Kolkata, IN;

Saurabh Jain, Kolkata, IN;

Christopher Smith, Simsbury, CT (US);

Inventors:

J. Carl Cooper, Los Gatos, CA (US);

Mirko Dusan Vojnovic, Santa Clara, CA (US);

Jibanananda Roy, Kolkata, IN;

Saurabh Jain, Kolkata, IN;

Christopher Smith, Simsbury, CT (US);

Assignee:

Other;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
H04N 9/475 (2006.01); H04N 21/44 (2011.01); G06K 9/00 (2006.01); H04N 17/00 (2006.01); H04N 21/2368 (2011.01); H04N 21/434 (2011.01); H04N 5/04 (2006.01);
U.S. Cl.
CPC ...
H04N 21/44008 (2013.01); G06K 9/00335 (2013.01); H04N 17/00 (2013.01); H04N 21/2368 (2013.01); H04N 21/4341 (2013.01); H04N 5/04 (2013.01);
Abstract

Method, system, and program product for measuring audio video synchronization. This is done by first acquiring audio video information into an audio video synchronization system. The step of data acquisition is followed by analyzing the audio information, and analyzing the video information. Next, the audio information is analyzed to locate the presence of sounds therein related to a speaker's personal voice characteristics. In Analysis Phase Audio and Video MuEv-s are calculated from the audio and video information, and the audio and video information is classified into vowel sounds including AA, EE, OO, B, V, TH, F, silence, other sounds, and unclassified phonemes. The inner space between the lips are also identified and determined. This information is used to determine and associate a dominant audio class in a video frame. Matching locations are determined, and the offset of video and audio is determined.


Find Patent Forward Citations

Loading…