The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 28, 2014

Filed:

May. 21, 2002
Applicants:

Amit Bagga, Green Brook, NJ (US);

Jianying HU, Cranford, NJ (US);

Jialin Zhong, Berkeley Heights, NJ (US);

Inventors:

Amit Bagga, Green Brook, NJ (US);

Jianying Hu, Cranford, NJ (US);

Jialin Zhong, Berkeley Heights, NJ (US);

Assignee:

Avaya Inc., Basking Ridge, NJ (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
H04N 5/14 (2006.01); G06F 17/30 (2006.01); G06K 9/18 (2006.01); G06K 9/62 (2006.01); G06K 9/00 (2006.01); G06K 9/72 (2006.01); G06F 3/01 (2006.01); G11B 27/28 (2006.01); H04N 5/073 (2006.01); H04N 5/445 (2011.01);
U.S. Cl.
CPC ...
G06F 17/30796 (2013.01); H04N 5/145 (2013.01); H04N 5/144 (2013.01); G06K 9/18 (2013.01); G06K 9/6202 (2013.01); G06K 9/00664 (2013.01); G06K 9/6218 (2013.01); G06K 9/72 (2013.01); G06K 9/6203 (2013.01); G06K 9/00456 (2013.01); G06F 3/018 (2013.01); G06F 17/30802 (2013.01); G06K 9/00718 (2013.01); G11B 27/28 (2013.01); H04N 5/073 (2013.01); H04N 5/147 (2013.01); H04N 5/44504 (2013.01);
Abstract

Techniques are presented for analyzing audio-video segments, usually from multiple sources. A combined similarity measure is determined from text similarities and video similarities. The text and video similarities measure similarity between audio-video scenes for text and video, respectively. The combined similarity measure is then used to determine similar scenes in the audio-video segments. When the audio-video segments are from multiple audio-video sources, the similar scenes are common scenes in the audio-video segments. Similarities may be converted to or measured by distance. Distance matrices may be determined by using the similarity matrices. The text and video distance matrices are normalized before the combined similarity matrix is determined. Clustering is performed using distance values determined from the combined similarity matrix. Resulting clusters are examined and a cluster is considered to represent a common scene between two or more different audio-video segments when scenes in the cluster are similar.


Find Patent Forward Citations

Loading…