The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Apr. 04, 2017

Filed:

Jan. 14, 2016
Applicants:

Ramasamy Govindaraju Balamurali, Los Angeles, CA (US);

Chandra Rajagopal, Los Angeles, CA (US);

Inventors:

Ramasamy Govindaraju Balamurali, Los Angeles, CA (US);

Chandra Rajagopal, Los Angeles, CA (US);

Assignee:

AUDYSSEY LABORATORIES, INC., Los Angeles, CA (US);

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G10L 21/00 (2013.01); G10L 25/81 (2013.01); G10L 25/21 (2013.01); G10L 25/06 (2013.01); G10L 19/26 (2013.01);
U.S. Cl.
CPC ...
G10L 25/81 (2013.01); G10L 19/26 (2013.01); G10L 25/06 (2013.01); G10L 25/21 (2013.01);
Abstract

A speech/music discrimination method evaluates the standard deviation between envelope peaks, loudness ratio, and smoothed energy difference. The envelope is searched for peaks above a threshold. The standard deviations of the separations between peaks are calculated. Decreased standard deviation is indicative of speech, higher standard deviation is indicative of non-speech. The ratio between minimum and maximum loudness in recent input signal data frames is calculated. If this ratio corresponds to the dynamic range characteristic of speech, it is another indication that the input signal is speech content. Smoothed energies of the frames from the left and right input channels are computed and compared. Similar (e.g., highly correlated) left and right channel smoothed energies is indicative of speech. Dissimilar (e.g., un-correlated content) left and right channel smoothed energies is indicative of non-speech material. The results of the three tests are compared to make a speech/music decision.


Find Patent Forward Citations

Loading…