The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
May. 13, 2025

Filed:

Mar. 10, 2022
Applicant:

Roku, Inc., San Jose, CA (US);

Inventors:

David Henry Friedman, Austin, TX (US);

Alan Robert Bithell, Santa Clara, CA (US);

Robert Caston Curtis, Napa, CA (US);

Assignee:

Roku, Inc., San Jose, CA (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G10L 25/81 (2013.01); G10L 15/08 (2006.01); G10L 21/02 (2013.01); G10L 25/18 (2013.01);
U.S. Cl.
CPC ...
G10L 21/02 (2013.01); G10L 15/08 (2013.01); G10L 25/18 (2013.01);
Abstract

A method for dynamically controlling enhancement of an audio stream is provided, where the audio stream defines a sequence of audio segments over time. Each audio segment defines a waveform having a plurality of waveform attributes. For each audio segment of the sequence of audio segments, the method includes: (i) determining a set of waveform-attribute values of the audio segment's waveform attributes, (ii) computing a first distance between the determined set of waveform-attribute values and a first predefined set of waveform-attribute values representative of speech, and computing a second distance between the determined set of waveform-attribute values and a second predefined set of waveform-attribute values representative of music, (iii) using the computed first and second distances as a basis to classify the audio segment as primarily speech or rather primarily music, and (iv) controlling, based on the classifying, whether or not to enhance the audio segment for output.


Find Patent Forward Citations

Loading…