The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 05, 2021

Filed:

Dec. 30, 2020
Applicant:

Sas Institute Inc., Cary, NC (US);

Inventors:

Xiaozhuo Cheng, Cary, NC (US);

Xu Yang, Cary, NC (US);

Xiaolong Li, Cary, NC (US);

Assignee:

SAS INSTITUTE INC., Cary, NC (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/16 (2006.01); G10L 15/02 (2006.01); G10L 15/26 (2006.01); G10L 15/04 (2013.01); G10L 25/78 (2013.01); G06N 3/08 (2006.01); G06N 3/04 (2006.01); G10L 25/30 (2013.01);
U.S. Cl.
CPC ...
G10L 15/26 (2013.01); G06N 3/0454 (2013.01); G06N 3/08 (2013.01); G10L 15/02 (2013.01); G10L 15/04 (2013.01); G10L 25/30 (2013.01); G10L 25/78 (2013.01); G10L 2025/783 (2013.01);
Abstract

An apparatus includes processor(s) to: divide a speech data set into multiple data chunks that each represent a chunk of speech audio; derive a threshold amplitude based on at least one peak amplitude of the speech audio; designate each data chunk with a peak amplitude below the threshold amplitude a pause data chunk; within a set of temporally consecutive data chunks of the multiple data chunks, identify a longest subset of temporally consecutive pause data chunks; within the set of temporally consecutive data chunks, designate the longest subset of temporally consecutive pause data chunks as a likely sentence pause of a candidate set of likely sentence pauses; based on at least the candidate set, divide the speech data set into multiple data segments that each represent a speech segment of the speech audio; and perform speech-to-text conversion, to identify a sentence spoken in each speech segment.


Find Patent Forward Citations

Loading…