The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Oct. 03, 2023

Filed:

Nov. 28, 2022
Applicant:

Sas Institute Inc., Cary, NC (US);

Inventors:

Xiaolong Li, Cary, NC (US);

Xiaozhuo Cheng, Cary, NC (US);

Samuel Norris Henderson, Raleigh, NC (US);

Xu Yang, Cary, NC (US);

Assignee:

SAS Institute Inc., Cary, NC (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 15/26 (2006.01); G10L 15/22 (2006.01); G10L 15/02 (2006.01); G10L 15/04 (2013.01); G10L 25/78 (2013.01); G10L 25/30 (2013.01);
U.S. Cl.
CPC ...
G10L 15/26 (2013.01); G10L 15/02 (2013.01); G10L 15/04 (2013.01); G10L 25/30 (2013.01); G10L 25/78 (2013.01); G10L 2025/783 (2013.01);
Abstract

An apparatus includes a processor to: receive a request to perform speech-to-text conversion of a speech data set; perform pause detection to identify a set of likely sentence pauses and/or speaker diarization technique to identify a set of likely speaker changes; based the set of likely sentence pauses and/or the set of likely speaker changes, divide the speech data set into data segments representing speech segments; use an acoustic model with the data segments to derive sets of probabilities of speech sounds uttered; store the sets of probabilities in temporal order within a buffer queue; distribute the sets of probabilities from the buffer queue in temporal order among threads of a thread pool; and within each thread, and based on set(s) of probabilities, derive one candidate word and select either the candidate word or an alternate candidate word derived from a language model as the next word most likely spoken.


Find Patent Forward Citations

Loading…