For the Inventor, By the Inventor

The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 12211517 B1

Date of Patent:

Jan. 28, 2025

Filed:

Sep. 15, 2021

Endpointing in speech processing

Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Roland Maximilian Rolf Maas, Seattle, WA (US);

Bjorn Hoffmeister, Seattle, WA (US);

Ariya Rastrow, Seattle, WA (US);

James Garnet Droppo, Carnation, WA (US);

Veerdhawal Pande, Walpole, MA (US);

Maarten Van Segbroeck, San Diego, CA (US);

Gautam Tiwari, Fremont, CA (US);

Andrew Smith, Seattle, WA (US);

Eli Joshua Fidler, Toronto, CA;

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:

Pierce Atwood LLP

Primary Examiner:

Daniel C Washburn

Assistant Examiner:

Athar N Pasha

Int. Cl.

CPC ...

G10L 25/78 (2013.01); G06N 3/045 (2023.01); G10L 15/26 (2006.01); G10L 25/30 (2013.01);

U.S. Cl.

CPC ...

G10L 25/78 (2013.01); G06N 3/045 (2023.01); G10L 15/26 (2013.01); G10L 25/30 (2013.01); G10L 2025/783 (2013.01);

Abstract

A speech-processing system may determine potential endpoints in a user's speech. Such endpoint prediction may include determining a potential endpoint in a stream of audio data, and may additionally including determining an endpoint score representing a likelihood that the potential endpoint represents an end of speech representing a complete user input. When the potential endpoint has been determined, the system may publish a transcript of speech that preceded the potential endpoint, and send it to downstream components. The system may continue to transcribe audio data and determine additional potential endpoints while the downstream components process the transcript. The downstream components may determine whether the transcript is complete; e.g., represents the entirety of the user input. Final endpoint determinations may be made based on the results of the downstream processing including automatic speech recognition, natural language understanding, etc.

Find Patent Forward Citations

Loading…