The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Aug. 08, 2023

Filed:

Jun. 29, 2021
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Stanislaw Ignacy Pasko, Zawonia, PL;

Pawel Zelazko, Gdansk, PL;

Cagdas Bak, Gdansk, PL;

Eli Joshua Fidler, Toronto, CA;

Michal Kowalczuk, Gdansk, PL;

Andrew Oberlin, Lynnwood, WA (US);

Ariya Rastrow, Seattle, WA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G10L 17/26 (2013.01); G10L 15/183 (2013.01); G10L 15/34 (2013.01); G10L 15/22 (2006.01);
U.S. Cl.
CPC ...
G10L 17/26 (2013.01); G10L 15/183 (2013.01); G10L 15/22 (2013.01); G10L 15/34 (2013.01);
Abstract

Some speech processing systems may handle some commands on-device rather than sending the audio data to a second device or system for processing. The first device may have limited speech processing capabilities sufficient for handling common language and/or commands, while the second device (e.g., an edge device and/or a remote system) may call on additional language models, entity libraries, skill components, etc. to perform additional tasks. An intermediate data generator may facilitate dividing speech processing operations between devices by generating a stream of data that includes a first-pass ASR output (e.g., a word or sub-word lattice) and other characteristics of the audio data such as whisper detection, speaker identification, media signatures, etc. The second device can perform the additional processing using the data stream; e.g., without using the audio data. Thus, privacy may be enhanced by processing the audio data locally without sending it to other devices/systems.


Find Patent Forward Citations

Loading…