For the Inventor, By the Inventor

The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Patent No.:

US 12374326 B1

Date of Patent:

Jul. 29, 2025

Filed:

Apr. 28, 2023

Natural language generation

Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Alexandros Potamianos, Santa Monica, CA (US);

Arijit Biswas, Dublin, CA (US);

Bonan Zheng, Torrance, CA (US);

Anushree Venkatesh, San Mateo, CA (US);

Yohan Jo, Sunnyvale, CA (US);

Vincent Auvray, Scotts Valley, CA (US);

Nikolaos Malandrakis, San Jose, CA (US);

Aaron Challenner, Melrose, MA (US);

Xinyan Zhao, Seattle, WA (US);

Angeliki Metallinou, Mountain View, CA (US);

David A Jara, Normandy Park, WA (US);

Jiahui Li, Sunnyvale, CA (US);

Ying Shi, Bellevue, WA (US);

Nikko Strom, Kirkland, WA (US);

Veerdhawal Pande, Walpole, MA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:

Pierce Atwood LLP

Primary Examiner:

Abul K Azad

Int. Cl.

CPC ...

G10L 15/18 (2013.01); G10L 15/22 (2006.01);

U.S. Cl.

CPC ...

G10L 15/1815 (2013.01); G10L 15/22 (2013.01); G10L 2015/223 (2013.01);

Abstract

Techniques for determining when speech is directed at another individual of a dialog, and storing a representation of such user-directed speech for use as context when processing subsequently-received system-directed speech are described. A system receives audio data and/or video data and determines therefrom that speech in the audio data is user-directed. Based on this, the system determine whether the speech is able to be used to perform an action by the system. If the speech is able to be used to perform an action, the system stores a natural language representation of the speech. Thereafter, when the system receives system-directed speech, the system generates a rewrite of a natural language representation of the system-directed speech based on the previously-received user-directed speech. The system then determines output data responsive to the system-directed speech using the rewritten natural language representation.

Find Patent Forward Citations

Loading…