The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Feb. 04, 2025

Filed:

Mar. 22, 2023
Applicant:

Amazon Technologies, Inc., Seattle, WA (US);

Inventors:

Ning Xie, Bellevue, WA (US);

Han Li, Seattle, WA (US);

Qipin Chen, Bellevue, WA (US);

Yuan Chen, San Francisco, CA (US);

Lingyun Wang, Bothell, WA (US);

Assignee:

Amazon Technologies, Inc., Seattle, WA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/78 (2019.01); G06F 16/783 (2019.01); G06T 1/00 (2006.01); G06V 10/70 (2022.01); H04N 21/232 (2011.01);
U.S. Cl.
CPC ...
G06F 16/7867 (2019.01); G06F 16/783 (2019.01); G06T 1/0021 (2013.01); G06V 10/70 (2022.01); H04N 21/232 (2013.01);
Abstract

Techniques for performing a machine learning model based spatial-temporal adaptive shift for end-to-end text-video retrieval are described. According to some examples, a computer-implemented method includes receiving a video comprising a plurality of frames at a content delivery service; generating, by the content delivery service, a set of embeddings for each of a plurality of sections of each frame of the plurality of frames; determining, by a candidate selector machine learning model of the content delivery service, a proper subset of the plurality of sections of each frame of the plurality of frames for a time shift based on the set of embeddings; time shifting, by the content delivery service, the proper subset of the plurality of sections of each frame of the plurality of frames to generate time shifted frames; generating, by the content delivery service, an updated set of embeddings based on the time shifted frames; receiving a search request comprising input text from a user device; determining the video is a match for the search request based on the input text and the updated set of embeddings for the time shifted frames; and sending the video to the user device based on the match.


Find Patent Forward Citations

Loading…