The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Dec. 16, 2025
Filed:
Dec. 14, 2022
Scale Ai, Inc., San Francisco, CA (US);
Jihan Yin, San Francisco, CA (US);
Chiao-Lun Cheng, Taipei, TW;
Scale AI, Inc., San Francisco, CA (US);
Abstract
One embodiment of the present invention sets forth a technique for sampling from a dataset comprises. The technique includes determining a plurality of embeddings for a plurality of objects depicted in a plurality of images in the dataset. The technique also includes populating a tree structure with the plurality of embeddings by generating a first node that stores a first set of embeddings and generating a first plurality of nodes as children of the first node, where each node included in the first plurality of nodes stores a different subset of embeddings included in the first set of embeddings. The technique further includes sampling a subset of embeddings from the plurality of embeddings via a traversal of the tree structure and generating a sampled dataset that includes a subset of images based on the subset of embeddings and a number of images to be included in the sampled dataset.