The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 07, 2025

Filed:

Sep. 19, 2023
Applicant:

Recruit Co., Ltd., Tokyo, JP;

Inventors:

Grace Fan, Wayne, PA (US);

Jin Wang, Cupertino, CA (US);

Yuliang Li, Cupertino, CA (US);

Dan Zhang, Sunnyvale, CA (US);

Renée J. Miller, Boston, MA (US);

Assignee:

RECRUIT CO., LTD., Toyko, JP;

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/00 (2019.01); G06F 16/22 (2019.01); G06F 16/242 (2019.01);
U.S. Cl.
CPC ...
G06F 16/221 (2019.01); G06F 16/243 (2019.01);
Abstract

Disclosed embodiments relate to systems, methods, and computer readable storage media for performing dataset discovery. Some embodiments may include accessing a data repository having a plurality of tables having cell values arranged in one or more columns and one or more rows, generating serialized sequences of the cell values that correspond to particular columns of the plurality of tables, inputting the serialized sequences into a natural language model, converting, using the natural language model, the serialized sequences into contextualized embeddings associated with the plurality of tables, storing the contextualized embeddings associated with the plurality of tables in one or more vector indices, receiving a query table, or generating an output of one or more candidate tables from the plurality of tables that are unionable with the received query table.


Find Patent Forward Citations

Loading…