The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 14, 2023

Filed:

Nov. 14, 2019
Applicant:

Microstrategy Incorporated, Vienna, VA (US);

Inventors:

Nannan Yu, Fairfax, VA (US);

Mohamed Diakite Pineda, Oakton, VA (US);

Ren-Jay Huang, Leesburg, VA (US);

Assignee:

MicroStrategy Incorporated, Vienna, VA (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/24 (2019.01); G06F 16/2455 (2019.01); G06F 17/16 (2006.01); G06F 17/18 (2006.01); G06K 9/62 (2022.01);
U.S. Cl.
CPC ...
G06F 16/2456 (2019.01); G06F 17/16 (2013.01); G06F 17/18 (2013.01); G06K 9/6215 (2013.01);
Abstract

Methods, systems, and apparatus, including computer programs encoded on computer-storage media, for inferring joins for data sets. In some implementations, a first data table and a second data table are identified. A first subset of records are selected from the first data table and a second subset of records are selected from the second data table. For fields of the first subset and the second subset, sets of feature values are generated indicating characteristics of the data in the fields. Based on the sets of feature values, one or more similarity score are determined, with each similarity score indicating a similarity of a column in the first data table with respect to a column in the second data table. Based on the one or more similarity scores, data indicating a recommendation to join one or more columns of the first data table with one or more columns of the second data table is provided for output by a computing device.


Find Patent Forward Citations

Loading…