The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 11, 2022

Filed:

May. 05, 2020
Applicant:

International Business Machines Corporation, Armonk, NY (US);

Inventors:

Xin Tang, Ningbo, CN;

Kun Yan Yin, Ningbo, CN;

He Li, Beijing, CN;

Xueliang Zhao, Shanghai, CN;

Xin Xu, Beijing, CN;

Attorneys:
Primary Examiner:
Int. Cl.
CPC ...
G06F 16/33 (2019.01); G06F 16/93 (2019.01); G06F 16/338 (2019.01); G06F 16/35 (2019.01); G06F 16/9535 (2019.01); G06F 40/40 (2020.01); G06F 40/58 (2020.01);
U.S. Cl.
CPC ...
G06F 16/3344 (2019.01); G06F 16/338 (2019.01); G06F 16/353 (2019.01); G06F 16/93 (2019.01); G06F 16/9535 (2019.01); G06F 40/40 (2020.01); G06F 40/58 (2020.01);
Abstract

An approach is provided for searching multilingual documents. A first classification is determined that includes a first document and other document(s) by minimizing a first distance between a first numerical fixed length vector for the first document and other numerical fixed length vector(s) for other document(s). Based on a query and a natural language detected in the query, a second document is selected. A second stream modeling the second document is encoded as a second numerical fixed length vector. Based on a distance between the first and second numerical fixed length vectors being less than a threshold, the first classification is identified as including the second document. Documents in the first classification are ranked and presented as having content matching the second document's content. At least one of the ranked documents is expressed in a natural language different from the natural language of the second document.


Find Patent Forward Citations

Loading…