The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Mar. 22, 2022
Filed:
Nov. 20, 2018
Wipro Limited, Bangalore, IN;
Cyrus Andre Dsouza, Mapusa, IN;
Manu Kuchhal, Gurgaon, IN;
Wipro Limited, Bangalore, IN;
Abstract
This disclosure relates generally to an information retrieval technology and more particularly to a creation of a taxonomy to facilitate subsequent search and retrieval of information. In one embodiment, an information retrieval device is disclosed, that comprises a processor and a memory that stores instructions, which, on execution, causes the processor to receive an input corpus. Thereafter, input document clusters are generated from top input n-grams associated with the input corpus. Further, top-ranked input n-grams are determined from the top input n-grams. Thereafter, an external corpus is identified based on the top-ranked input n-grams. An enriched corpus (external and input corpus), is clustered based on top enriched n-grams associated with the enriched corpus to generate enriched document clusters. Further, for each n-gram of the enriched corpus, corresponding n-gram clusters are determined. Finally, creates a taxonomy based on the input document clusters, the enriched document clusters, the n-gram clusters and the top-ranked input n-grams.