The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Oct. 09, 2018
Filed:
Feb. 14, 2017
Tata Consultancy Services Limited, Mumbai, IN;
Mandar Shrikant Kulkarni, Pune, IN;
Anand Sriraman, Pune, IN;
Rahul Kumar, Pune, IN;
Kanika Kalra, Pune, IN;
Shirish Subhash Karande, Pune, IN;
Sachin Premsukh Lodha, Pune, IN;
Tata Consultancy Services Limited, Mumbai, IN;
Abstract
The present application provides a method and system for unsupervised word image clustering, comprises capturing one or more image wherein the one or more image comprises at least one word images. Extracting at least one feature vector using an untrained convolution neural network architecture, wherein the convolution filters are initialized by random filter based deep learning techniques using Gaussian random variable with zero mean and unit standard deviation, and wherein the convolution filters are constrained to sum to zero. The extracted feature vectors are used for clustering, wherein clustering is performed in two stages. First stage includes clustering word images which are similar using a graph connected component. Second stage clustering includes clustering a remaining word images which are not clustered during the first stage by evaluating the remaining images against the clusters formed during the first stage and assigning them to clusters based on the evaluation.