The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
Apr. 23, 2019
Filed:
Mar. 31, 2017
Tata Consultancy Services Limited, Mumbai, IN;
Purushotam Gopaldas Radadia, Pune, IN;
Kanika Kalra, Pune, IN;
Rahul Kumar, Pune, IN;
Anand Sriraman, Pune, IN;
Gangadhara Reddy Sirigireddy, Pune, IN;
Shrikant Joshi, Pune, IN;
Shirish Subhash Karande, Pune, IN;
Sachin Premsukh Lodha, Pune, IN;
Tata Consultancy Services Limited, Mumbai, IN;
Abstract
The disclosure generally relates to transcription of spoken words, and more particularly to a system and method for transcription of spoken words using multilingual mismatched words. The process comprises collection of multi-scripted noisy transcriptions of the spoken word obtained from workers of the multilingual mismatched crowd unfamiliar with the spoken language. The collected words are mapped to a phoneme sequence in the source language using script specific graphemes to phoneme model. Further, it builds a multi-scripted transcription script specific, worker specific and a global insertion-deletion-substitution (IDS) channel. Furthermore, the disclosure also determines reputation of workers to allocate the transcription task. Determination of reputation is based on word belief. The word belief is determined by taking ratio of likelihood probability of mapped phoneme sequence of transcriptions given the current estimate of word to the sum of likelihood probabilities of mapped phoneme sequences of the transcriptions given the phoneme sequence of each dictionary word.