Palo Alto, CA, United States of America

Sauraj Goswami


Average Co-Inventor Count = 1.9

ph-index = 5

Forward Citations = 109(Granted Patents)


Company Filing History:


Years Active: 2008-2016

where 'Filed Patents' based on already Granted Patents

11 patents (USPTO):

Title: Innovations of Sauraj Goswami

Introduction

Sauraj Goswami is a notable inventor based in Palo Alto, California. He has made significant contributions to the field of technology, particularly in document processing and language identification. With a total of 11 patents to his name, his work has had a considerable impact on how documents are analyzed and processed.

Latest Patents

One of his latest patents focuses on the clustering of near-duplicate documents. This innovation involves clustering documents that are likely to be near-duplicates based on document vectors representing word-occurrence patterns in a low-dimensional space. The edit distance between documents is defined by comparing their document vectors. In this process, initial clusters are formed by applying a first edit-distance constraint relative to a root document of each cluster. These initial clusters can then be merged based on a second edit-distance constraint that limits the maximum edit distance between any two documents in the cluster. This second constraint can be determined by comparing cluster structures rather than individual documents.

Another significant patent by Goswami addresses language identification for documents containing multiple languages. This invention allows for the identification of multiple non-overlapping languages within a single document. For each candidate language, a set of non-overlapping languages is defined. The document is analyzed under the hypothesis that it is entirely in one language or partially in one language while the rest is in a different, non-overlapping language. The languages of the document are identified by comparing these competing hypotheses across various language pairs. Additionally, transitions between non-overlapping character sets are used to segment the document, with each segment scored separately for a subset of candidate languages.

Career Highlights

Throughout his career, Sauraj Goswami has worked with prominent companies, including IBM and Stratify, Inc. His experience in these organizations has contributed to his expertise in the field of technology and innovation.

Collaborations

Goswami has collaborated with notable individuals such as You-Chin Gene Fuh and James Zu-Chia Teng. These collaborations have further enriched his work and contributed to the advancements in his areas of expertise.

Conclusion

Sauraj Goswami's contributions to technology through his patents and collaborations highlight his innovative spirit and dedication to improving document processing and language identification. His work continues to influence the field and pave the way for future advancements.

This text is generated by artificial intelligence and may not be accurate.
Please report any incorrect information to support@idiyas.com
Loading…