The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 15, 2016

Filed:

Dec. 01, 2014
Applicant:

Xerox Corporation, Norwalk, CT (US);

Inventors:

Nicolas Monet, Montbonnot-Saint-Martin, FR;

Johan Clier, Meylan, FR;

Assignee:

XEROX CORPORATION, Norwalk, CT (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
H04L 9/00 (2006.01); G06F 21/60 (2013.01);
U.S. Cl.
CPC ...
H04L 9/008 (2013.01); G06F 21/60 (2013.01);
Abstract

A system and method for text language identification allow private information of a server and a client to be kept secret from each other. An encrypted score for each of a plurality of languages is received by the server from the client. The encrypted scores are generated by homomorphic addition of encrypted frequencies of n-grams in a list of n-grams extracted from text. The unencrypted list is not provided to the server. The encrypted frequencies of the n-grams in the list are extracted using encrypted resources which, for each of the plurality of languages, include an encrypted frequency for each of a set of n-grams. At the server, the encrypted scores are decrypted to generate unencrypted scores and information is provided to the client based on the unencrypted scores from which the client is able to identify a language for the text.


Find Patent Forward Citations

Loading…