The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jan. 12, 2016

Filed:

Jan. 14, 2013
Applicant:

Xerox Corporation, Norwalk, CT (US);

Inventors:

Markos Mylonakis, Grenoble, FR;

Nicola Cancedda, Grenoble, FR;

Assignee:

XEROX CORPORATION, Norwalk, CT (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/27 (2006.01); G06F 17/28 (2006.01);
U.S. Cl.
CPC ...
G06F 17/2854 (2013.01); G06F 17/2818 (2013.01);
Abstract

A method adapted to multiple corpora includes training a statistical machine translation model which outputs a score for a candidate translation, in a target language, of a text string in a source language. The training includes learning a weight for each of a set of lexical coverage features that are aggregated in the statistical machine translation model. The lexical coverage features include a lexical coverage feature for each of a plurality of parallel corpora. Each of the lexical coverage features represents a relative number of words of the text string for which the respective parallel corpus contributed a biphrase to the candidate translation. The method may also include learning a weight for each of a plurality of language model features, the language model features comprising one language model feature for each of the domains.


Find Patent Forward Citations

Loading…