The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Jul. 15, 2014

Filed:

Jul. 25, 2011
Applicants:

Nicola Cancedda, Grenoble, FR;

Sara Stymne, Linköping, SE;

Inventors:

Nicola Cancedda, Grenoble, FR;

Sara Stymne, Linköping, SE;

Assignee:

Xerox Corporation, Norwalk, CT (US);

Attorney:
Primary Examiner:
Int. Cl.
CPC ...
G06F 17/28 (2006.01);
U.S. Cl.
CPC ...
Abstract

A method and a system for making merging decisions for a translation are disclosed which are suited to use where the target language is a productive compounding one. The method includes outputting decisions on merging of pairs of words in a translated text string with a merging system. The merging system can include a set of stored heuristics and/or a merging model. In the case of heuristics, these can include a heuristic by which two consecutive words in the string are considered for merging if the first word of the two consecutive words is recognized as a compound modifier and their observed frequency fas a closed compound word is larger than an observed frequency fof the two consecutive words as a bigram. In the case of a merging model, it can be one that is trained on features associated with pairs of consecutive tokens of text strings in a training set and predetermined merging decisions for the pairs. A translation in the target language is output, based on the merging decisions for the translated text string.


Find Patent Forward Citations

Loading…