The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.

The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.

Date of Patent:
Mar. 02, 2010

Filed:

May. 26, 2005
Applicants:

Cyril Goutte, Le Versoud, FR;

Michel Simard, Meylan, FR;

Kenji Yamada, Redondo Beach, CA (US);

Eric Gaussier, Eybens, FR;

Arne Mauser, Aachen, DE;

Inventors:

Cyril Goutte, Le Versoud, FR;

Michel Simard, Meylan, FR;

Kenji Yamada, Redondo Beach, CA (US);

Eric Gaussier, Eybens, FR;

Arne Mauser, Aachen, DE;

Assignee:

Xerox Corporation, Norwalk, CT (US);

Attorney:
Primary Examiner:
Assistant Examiner:
Int. Cl.
CPC ...
G06F 17/28 (2006.01); G06F 17/27 (2006.01);
U.S. Cl.
CPC ...
Abstract

Methods are disclosed for performing proper word alignment that satisfy constraints of coverage and transitive closure. Initially, a translation matrix which defines word association measures between source and target words of a corpus of bilingual translations of source and target sentences is computed. Subsequently, in a first method, the association measures in the translation matrix are factorized and orthogonalized to produce cepts for the source and target words, which resulting matrix factors may then be, optionally, multiplied to produce an alignment matrix. In a second method, the association measures in the translation matrix are thresholded, and then closed by transitivity, to produce an alignment matrix, which may then be, optionally, factorized to produce cepts. The resulting cepts or alignment matrices may then be used by any number of natural language applications for identifying words that are properly aligned.


Find Patent Forward Citations

Loading…