The patent badge is an abbreviated version of the USPTO patent document. The patent badge does contain a link to the full patent document.
The patent badge is an abbreviated version of the USPTO patent document. The patent badge covers the following: Patent number, Date patent was issued, Date patent was filed, Title of the patent, Applicant, Inventor, Assignee, Attorney firm, Primary examiner, Assistant examiner, CPCs, and Abstract. The patent badge does contain a link to the full patent document (in Adobe Acrobat format, aka pdf). To download or print any patent click here.
Patent No.:
Date of Patent:
May. 31, 2011
Filed:
Jul. 01, 2004
Heather D. Achilles, Hudson, NH (US);
Steven R. Willis, Acton, MA (US);
Patrick R. Mcmanus, Boston, MA (US);
Charles R. Morgan, Manchester, MA (US);
Jan-christian Nelson, Boston, MA (US);
Kenneth R. Ballou, Framingham, MA (US);
Eugene Kuznetsov, Cambridge, MA (US);
Heather D. Achilles, Hudson, NH (US);
Steven R. Willis, Acton, MA (US);
Patrick R. McManus, Boston, MA (US);
Charles R. Morgan, Manchester, MA (US);
Jan-Christian Nelson, Boston, MA (US);
Kenneth R. Ballou, Framingham, MA (US);
Eugene Kuznetsov, Cambridge, MA (US);
International Business Machines Corporation, Armonk, NY (US);
Abstract
Systems, methods and apparatus provide a character process for processing markup language data, such as XML data, by receiving a character stream of markup language data and applying sequences of characters of the character stream to a set of state machines. The set of state machines includes a plurality of construct state machines responsible for processing respective markup language constructs identified by the sequences of characters. The character processor produces, from application of the sequences of characters to the set of state machines, an intermediate representation of the markup language constructs identified by the sequence of characters of the character stream of markup language data. The intermediate representation contains encoded items containing type, length, value representations representative of constructs within the character stream of markup language data.