A system and method for translating an annotated source document in a first natural language to a target document in a second natural language having corresponding annotations, includes computer storage, a computer receiving module for receiving input textual information in a first language and for storing the input textual information in the computer storage, the input textual information including annotations and a translation engine for creating a first token string including first language tokens, annotations tokens that apply to the first language tokens, and ending tokens. Prior to translation, the annotation tokens are removed from the first token string and a plurality of annotation records are created for the first language tokens, each annotation record linking one of the first language tokens to each of the annotation tokens that apply to the first language token. After the first language tokens have been translated into the second language tokens, the annotation tokens are inserted at appropriate locations in the second token string. These locations are determined using the end-of-sentence tokens and the annotation records, and a target document is produced in the second language using the second token string and annotations, thus recreating in the target document, the annotations of the source document.
展开▼