GALE: Translation
LDC creates translations for a large volume of newswire, broadcast news and broadcast conversation transcripts, and weblog and newsgroup text. All translations are created by outsourcing to selected agencies.
All incoming data undergoes a rigid quality control procedure to ensure translation quality, and LDC staff perform manual corrections on devtest and evaluation data. Enhancement of evaluation data includes the insertion of translation variants (alternates) into the translations to represent ambiguity in the source.
LDC also collects a large volume of parallel text from existing sources.
LDC manually identifies Sentence Units (SUs) in the source text before it is sent out for translation, so all incoming translations are perfectly aligned.
Guidelines:
GALE Arabic Translation Guidelines V2.4 - Updated 06/14/2007
GALE Chinese Translation Guidelines V2.5 - Updated 08/13/2007
GALE Arabic Alternation Guidelines V1.0 - Updated 11/10/2006
GALE Chinese Alternation Guidelines V1.1 - Updated 11/27/2006














