EARS Arabic Website


EARS January 16, 2004 Meeting (LDC)


Collection


Transcription Tool

AMADAT is an Arabic Multi-Dialectal Transcription tool designed and developed at LDC in 2003. The tool's name, AMADAT, relates the English acronym 'AMDT' to the Arabic roots AMD and AMDD, which signify 'extent/extending' and 'scope-span/spanning' (cf. Al-Mawrid Arabic English Dictionary, 2000, page 168). These semantic notions are closely related to the tool's main transcription and annotation functions.

AMADAT provides a multi-layered transcription that spans the diglossic gap and linguistic distance which exist between any and all Arabic dialects and Modern Standard Arabic (MSA) by extending links between the two sets of linguistic structures and connecting transcribed dialectal forms to their underlying MSA-based forms whenever applicable. AMADAT is designed to describe the linguistic distance existing between MSA and any given Arabic dialect by keeping a close annotation of the complicated and idiosyncratic linguistic variation features occurring between individual Arabic speakers in multidialectal communication.

AMADAT uses a two-tier transcription, which provides a Modern Standard Arabic-based transcription (MSAT) of speech data in a first pass followed by a second annotation pass, which uses an Arabic Orthographic System- based Transliteration (AOST) and provides pertinent phonemic and main missing pronunciation features of the target dialect (such as distinctive dialect short vowels, consonantal sociolinguistic variation, shaddah, etc.) AMADAT has three mutually-exclusive operation modes: (a) GREEN PASS for the MSAT, which uses an Arabic keyboard; (b) YELLOW PASS for the AOST, which uses a Latin keyboard; and (c) RED PASS which uses a Latin keyboard and serves for editing and correction of MSAT or/and AOST errors and typos.


Transcription Guidelines

 

References

 

RT-04 Guidelines: Levantine Arabic Telephone Speech Transcription conventions
(This is a short version of the RT-03 guidelines)