Annotation Tools

To support efficient large-scale, multi-user manual annotation, LDC has developed a suite of tools for a wide range of annotation tasks including transcription, metadata extraction and entity, relation and event tagging.

The Annotation Graph Toolkit (AGTK) is a primary resource for annotation tool development at LDC. AGTK provides a collection of software components that support rapid development of task-specific annotation tools. Unlike the traditional approach of designing and implementing data structures and user interfaces for new tasks from scratch, AGTK allows developers to quickly prototype tools and define data formats.  The flexible nature of the AG model means that data representations can be rapidly modified in response to evolving annotation task definitions.  Unlike monolithic, general-purpose tools that handle a variety of annotation tasks, AGTK allows for rapid deployment of highly specialized, task-specific tools that maximize user interface ergonomics and improve the speed and accuracy of annotation.

The current AGTK toolbase contains annotation tools developed to support corpus development for DARPA TIDES, EARS and GALE; REFLEX LCTL and ACE, NIST RT, and several other programs.

The current AGTK toolbase is available for free download by external users here.