Acknowledgments This wiki describes tools and formats for creating and managing linguistic annotations. `Linguistic annotation<nowiki>‘</nowiki> covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added notations may include transcriptions of all sorts (from phonetic features to discourse structures), part-of-speech and sense tagging, syntactic analysis, "named entity" identification, co-reference annotation, and so on. The focus is on tools which have been widely used for constructing annotated linguistic databases, and on the formats commonly adopted by such tools and databases.
For quicker reference, there's a page with transcription and annotation Tools only.
|Annotation Graph Toolkit (AGTK)||X||X||X|
|Classical Text Editor||X||X|
*DAISY (FTP/U,W) --- DAMSL (FTRC/U,W) --- Delta (TP/U,W) --- Dexter (T) --- DRI (R) E *EAGLES (FR) --- ELAN (FTD) --- E-MELD (R) --- Emu (FTDP/U,W) --- EXMARaLDA (FTDP/U,W,M) F *Festival (TD/U) --- FLEX (Fieldworks Language Explorer) --- FORM (C) --- FSA's (TD) G *GATE (FTDP/U) --- Gsearch (T/U) H *HIAT (FTDPRC/W) --- HIAT-DOS (T) (Review) --- Hyperlex (TP/U) I *Intex (F/U,W,M) --- ISIP (TDP/U) --- ISLE K *Knowtator (TDPC/W,U,M) L *LACITO Linguistic Data Archiving Project (FTD) --- LAF Linguistic Annotation Framework --- LDC (FTDPRC) --- LT (T/U,W) M *MacShapa (TP) --- MacVissta (TD) --- MATE (FT) --- MediaStreams (P) --- MediaTagger (P) --- MICASE (TDC/W) --- MMAX (TD) --- MPEG (FPR) --- MPI (FT/UWM) --- Multitext (F) N *NEGRA (FTPC/U) --- NITE O *Observer (T/W) P *Partitur (FT) --- PAULA (F) --- Praat (TD/U,W,M) R *RSTTool (TD) S *SABLE (FP) --- SACODEYL Transcriptor and Annotator --- SAMPA (C) --- SGREP (TDP/U,W) --- SignStream (TDP/M) --- SIL (TDPF/W,M) --- SLAM (TDP/W) --- SMDL (P) --- SNACK (TDP/U,W,M) --- SUSANNE (CP) --- SyncWriter (T) (Review) T *TalkBank (R) --- TASX (TD/U,W,M) --- TEI (F) --- Tipster (F) --- Transcriber (TDP/U,W,M) (Review) --- Transana (T) (Review) --- Transformer (TDP) --- TransTool (TD/U,W) --- Treebank (C) --- TSNLP (FT) --- TUSNELDA U *Unicode (RC) V
F: a systematically-documented annotation format T: an available tool for creation, display or search (W=Windows, U=Unix, M=MacOS) D: a tool is downloadable P: there is a citeable paper which documents the format/system R: other kinds of resource, such as books and associations C: methods and standards for transcribing content