From LinguisticAnnotation
Jump to: navigation, search
(Moved Key to the end of the page)
 
(38 intermediate revisions by 4 users not shown)
Line 1: Line 1:
Linguistic Annotation
+
==Linguistic Annotation Wiki==
  
 
This wiki describes tools and formats for creating and managing ''linguistic annotations''. `Linguistic annotation<nowiki>‘</nowiki> covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added notations may include transcriptions of all sorts (from phonetic features to discourse structures), part-of-speech and sense tagging, syntactic analysis, "named entity" identification, co-reference annotation, and so on. The focus is on tools which have been widely used for constructing annotated linguistic databases, and on the formats commonly adopted by such tools and databases.  
 
This wiki describes tools and formats for creating and managing ''linguistic annotations''. `Linguistic annotation<nowiki>‘</nowiki> covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added notations may include transcriptions of all sorts (from phonetic features to discourse structures), part-of-speech and sense tagging, syntactic analysis, "named entity" identification, co-reference annotation, and so on. The focus is on tools which have been widely used for constructing annotated linguistic databases, and on the formats commonly adopted by such tools and databases.  
Line 12: Line 12:
 
*[http://www.language-archives.org/ Open Language Archives Community]
 
*[http://www.language-archives.org/ Open Language Archives Community]
 
*[http://www.ldc.upenn.edu/exploration/ Linguistic Exploration]
 
*[http://www.ldc.upenn.edu/exploration/ Linguistic Exploration]
 +
 +
----
 +
For quicker reference, there's a page with transcription and annotation [[Tools]] only
  
 
'''A'''
 
'''A'''
*[[Alembic Workbench (David Day)]] (DT/U,W)
+
*[[Alembic Workbench]] (DT/U,W)
 +
*[[Annotation Graph Toolkit (AGTK)]] (TDP)
 +
*[[ANNIS]]
 +
*[[annotate]] (TD)
 
*[[Anvil]] (TP)
 
*[[Anvil]] (TP)
 
*[[ATLAS]] (FP)
 
*[[ATLAS]] (FP)
 
'''C'''
 
'''C'''
 
*[[CA]] (P)
 
*[[CA]] (P)
 +
*[[Callisto]] (TD/W,U,M)
 +
*[[C-BAS]] (T/W)
 
*[[CES]] (FC)
 
*[[CES]] (FC)
 
*[[CHILDES]] (FTDPRC/W,M)
 
*[[CHILDES]] (FTDPRC/W,M)
Line 30: Line 38:
 
*[[DAMSL]] (FTRC/U,W)
 
*[[DAMSL]] (FTRC/U,W)
 
*[[Delta]] (TP/U,W)
 
*[[Delta]] (TP/U,W)
 +
*[[Dexter]] (T)
 
*[[DRI]] (R)
 
*[[DRI]] (R)
 
'''E'''
 
'''E'''
 
*[[EAGLES]] (FR)
 
*[[EAGLES]] (FR)
 
*[[ELAN]] (FTD)
 
*[[ELAN]] (FTD)
 +
*[[E-MELD]] (R)
 
*[[Emu]] (FTDP/U,W)
 
*[[Emu]] (FTDP/U,W)
 
*[[EXMARaLDA]] (FTDP/U,W,M)
 
*[[EXMARaLDA]] (FTDP/U,W,M)
 
'''F'''
 
'''F'''
 
*[[Festival]] (TD/U)
 
*[[Festival]] (TD/U)
 +
*[[FLEX (Fieldworks Language Explorer)]]
 
*[[FORM]] (C)
 
*[[FORM]] (C)
 
*[[FSA's]] (TD)
 
*[[FSA's]] (TD)
Line 45: Line 56:
 
'''H'''
 
'''H'''
 
*[[HIAT]] (FTDPRC/W)
 
*[[HIAT]] (FTDPRC/W)
*[[HIAT-DOS]] (T)
+
*[[HIAT-DOS]] (T) ([[HIAT-DOS (Review)|Review]])
 
*[[Hyperlex]] (TP/U)
 
*[[Hyperlex]] (TP/U)
 
'''I'''
 
'''I'''
Line 53: Line 64:
 
'''L'''
 
'''L'''
 
*[[LACITO]] Linguistic Data Archiving Project  (Boyd Michailovsky, John B. Lowe, Michel Jacobson) (FTD)
 
*[[LACITO]] Linguistic Data Archiving Project  (Boyd Michailovsky, John B. Lowe, Michel Jacobson) (FTD)
 +
*[[LAF]] Linguistic Annotation Framework
 
*[[LDC]] (FTDPRC)
 
*[[LDC]] (FTDPRC)
 
*[[LT]] (T/U,W)
 
*[[LT]] (T/U,W)
 
'''M'''
 
'''M'''
 
*[[MacShapa]] (TP)
 
*[[MacShapa]] (TP)
 +
*[[MacVissta]] (TD)
 
*[[MATE]] (FT)
 
*[[MATE]] (FT)
 
*[[MediaStreams]] (P)
 
*[[MediaStreams]] (P)
 
*[[MediaTagger]] (P)
 
*[[MediaTagger]] (P)
 
*[[MICASE]] (TDC/W)
 
*[[MICASE]] (TDC/W)
 +
*[[MMAX]] (TD)
 
*[[MPEG]] (FPR)
 
*[[MPEG]] (FPR)
 
*[[MPI]] (FT/UWM)
 
*[[MPI]] (FT/UWM)
Line 71: Line 85:
 
'''P'''
 
'''P'''
 
*[[Partitur]] (FT)
 
*[[Partitur]] (FT)
 +
*[[PAULA]] (F)
 
*[[Praat]] (TD/U,W,M)
 
*[[Praat]] (TD/U,W,M)
 +
'''R'''
 +
*[[RSTTool]] (TD)
 
'''S'''
 
'''S'''
 
*[[SABLE]] (FP)
 
*[[SABLE]] (FP)
Line 82: Line 99:
 
*[[SNACK]] (TDP/U,W,M)
 
*[[SNACK]] (TDP/U,W,M)
 
*[[SUSANNE]] (CP)
 
*[[SUSANNE]] (CP)
*[[SyncWriter]] (T)
+
*[[SyncWriter]] (T) ([[SyncWriter (Review)|Review]])
 
'''T'''
 
'''T'''
 
*[[TalkBank]] (R)
 
*[[TalkBank]] (R)
Line 88: Line 105:
 
*[[TEI]] (F)
 
*[[TEI]] (F)
 
*[[Tipster]] (F)
 
*[[Tipster]] (F)
*[[Transcriber]] (TDP/U,W,M)
+
*[[Transcriber]] (TDP/U,W,M) ([[Transcriber (Review)|Review]])
*[[Transana]] (T)
+
*[[Transana]] (T) ([[Transana (Review)|Review]])
 +
*[[Transformer]] (TDP)
 
*[[TransTool]] (TD/U,W)
 
*[[TransTool]] (TD/U,W)
 
*[[Treebank]] (C)
 
*[[Treebank]] (C)
Line 101: Line 119:
 
*[[vPrism]] (T/W)
 
*[[vPrism]] (T/W)
  
'''KEY:'''
+
=Key=
  
 
F:  a systematically-documented annotation format
 
F:  a systematically-documented annotation format
  
T: an available tool for creation, display or search
+
T: an available tool for creation, display or search (W=Windows, U=Unix, M=MacOS)
  
 
D: a tool is downloadable
 
D: a tool is downloadable

Latest revision as of 11:14, 15 November 2007

Linguistic Annotation Wiki

This wiki describes tools and formats for creating and managing linguistic annotations. `Linguistic annotation‘ covers any descriptive or analytic notations applied to raw language data. The basic data may be in the form of time functions -- audio, video and/or physiological recordings -- or it may be textual. The added notations may include transcriptions of all sorts (from phonetic features to discourse structures), part-of-speech and sense tagging, syntactic analysis, "named entity" identification, co-reference annotation, and so on. The focus is on tools which have been widely used for constructing annotated linguistic databases, and on the formats commonly adopted by such tools and databases.

This wiki is based on these webpages:

These are no longer maintained. Used with permission.

Related pages:


For quicker reference, there's a page with transcription and annotation Tools only

A

C

D

E

F

G

H

I

L

  • LACITO Linguistic Data Archiving Project (Boyd Michailovsky, John B. Lowe, Michel Jacobson) (FTD)
  • LAF Linguistic Annotation Framework
  • LDC (FTDPRC)
  • LT (T/U,W)

M

N

O

P

R

S

T

U

V

Key

F: a systematically-documented annotation format

T: an available tool for creation, display or search (W=Windows, U=Unix, M=MacOS)

D: a tool is downloadable

P: there is a citeable paper which documents the format/system

R: other kinds of resource, such as books and associations

C: methods and standards for transcribing content