From LinguisticAnnotation
Jump to: navigation, search
 
 
(8 intermediate revisions by 2 users not shown)
Line 1: Line 1:
[http://www.sfb632.uni-potsdam.de/annis/ ANNIS]
+
ANNIS is an open-source web application that provides access to multi-layer richly annotated corpora. It provides search and visualization functionality for complex constellations of token and token span-based annotations, hierarchical graph structures such as syntax trees or rhetorical annotations, arbitrary labeled pointing relations, used for example in coreference annotation or dependency syntax, and also for metadata. Multimodal data can also be aligned with corpora, and parallel texts can be aligned at the word, phrase or sentence level.
  
ANNIS is the name of the web application that provides access to the data of the SFB 632 ("Information structure: the linguistic means for structuring utterances, sentences and texts").
+
ANNIS3, The current iteration of the ANNIS software, is being used and developed in Berlin at the Humboldt University of Berlin, and at Georgetown University in Washington, DC. A number of different projects collect and annotate data according to multiple annotation standards. This data, which is annotated using a growing set of annotation tools ([[EXMARaLDA]], [[annotate]] (TigerXML), [[MMAX]], [[rstWeb]], [[WebAnno]], [[Vakyartha]] (a.k.a. Arborator)), can be mapped to the multilayer standoff format [[PAULA]] (Potsdamer Austauschformat für Linguistische Annotation / Potsdam Interchange Format for Linguistic Annotation), which serves as the basis for further processing for many corpora. Alternatively corpora from the single formats mentioned above can be converted directly to ANNIS using [[SaltNPepper]].
In the SFB, a number of different projects collect and annotate data according to the common SFB Annotation Standard. This data, which is annotated using a small set of annotation tools ([[EXMARaLDA]], [[annotate]], [[MMAX]], [[RSTTool]]), is mapped to the encoding standard of the SFB, [[PAULA]] (Potsdamer Austauschformat für Linguistische Annotation / Potsdam Interchange Format for Linguistic Annotation), which serves as the basis for further processing. ANNIS provides the means for visualizing and retrieving this data.
+
 
 +
==Literature/References==
 +
*[http://corpus-tools.org/annis ANNIS Homepage]
 +
*Zeldes, Amir & Krause, Thomas (2016): [http://dx.doi.org/10.1093/llc/fqu057 ANNIS3. A New Architecture for Generic Corpus Query and Visualization]. Digital Scholarship in the Humanities 31(1), 118-139.
 +
*Zeldes, Amir, Ritz, Julia, Lüdeling, Anke & Chiarcos, Christian (2009): [http://www.linguistik.hu-berlin.de/institut/professuren/korpuslinguistik/mitarbeiter-innen/amir/pdf/CL2009_ANNIS_pre.pdf Annis: A search tool for multi-layer annotated corpora]. In: Proceedings of Corpus Linguistics 2009, July 20-23, Liverpool, UK.
 +
* Chiarcos, Christian, Dipper, Stefanie, Götze, Michael, Leser, Ulf, Lüdeling, Anke, Ritz, Julia & Stede, Manfred (2008): [http://www.atala.org/IMG/pdf/TAL-2008-49-2-08-Chiarcos.pdf A Flexible Framework for Integrating Annotations from Different Tools and Tag Sets]. In: Traitement Automatique des Langues 49(2), 271-293.
 +
* Dipper, Stefanie & Götze, Michael (2005): [http://www.ling.uni-potsdam.de/~dipper/papers/ltc05.pdf Accessing Heterogeneous Linguistic Data - Generic XML-Based Representation and Flexible Visualization]. In: Proceedings. of the 2nd Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics. Poznań, Poland, 206-210.

Latest revision as of 15:47, 11 May 2016

ANNIS is an open-source web application that provides access to multi-layer richly annotated corpora. It provides search and visualization functionality for complex constellations of token and token span-based annotations, hierarchical graph structures such as syntax trees or rhetorical annotations, arbitrary labeled pointing relations, used for example in coreference annotation or dependency syntax, and also for metadata. Multimodal data can also be aligned with corpora, and parallel texts can be aligned at the word, phrase or sentence level.

ANNIS3, The current iteration of the ANNIS software, is being used and developed in Berlin at the Humboldt University of Berlin, and at Georgetown University in Washington, DC. A number of different projects collect and annotate data according to multiple annotation standards. This data, which is annotated using a growing set of annotation tools (EXMARaLDA, annotate (TigerXML), MMAX, rstWeb, WebAnno, Vakyartha (a.k.a. Arborator)), can be mapped to the multilayer standoff format PAULA (Potsdamer Austauschformat für Linguistische Annotation / Potsdam Interchange Format for Linguistic Annotation), which serves as the basis for further processing for many corpora. Alternatively corpora from the single formats mentioned above can be converted directly to ANNIS using SaltNPepper.

Literature/References