From LinguisticAnnotation
Jump to: navigation, search
 
(One intermediate revision by the same user not shown)
Line 4: Line 4:
 
* XML based data formats - All EXMARaLDA transcriptions are stored in XML files. The use of this W3C standard ensures flexible usability and long-term archivability of the data.
 
* XML based data formats - All EXMARaLDA transcriptions are stored in XML files. The use of this W3C standard ensures flexible usability and long-term archivability of the data.
 
* Java based tools - All software tools for creating and working with EXMARaLDA data are JAVA applications. This makes them suitable for all currently used operating systems (Windows, Macintosh, Linux, Unix).
 
* Java based tools - All software tools for creating and working with EXMARaLDA data are JAVA applications. This makes them suitable for all currently used operating systems (Windows, Macintosh, Linux, Unix).
* Interoperability - The EXMARaLDA concept is loosely based on the annotation graph framework (Bird/Liberman 2001) and thus aims at a maximal exchangeability and reusability of transcription data. Hence, it is possible to create and edit EXMARaLDA data not only with the system's own tools, but also with other popular software (like [[Praat]], [[ELAN]] or the [[TASX]] Annotator). An import and export facility for [[TEI]] data is also available, as well as import filters for [[SyncWriter]] and [[HIAT-DOS]] data.
+
* Interoperability - The EXMARaLDA concept is loosely based on the annotation graph framework (Bird/Liberman 2001) and thus aims at a maximal exchangeability and reusability of transcription data. Hence, it is possible to create and edit EXMARaLDA data not only with the system's own tools, but also with other popular software (like [[Praat]], [[ELAN]], [[Transcriber]], [[CHILDES]], [[WinPitch]] or the [[TASX]] Annotator). An import and export facility for [[TEI]] data is also available, as well as import filters for [[SyncWriter]] and [[HIAT-DOS]] data.
  
 
Furthermore, EXMARaLDA data can be transformed into a number of widely used presentation formats (RTF, HTML, PDF,SVG) for web-based or printed publication. Last but not least, EXMARaLDA supports several important transcription systems ([[HIAT]], DIDA, GAT, [[CHILDES]]) through a number of parameterised functions.
 
Furthermore, EXMARaLDA data can be transformed into a number of widely used presentation formats (RTF, HTML, PDF,SVG) for web-based or printed publication. Last but not least, EXMARaLDA supports several important transcription systems ([[HIAT]], DIDA, GAT, [[CHILDES]]) through a number of parameterised functions.
Line 14: Line 14:
 
*Schmidt, Thomas (2004): [http://www1.uni-hamburg.de/exmaralda/Daten/4D-Literatur/Paper_LREC.pdf Transcribing and annotating spoken language with EXMARaLDA] In: Proceedings of the LREC-Workshop on XML based richly annotated corpora, Lisbon 2004. Paris: ELRA.  
 
*Schmidt, Thomas (2004): [http://www1.uni-hamburg.de/exmaralda/Daten/4D-Literatur/Paper_LREC.pdf Transcribing and annotating spoken language with EXMARaLDA] In: Proceedings of the LREC-Workshop on XML based richly annotated corpora, Lisbon 2004. Paris: ELRA.  
 
*Schmidt, Thomas (2001): [http://www1.uni-hamburg.de/exmaralda/Daten/4D-Literatur/Vortraege-Dokumente/IRCS_Paper.pdf The transcription system EXMARaLDA: An application of the annotation graph formalism as the Basis of a Database of Multilingual Spoken Discourse.] In: Bird et al. (2001), 219-227.
 
*Schmidt, Thomas (2001): [http://www1.uni-hamburg.de/exmaralda/Daten/4D-Literatur/Vortraege-Dokumente/IRCS_Paper.pdf The transcription system EXMARaLDA: An application of the annotation graph formalism as the Basis of a Database of Multilingual Spoken Discourse.] In: Bird et al. (2001), 219-227.
 +
* Meißner, Cordula & Slavcheva, Adriana (2013): Review of EXMARaLDA. In: [http://scholarspace.manoa.hawaii.edu/bitstream/handle/10125/4571/Meissner.pdf?sequence=1 Language Documentation & Conservation 7].

Latest revision as of 12:03, 2 May 2013

EXMARaLDA is an acronym of "Extensible Markup Language for Discourse Annotation". It is a system of concepts, data formats and tools for the computer assisted transcription and annotation of spoken language. EXMARaLDA is being developed in a project at the Collaborative Research Center "Multilingualism" (Sonderforschungsbereich "Mehrsprachigkeit" - SFB 538) at the University of Hamburg. The system's software tools - an editor for transcriptions in musical score notation, a corpus manager for administrating corpus meta-data, and a concordancing tool - are freely available to users outside the SFB.

The main features of EXMARaLDA are:

  • XML based data formats - All EXMARaLDA transcriptions are stored in XML files. The use of this W3C standard ensures flexible usability and long-term archivability of the data.
  • Java based tools - All software tools for creating and working with EXMARaLDA data are JAVA applications. This makes them suitable for all currently used operating systems (Windows, Macintosh, Linux, Unix).
  • Interoperability - The EXMARaLDA concept is loosely based on the annotation graph framework (Bird/Liberman 2001) and thus aims at a maximal exchangeability and reusability of transcription data. Hence, it is possible to create and edit EXMARaLDA data not only with the system's own tools, but also with other popular software (like Praat, ELAN, Transcriber, CHILDES, WinPitch or the TASX Annotator). An import and export facility for TEI data is also available, as well as import filters for SyncWriter and HIAT-DOS data.

Furthermore, EXMARaLDA data can be transformed into a number of widely used presentation formats (RTF, HTML, PDF,SVG) for web-based or printed publication. Last but not least, EXMARaLDA supports several important transcription systems (HIAT, DIDA, GAT, CHILDES) through a number of parameterised functions.

Literature/References