From LinguisticAnnotation
m (links) |
|||
Line 1: | Line 1: | ||
[http://www.coli.uni-saarland.de/projects/sfb378/negra-corpus/negra-corpus.html NEGRA Corpus] (Thorsten Brants) | [http://www.coli.uni-saarland.de/projects/sfb378/negra-corpus/negra-corpus.html NEGRA Corpus] (Thorsten Brants) | ||
− | The NEGRA corpus consists of approximately 10,000 sentences of German newspaper text. The corpus is a type of treebank, but with a novel [http://www.coli.uni-sb.de/%7Ethorsten/publications/Skut-ea-ANLP97.ps.gz annotation scheme for discontinuous constituents]. An example tree showing the visual format, the annotation format, and the | + | The NEGRA corpus consists of approximately 10,000 sentences of German newspaper text. The corpus is a type of treebank, but with a novel [http://www.coli.uni-sb.de/%7Ethorsten/publications/Skut-ea-ANLP97.ps.gz annotation scheme for discontinuous constituents]. An example tree showing the visual format, the annotation format, and the [Treebank Penn Treebank] equivalent, is available [http://www.coli.uni-sb.de/sfb378/negra-corpus/sentno3.html here]. [http://www.coli.uni-sb.de/sfb378/negra-corpus/annotate.html Annotate] is a sophisticated tool which supports human-machine collaboration on the construction of syntactic trees. |
Revision as of 11:23, 11 October 2006
NEGRA Corpus (Thorsten Brants)
The NEGRA corpus consists of approximately 10,000 sentences of German newspaper text. The corpus is a type of treebank, but with a novel annotation scheme for discontinuous constituents. An example tree showing the visual format, the annotation format, and the [Treebank Penn Treebank] equivalent, is available here. Annotate is a sophisticated tool which supports human-machine collaboration on the construction of syntactic trees.