Each text element corresponds to the text extracted from one Web page.
The text has been automatically marked-up for sentences and tokens by TreeTagger.
The text has been automatically
tokenised, part-of-speech tagged and lemmatised with
TreeTagger.
The TreeTagger tags have been then also mapped to the common, MULTEXT-based
SPOOK tagset.