Powered by TEITOK
Maarten Janssen, 2014-
CzeSL – Czech as a Second Language
- Work in progress: new texts, transcription, annotation, imports of data in other formats
- This corpus is meant to include all available Czech texts written by non-native learners.
- The currently searchable texts are annotated automatically in the czesl-sgt style: forms identified as incorrect are normalized and assigned one or more error tags, the original and normalized forms are tagged and lemmatized.
- The corpus comes with metadata and scans of manuscripts (where available) for registered users.
- Manual annotation already available for some texts will be added, the automatic normalization for other texts will be checked and extended manually.
Přepis žákovských textů
Oprava žákovských textů
More information about how to set-up TEITOK can be found in the help pages: http://teitok.corpuswiki.org/site/index.php?action=help