PoLKo - the Polish Learner Corpus

PoLKo is an academic project. The primary goal of the project is to collect learners’ writings in Polish as a foreign language at various levels of language proficiency. The collected material will be a basis for analysing the learners’ language, identifying the most common language errors, creating classroom materials, and improving modern teaching methods.

In the first step, we are going to collect all available electronic texts, so as to gather a sizeable amount of starting material in the shortest possible time. In the second step of our project, we intend to focus more on hand-written texts and the rules for transcribing such texts in a computer-readable format, and on balancing the entire corpus in terms of first language and language level (CEFR).

How to cite the corpus?

Kaczmarska, E., & Zasina, A. J. (2020). Błędy walencyjne w tekstach obcokrajowców uczących się języka polskiego w świetle korpusu PoLKoPrace Filologiczne, 75(1), 197–213. https://doi.org/10.32798/pf.657

Kaczmarska, E., & Zasina, A. J. (2021). Język polski w tekstach osób czeskojęzycznych na podstawie korpusu uczniowskiego PoLKo. ROSSICA OLOMUCENSIA, LX(2), 5–17.

Zasina, A. J., & Kaczmarska, E. (2020). Infrastructure of the Polish Learner Corpus PoLKo. Retrieved from https://www.researchgate.net/publication/342888260_Infrastructure_of_the_Polish_Learner_Corpus_PoLKo. https://doi.org/10.13140/RG.2.2.23874.40648 

For the collaboration, please contact Dr. Adrian Zasina.

Would you like to access our corpus through your own Login and add your own texts? Do not hesitate to contac us!

More information about how to set-up TEITOK can be found in the help pages: http://www.teitok.org/index.php?action=help