PoLKo - the Polish Learner Corpus

PoLKo is an academic project. The primary goal of the project is to collect learners’ writings in Polish as a foreign language at various levels of language proficiency. The collected material will be a basis for analysing the learners’ language, identifying the most common language errors, creating classroom materials, and improving modern teaching methods.

In the first step, we are collecting all available electronic texts, so as to gather a sizeable amount of starting material in the shortest possible time. In the second step, we intend to focus more on hand-written texts and the rules for transcribing such texts in a computer-readable format, and on balancing the entire corpus in terms of first language and language level (CEFR).

How to cite the corpus?

Kaczmarska, E., & Zasina, A. J. (2020). Błędy walencyjne w tekstach obcokrajowców uczących się języka polskiego w świetle korpusu PoLKoPrace Filologiczne, 75(1), 197–213. https://doi.org/10.32798/pf.657

Kaczmarska, E., & Zasina, A. J. (2021). Język polski w tekstach osób czeskojęzycznych na podstawie korpusu uczniowskiego PoLKo. ROSSICA OLOMUCENSIA, LX(2), 5–17.

Zasina, A. J., & Kaczmarska, E. (2020). Infrastructure of the Polish Learner Corpus PoLKo. Retrieved from https://www.researchgate.net/publication/342888260_Infrastructure_of_the_Polish_Learner_Corpus_PoLKo. https://doi.org/10.13140/RG.2.2.23874.40648 
Zasina, A. J., & Kaczmarska-Zglejszewska, E. (2022). Czech errors in writing based on the Polish Learner Corpus PoLKo: Pilot study. Retrieved from https://www.researchgate.net/publication/363762748_Czech_errors_in_writing_based_on_the_Polish_Learner_Corpus_PoLKo_Pilot_study 

For collaboration, please contact Dr. Adrian Zasina.

Would you like to access our corpus through your own Login and add your own texts? Do not hesitate to contact us!


Creative Commons License
PoLKo - the Polish Learner Corpus by Adrian Jan Zasina & Elżbieta Kaczmarska is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Please inform us about any publications  in which you used our data. 

More information about how to set-up TEITOK can be found in the help pages: http://www.teitok.org/index.php?action=help