Rozdíly

Zde můžete vidět rozdíly mezi vybranou verzí a aktuální verzí dané stránky.

Odkaz na výstup diff

Both sides previous revision Předchozí verze
Následující verze
Předchozí verze
Poslední revize Both sides next revision
czesl:czesl [2020/12/18 18:22]
rosen
czesl:czesl [2021/03/11 19:53]
rosen [Tools]
Řádek 1: Řádek 1:
 +{{ :​czesl:​logolink_op_vvv_hor_barva_eng.jpg?​600 |}}
 +
 ======= CzeSL – a Learner Corpus of Czech ======= ======= CzeSL – a Learner Corpus of Czech =======
  
Řádek 15: Řádek 17:
 ===== Available versions ===== ===== Available versions =====
  
-| ^  Thousands of tokens in  ^^^^  ​annotation ​ ^^^^^ Metadata ^  Access ​ ^  Year  ^ +| ^  Thousands of tokens in  ^^^^  ​Annotation ​ ^^^^^ Metadata ^  Access ​ ^  Year  ^ 
-| ::: ^  non-native ​ ^^  ethnolect ​ ^  𝚺  ^  ​Error  ​^^  ​Linguistic ​ ​^^^:::​^:::​^:::​^+| ::: ^  non-native ​ ^^  ethnolect ​ ^  𝚺  ^  ​error  ​^^  ​linguistic ​ ​^^^:::​^:::​^:::​^
 | ::: ^  essays ​ ^  theses ​ ^ ::: ^ ::: ^  Tags  ^  TH  ^  T0  ^  T1  ^  T2  ^ ::: ^ ::: ^ ::: ^ | ::: ^  essays ​ ^  theses ​ ^ ::: ^ ::: ^  Tags  ^  TH  ^  T0  ^  T1  ^  T2  ^ ::: ^ ::: ^ ::: ^
 ^ CzeSL-plain |  1,315 |  732 |  428 |  2,475 |  --  |  --  |  --  |  --  |  --  |  --  |  SD  |  2012  | ^ CzeSL-plain |  1,315 |  732 |  428 |  2,475 |  --  |  --  |  --  |  --  |  --  |  --  |  SD  |  2012  |
Řádek 89: Řádek 91:
     * Each text with its annotation consists of several related files.     * Each text with its annotation consists of several related files.
     * Some of the texts are independently annotated twice.     * Some of the texts are independently annotated twice.
 +    * Includes also flat version (files named *.vert), see CzeSL-man v2 below.
   * **CzeSL-man v1 searchable**:​   * **CzeSL-man v1 searchable**:​
     * Searchable by KonText: https://​kontext.korpus.cz/​first_form?​corpname=czesl-man     * Searchable by KonText: https://​kontext.korpus.cz/​first_form?​corpname=czesl-man
Řádek 104: Řádek 107:
   * Apart from the error annotation, the content and metadata are the same as in CzeSL-man v1.   * Apart from the error annotation, the content and metadata are the same as in CzeSL-man v1.
   * Linguistic annotation (tags and lemmas) is provided for all tokens at Tier 0 and Tier 2.   * Linguistic annotation (tags and lemmas) is provided for all tokens at Tier 0 and Tier 2.
 +  * Downloadable from https://​bitbucket.org/​czesl/​czesl-man/​ (files named *.vert).
  
 === CzeSL-TH === === CzeSL-TH ===
Řádek 153: Řádek 157:
   * Multi-level concordancer [[http://​utkl.ff.cuni.cz/​czesl/​selaq.html|SeLaQ]],​ used for basic searching in CzeSL-man   * Multi-level concordancer [[http://​utkl.ff.cuni.cz/​czesl/​selaq.html|SeLaQ]],​ used for basic searching in CzeSL-man
   * Standard concordancer [[http://​wiki.korpus.cz/​doku.php/​en:​manualy:​kontext:​index|Manatee/​KonText]],​ used for searching in CzeSL-plain and CzeSL-SGT   * Standard concordancer [[http://​wiki.korpus.cz/​doku.php/​en:​manualy:​kontext:​index|Manatee/​KonText]],​ used for searching in CzeSL-plain and CzeSL-SGT
-  * General corpus tool [[http://beta.clul.ul.pt/​teitok/​site/​|TEITOK]], currently used for building, editing and viewing learner corpora hosted by the Institute of Theoretical and Computational linguistics (see [[http://​utkl.ff.cuni.cz/​teitok/​|Learner corpora at ICTL]])+  * General corpus tool [[http://www.teitok.org|TEITOK]], currently used for building, editing and viewing learner corpora hosted by the Institute of Theoretical and Computational linguistics (see [[http://​utkl.ff.cuni.cz/​teitok/​|Learner corpora at ICTL]])
  
 ===== Bibliography ===== ===== Bibliography =====
Řádek 166: Řádek 170:
 ===== Acknowledgement ===== ===== Acknowledgement =====
  
-{{ :​czesl:​logolink_op_vvv_hor_barva_eng.jpg?​600 |}} 
 This work was supported by the European Regional Development Fund project “Creativity and Adaptability as Conditions of the Success of Europe in an Interrelated World” (reg. no.: CZ.02.1.01/​0.0/​0.0/​16_019/​0000734). This work was supported by the European Regional Development Fund project “Creativity and Adaptability as Conditions of the Success of Europe in an Interrelated World” (reg. no.: CZ.02.1.01/​0.0/​0.0/​16_019/​0000734).

QR Code
QR Code czesl:czesl (generated for current page)