Corpora are available from our corpus server and require a username and password, which can be created online.
Corpus of Learner English in Malta (CLEM)
Corpus of ca. 1 million tokens, consisting of English essays by
students. These essays were sampled from examination scripts by
Maltese students written during the 2011-2013 examinations sessions
of the Matriculation and Secondary Education Certificate (MATSEC) Examinations Board.
The corpus is stratified by gender, school type, candidate's region of residence, date of birth and mark/grade. Tokens are annotated with part of speech, lemma and orthographic errors.