Downloads

Download the latest version of the Korpus Malti for research purposes (CC BY-NC-SA).
Format
The files are in vertical format. This means that every word is on a separate line, and each word has its part of speech tag (see Tagset Malti v3.0), lemma and morphological root. e.g.:
imniedi PART-PASS mniedi n-d-j
mill- PREP-DEF minn null
Inizjamed NOUN-PROP Inizjamed null
Korpus Malti v4.2

Academic Section (111.4MB)
Administration Section (549.7MB)
Blogs Section (68.7MB)
Comics Section (207kB)
Jurisprudence Section (46.8MB)
Law Section (316.5MB)
Parliament Section (314.9MB)
Press Section (158MB)
Speeches Section (233kB)
Web Section (66.9MB)
Wiki Section (8MB)
All Sections (1.54GB)

Korpus Malti v4 other

Find other versions of Korpus Malta v4 on HuggingFace.