PLANNED_DATA.md

Planned datasets

Data subsets are devided according to single languages or language pairs, which are labeled with the respective ISO code.

hun

Hungarian translations of Pite Saami spoken texts collected by Ignác Halász.

kpv

Komi-Zyrian. The texts were digitalized by the Fennougrica project, proofread bu FU-Lab and processed by the Izhva Komi Documentation Project.

sia

sia-sms

Gospel of Matthew by Arvid Genetz (Akkala) and Konstantin Ščekoldin (Skolt)

sjd

sjd-sms

Gospel of Matthew by Arvid Genetz (Akkala) and Konstantin Ščekoldin (Skolt)

sje-hun

Pite Saami spoken texts with Hungarian translations collected and published by Ignác Halász. The texts were digitalized by the Pite Saami Documentation Project; annotations are being added in that project (and its descendants) on an on-going basis.

sms

sjt

Ter Saami spoken texts collected and published by Arvid Genetz. The texts were digitalized by the Kola Saami Documentation Project.



langdoc/uralic documentation built on May 29, 2019, 3:41 a.m.