Wikipedia dataset containing cleaned articles of all the languages on Wikipedia, including Faroese. The datasets are built from the Wikipedia dump (https://dumps.wikimedia.org/) with one split per language. You can also download a newer version of the dataset by using the Hugging Face interface (click the button below).



