The “stt_fo_quartznet15x5_sp_ep163_100h” is an acoustic model created with NeMo which is suitable for Automatic Speech Recognition in Faroese.
It is the result of fine-tuning the model “QuartzNet15x5Base-En.nemo” with 100 hours of Faroese data developed by the Ravnur Project from the Faroe Islands and curated by Carlos Mena during 2022. Most of the data is available at public repositories such as Clarin.is or Hugging Face.
The specific corpus used to fine-tune the model is this one.
Release: 2022
Contact: carlos.mena@ciempiess.org




