

I was just at a networking/research technology conference in Helsinki (TNC26) where the topic of nordic languages— especially minority ones—being under-represented by current automated transcription/translation tools came up in one of the side talks I attended. There’s some effort by various European NRENs and universities to train models on these languages so those tools can be more widely available to students, academics, and the public. The talk was about “Scribe” by SUNET (Swedish Research Network) hosting whisper models for this purpose.
That should be especially good for things like the multiple sapmi languages! At least in finnish you can already write in the proper “book language” and get pretty accurate translations, even though the dialects still escape that.
Also, Finnish is fucking hard lol. I can usually pick up a bit of language wherever I travel, basic phrases usually. But DAMN trying to nail the epiglottal sounds of even “Hyvää yötä” threw me!
It is usually especially hard to learn for indo-european speakers, so it is not just you struggling! Haha :)





No, they are not usually written except people sometimes doing it casually in social media, but because our lettering system is almost fully phonetic, it is very easy to write and read them if you just speak finnish. Also if you are native speaker, you kind of learn the certain fluidity in the core of the language, so you can pretty much understand the words even if they vary a lot (except people from Rauma, nobody understands them).
I really thought it would have been cracked by AI because it can translate finnish pretty accurately (not always…) and if you can do that dialects aren’t hard at all, but I was surprised to find that it still cannot! I am assuming it really is just because there are not enough written sources to teach from.
//Oh, and as a summary my main points were, that AI most definitely has not “solved” learning and translating languages as it yet cannot even translate a lot of things, and I guess also, that you cannot trust AI translations if the text translated is some obscure language you do not know. They can sound convincing and form coherent sentences, but the meaning can be fully incorrect.