Supported Languages
The Perl extension Lingua::Lid implements an interface to lid - a C/C++ library that currently supports 42 languages and transliterations in a variety of modern, common and legacy character encodings.
Support for additional languages and character encodings is added regularly as the development of the underlying lid library proceeds. However, if you need a specific language or character encoding supported, feel free to contact us so we can support it quickly.
| Language | ISO 639-3 Code | Character Encodings | |
|---|---|---|---|
| Bulgarian | ![]() |
bul | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-5, Windows-1251, MacCyrillic, CP 855, CP 866, KOI8-R |
| Bulgarian (DIN 1460 transliteration) |
![]() |
bul | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, Windows-1250, ASCII |
| Bulgarian (ISO 9 transliteration) |
![]() |
bul | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ASCII |
| Bulgarian (Streamlined System transliteration) |
![]() |
bul | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, Windows-1250, ASCII |
| Czech | ![]() |
ces | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, Windows-1250, MacCentralEuropean, CP 852 |
| Czech (Common transliteration) |
![]() |
ces | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII |
| Danish | ![]() |
dan | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP 850, ASCII |
| Dutch | ![]() |
nld | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP 850, ASCII |
| English | ![]() |
eng | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP 850, ASCII |
| Estonian | ![]() |
est | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-4, Windows-1257, MacCentralEuropean, CP 775, ASCII |
| Finnish | ![]() |
fin | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP 850, ASCII |
| French | ![]() |
fra | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP 850, ASCII |
| German | ![]() |
deu | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP 850, ASCII |
| German (Common transliteration) |
![]() |
deu | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII |
| Greek | ![]() |
ell | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-7, Windows-1253, MacGreek, CP 737 |
| Greek (DIN 31634 transliteration) |
![]() |
ell | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ASCII |
| Greek (Greeklish transliteration) |
![]() |
ell | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII |
| Greek (ISO 843 transliteration) |
![]() |
ell | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ASCII |
| Hungarian | ![]() |
hun | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, ISO-8859-16, Windows-1250, CP 852, MacCentralEuropean |
| Irish (Gaelic) | ![]() |
gle | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP 850, ASCII |
| Italian | ![]() |
ita | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-16, Windows-1252, MacRoman, CP 850, ASCII |
| Latvian | ![]() |
lav | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-4, Windows-1257, MacCentralEuropean, CP 775, ASCII |
| Lithuanian | ![]() |
lit | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-4, Windows-1257, MacCentralEuropean, CP 775, ASCII |
| Maltese | ![]() |
mlt | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-3 |
| Mandarin (Chinese) | ![]() |
cmn | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, Big5, GB2312 |
| Polish | ![]() |
pol | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, ISO-8859-16, Windows-1250, MacCentralEuropean, CP 852 |
| Polish (Common transliteration) |
![]() |
pol | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII |
| Portuguese | ![]() |
por | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP 850, ASCII |
| Romanian | ![]() |
ron | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, Windows-1250, MacRomanian, CP 852 |
| Romanian (Common transliteration) |
![]() |
ron | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII |
| Russian | ![]() |
rus | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-5, Windows-1251, MacCyrillic, CP 855, CP 866, KOI8-R |
| Russian (DIN 1460 transliteration) |
![]() |
rus | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8 |
| Russian (ISO 9 transliteration) |
![]() |
rus | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8 |
| Slovak | ![]() |
slk | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, Windows-1250, MacCentralEuropean, CP 852 |
| Slovak (Common transliteration) |
![]() |
slk | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII |
| Slovenian | ![]() |
slv | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, ISO-8859-16, Windows-1250, MacCentralEuropean, CP 852, ASCII |
| Slovenian (Common transliteration) |
![]() |
slv | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII |
| Spanish | ![]() |
spa | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP 850, ASCII |
| Swedish | ![]() |
swe | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP 850, ASCII |
| Ukrainian | ![]() |
ukr | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, Windows-1251, MacUkrainian, KOI8-U |
| Ukrainian (DIN 1460 transliteration) |
![]() |
ukr | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8 |
| Ukrainian (ISO 9 transliteration) |
![]() |
ukr | UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8 |



























