Supported Character Encodings
lidc currently supports a variety of 35 distinct character encodings. These cover both every modern and common encoding for any given language and a set of legacy encodings.
lidc supports all common Unicode Transformation Formats, namely "UTF-8", "UTF-16BE", "UTF-16LE", "UTF-32BE" and "UTF-32LE" - for any language and transliteration! There is, however, no support for UTF-16 and UTF-32 within emails.
| Character Encoding | Languages |
|---|---|
| ASCII | Bulgarian (DIN 1460 transliteration), Bulgarian (ISO 9 transliteration), Bulgarian (Streamlined System transliteration), Czech (Common transliteration), Danish, Dutch, English, Estonian, Finnish, French, German, German (Common transliteration), Greek (DIN 31634 transliteration), Greek (Greeklish transliteration), Greek (ISO 843 transliteration), Irish (Gaelic), Italian, Latvian, Lithuanian, Polish (Common transliteration), Portuguese, Romanian (Common transliteration), Slovak (Common transliteration), Slovenian, Slovenian (Common transliteration), Spanish, Swedish |
| Big5 | Mandarin (Chinese) |
| CP 737 | Greek |
| CP 775 | Estonian, Latvian, Lithuanian |
| CP 850 | Bokmål (Norwegian), Danish, Dutch, English, Finnish, French, German, Irish (Gaelic), Italian, Nynorsk (Norwegian), Portuguese, Spanish, Swedish |
| CP 852 | Czech, Hungarian, Polish, Romanian, Slovak, Slovenian |
| CP 855 | Bulgarian, Russian |
| CP 866 | Bulgarian, Russian |
| GB2312 | Mandarin (Chinese) |
| ISO-8859-1 | Bokmål (Norwegian), Czech (Common transliteration), Danish, Dutch, English, Finnish, French, German, German (Common transliteration), Greek (Greeklish transliteration), Irish (Gaelic), Italian, Nynorsk (Norwegian), Polish (Common transliteration), Portuguese, Romanian (Common transliteration), Slovak (Common transliteration), Slovenian (Common transliteration), Spanish, Swedish |
| ISO-8859-15 | Dutch, Finnish, French, German, Portuguese, Spanish |
| ISO-8859-16 | Hungarian, Italian, Polish, Slovenian |
| ISO-8859-2 | Czech, Hungarian, Polish, Romanian, Slovak, Slovenian |
| ISO-8859-3 | Maltese |
| ISO-8859-4 | Estonian, Latvian, Lithuanian |
| ISO-8859-5 | Bulgarian, Russian |
| ISO-8859-7 | Greek |
| KOI8-R | Bulgarian, Russian |
| KOI8-U | Ukrainian |
| MacCentralEuropean | Czech, Estonian, Hungarian, Latvian, Lithuanian, Polish, Slovak, Slovenian |
| MacCyrillic | Bulgarian, Russian |
| MacGreek | Greek |
| MacRoman | Bokmål (Norwegian), Danish, Dutch, English, Finnish, French, German, Irish (Gaelic), Italian, Nynorsk (Norwegian), Portuguese, Spanish, Swedish |
| MacRomanian | Romanian |
| MacUkrainian | Ukrainian |
| UTF-16BE | Bokmål (Norwegian), Bulgarian, Bulgarian (DIN 1460 transliteration), Bulgarian (ISO 9 transliteration), Bulgarian (Streamlined System transliteration), Czech, Czech (Common transliteration), Danish, Dutch, English, Estonian, Finnish, French, German, German (Common transliteration), Greek, Greek (DIN 31634 transliteration), Greek (Greeklish transliteration), Greek (ISO 843 transliteration), Hungarian, Irish (Gaelic), Italian, Latvian, Lithuanian, Maltese, Mandarin (Chinese), Nynorsk (Norwegian), Polish, Polish (Common transliteration), Portuguese, Romanian, Romanian (Common transliteration), Russian, Russian (DIN 1460 transliteration), Russian (ISO 9 transliteration), Slovak, Slovak (Common transliteration), Slovenian, Slovenian (Common transliteration), Spanish, Swedish, Ukrainian, Ukrainian (DIN 1460 transliteration), Ukrainian (ISO 9 transliteration) |
| UTF-16LE | Bokmål (Norwegian), Bulgarian, Bulgarian (DIN 1460 transliteration), Bulgarian (ISO 9 transliteration), Bulgarian (Streamlined System transliteration), Czech, Czech (Common transliteration), Danish, Dutch, English, Estonian, Finnish, French, German, German (Common transliteration), Greek, Greek (DIN 31634 transliteration), Greek (Greeklish transliteration), Greek (ISO 843 transliteration), Hungarian, Irish (Gaelic), Italian, Latvian, Lithuanian, Maltese, Mandarin (Chinese), Nynorsk (Norwegian), Polish, Polish (Common transliteration), Portuguese, Romanian, Romanian (Common transliteration), Russian, Russian (DIN 1460 transliteration), Russian (ISO 9 transliteration), Slovak, Slovak (Common transliteration), Slovenian, Slovenian (Common transliteration), Spanish, Swedish, Ukrainian, Ukrainian (DIN 1460 transliteration), Ukrainian (ISO 9 transliteration) |
| UTF-32BE | Bokmål (Norwegian), Bulgarian, Bulgarian (DIN 1460 transliteration), Bulgarian (ISO 9 transliteration), Bulgarian (Streamlined System transliteration), Czech, Czech (Common transliteration), Danish, Dutch, English, Estonian, Finnish, French, German, German (Common transliteration), Greek, Greek (DIN 31634 transliteration), Greek (Greeklish transliteration), Greek (ISO 843 transliteration), Hungarian, Irish (Gaelic), Italian, Latvian, Lithuanian, Maltese, Mandarin (Chinese), Nynorsk (Norwegian), Polish, Polish (Common transliteration), Portuguese, Romanian, Romanian (Common transliteration), Russian, Russian (DIN 1460 transliteration), Russian (ISO 9 transliteration), Slovak, Slovak (Common transliteration), Slovenian, Slovenian (Common transliteration), Spanish, Swedish, Ukrainian, Ukrainian (DIN 1460 transliteration), Ukrainian (ISO 9 transliteration) |
| UTF-32LE | Bokmål (Norwegian), Bulgarian, Bulgarian (DIN 1460 transliteration), Bulgarian (ISO 9 transliteration), Bulgarian (Streamlined System transliteration), Czech, Czech (Common transliteration), Danish, Dutch, English, Estonian, Finnish, French, German, German (Common transliteration), Greek, Greek (DIN 31634 transliteration), Greek (Greeklish transliteration), Greek (ISO 843 transliteration), Hungarian, Irish (Gaelic), Italian, Latvian, Lithuanian, Maltese, Mandarin (Chinese), Nynorsk (Norwegian), Polish, Polish (Common transliteration), Portuguese, Romanian, Romanian (Common transliteration), Russian, Russian (DIN 1460 transliteration), Russian (ISO 9 transliteration), Slovak, Slovak (Common transliteration), Slovenian, Slovenian (Common transliteration), Spanish, Swedish, Ukrainian, Ukrainian (DIN 1460 transliteration), Ukrainian (ISO 9 transliteration) |
| UTF-8 | Bokmål (Norwegian), Bulgarian, Bulgarian (DIN 1460 transliteration), Bulgarian (ISO 9 transliteration), Bulgarian (Streamlined System transliteration), Czech, Czech (Common transliteration), Danish, Dutch, English, Estonian, Finnish, French, German, German (Common transliteration), Greek, Greek (DIN 31634 transliteration), Greek (Greeklish transliteration), Greek (ISO 843 transliteration), Hungarian, Irish (Gaelic), Italian, Latvian, Lithuanian, Maltese, Mandarin (Chinese), Nynorsk (Norwegian), Polish, Polish (Common transliteration), Portuguese, Romanian, Romanian (Common transliteration), Russian, Russian (DIN 1460 transliteration), Russian (ISO 9 transliteration), Slovak, Slovak (Common transliteration), Slovenian, Slovenian (Common transliteration), Spanish, Swedish, Ukrainian, Ukrainian (DIN 1460 transliteration), Ukrainian (ISO 9 transliteration) |
| Windows-1250 | Bulgarian (DIN 1460 transliteration), Bulgarian (Streamlined System transliteration), Czech, Hungarian, Polish, Romanian, Slovak, Slovenian |
| Windows-1251 | Bulgarian, Russian, Ukrainian |
| Windows-1252 | Bokmål (Norwegian), Danish, Dutch, English, Finnish, French, German, Irish (Gaelic), Italian, Nynorsk (Norwegian), Portuguese, Spanish, Swedish |
| Windows-1253 | Greek |
| Windows-1257 | Estonian, Latvian, Lithuanian |
For more information on character encodings, have a look at the Unicode mappings provided in our knowledge base.


