Supported Charsets

Input

AutoUniConv currently supports a variety of 35 distinct character sets from 6 families. These cover both modern and common charsets and a set of legacy charsets.

All common Unicode Transformation Formats (UTF), namely "UTF-8", "UTF-16BE", "UTF-16LE", "UTF-32BE" and "UTF-32LE" are supported both as an input and output.

Family Character Set
Unicode UTF-16BE, UTF-16LE, UTF-32BE, UTF-32LE, UTF-8
ISO ISO-8859-1, ISO-8859-15, ISO-8859-16, ISO-8859-2, ISO-8859-3, ISO-8859-4, ISO-8859-5, ISO-8859-7
Macintosh MacCentralEuropean, MacCyrillic, MacGreek, MacRoman, MacRomanian, MacUkrainian
DOS Code Pages CP 737, CP 775, CP 850, CP 852, CP 855, CP 866
Windows Windows-1250, Windows-1251, Windows-1252, Windows-1253, Windows-1257
Natinonal ASCII, Big5, GB2312, KOI8-R, KOI8-U

Output

The input may be converted to different Unicode Transformation Formats (UTF):

  • UTF-8
  • UTF-16LE (Little-Endian)
  • UTF-16BE (Big-Endian)
  • UTF-32LE (Little-Endian)
  • UTF-32BE (Big-Endian)

For more information on character encodings, have a look at the Unicode mappings provided in our knowledge base.