Sprachinspektor Language Detector

Sprachinspektor Mascot

The language detector Sprachinspektor provides two important pieces of information: the language and character encoding a text is written in.

Sprachinspektor detects the language and character encoding with high accuracy. Even very short input of about five words is in most cases sufficient to identify the language correctly.

Sprachinspektor is available as...

Download
Free Application
Request
Quote for SDK
Request
SDK Evaluation

Supported Languages and Character Encodings

Sprachinspektor currently detects 29 languages and additionally 16 languages in transliterated form.

The supported character encodings cover both every modern and common encoding for any given language and a set of legacy encodings. All together Sprachinspektor detects 39 character encodings.

Support for additional languages and character encodings is added regularly. However, if you need a specific language or character encoding supported, feel free to contact us.

If you are interested in the conversion of character encoding, please have a look at our Unicode converter AutoUniConv.

Language Encodings
- Arabic UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-6, Windows-1256, MacArabic, CP720
bul Bulgarian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-5, Windows-1251, MacCyrillic, CP855, CP866, KOI8-R
bul Bulgarian
(ISO 9 transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ASCII
bul Bulgarian
(DIN 1460 transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, Windows-1250, ASCII
bul Bulgarian
(Streamlined System transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, Windows-1250, ASCII
ces Czech UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, Windows-1250, MacCentralEuropean, CP852
ces Czech
(Common CES transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII
cmn Mandarin (Chinese) UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, Big5, GB2312
dan Danish UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP850, ASCII
deu German UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP850, ASCII
deu German
(Common DEU transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII
eng English UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP850, ASCII
est Estonian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-4, Windows-1257, MacCentralEuropean, CP775, ASCII
ell Greek UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-7, Windows-1253, MacGreek, CP737
ell Greek
(ISO 843 transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ASCII
ell Greek
(DIN 31634 transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ASCII
ell Greek
(Greeklish transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII
fin Finish UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP850, ASCII
fra French UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP850, ASCII
gle Irish (Gaelic) UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP850, ASCII
hun Hungarian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, ISO-8859-16, Windows-1250, CP852, MacCentralEuropean
ita Italian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-16, Windows-1252, MacRoman, CP850, ASCII
lav Latvian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-4, Windows-1257, MacCentralEuropean, CP775, ASCII
lit Lithuanian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-4, Windows-1257, MacCentralEuropean, CP775, ASCII
mlt Maltese UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-3
nld Dutch UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP850, ASCII
nno Nynorsk (Norwegian) UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP850
nob Bokmål (Norwegian) UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP850
pol Polish UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, ISO-8859-16, Windows-1250, MacCentralEuropean, CP852
pol Polish
(Common POL transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII
por Portuguese UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP850, ASCII
ron Romanian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, Windows-1250, MacRomanian, CP852
ron Romanian
(Common RON transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII
rus Russian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-5, Windows-1251, MacCyrillic, CP855, CP866, KOI8-R
rus Russian
(DIN 1460 transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8
rus Russian
(ISO 9 transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8
slk Slovak UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, Windows-1250, MacCentralEuropean, CP852
slk Slovak
(Common SLK transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII
slv Slovenian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-2, ISO-8859-16, Windows-1250, MacCentralEuropean, CP852, ASCII
slv Slovenian
(Common SLV transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ASCII
spa Spanish UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, ISO-8859-15, Windows-1252, MacRoman, CP850, ASCII
swe Swedish UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, ISO-8859-1, Windows-1252, MacRoman, CP850, ASCII
ukr Ukrainian UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8, Windows-1251, MacUkrainian, KOI8-U
ukr Ukrainian
(DIN 1460 transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8
ukr Ukrainian
(ISO 9 transliteration)
UTF-32BE, UTF-32LE, UTF-16BE, UTF-16LE, UTF-8