News
2010-07-29: Lingua::Translit v0.18 supports GOST
Lingua::Translit v0.18 supports GOST 7.79:2000 transliterations for Russian, Ukrainian and Old Russian. These allow plain ASCII transliterations of Cyrillic letters.
(Go to product)
2010-07-08: lid v3.1.0 identifies Mandarin (Chinese)
lid v3.1.0 now supports Mandarin (Chinese) in both simplified and traditional variants. Besides the popular Unicode encodings, the frequently used charsets Big5 and GB2312 can be identified.
(Go to product)
2010-06-18: Lingua::Lid v0.02 released
Lingua::Lid provides thread-safety if compiled with a lid version of 3.0.0 or above.
(Go to product)
2010-06-18: lid v3.0.0 thread-safe
lid v3.0.0 is thread-safe and reentrant and can therefore easily be deployed in modern, threaded applications. Besides that the language and character encoding identification algorithms have been optimized even further resulting in speed-ups of up to 100%.
The set of supported systems has been extended to cover not only Debian GNU/Linux, Solaris and FreeBSD, but Windows (32/64-bit) and Ubuntu GNU/Linux 10.04 (LTS), too.
(Go to product)
2010-06-04: AutoUniConv v1.0.0 released
AutoUniConv is a C/C++ library that automatically converts text encoded in various charsets to Unicode.
It features robust and fast processing, support for a large set of input charsets, support for all common Unicode Transformation Formats and thread-safety. It is available for many Unix-like systems (Linux, Solaris, FreeBSD) as well as for Windows.
(Go to product)
2010-03-08: lidc v1.1.0 released
Thanks to a new version of the underlying library lid, lidc v1.1.0 introduces support for two more languages: lidc may identify Russian and Ukrainian for a variety of character encodings.
(Go to product)
2010-03-01: lid v2.1.0 released
lid v2.1.0 covers two more languages, Russian and Ukrainian, and a set of related transliterations. Additionally processing speed on large inputs has been improved in combination with an increased memory efficiency.
(Go to product)
2010-01-20: Lingua::Translit v0.17 released
Added an Ukrainian transliteration ("DIN 1460 UKR").
(Go to product)
2010-01-04: Lingua::Translit v0.16 released
Added another Russian transliteration ("DIN 1460 RUS") and a PDF developer manual.
(Go to product)
2009-12-15: lidc v1.0.1 released
lidc v1.0.1 allows to detect language and character encoding of textual input from a variety of formats (Email, XML, HTML, ...). It is provided for Linux, Solaris and FreeBSD.
(Go to product)