lidc - A Language Identifier (Preview)

lidc is a command line application for Unix-like operating systems (Linux, Solaris, FreeBSD) that allows you to identify the language and character encoding of an input. Based on the lid library, it provides accurate identification results and high performance. However, lidc implements a significant amount of new features on top of those provided by lid, namely the parsing of common input formats. These include:

lidc is currently under development and will be released in a few month. The complete set of supported formats will be published in lidc's software specification as soon as it will have been released.

Watch the following screencast to get a preview of the current state of development and find out more about the usage and capabilities of lidc.

The requested screencast cannot be displayed by your browser:

  • either JavaScript is disabled
  • or you do not have a Flash player installed - download.

In case you may download this screencast to disk directly: /screencasts/lidc-a-language-identifier-preview.swf

Posted 2009-09-25 16:43   by Alex Linke   Link: Permalink
Tags: lid  lidc  HTML  email  screencast  language-identifier  language  charset  software