libtextcat has been added to the cygwin distribution.
It classifies text into languages (to a limited amount locale also), to
be used as C or C++ library.
Additionally to the upstream release (and from suse, ubuntu, altlinux)
there's a user binary /usr/bin/textcat to classify text from
the command line.
$ cat /usr/share/doc/Cygwin/CYGWIN_LICENSE | textcat
[english]
The default for the config file is changed to /etc/libtextcat.conf
I attached the README for full info.
If you have questions or comments, please send them to
the Cygwin mailing list at: cygwin@cygwin.com .
I'll answer only there and I don't answer private mails.
*** CYGWIN-ANNOUNCE UNSUBSCRIBE INFO ***
If you want to unsubscribe from the cygwin-announce
mailing list, look at the "List-Unsubscribe: " tag in
the email header of this message. Send email to the
address specified there. It will be in the format:
[EMAIL PROTECTED]
If you need more information on unsubscribing, start
reading here:
http://sources.redhat.com/lists.html#unsubscribe-simple
Please read *all* of the information on unsubscribing
that is available starting at this URL.
libtextcat
-------------------------------
Text Classification Library (language guessing)
Runtime requirements: (versions given or later)
cygwin-1.5.x
libtextcat0
Build requirements: (versions given or later)
cygwin-1.5.x
zlib-1.2.x
gcc-core
gcc-g++
libtool
binutils
make
bash
fileutils
sed
Canonical homepage:
http://software.wise-guys.nl/libtextcat/
Canonical download:
http://software.wise-guys.nl/download/libtextcat-<VER>.tar.gz
-------------------------------------------------------------------------------
Build instructions:
If you use setup to install this src package, it will be
unpacked under /usr/src automatically.
cd /usr/src
tar xfvj libtextcat-<VER>-src.tar.bz2
cygport libtextcat-<VER>-<CYGREL>.cygport almostall
This will create:
/usr/src/libtextcat-<VER>-<CYGREL>.tar.bz2
/usr/src/libtextcat-<VER>-<CYGREL>-src.tar.bz2
Or use 'cygport libtextcat-<VER>-<CYGREL>.cygport prep' to get a patched source
directory
-------------------------------------------------------------------------------
Files included in the libtextcat package:
etc/libtextcat.conf
usr/bin/textcat.exe
usr/share/doc/Cygwin/libtextcat-2.2.README
usr/share/doc/libtextcat-2.2/ChangeLog
usr/share/doc/libtextcat-2.2/LICENSE
usr/share/doc/libtextcat-2.2/README
usr/share/doc/libtextcat-2.2/TODO
usr/share/libtextcat/*.lm
Files included in the libtextcat0 package:
usr/bin/cygtextcat-0.dll
Files included in the libtextcat-devel package:
usr/bin/createfp.exe
usr/include/textcat.h
usr/lib/libtextcat.a
usr/lib/libtextcat.dll.a
usr/lib/libtextcat.la
usr/share/libtextcat/ShortTexts/*.txt
-------------------------------------------------------------------------------
Port Notes:
----- version 2.2-2 -----
new binary textcat, derived from texttextcat
rearrange layout:
etc/libtextcat.conf
usr/bin/textcat.exe
usr/share/libtextcat/*.lm
----- version 2.2-1 -----
Initial cygwin release.
Various c++ fixes: remove extern, warp around extern "C" { }, proper c++ casts
Install textcat.h
Install langclass: LM into langclass and ShortTexts as subdir
Cygwin port maintained by: Reini Urban <[EMAIL PROTECTED]>
Cygwin port questions to the mailinglist <cygwin@cygwin.com>
--
Unsubscribe info: http://cygwin.com/ml/#unsubscribe-simple
Problem reports: http://cygwin.com/problems.html
Documentation: http://cygwin.com/docs.html
FAQ: http://cygwin.com/faq/