Package: wnpp Severity: wishlist Owner: Joost van Baal <joostvb-debian-b...@mdcc.cx>
* Package name : frog Version : 0.1 Upstream Author : ILK Research Group, Tilburg University, http://ilk.uvt.nl * URL : http://ilk.uvt.nl/tadpole * License : GPL-3 Programming Lang: C++, Python Description : tagger and parser for Dutch language Memory-Based Learning (MBL) is a machine-learning method applicable to a wide range of tasks in Natural Language Processing (NLP). . Frog is a modular system integrating a morphosyntactic tagger, lemmatizer, morphological analyzer, and dependency parser for the Dutch language. It is based upon it's predecessor TADPOLE (TAgger, Dependency Parser, and mOrphoLogical analyzEr). Using Memory-Based Learning techniques, Tadpole tokenizes, tags, lemmatizes, and morphologically segments word tokens in incoming Dutch UTF-8 text files, and assigns a dependency graph to each sentence. Tadpole is particularly targeted at the increasing need for fast, automatic NLP systems applicable to very large (multi-million to billion word) document collections that are becoming available due to the progressive digitization of both new and old textual data. . Frog is a product of the ILK (Induction of Linguistic Knowledge) research group of the Tilburg University and the CNTS research group of the University of Antwerp. . If you do scientific research in NLP, Frog will likely be of use to you. -------------- Frog depends upon the TiMBL, TimblServer, and Mbt software packages. These are available from deb http://apt.ticc.uvt.nl lenny main deb-src http://apt.ticc.uvt.nl lenny main . ITP's for these packages are planned. Frog has not yet been publically released (Tadpole has, see http://ilk.uvt.nl/downloads/pub/software/tadpole-0.6.tar.gz .) Bye, Joost
signature.asc
Description: Digital signature