On Tue, Aug 28, 2007 at 05:05:24PM +0200, Giampaolo Tomassoni wrote: > > TextCat? > > Does it yield the probability for each language? Or it just yields a single > result? (i.e.: language id). > > We would need the probability of the text being in any given language. Or, > better, something close to an array of probabilities where the indexes are > the various languages for which the module can work.
To be honest, I haven't looked at that plugin in ages, so I don't remember exactly what it does. As I recall, it gives a list of possible languages, which means that internally it would have to know probabilities. -- Randomly Selected Tagline: "Jab, Jab, Oooh. O(n log n)! Ha! Tail recursion! Thrust! Parry! <BOING>" - Jim Flanagan
pgpE8zJyCk2Ur.pgp
Description: PGP signature