Re: [SAtalk] RFC: ok_languages patch

2002-05-04 Thread Olivier Nicole
Beside the intrest for selected languages, I see another general interest in that piece of code, is to apply rules depending on the language. Why trying to find "click below" if the message is detected to be in French. That could lead to buid rules with language variants, one single CLICKBELOW r

Re: [SAtalk] RFC: ok_languages patch

2002-05-03 Thread Craig R Hughes
Daniel Quinlan wrote: DQ> There were a bunch of test files distributed with TextCat. We can probably paste some of them into an email body then I guess. DQ> Having the GA score this would be nice. My last rule was almost a DQ> "fiver" after the GA got done with it. :-) It will be interesting

Re: [SAtalk] RFC: ok_languages patch

2002-05-03 Thread Daniel Quinlan
Craig R Hughes writes: > thanks, great work. It's getting late now, and I have a big > breakfast meeting early tomorrow, so I'll take a look at this > sometime after noon. Is it kosher to roll this with the > language-detection stuff and all into the SA distribution then? > Sounds like you've g

Re: [SAtalk] RFC: ok_languages patch

2002-05-03 Thread Craig R Hughes
Daniel, thanks, great work. It's getting late now, and I have a big breakfast meeting early tomorrow, so I'll take a look at this sometime after noon. Is it kosher to roll this with the language-detection stuff and all into the SA distribution then? Sounds like you've got the upstream author's

[SAtalk] RFC: ok_languages patch

2002-05-02 Thread Daniel Quinlan
I'm basically finished adapting TextCat, an open source language guesser, for use in SA. Thanks to the upstream author, it is now licensed under the same terms as Perl. At this point, I'm looking for testing help and comments. - 76 different languages are currently recognized. - The level o