Beside the intrest for selected languages, I see another general
interest in that piece of code, is to apply rules depending on the
language.
Why trying to find "click below" if the message is detected to be in
French.
That could lead to buid rules with language variants, one single
CLICKBELOW r
Daniel Quinlan wrote:
DQ> There were a bunch of test files distributed with TextCat.
We can probably paste some of them into an email body then I guess.
DQ> Having the GA score this would be nice. My last rule was almost a
DQ> "fiver" after the GA got done with it. :-)
It will be interesting
Craig R Hughes writes:
> thanks, great work. It's getting late now, and I have a big
> breakfast meeting early tomorrow, so I'll take a look at this
> sometime after noon. Is it kosher to roll this with the
> language-detection stuff and all into the SA distribution then?
> Sounds like you've g
Daniel,
thanks, great work. It's getting late now, and I have a big breakfast meeting
early tomorrow, so I'll take a look at this sometime after noon. Is it kosher
to roll this with the language-detection stuff and all into the SA distribution
then? Sounds like you've got the upstream author's
I'm basically finished adapting TextCat, an open source language
guesser, for use in SA. Thanks to the upstream author, it is now
licensed under the same terms as Perl. At this point, I'm looking for
testing help and comments.
- 76 different languages are currently recognized.
- The level o