Modifying IDF

2010-01-28 Thread Franz Allan Valencia See
misunderstanding lucene's tf/idf? :-) Thanks, -- Franz Allan Valencia See | Java Software Engineer franz@gmail.com LinkedIn: http://www.linkedin.com/in/franzsee Twitter: http://www.twitter.com/franz_see

Re: Modifying IDF

2010-01-29 Thread Franz Allan Valencia See
How should I go about identifying the domain? Thanks, -- Franz Allan Valencia See | Java Software Engineer franz@gmail.com LinkedIn: http://www.linkedin.com/in/franzsee Twitter: http://www.twitter.com/franz_see On Fri, Jan 29, 2010 at 6:42 PM, Ian Lea wrote: > Instead of playing aro

Re: Modifying IDF

2010-02-01 Thread Franz Allan Valencia See
Hmm My Analyzer is a Dictionary-based Analyzer. And so, it only recognizes tokens in its dictionary. Adding every url (or domain) is not a viable solution. So how could I include that to my analyzer? Lucene Filter? FilterReader? Thanks, -- Franz Allan Valencia See | Java Software Engineer