Injecting custom synonyms in Lucene

2013-07-12 Thread gtkesh
newly created SynonymMap in the SynonymFilter. What am I doing wrong?If you’d like to have a look at the code, here it is: CustomAnalyzer.java <https://gist.github.com/gtkesh/5986851> I'd appreciate your suggestions. Thanks in advance! -- View this message in context: ht

Adding BM25 in Lucene

2013-07-11 Thread gtkesh
Hi everyone! I have two questions: 1. What are the cases where Lucene's default tf-idf overperforms BM25? What are the best use cases where I should use tf-idf or BM25? 2. Are there any user-friendly guide or something about how can I use BM25 algorithm instead of Lucene's default tf-idf? I trie

Help with document design for indexing/searching

2013-07-03 Thread gtkesh
Hi everyone! This is my first post here and I'm new to Lucene, so I would appreciate your ideas with the design of lucene document I came up with. *What is my goal* I'm trying to index the collection of xml documents and all have the same structure like this: Each tag can itself have tag which