ReaderManager, more drama with things not being closed before closing the Directory

2016-10-19 Thread Trejkaz
Hi all. I seem to have a situation where ReaderManager is reducing a refCount to 0 before it actually releases all its references. It's difficult because it's all mixed up in our framework for multiple ReaderManagers, which I'm still not convinced works because the concurrency is impossible to fi

ApacheCon is now less than a month away!

2016-10-19 Thread Rich Bowen
Dear Apache Enthusiast, ApacheCon Sevilla is now less than a month out, and we need your help getting the word out. Please tell your colleagues, your friends, and members of related technical communities, about this event. Rates go up November 3rd, so register today! ApacheCon, and Apache Big Dat

Re: How to add ASCIIFoldingFilter in ClassicAnalyzer

2016-10-19 Thread Adrien Grand
You would need to override the wrapComponents method in order to wrap the tokenstream. See for instance Lucene's LimitTokenCountAnalyzer. Le mar. 18 oct. 2016 à 18:46, Kumaran Ramasubramanian a écrit : > Hi Adrien > > How to do this? Any Pointers? > > ​ > > If it is fine to add the ascii folding

Re: POS tagging in Lucene

2016-10-19 Thread Tommaso Teofili
I think it might be helpful to handle POS tags as TypeAttributes so that the input and output texts would cleaner and you can still filter and retrieve tokens by type (e.g. with TypeTokenFilter). My 2 cents, Tommaso Il giorno mer 19 ott 2016 alle ore 11:56 Niki Pavlopoulou ha scritto: > Hi Ste

Re: POS tagging in Lucene

2016-10-19 Thread Niki Pavlopoulou
Hi Steve, thank you for your answer. I created a custom Lucene Analyser in the end. Just to clarify on what I mean, Lucene works perfectly for pure words, but since it does not support POS tagging some workaround needs to be done for the analysis of tokens with POS tags. For example: Input withou