Re: How to delete a token that comes exactly after a token

2014-02-26 Thread Bernhard Kraft
Hello, On 2014-02-26 19:37, Furkan KAMACI wrote: [...] If there is no such an implementation I can implement a patch for it? If you "really" want to implement this in Lucene/Java I guess you should have a look at existing token filters in: lucene/analysis/common/src/java/org/apache/lucene/

Re: How to delete a token that comes exactly after a token

2014-02-26 Thread Jack Krupansky
: java-user@lucene.apache.org Subject: Re: How to delete a token that comes exactly after a token Hi; I'm parsing a wiki dump file. There are some special definitions. In example: link:km so when I parse my text I have that tokens: "link" and "km". I want to remove "lin

Re: How to delete a token that comes exactly after a token

2014-02-26 Thread Furkan KAMACI
Hi; I'm parsing a wiki dump file. There are some special definitions. In example: link:km so when I parse my text I have that tokens: "link" and "km". I want to remove "link" and it is a stopword for my situation. However I want to remove "km" too if km is followed by token of "link". If there i

Re: How to delete a token that comes exactly after a token

2014-02-26 Thread Jack Krupansky
Sounds like a custom filter. Or maybe an option for stop filter or a specialization of stop filter. Or maybe it could be even more generalized. What are some practical example token sequences? -- Jack Krupansky -Original Message- From: Furkan KAMACI Sent: Wednesday, February 26, 201