Re: Can we configure analyzers to not exclude specific characters

2015-01-29 Thread Michael Sokolov
ts = new UAX29URLEmailTokenizer(Version.LUCENE_48,myreader); Regards, Shiv -Original Message- From: "Michael Sokolov" Sent: ‎29-‎01-‎2015 01:32 AM To: "java-user@lucene.apache.org" Subject: Re: Can we configure analyzers to not exclu

RE: Can we configure analyzers to not exclude specific characters

2015-01-29 Thread Shivashankar Maddanimath
quot;Michael Sokolov" Sent: ‎29-‎01-‎2015 01:32 AM To: "java-user@lucene.apache.org" Subject: Re: Can we configure analyzers to not exclude specific characters It's a bit of a hack, but we do this: On 1/28/2015 2:00 AM, Shivashankar Maddanimath wrote:

Re: Can we configure analyzers to not exclude specific characters

2015-01-28 Thread Michael Sokolov
It's a bit of a hack, but we do this: pattern="([A-Za-z])\+\+" replacement="$1plusplus" /> pattern="([A-Za-z])\#" replacement="$1sharp" /> On 1/28/2015 2:00 AM, Shivashankar Maddanimath wrote: Hi, I am using Lucene standard and uax29urlemailtokenizer. These analysers are exc

Can we configure analyzers to not exclude specific characters

2015-01-27 Thread Shivashankar Maddanimath
Hi, I am using Lucene standard and uax29urlemailtokenizer. These analysers are excluding some characters like "+" ( I can't search C++). Is there any way we can configure analyzers to include specific characters in analyzers while tokenising? Regards, Shiv -Original Message- From: "