Re: Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Steve Rowe
On Jul 23, 2014, at 7:43 PM, Milind wrote: >>> input=esl2.gbr >>> output=[esl2.gb][r] >>> >>> This is a bug, which was fixed in Lucene 4.7 - see < > https://issues.apache.org/jira/browse/LUCENE-5391> > > BTW, I changed the POM dependency to 4.7.1, but I'm still seeing the same > output. I

Re: Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Milind
>>input=esl2.gbr >>output=[esl2.gb][r] > > >> This is a bug, which was fixed in Lucene 4.7 - see < https://issues.apache.org/jira/browse/LUCENE-5391> BTW, I changed the POM dependency to 4.7.1, but I'm still seeing the same output. I can't go beyond 4.7 since it seems 4.8 onwards, Lucene

Re: Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Milind
Brilliant. Thanks! On Wed, Jul 23, 2014 at 6:12 PM, Steve Rowe wrote: > See PerFieldAnalyzerWrapper, which is itself an Analyzer: < > http://lucene.apache.org/core/4_4_0/analyzers-common/org/apache/lucene/analysis/miscellaneous/PerFieldAnalyzerWrapper.html > > > > Steve > > On Jul 23, 2014, at

Re: Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Steve Rowe
See PerFieldAnalyzerWrapper, which is itself an Analyzer: Steve On Jul 23, 2014, at 6:00 PM, Milind wrote: > Thanks Steve, that helped. I had forgotten about the URL pa

Re: Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Milind
Thanks Steve, that helped. I had forgotten about the URL part of the Analyzer since I was using it for the email field. I need to see if it's possible to use different analyzers for different fields. If so, then I'll use the UAX29URLEmailAnalyzer only for the email field and use StandardAnalyzer

Re: Seeking Additional Moderator Volunteers for java-user@lucene

2014-07-23 Thread Yalamarthi, Vineel
Can I be volunteer too vineelyalamar...@gmail.com On 7/23/14, 2:49 PM, "Chris Hostetter" wrote: > >Thanks folks, plenty of new volunteers > >https://issues.apache.org/jira/browse/INFRA-8082 > > > >-Hoss >http://www.lucidworks.com/ > >-

Re: Seeking Additional Moderator Volunteers for java-user@lucene

2014-07-23 Thread Chris Hostetter
Thanks folks, plenty of new volunteers https://issues.apache.org/jira/browse/INFRA-8082 -Hoss http://www.lucidworks.com/ - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-

Re: Seeking Additional Moderator Volunteers for java-user@lucene

2014-07-23 Thread Arvind Gupta
I can help arvind.gupt...@gmail.com On Wed, Jul 23, 2014 at 11:02 PM, Alejandro Narancio wrote: > I can help: ale.naran...@gmail.com > > > On Wed, Jul 23, 2014 at 2:17 PM, Steve Rowe wrote: > > > Sign me up: sar...@gmail.com > > > > Steve > > > > On Jul 23, 2014, at 1:02 PM, Chris Hostetter >

Re: Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Steve Rowe
Hi Milind, On Jul 23, 2014, at 1:49 PM, Milind wrote: > The UAX29URLEmailAnalyzer analyzer in Lucene 4.4 is not working as I > expected. Is this a bug in the analyzer or is this working as designed? > > If I use the UAX29URLEmailAnalyzer, it tokenizes the following strings as >input=bwl-es

Incorrect tokenizing in the UAX29URLEmailAnalyzer analyzer?

2014-07-23 Thread Milind
The UAX29URLEmailAnalyzer analyzer in Lucene 4.4 is not working as I expected. Is this a bug in the analyzer or is this working as designed? If I use the UAX29URLEmailAnalyzer, it tokenizes the following strings as input=bwl-esl2.gbr.hp.com output=[bwl-esl2.gbr.hp.com] input=esl2.gbr

Re: Seeking Additional Moderator Volunteers for java-user@lucene

2014-07-23 Thread Alejandro Narancio
I can help: ale.naran...@gmail.com On Wed, Jul 23, 2014 at 2:17 PM, Steve Rowe wrote: > Sign me up: sar...@gmail.com > > Steve > > On Jul 23, 2014, at 1:02 PM, Chris Hostetter > wrote: > > > > > We're doing some housekeeping of the moderators of this list, and > looking for any new folks that

Re: Seeking Additional Moderator Volunteers for java-user@lucene

2014-07-23 Thread Steve Rowe
Sign me up: sar...@gmail.com Steve On Jul 23, 2014, at 1:02 PM, Chris Hostetter wrote: > > We're doing some housekeeping of the moderators of this list, and looking for > any new folks that would like to volunteer. (we currently have 3 active > moderators, 1-2 additional mods would be helpfu

Seeking Additional Moderator Volunteers for java-user@lucene

2014-07-23 Thread Chris Hostetter
We're doing some housekeeping of the moderators of this list, and looking for any new folks that would like to volunteer. (we currently have 3 active moderators, 1-2 additional mods would be helpful for good coverage) If you'd like to volunteer to be a moderator, please reply back to this th

IndexReader from SearchManager.

2014-07-23 Thread Umashanker, Srividhya
Which is the efficient way to create faster searches? 1. Single large index (or) 2. Multiple small indexes (opened with SearchManager, multireaders) When creating the IndexSearcher(multireader), how do we quickly get the list of IndexReaders. Should we use DirectoryReader.open(wr

Re: Invalid fieldsStream maxPointer (file truncated?): maxPointer=2966205946, length=2966208512

2014-07-23 Thread Robert Muir
Hey, thank you for following up! On Wed, Jul 23, 2014 at 8:46 AM, Harald Kirsch wrote: > On 23.07.2014 13:38, Robert Muir wrote: >> >> On Wed, Jul 23, 2014 at 7:29 AM, Harald Kirsch >> wrote: >>> >>> >>> >>> (As a side note: after truncating the file to the expected size+16, at >>> least >>> the

Re: Invalid fieldsStream maxPointer (file truncated?): maxPointer=2966205946, length=2966208512

2014-07-23 Thread Harald Kirsch
On 23.07.2014 13:38, Robert Muir wrote: On Wed, Jul 23, 2014 at 7:29 AM, Harald Kirsch wrote: (As a side note: after truncating the file to the expected size+16, at least the core starts up again. Have not tested anything else yet.) After applying your truncation-fix, Is it possible for you

Re: Invalid fieldsStream maxPointer (file truncated?): maxPointer=2966205946, length=2966208512

2014-07-23 Thread Robert Muir
On Wed, Jul 23, 2014 at 7:29 AM, Harald Kirsch wrote: > File system is xfs hosted on a corporate file share somewhere. > Sorry, i forgot to ask: how do you access this? is it mounted over nfs? - To unsubscribe, e-mail: java-use

Re: Invalid fieldsStream maxPointer (file truncated?): maxPointer=2966205946, length=2966208512

2014-07-23 Thread Robert Muir
On Wed, Jul 23, 2014 at 7:29 AM, Harald Kirsch wrote: > > > (As a side note: after truncating the file to the expected size+16, at least > the core starts up again. Have not tested anything else yet.) After applying your truncation-fix, Is it possible for you to run the checkindex tool (and show

Re: Invalid fieldsStream maxPointer (file truncated?): maxPointer=2966205946, length=2966208512

2014-07-23 Thread Harald Kirsch
Hello Robert, thanks for showing interest in this case. Find my answer below. On 23.07.2014 12:58, Robert Muir wrote: On Wed, Jul 23, 2014 at 6:03 AM, Harald Kirsch wrote: Hi, below is an exception I get from one Solr core. According to https://issues.apache.org/jira/browse/LUCENE-5617 the c

Re: Invalid fieldsStream maxPointer (file truncated?): maxPointer=2966205946, length=2966208512

2014-07-23 Thread Robert Muir
On Wed, Jul 23, 2014 at 6:03 AM, Harald Kirsch wrote: > Hi, > > below is an exception I get from one Solr core. According to > https://issues.apache.org/jira/browse/LUCENE-5617 the check that leads to > the exception was introduced recently. > > Two things are worth mentioning: > > a) contrary to

Invalid fieldsStream maxPointer (file truncated?): maxPointer=2966205946, length=2966208512

2014-07-23 Thread Harald Kirsch
Hi, below is an exception I get from one Solr core. According to https://issues.apache.org/jira/browse/LUCENE-5617 the check that leads to the exception was introduced recently. Two things are worth mentioning: a) contrary to the expectation expressed in the message (file truncated?), the a