RE: Lucene-analyzer 3.3.0 and Lucene snowball 3.0.1

2012-12-17 Thread Adrien RUFFIE
Hello Steve, Sorry for the 1st point. No it is an old version of solr 3.3.0 and Maven/Ivy/Gradle not used, the library has been added by hands by hands By another developer. Do you have an idea if I can remove snowball without worries ? Bien cordialement, Adrien Ruffié LD : +33 1 73 03 29 50

Re: how to get term docs in lucene 4.0

2012-12-17 Thread Hu Jing
thanks. 2012/12/18 Steve Rowe > Check out MIGRATE.txt: > > http://lucene.apache.org/core/4_0_0/MIGRATE.html > > Steve > > On Dec 17, 2012, at 7:18 PM, Hu Jing wrote: > > > in lucene 3.0,i can get term doc by using indexreader termdocs. how to > > implement this in lucene 4.0 > > > ---

Looking for case studies for 'Lucene and Solr: The Definitive Guide' from O'Reilly

2012-12-17 Thread Jason Rutherglen
This is a great chance to get your Lucene/Solr project included in 'Lucene and Solr: The Definitive Guide' from O'Reilly. Your case study should be between 2 - 15 pages in length and may include code from any programming language, diagrams, schemas, etc. Topics of interest for the case studies c

Re: how to get term docs in lucene 4.0

2012-12-17 Thread Steve Rowe
Check out MIGRATE.txt: http://lucene.apache.org/core/4_0_0/MIGRATE.html Steve On Dec 17, 2012, at 7:18 PM, Hu Jing wrote: > in lucene 3.0,i can get term doc by using indexreader termdocs. how to > implement this in lucene 4.0

Re: Russiam stemmer?

2012-12-17 Thread dokondr
I have found it: .../lucene-4.0.0/analysis/common/src/java/org/apache/lucene/analysis/ru Sorry for confusion :( On Tue, Dec 18, 2012 at 3:58 AM, dokondr wrote: > Thanks Steve! > It is weird, but I cant > find org.apache.lucene.analysis.ru.RussianLightStemmer ! > I have just downloaded Lucene 4.

Re: Russiam stemmer?

2012-12-17 Thread dokondr
Thanks Steve! It is weird, but I cant find org.apache.lucene.analysis.ru.RussianLightStemmer ! I have just downloaded Lucene 4.0.0 sources and 'analysis' directory does not have 'ru'. All it has: analysis-module-build.xml common kuromojiphoneticsmartcn uima build.xml ic

Re: Russiam stemmer?

2012-12-17 Thread Steve Rowe
Hi Dima, Javadocs for Lucene 4.0.0 are here, organized by module: There are two separate Russian stemmers in the analyzers-common module: 1. org.apache.lucene.analysis.ru.RussianLightStemmer, used by RussianLightStemFilter; and 2. The Russian Snowball ste

Re: java-user-subscribe

2012-12-17 Thread Devon H. O'Dell
2012/12/17 dokondr : > java-user-subscribe Sorry, I let this message through forgetting that the allow / accept addresses just send the message and don't actually subscribe the user. If you would like to subscribe to the list, please send an email to java-user-subscr...@lucene.apache.org. --dho

Re: Lucene-analyzer 3.3.0 and Lucene snowball 3.0.1

2012-12-17 Thread Steve Rowe
Hi Adrien, Three comments and a question: 1. From : When starting a new discussion on a mailing list, please do not reply to an existing message, instead start a fresh email. Even if you change the subject line of your email, other mai

Lucene-analyzer 3.3.0 and Lucene snowball 3.0.1

2012-12-17 Thread Adrien RUFFIE
Hello all, I beginning with an application and nobody knows with Lucene-analyzer 3.3.0.jar and Lucene snowball 3.0.1.jar are both included Its do same thing ? I how can I be sure that excluding Lucene-snowball jar in my application, it is not used (the idea that it is Lucene-analyzer is used in

Lucene 4.0.0 - find offsets for phrase queries

2012-12-17 Thread Vitaly_Artemov
Hi all, I use Lucene 4.0. I try to find offsets for phrase queries. My code works then I search for one word but then I call it for some phrase I didn't get offsets. termsEnum.seekExact returns false for phrase queries. reader = DirectoryReader.open( mIndexDir ); IndexSearcher sear

Re: Boolean and SpanQuery: different results

2012-12-17 Thread Carsten Schnober
Am 17.12.2012 11:54, schrieb Carsten Schnober: > Might this have to do with the docbase? I collect the document IDs from > the BooleanQuery through a Collector, adding the actual ID to the > current AtomicReaderContext.docbase. In the corresponding SpanQuery, I > pass these document IDs as a DocId

TermVectors and Attributes in Lucene 4.0

2012-12-17 Thread Piotr Pęzik
Hi, I've been trying to enumerate over all terms in all documents in a Lucene 4.0 index in order to retrieve their attributes (payloads, positions etc.). I have an index with documents containing stored, tokenized fields with term vectors, offsets and payloads. Below is what I have tried s

Re: Boolean and SpanQuery: different results

2012-12-17 Thread Carsten Schnober
Am 13.12.2012 18:00, schrieb Jack Krupansky: > Can you provide some examples of terms that don't work and the index > token stream they fail on? > > Make sure that the Analyzer you are using doesn't do any magic on the > indexed terms - your query term is unanalyzed. Maybe multiple, but > distinct