"read past EOF" when merge

2012-10-30 Thread superruiye
The exception "read past EOF" Bothering me a long time, trace at below. Exception in thread "Lucene Merge Thread #7" org.apache.lucene.index.MergePolicy$MergeException: java.io.IOException: read past EOF at org.apache.lucene.index.ConcurrentMergeScheduler.handleMergeException(ConcurrentM

Re: WFST/Analyzing Suggesters: foreign keys, user-supplied filter, highlighting

2012-10-30 Thread Michael McCandless
On Tue, Oct 30, 2012 at 3:52 PM, Dawid Weiss wrote: >> https://issues.apache.org/jira/browse/LUCENE-4491 ? Could you simply >> stuff your ISBN onto the end of the suggestion (ie enroll Lucene in >> Action|1933988177)? > > Just remember that if your suffixes are unique then you'll be > expanding

Re: WFST/Analyzing Suggesters: foreign keys, user-supplied filter, highlighting

2012-10-30 Thread Dawid Weiss
> https://issues.apache.org/jira/browse/LUCENE-4491 ? Could you simply > stuff your ISBN onto the end of the suggestion (ie enroll Lucene in > Action|1933988177)? Just remember that if your suffixes are unique then you'll be expanding the automaton quite a bit (unique suffix paths). D.

RE: Norms and Term Vectors in Lucene 4.0

2012-10-30 Thread Scott Smith
Thanks Simon. Appears I had it mostly figured out correctly--except for the last question :-) Thanks for the suggestion on caching the fieldtype. Cheers Scott -Original Message- From: Simon Willnauer [mailto:simon.willna...@gmail.com] Sent: Tuesday, October 30, 2012 2:10 AM To: java-

Re: WFST/Analyzing Suggesters: foreign keys, user-supplied filter, highlighting

2012-10-30 Thread Michael McCandless
Ideas/comments below: On Tue, Oct 30, 2012 at 9:40 AM, Oliver Christ wrote: > I'm currently researching using a WFST suggester on e.g. book titles. > While our basic use cases are well covered, there seem to be at least > three which aren't: > > * The possibility to associate a "foreign

WFST/Analyzing Suggesters: foreign keys, user-supplied filter, highlighting

2012-10-30 Thread Oliver Christ
Hi, I'm currently researching using a WFST suggester on e.g. book titles. While our basic use cases are well covered, there seem to be at least three which aren't: * The possibility to associate a "foreign key" with a string (rather: final node) in the WFST (in addition to the rank

Re:Delete file index entries that do not exist physically

2012-10-30 Thread qibaoyuan
Maybe Luke(code.google.com/p/luke/) could help you out without coding. At 2012-10-30 19:41:57,"ViTi No" wrote: > >Hello, >I have a Lucene index created, and I would like to know how to delete the file >index entries that do not already exist on the computer. Is there any way from >Lu

Delete file index entries that do not exist physically

2012-10-30 Thread ViTi No
Hello, I have a Lucene index created, and I would like to know how to delete the file index entries that do not already exist on the computer. Is there any way from Lucene or have to go file by file opening a file, and by checking for java? the example is as follows: I have a directory which con

Re: SpanQuery, Filter, BooleanQuery

2012-10-30 Thread Carsten Schnober
Am 29.10.2012 13:40, schrieb Carsten Schnober: > Now, I'd like to add the option to filter the resulting Spans object by > another WildcardQuery on a different field that contains document > titles. My intuitive approach would have been to use a filter like this: I'd like to conclude my previous

Re: Term Positions added to one document forward

2012-10-30 Thread Ivan Vasilev
Thanks Simon! On 29.10.2012 г. 21:38, Simon Willnauer wrote: you should call currDocsAndPositions.nextPosition() before you call currDocsAndPositions.getPayload() payloads are per positions so you need to advance the pos first! simon On Mon, Oct 29, 2012 at 6:44 PM, Ivan Vasilev wrote: Hi G

Re: Norms and Term Vectors in Lucene 4.0

2012-10-30 Thread Simon Willnauer
hey scott, On Mon, Oct 29, 2012 at 11:56 PM, Scott Smith wrote: > Converting some code to lucene 4.0, it appears that we can no longer set > whether we want to store norms or termvectors using the "sugared" Field > classes (e.g., StringField() and TextField). I gather the defaults are to > st