Miles,
I understand you are trying to solve your problem by changing the index
contents (removing documents). Would it be possible to workaround it and
to achieve this during search, by only returning the relevant documents
and ignore the rests?
Just my 2 cents...
Tom
Miles Efron wrote:
> This question surely shows how new I am to Lucene... but I'm
interested in removing terms from a lucene index. In particular, I'd
like to be able to delete all terms that appear in fewer than x
documents (say x=3). This is in efforts to reduce the feature set for
some research I'm doing.
>
> I found a post to this effect on the list from a while back:
> http://www.gossamer-threads.com/lists/lucene/java-user/9538#9538
> but I couldn't find any responses to it.
>
> The only thing I can think of is to re-index the collection, using
the undesired words as a sort of stoplist. But surely there's a better
way to do it (the inverted index structure seems like this should be
natural). Any pointers would be most helpful.
>
> Thanks,
> -Miles
>
> Andrzej Bialecki wrote:
>
>> Huinan wrote:
>>
>>> Thanks, Ronnie. But why it works in some cases (when there is a
small number
>>> of documents inside the index) ?
>>
>>
>> The Hits class retrieves the first 50 results, and caches them.
>>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]