do i need a key if not going to query by key or update the document

2016-09-12 Thread Cam Bazz
Hello, Do I need to add a key, if I will not be a. updating the document b. will not fetch the document by key? What could be the possible downside of not using a key that uniquely identifies the document? I am building a log processor, and all I will do is sort and iterate. Best regards, C.

Re: do i need a key if not going to query by key or update the document

2016-09-12 Thread Michael McCandless
No, you don't need a key for append-only usage. Lucene itself doesn't care if you have a unique key per document, and in fact it's very costly, especially if your documents are otherwise tiny, to add one if you don't really need it. Mike McCandless http://blog.mikemccandless.com On Mon, Sep 12,

Re: do i need a key if not going to query by key or update the document

2016-09-12 Thread Cam Bazz
Thank you for the quick response. But I still can delete that document - based on docId right? I understand docId may change from time to time, but if I iterate a bunch of docs and then fetch the docId from the doc I want to delete and then delete that doc will it still work? Best regards, C. On

Is it possible to search for a paragraph in Lucene?

2016-09-12 Thread szzoli
Hi All, Is it possible to search for a paragraph in Lucene? Thx Zoli -- View this message in context: http://lucene.472066.n3.nabble.com/Is-it-possible-to-search-for-a-paragraph-in-Lucene-tp4295705.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. --

Re: Is it possible to search for a paragraph in Lucene?

2016-09-12 Thread Ahmet Arslan
Hi, If you have some tool/mechanism to detect paragraph boundaries, yes it is possible to search for a paragraph. But Lucene it self cannot detect sentence/paragraph for you. There are other libraries for this. Ahmet On Monday, September 12, 2016 1:06 PM, szzoli wrote: Hi All, Is it possibl

Re: Is it possible to search for a paragraph in Lucene?

2016-09-12 Thread szzoli
Hi, thanks for the hint. My question exatly is: Can I use a paragraph of a document to use as a term to search in the index? Does Lucene create an inde only on word level, or can it be set to index on phrase, or paragraph level? Is it the question of indexing or of searching to search for severa

Re: Is it possible to search for a paragraph in Lucene?

2016-09-12 Thread Erick Erickson
First, _you_ define a "paragraph". It's one of those tricky concepts that's totally obvious to a human but is surprisingly hard to implement in code. What's a paragraph in Chinese? Hebrew? Even in English it's tricky.. How does a PDF signal a paragraph? Is that consistent with Word? Open Office? Ho