Re: Rendexing problem: Indexing folder size is keep on growing for same remote folder

2013-10-02 Thread gudiseashok
Thank you very much for your time sir, I follow your suggestion. -- View this message in context: http://lucene.472066.n3.nabble.com/Rendexing-problem-Indexing-folder-size-is-keep-on-growing-for-same-remote-folder-tp4092835p4093136.html Sent from the Lucene - Java Users mailing list archive at

Re: Rendexing problem: Indexing folder size is keep on growing for same remote folder

2013-10-02 Thread Ian Lea
Yes, as I suggested, you could search on your unique id and not index if already present. Or, as Uwe suggested, call updateDocument instead of add, again using the unique id. -- Ian. On Tue, Oct 1, 2013 at 6:41 PM, gudiseashok wrote: > I am really sorry if something made you confuse, as I sai

Re: Rendexing problem: Indexing folder size is keep on growing for same remote folder

2013-10-01 Thread gudiseashok
I am really sorry if something made you confuse, as I said I am indexing a folder which contains mylogs.log,mylogs1.log,mylogs2.log etc, I am not indexing them as a flat file. I have tokenized my each line of text with regex and storing them as fields like "messageType", "timeStamp","message". So

Re: Rendexing problem: Indexing folder size is keep on growing for same remote folder

2013-10-01 Thread Ian Lea
I'm still a bit confused about exactly what you're indexing, when, but if you have a unique id and don't want to add or update a doc that's already present, add the unique id to the index and search (TermQuery probably) for each one and skip if already present. Can't you change the log rotation/co

Re: Rendexing problem: Indexing folder size is keep on growing for same remote folder

2013-10-01 Thread gudiseashok
Hi Basically my log folder consists of four log files like abc.log,abc1.log,abc2.log,abc3.log, as my log appender is doing. Every 30 minutes content will be changed of all these file , for example after 30 minutes refresh my conent of abc1.log will be replaced with existing abc.log content and ab

Re: Rendexing problem: Indexing folder size is keep on growing for same remote folder

2013-10-01 Thread Ian Lea
milliseconds as unique keys are a bad idea unless you are 100% certain you'll never be creating 2 docs in the same millisecond. And are you saying the log record A1 from file a.log indexed at 14:00 will have the same unique id as the same record from the same file indexed at 14:30 or will it be di

RE: Rendexing problem: Indexing folder size is keep on growing for same remote folder

2013-10-01 Thread gudiseashok
I am afraid, my document in the above code has already a unique-key (will milli-seconds I hope this is enough to differentiate with another records). My requirement is simple, I have a folder with a.log,b.log and c.log files which will be updated every 30 minutes, I want to update the index of the

RE: Rendexing problem: Indexing folder size is keep on growing for same remote folder

2013-10-01 Thread Uwe Schindler
You have to call updateDocument with the unique key of the document to update. The unique key must be a separate, indexed, not necessarily stored key. addDocument just adds a new instance of the document to the index, it cannot determine if it’s a duplicate. - Uwe Schindler H.-H.-Meier-Alle