IndexDivisor

2009-11-26 Thread Ganesh
Hello all, I am using Lucene v2.9.1, If I open my reader with positive value for termInfosIndexDivisor then the search works fine. If i set to -1, then search throws exception "terms index was not loaded when this reader was created". Is this the intend behaviour? I thought by setting this va

Re: Is it a lucene bug?

2009-11-26 Thread Wilson Wu
Hi, Thanks for your inspiration. What version(lucene 2.4 or 2.9 or others) are you used in your project. Can you give more details fo your suggestion, thanks. Wilson 2009/11/27 fulin tang : > Maybe you should take a look at the Scorer and Similarit

Re: Is it a lucene bug?

2009-11-26 Thread Wilson Wu
Hi, I am afraid I didn't describe clearly enough in my last mail. Let me describe it again. For example, there are 5 documents as doc1,doc2,doc3,doc4,doc5 in the search hits. And their updateTimes are respectively t1 = doc1.updateTime = 2009-01-01 12:45:00 t2 = doc2.updateTime = 2009-01-01

Re: To exit the while loop if match is found

2009-11-26 Thread fulin tang
Make sure your analyzer has called strToLower when index the data 2009/11/26 Savvas-Andreas Moysidis : > hi, > > maybe you could not break but rather collect them in an ordered collection. > If it's a web app you are having you can then pass it up your view layer > wrapped in json? > > savvas > >

Re: Need help regarding implementation of autosuggest using jquery

2009-11-26 Thread fulin tang
By the way , we search Chinese words, so Trie tree looks not perfect for us either 2009/11/27 fulin tang : > We have the same needs in our music search, and we found this is not a > good approach for performance reason . > > Did any one have experience of implement the autosuggestion in a heavy >

Re: Need help regarding implementation of autosuggest using jquery

2009-11-26 Thread fulin tang
We have the same needs in our music search, and we found this is not a good approach for performance reason . Did any one have experience of implement the autosuggestion in a heavy product environment ? Any suggestions ? 2009/11/26 Anshum : > Try this, > Change the code as required: > -

Re: Is it a lucene bug?

2009-11-26 Thread fulin tang
Maybe you should take a look at the Scorer and Similarity series classes , they will show you how the score is calculated , make some change of them, and you will get what you want. We have the same problem and we get it done by write subclasses of DefaultSimilarity and BooleanScorer 2009/11

Re: Is it a lucene bug?

2009-11-26 Thread Wilson Wu
hi Thank you vary much! I have another question.As is mentioned in document class: if I set a boost for a document, it is used when indexing the field and multiplied to each field. Here is a case: sometimes I want the boost to be a factor of score, but sometimes I want to ignore the boost

Re: best way to ensure IndexWriter won't corrupt the index?

2009-11-26 Thread Michael McCandless
The leftover write.lock is expected on ungraceful shutdown; you just have to remove it. Or, switch to NativeFSLockFactory, which correctly detects when the lock is no longer in use. Next time this happens, try running CheckIndex on the index. Are you sure the Luke version you're using is new eno

Re: best way to ensure IndexWriter won't corrupt the index?

2009-11-26 Thread Khosro Asgharifard
>>Right, in 2.4, if you kill -9, pull power, OS crashes, etc., it should >>not corrupt the index. >>Can you share details on what corruption you see? Is it possible >>there are two IndexWriters open on the index at once? Our apps is multithread,and sometimes when we shutdown the Tomcat , the wr

Re: best way to ensure IndexWriter won't corrupt the index?

2009-11-26 Thread Michael McCandless
Right, in 2.4, if you kill -9, pull power, OS crashes, etc., it should not corrupt the index. Can you share details on what corruption you see? Is it possible there are two IndexWriters open on the index at once? Mike On Thu, Nov 26, 2009 at 2:08 PM, Khosro Asgharifard wrote: > Hello, >>>Befor

Re: best way to ensure IndexWriter won't corrupt the index?

2009-11-26 Thread Khosro Asgharifard
Hello, >>Before 2.4 it was possible that a crash of the OS, or sudden power >>loss to the machine, could corrupt the index. But that's been fixed >>with 2.4. Did you mean that Lucene does not have this issue in 2.4.1? We are running the program that index some data ,and sometime we must shutdown

Re: Inside Lucene

2009-11-26 Thread Benjamin Heilbrunn
Hello, if you are searching for information about lucenes file structure you can find something here: http://lucene.apache.org/java/3_0_0/fileformats.html Benjamin - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.o

RE: Is it a lucene bug?

2009-11-26 Thread Uwe Schindler
Read the documentation of the Document class: if you set a boost for a document, it is used when indexing the fields and multiplied to each field. For the document itself no boost value is stored, so you cannot get it (only so called stored fields are retrievable). - Uwe Schindler H.-H.-Meier-

Re: Is it a lucene bug?

2009-11-26 Thread Savvas-Andreas Moysidis
hi, I’m not exactly sure I understand they the type of sorting you are trying to achieve. You have an updateTime field and you mention that "We want the new document in the front and also want high score document in the front". My take on this is that you want to first sort by the updateTime a

Re: MergePolicy$MergeException CorruptIndexException in lucene2.4.1

2009-11-26 Thread jm
The process is still running and ops dont want to stop it. As soon as stops I'll try checkindex. Its created brand new with 2.4.1 On Thu, Nov 26, 2009 at 12:42 PM, Michael McCandless wrote: > I think you're using a JRE that has the fix for the issue found in > LUCENE-1282. > > Can you run Check

Is it a lucene bug?

2009-11-26 Thread Wilson Wu
Hi, Recently, there is a requirement to sort the hits by both the scores of documents and the updateTime which is a field of document to mark the document's update time. We want the new document in the front and also want high score document in the front,in other words, we want to mix the sco

Re: MergePolicy$MergeException CorruptIndexException in lucene2.4.1

2009-11-26 Thread Michael McCandless
I think you're using a JRE that has the fix for the issue found in LUCENE-1282. Can you run CheckIndex on your index and post the output? Was this index created from scratch on Lucene 2.4.1? Or, created from an earlier Lucene version? Mike On Thu, Nov 26, 2009 at 6:03 AM, jm wrote: > or are w

Re: MergePolicy$MergeException CorruptIndexException in lucene2.4.1

2009-11-26 Thread jm
or are we really? I think we are on 1.6 update 14 right?? sorry Im lost right now on jdk version numbering On Thu, Nov 26, 2009 at 12:01 PM, jm wrote: > on second thought...I hadnt noticed the jdk numbers properly, we are > using using b28, and JDK 6 Update 10 (b28) is the one fixing this... > >

Re: MergePolicy$MergeException CorruptIndexException in lucene2.4.1

2009-11-26 Thread jm
on second thought...I hadnt noticed the jdk numbers properly, we are using using b28, and JDK 6 Update 10 (b28) is the one fixing this... ok forget this then thanks! On Thu, Nov 26, 2009 at 11:55 AM, jm wrote: > Hi, > > Dont know if this should be here or in java-dev, posting to this one > first

Re: Need help regarding implementation of autosuggest using jquery

2009-11-26 Thread Anshum
Try this, Change the code as required: - import java.io.IOException; import org.apache.lucene.index.CorruptIndexException; import org.apache.lucene.index.IndexReader; import org.apache.lucene.index.Term; import org.apache.lucene.index.TermEnum; /** * @author anshum * */ public class

MergePolicy$MergeException CorruptIndexException in lucene2.4.1

2009-11-26 Thread jm
Hi, Dont know if this should be here or in java-dev, posting to this one first. In one of our installations, we have encountered an exception: Exception in thread "Lucene Merge Thread #0" org.apache.lucene.index.MergePolicy$MergeException: org.apache.lucene.index.CorruptIndexException: docs out o

[ANNOUNCE] Apache Lucene Java 3.0.0 released

2009-11-26 Thread Uwe Schindler
Hello Lucene users, On behalf of the Lucene dev community (a growing community far larger than just the committers) I would like to announce the release of Lucene Java 3.0.0: The new version is mostly a cleanup release without any new features. All deprecations targeted to be removed in version

Re: To exit the while loop if match is found

2009-11-26 Thread Savvas-Andreas Moysidis
hi, maybe you could not break but rather collect them in an ordered collection. If it's a web app you are having you can then pass it up your view layer wrapped in json? savvas 2009/11/26 DHIVYA M > Thanks for your suggested code sir. > > But it displays only one word. > > Ex: > if the input i

Re: Is Lucene a good choice for PB scale mailbox search?

2009-11-26 Thread Ian Lea
If you are planning on using lucene only for searching then you don't need to store much data at all - just the message id or whatever you use to identify messages. And there won't be much point in compressing that. If on the other hand you plan on storing data in lucene, perhaps for displaying h

To exit the while loop if match is found

2009-11-26 Thread DHIVYA M
Thanks for your suggested code sir.   But it displays only one word.   Ex: if the input is "z"   actual output must be   zing Zohar   but am getting zing alone --- On Thu, 26/11/09, Uwe Schindler wrote: From: Uwe Schindler Subject: RE: Need help regarding implementation of autosuggest using jq

RE: Need help regarding implementation of autosuggest using jquery

2009-11-26 Thread Uwe Schindler
You can fix this if you just create the initial term not with "", instead with your prefix: TermEnum tenum = reader.terms(new Term(field,prefix)); And inside the while loop just break out, if (!termText.startsWith(prefix)) break; - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://ww

to exit the while loop if match is found

2009-11-26 Thread DHIVYA M
Sir,   This is my code sir. I have given break statement.    String s1= request.getParameter("indtext");     IndexReader reader = IndexReader.open("newindex");     String field="contents";     //field = field.intern();     TermEnum tenum = reader.term

Re: Need help regarding implementation of autosuggest using jquery

2009-11-26 Thread Anshum
Just add a check in the while statement to exit as soon as the pattern of the term changes. You could check if the term does not start with your input and exit from the while loop there. It would exit wherever the term start changes from what you want. -- Anshum Gupta Naukri Labs! http://ai-cafe.b

RE: Need help regarding implementation of autosuggest using jquery

2009-11-26 Thread DHIVYA M
Sir,   Your suggestion was fantastic.   I tried the below mentioned code but it is showing me the entire result of indexed words starting from the letter that i give as input. Ex: if i give "fo" am getting all the indexes from the word starting with fo upto words starting with z. i.e. it starts d

RE: API changes in 2.9

2009-11-26 Thread Uwe Schindler
> Thanks. > > I used NumberTool to convert the int to string. In 2.9, NumericField is to > be used but it stores it based on some precision. I think the way the > integer data gets indexed is changed from 2.4 to 2.9. This means my > customers using 2.4.1 should rebuild their index. Exactly. Becau

Re: API changes in 2.9

2009-11-26 Thread Ganesh
Thanks. I used NumberTool to convert the int to string. In 2.9, NumericField is to be used but it stores it based on some precision. I think the way the integer data gets indexed is changed from 2.4 to 2.9. This means my customers using 2.4.1 should rebuild their index. Whether 2.9 could able

RE: API changes in 2.9

2009-11-26 Thread Uwe Schindler
You can use these APIs. They are only marked as experimental not because they are unstable or something like that, it means the API may change suddenly (methods get removed and so on). In 3.0 (which was released *now*) it is still experimental. So it's more "subject to change" than "unstable". ---

API changes in 2.9

2009-11-26 Thread Ganesh
Hello all, I am planning to upgrade from 2.4.1 to 2.9.1. I could see some API changes and older APIs are deprecated. In 2.4, I use 'NumberTools' class to convert long to String and now this API is deprecated. I think i have to use 'NumericField' or 'NumericUtils', but in both the class there is