IllegalStateEx thrown when calling close

2008-10-27 Thread Jed Wesley-Smith
All, We have seen the following stacktrace in production with Lucene 2.3.2: java.lang.IllegalStateException: abort() can only be called when IndexWriter was opened with autoCommit=false at org.apache.lucene.index.IndexWriter.abort(IndexWriter.java:2009) at org.apache.lucene.index.IndexWr

RE: How can I get to the Document for architecture of lucene index.

2008-10-27 Thread Kwon, Ohsang
Dear Karsten, thank for your help. It`s so difficult. ^^; Thank you again. -Original Message- From: Karsten F. [mailto:[EMAIL PROTECTED] Sent: Monday, October 27, 2008 4:54 PM To: java-user@lucene.apache.org Subject: Re: How can I get to the Document for architecture of lucene index. H

Re: Multiple values in field

2008-10-27 Thread agatone
I just hope indexing one extra field isn't gonna be performance issue later. Ty for your replies. Erick Erickson wrote: > > then you'll need to index another field that records that intention, or > just "know" which fields are intended to be multiple. > > Best > Erick > > On Sat, Oct 25, 200

Newbie Question: Query Creation Best Approach

2008-10-27 Thread JulieSoko
I am having some strange behavior and want to rule out that the query that I am constructing is the correct way to deal with this type of query. This is what I am working with: I am creating a query that contains 3 different fields Field 1: value - String Field 2: type- String Field 3:

Re: instantiated index in 2.4

2008-10-27 Thread Darren Govoni
Has anyone gotten some initial performance observations about instantiated index? I replaced my RAMDirectory searcher with one and it was slower or about the same. The note about it claims 100x possible performance improvement. Maybe there is a data size beyond which its performance excels. thank

Change the merge factor for an existing index?

2008-10-27 Thread Tom Saulpaugh
Hello, We are currently using lucene v2.1 and we are planning to upgrade to lucene v2.4. Can we change the merge factor for an existing index and then add more documents to that index? Is there some kind of upgrade path like using optimize to move an existing index to a different merge factor?

RE: example on RegexQuery

2008-10-27 Thread Steven A Rowe
Hi Aashish, On 10/26/2008 at 11:36 PM, Agrawal, Aashish (IT) wrote: > I am searching a sample file like below - > --- > agrawal fdfdf > fsdfafasf 3495549584 > fsfsfs fsffsf r4e3fdere j4343 > - > > when I search this file with pattern - > .*4343* > .*[a-z]4343 > j4343 > or even search for

RE: Lucene 2.2.0 Strange QueryParser behavior

2008-10-27 Thread Bob Rhodes
Thanks for the tip! I used the StandardAnalyser(Set stopWords) constructor and passed in an empty stop words set. Now my queries in OR look like this: +lastname:rhodes +state:or I think that solved the problem. Bob -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] S

Re: Lucene Input/Output error

2008-10-27 Thread Michael McCandless
Sorry, those methods do indeed require 2.4.0 -- I forgot it was only with 2.4.0 that we made them public. If for some reason you can't upgrade, then you can always put your own incRef/decRef wrapper around the IndexReader. Mike JulieSoko wrote: I was able to upgrade to the 2.3.2 versi

Re: Lucene Input/Output error

2008-10-27 Thread JulieSoko
I was able to upgrade to the 2.3.2 version of Lucene... We do not have the 2.4.0 version in house ... I am trying to facilitate downloading that version. I did notice that the incRef and decRef methods for the IndexReader are protected in both the 2.3.1 and 2.3.2 versions... How did you get aroun

Re: Lucene 2.2.0 Strange QueryParser behavior

2008-10-27 Thread Erick Erickson
OR is both a stopword and a reserved word In this case you *probably* are getting hit by it being a stopword. Changing your analyzer to not use stopwords is probably the way to go, but beware that you should probably use the same analyzer at index AND query time. If you're using StandardAnalyz

Re: Search on tag / category / label / keyword ...

2008-10-27 Thread Israel Tsadok
On Mon, Oct 27, 2008 at 11:21 AM, T. H. Lin <[EMAIL PROTECTED]> wrote: > I would like to search a collection of "keyword"s with lucene. > > A Document has one or many keywords. The keywords appear only once in a > document. (tf = 1) > for example: > Document_1 : ( "aa" "bb" "cc" ) > Docu

Lucene 2.2.0 Strange QueryParser behavior

2008-10-27 Thread Bob Rhodes
Hi all, I'm hoping this is an easy question. I have a Lucene index that contains names and states, and I'm having an issue when the state is Oregon, specifically OR. Below is an example of the query I pass in and the query string that QueryParser builds below it. This works for every state except

Re: API to get the matched lines in a file?

2008-10-27 Thread Erick Erickson
No, you have to build that in yourself It can be a bit tricky defining how you indicate that lines are in a file, but this has been discussed extensively so searching the mail archive should get you some pointers. Best Erick On Mon, Oct 27, 2008 at 2:45 AM, Agrawal, Aashish (IT) < [EMAIL PROT

Re: example on RegexQuery

2008-10-27 Thread Erick Erickson
query.toString() is your friend, as is Luke . Two things are often the problem here. 1> your index doesn't contain what you think it does. Luke helps a LOT here. 2> your regex isn't doing what you think, toString() whould help here Best Erick On Sun, Oct 26, 2008 at 11:36 PM, Agrawal, Aashish (IT

Re: Memory problem dealing with indexsearcher and topdocs

2008-10-27 Thread Erick Erickson
Are you opening/closing your searcher and writer for each document? If so, it sounds like you're not closing all of them appropriately and that would be the cause of your memory increase. But you shouldn't have to do that anyway. Why not just use the same IndexReader to search and delete all your d

eclipse birt with lucene

2008-10-27 Thread blured blured
Hi, I'd like to use Eclipse Birt in order to make reports based on Lucene index content as if I use a database. The idea is to use SQL statement to read this index with a JDBC wrapper like the one existing for LDAP (http://www.openldap.org/jdbcldap/). Perhaps somebody already have done a custo

Re: Multiple values in field

2008-10-27 Thread Erick Erickson
then you'll need to index another field that records that intention, or just "know" which fields are intended to be multiple. Best Erick On Sat, Oct 25, 2008 at 4:12 AM, agatone <[EMAIL PROTECTED]> wrote: > > Yeah but if it happens that for a certain document field1 has only one > value > and in

RE: Use SQL frontend to read lucene index

2008-10-27 Thread Karsten F.
Hi Blured, sorry I don't know anything about eclipse birt. I recommend to start a new thread "eclipse birt with lucene" where you describe your problem again in detail. be aware that lucene don't know numerical values. lucene only knows strings. best regards Karsten blured blured wrote: >

RE: Use SQL frontend to read lucene index

2008-10-27 Thread blured blured
Hi, Thanks for this response. The fact is I'd like to make some select request to read documents already indexed in a Lucene index in order to base a Eclipse Birt dataSource and dataset on this index. Regards, Blured. > Date: Mon, 27 Oct 2008 02:15:55 -0700> From: [EMAIL PROTECTED]> To: >

Search on tag / category / label / keyword ...

2008-10-27 Thread T. H. Lin
I would like to search a collection of "keyword"s with lucene. A Document has one or many keywords. The keywords appear only once in a document. (tf = 1) for example: Document_1 : ( "aa" "bb" "cc" ) Document_2 : ( "bb" "cc" ) Document_3 : ( "cc" "dd" ) D

Search on tag / category / label / keyword ...

2008-10-27 Thread T. H. Lin
I would like to search a collection of "keyword"s with lucene. A Document has one or many keywords. The keywords appear only once in a document. (tf = 1) for example: Document_1 : ( "aa" "bb" "cc" ) Document_2 : ( "bb" "cc" ) Document_3 : ( "cc" "dd" ) D

Re: Use SQL frontend to read lucene index

2008-10-27 Thread Karsten F.
Hi Blured, if you are asking about integration of lucene and a DBMS, possible compass is something for you http://www.nabble.com/Lucene-vs.-Database-tp19755932p19758736.html if you think about using hibernate: I think there already exist a lucene connector, so you don't have to use jdbc. if you

Use SQL frontend to read lucene index

2008-10-27 Thread blured blured
Hi, Do you know if a plugin or a third party software allow to read Lucene index using sql statements ? Regards, Blured. _ Discover the new Windows Vista http://search.msn.com/results.aspx?q=windows+vista&mkt=en-US&form=QBRE

Re: How can I get to the Document for architecture of lucene index.

2008-10-27 Thread Karsten F.
Hi Ohsang, are you looking for http://lucene.apache.org/java/2_4_0/fileformats.html ? Best regards Karsten Kwon, Ohsang wrote: > > I want to know how the lucene stored the data in the index internally. > > (Lucene`s index format changed very often.) > > > > I can not find this informati

How can I get to the Document for architecture of lucene index.

2008-10-27 Thread Kwon, Ohsang
I want to know how the lucene stored the data in the index internally. (Lucene`s index format changed very often.) I can not find this information in wiki. Where can I get that? Does the same document or picture for index fomat? plz help me.