Re: Indexsearcher - one instance in PHP via javabridge?

2006-08-01 Thread Chris Hostetter
: I'm trying to follow these instructions, but I have zero experience with : Java, JVMs, Tomcat, etc. Could somebody help me with this one? Thanks in : advance! if you want to eliminate your need to write java code (or servlets) completely take a look at Solr ... it provides a webservicesish API

Indexsearcher - one instance in PHP via javabridge?

2006-08-01 Thread Michael Imbeault
Hello everyone, I'm having tons of fun right now with Lucene indexing a large (15 millions documents) library. I'm developing the web front end, and I read on this mailing list that it's better to have one instance of IndexSearcher. I'm using Lucene in PHP via JavaBridge (and Tomcat), but I

Re: About search performance

2006-08-01 Thread zhongyi yuan
My question is about deal with the multi clauses booleanQuery, the num of clauses is giant and induce the performance.So I want some other method to replace this query to improve the performance. Now through filter achieve the goal. Thanks for the suggestions. --

RE: Sorting

2006-08-01 Thread Chris Hostetter
: I'm with you now. So you do seeks in your comparator. For a large index you : might as well use java.io.RandomAccessFile for the "array", because there : would be little value in buffering when the comparator is liable to jump all yep .. that's what i was getting at ... but i'm not so sure that

Re: Does lucene performance suffer with a lot of empty fields ?

2006-08-01 Thread Chris Hostetter
: >From what I gather, I can go ahead & create an Index & for each Document & : only add the relevant fields. Is this correct? : I should still be able to search with queries like "mel Movies:braveheart". : Right ? : : Would this impact the search performance ? : Any other words of caution for me ?

RE: Search with accents

2006-08-01 Thread Zhang, Lisheng
Hi, In this case I guess we may need to find out what exactly BrazilianAnalyzer do on the input string: BrazilianAnalyzer braAnalyser = new BrazilianAnalyzer(); TokenStream ts1 = braAnalyzer.tokenStream("text", new StringReader(queryStr)); ... // what BrazilianAnalyzer do? Also what exactly ISO

Re: Search with accents

2006-08-01 Thread Eduardo S. Cordeiro
Yes...here's how I create my QueryParser: QueryParser parser = new QueryParser("text", new BrazilianAnalyzer()); 2006/8/1, Zhang, Lisheng <[EMAIL PROTECTED]>: Hi, Have you used the same BrazilianAnalyzer when searching? Best regards, Lisheng -Original Message- From: Eduardo S. Cordei

RE: Search with accents

2006-08-01 Thread Zhang, Lisheng
Hi, Have you used the same BrazilianAnalyzer when searching? Best regards, Lisheng -Original Message- From: Eduardo S. Cordeiro [mailto:[EMAIL PROTECTED] Sent: Tuesday, August 01, 2006 1:40 PM To: java-user@lucene.apache.org Subject: Search with accents Hello there, I have a brazilia

Search with accents

2006-08-01 Thread Eduardo S. Cordeiro
Hello there, I have a brazilian portuguese index, which has been analyzed with BrazilianAnalyzer. When searching words with accents, however, they're not found -- for instance, if the index contains some text with the word "maçã" and I search for that very word, I get no hits, but if I search "ma

Re: Search matching

2006-08-01 Thread Erik Hatcher
Rajiv, Have a look at the details provided by IndexSearcher.explain() for those documents, and you'll get some insight into the factors used to rank them. Since both scores are 1.0, you'll probably want to implement your own custom Similarity and override the lengthNorm() to adjust that

Re: FileNotFoundException

2006-08-01 Thread Michael McCandless
For the index process I use IndexModifier class. That happens when I try to search something into the index in the same time that the index process still running. the code for indexing: System.setProperty("org.apache.lucene.lockDir", System .getProperty("user.dir"));

Re: Search matching

2006-08-01 Thread Rajiv Roopan
Ok, this is how I'm indexing. Both in indexing and searching I'm using SimpleAnalyzer() String loc = "New York, NY"; doc.add(new Field("location", loc, Field.Store.NO, Field.Index.TOKENIZED)); String loc2 = "New York Mills, NY"; doc.add(new Field("location", loc2, Field.Store.NO, Field.Index.TOK

Re: Search matching

2006-08-01 Thread Simon Willnauer
I guess so, but without any information about your code nobody can tell what. If you provide more information you willl get help!! regards simon On 8/1/06, Rajiv Roopan <[EMAIL PROTECTED]> wrote: Hello, I have an index of locations for example. I'm indexing one field using SimpleAnalyzer. doc1

Re: FileNotFoundException

2006-08-01 Thread Michael McCandless
Yes, I use the nfs mount to share the index for other search instance and all the instances have same lock directory configured, but the only the difference is that nfs mount is read-only mount, so I have to disable the lock mechanism for search instances, only lock is enabled for index modif

Re: FileNotFoundException

2006-08-01 Thread Supriya Kumar Shyamal
Yes, I use the nfs mount to share the index for other search instance and all the instances have same lock directory configured, but the only the difference is that nfs mount is read-only mount, so I have to disable the lock mechanism for search instances, only lock is enabled for index modific

RE: FileNotFoundException

2006-08-01 Thread WATHELET Thomas
Ok thanks a lot. -Original Message- From: Michael McCandless [mailto:[EMAIL PROTECTED] Sent: 01 August 2006 17:19 To: java-user@lucene.apache.org Subject: Re: FileNotFoundException > Ok if I well understood I have to put the lock file at the same place in > my indexing process and searc

Re: FileNotFoundException

2006-08-01 Thread Michael McCandless
Ok if I well understood I have to put the lock file at the same place in my indexing process and searching process. That's correct. And, that place can't be an NFS mounted directory (until we fix locking implementation...). The two different processes will use this lock file to make sure

RE: FileNotFoundException

2006-08-01 Thread WATHELET Thomas
Ok if I well understood I have to put the lock file at the same place in my indexing process and searching process. -Original Message- From: Michael McCandless [mailto:[EMAIL PROTECTED] Sent: 01 August 2006 17:14 To: java-user@lucene.apache.org Subject: Re: FileNotFoundException > Yes

Re: FileNotFoundException

2006-08-01 Thread Michael McCandless
Yes Yes, you're certain you have the same lock dir for both modifier & search process? Or, Yes you're using NFS as your lock dir? Or, both? Mike - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail

RE: FileNotFoundException

2006-08-01 Thread WATHELET Thomas
Yes -Original Message- From: Michael McCandless [mailto:[EMAIL PROTECTED] Sent: 01 August 2006 17:10 To: java-user@lucene.apache.org Subject: Re: FileNotFoundException > I think its a directory access synchronisation problem, I have also > posted about this before. The scenario can be

Re: FileNotFoundException

2006-08-01 Thread Michael McCandless
I think its a directory access synchronisation problem, I have also posted about this before. The scenario can be like this .. When Indexwriter object is created it reads the segment information from the file "segments" which nothing but list of files with .cfs or mayn more type, at teh same

Re: FileNotFoundException

2006-08-01 Thread Supriya Kumar Shyamal
I should say not exactly, the temporary solution I made is that, I always copy the existing index to different directory run the modification or optimization task and then copy back, somethign like flip flop mechanism.. current index <-- searcher copy to --> temp index <-- run optimization tem

RE: FileNotFoundException

2006-08-01 Thread WATHELET Thomas
Have you solved thisproblem? -Original Message- From: Supriya Kumar Shyamal [mailto:[EMAIL PROTECTED] Sent: 01 August 2006 16:30 To: java-user@lucene.apache.org Subject: Re: FileNotFoundException I think its a directory access synchronisation problem, I have also posted about this befo

Re: FileNotFoundException

2006-08-01 Thread Supriya Kumar Shyamal
I think its a directory access synchronisation problem, I have also posted about this before. The scenario can be like this .. When Indexwriter object is created it reads the segment information from the file "segments" which nothing but list of files with .cfs or mayn more type, at teh same t

RE: FileNotFoundException

2006-08-01 Thread WATHELET Thomas
I'm sure that it's the good location. When the index process is finished then I can access the index. I know why but I don't know how to solve it. When I indexing a lot of file with the extension cfs are created and after few second the file are merge in an other file ex: I have a file with this na

Re: searching oracle databse records using apache Lucene

2006-08-01 Thread karl wettin
On Tue, 2006-08-01 at 15:32 +0200, Vasily Borisov wrote: > the presentation layer for them is never good since the user is > exposed to the data model complexity Isn't that why we have facades? - To unsubscribe, e-mail: [EMAIL

Re: searching oracle databse records using apache Lucene

2006-08-01 Thread Erick Erickson
I agree completely. I was mostly responding to what appeared to be an attempt to use lucene to actually execute a database query, which is entirely different from restructing legacy data into a more-usable form as you point out, and in which case all bets are off. Erick On 8/1/06, Vasily Borisov

Re: FileNotFoundException

2006-08-01 Thread Erick Erickson
So it sounds like you're not writing the index to the place you think you are. Have you just looked in the directories and checked that there are files there? If Luke can't find them, they're not where you think they are. Especially if your writer had closed before you looked. Erick On 8/1/06, W

Re: searching oracle databse records using apache Lucene

2006-08-01 Thread Vasily Borisov
Eric, I'm sure that is entirely true. E.g. in E&P industry we have a bunch of legacy relational databases that are tremendously complex. Therefore the presentation layer for them is never good since the user is exposed to the data model complexity every time he uses this database. So, giving up

RE: FileNotFoundException

2006-08-01 Thread WATHELET Thomas
It's the same when I try to open the index with luke -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: 01 August 2006 15:24 To: java-user@lucene.apache.org Subject: Re: FileNotFoundException two things come to mind 1> are you absolutely sure that your reader a

Re: FileNotFoundException

2006-08-01 Thread Erick Erickson
two things come to mind 1> are you absolutely sure that your reader and writer are pointing to the same place? Really, absolutely, positively sure? You've hard-coded the path into both writer and reader just to be really, absolutely positively sure? Or, you could let the writer close and *the

Re: searching oracle databse records using apache Lucene

2006-08-01 Thread Erick Erickson
You're absolutely right, lucene is a text searching tool, not a database tool. There's no point in trying to jump through hoops to use lucene if your database already works for you. If you're trying to do text searches, particularly if want to ask questions like "find the words biggest and large

Re: Does lucene performance suffer with a lot of empty fields ?

2006-08-01 Thread Erick Erickson
I can't speak to performance, but there's no problem having different fields for different documents. Stated differently, you don't need to have all fields in all documents. It took me a while to get my head out of database tables and accept this I doubt there's a problem with speed, but as

Re: Seach In slide with lucene

2006-08-01 Thread Erik Hatcher
I believe you'll need to inquire with the Slide community, which unfortunately is a bit inactive lately. Erik On Aug 1, 2006, at 7:31 AM, aslam bari wrote: Dear All, I am facing a unknown situaltion. I am using webdav search, it is working fine, i know it is slower than lucene.

RE: FileNotFoundException

2006-08-01 Thread WATHELET Thomas
For the index process I use IndexModifier class. That happens when I try to search something into the index in the same time that the index process still running. the code for indexing: System.setProperty("org.apache.lucene.lockDir", System .getProperty("user.dir"));

Re: FileNotFoundException

2006-08-01 Thread Michael McCandless
When the indexing process still running on a index and I try to search something on this index I retrive this error message: java.io.FileNotFoundException: \\tradluxstmp01\JavaIndex\tra\index_EN\_2hea.fnm (The system cannot find the file specified) How can I solve this. Could you provide some

Re: searching oracle databse records using apache Lucene

2006-08-01 Thread amit_kkumar
hi sandip, first get all those fields on which you want search and store it in some var. then apply indexing with these var. then fire ur search query regards amit kumar DISCLAIMER == This e-mail may contain privileged and confidential information which is the property of Persistent

Seach In slide with lucene

2006-08-01 Thread aslam bari
Dear All, I am facing a unknown situaltion. I am using webdav search, it is working fine, i know it is slower than lucene. I am using jakarta-slide-2.1 and lucene-2.1. I have configured my domain.xml file as:- ./index I saw that in store/index folder is getting created. But i

searching oracle databse records using apache Lucene

2006-08-01 Thread Sandip
Hi All, I am confused with Apache Lucene. I want to search my databse table records using apache lucene. But what i found is that Lucene is full-text search engine.This means is it only used to search documents text or anything else ? I want to search my databse like e.g. select * from tableNa

RE: Sorting

2006-08-01 Thread Rob Staveley (Tom)
> file seeks instead of array lookups I'm with you now. So you do seeks in your comparator. For a large index you might as well use java.io.RandomAccessFile for the "array", because there would be little value in buffering when the comparator is liable to jump all around the file. This sounds ver

Re: dash-words

2006-08-01 Thread Martin Braun
Hi Yonik, >> So a Phrase search to "The xmen story" will fail. With a slop of 1 the >> doc will be found. >> >> But when generating the query I won't know when to use a slop. So adding >> slops isn't a nice solution. > > If you can't tolerate slop, this is a problem. I use the WordDelimiterFilte

Re: EMAIL ADDRESS: Tokenize (i.e. an EmailAnalyzer)

2006-08-01 Thread Chris Hostetter
: Sure I would love to! Can you ping me at [EMAIL PROTECTED] and : let me know what I need to do? Do I just post it to JIRA? instructions on submitting code can be found in the wiki.. http://wiki.apache.org/jakarta-lucene/HowToContribute note in particular that since you are primarily submit

RE: Sorting

2006-08-01 Thread Chris Hostetter
: I take your point that Berkley DB would be much less clumsy, but an : application that's already using a relational database for other purposes : might as well use that relational database, no? if you already have some need to access data about each matching doc from a relational DB, then sure

FileNotFoundException

2006-08-01 Thread WATHELET Thomas
When the indexing process still running on a index and I try to search something on this index I retrive this error message: java.io.FileNotFoundException: \\tradluxstmp01\JavaIndex\tra\index_EN\_2hea.fnm (The system cannot find the file specified) How can I solve this.

Does lucene performance suffer with a lot of empty fields ?

2006-08-01 Thread Mek
I have 1 generic index, but am Indexing a lot of different things, like actors, politicians, scientists, sportsmen. And as you can see that though there are some common fields, like name & DOB, there are also fields for each of these types of people that are different. e.g. Actors will have "Movi