: I'm trying to follow these instructions, but I have zero experience with
: Java, JVMs, Tomcat, etc. Could somebody help me with this one? Thanks in
: advance!
if you want to eliminate your need to write java code (or servlets)
completely take a look at Solr ... it provides a webservicesish API
Hello everyone,
I'm having tons of fun right now with Lucene indexing a large (15
millions documents) library. I'm developing the web front end, and I
read on this mailing list that it's better to have one instance of
IndexSearcher.
I'm using Lucene in PHP via JavaBridge (and Tomcat), but I
My question is about deal with the multi clauses booleanQuery, the
num of clauses is giant and induce the performance.So I want some
other method to replace this query to improve the performance. Now
through filter achieve the goal.
Thanks for the suggestions.
--
: I'm with you now. So you do seeks in your comparator. For a large index you
: might as well use java.io.RandomAccessFile for the "array", because there
: would be little value in buffering when the comparator is liable to jump all
yep .. that's what i was getting at ... but i'm not so sure that
: >From what I gather, I can go ahead & create an Index & for each Document &
: only add the relevant fields. Is this correct?
: I should still be able to search with queries like "mel Movies:braveheart".
: Right ?
:
: Would this impact the search performance ?
: Any other words of caution for me ?
Hi,
In this case I guess we may need to find out what
exactly BrazilianAnalyzer do on the input string:
BrazilianAnalyzer braAnalyser = new BrazilianAnalyzer();
TokenStream ts1 = braAnalyzer.tokenStream("text", new StringReader(queryStr));
... // what BrazilianAnalyzer do?
Also what exactly ISO
Yes...here's how I create my QueryParser:
QueryParser parser = new QueryParser("text", new BrazilianAnalyzer());
2006/8/1, Zhang, Lisheng <[EMAIL PROTECTED]>:
Hi,
Have you used the same BrazilianAnalyzer when
searching?
Best regards, Lisheng
-Original Message-
From: Eduardo S. Cordei
Hi,
Have you used the same BrazilianAnalyzer when
searching?
Best regards, Lisheng
-Original Message-
From: Eduardo S. Cordeiro [mailto:[EMAIL PROTECTED]
Sent: Tuesday, August 01, 2006 1:40 PM
To: java-user@lucene.apache.org
Subject: Search with accents
Hello there,
I have a brazilia
Hello there,
I have a brazilian portuguese index, which has been analyzed with
BrazilianAnalyzer. When searching words with accents, however, they're
not found -- for instance, if the index contains some text with the
word "maçã" and I search for that very word, I get no hits, but if I
search "ma
Rajiv,
Have a look at the details provided by IndexSearcher.explain() for
those documents, and you'll get some insight into the factors used to
rank them. Since both scores are 1.0, you'll probably want to
implement your own custom Similarity and override the lengthNorm() to
adjust that
For the index process I use IndexModifier class.
That happens when I try to search something into the index in the same
time that the index process still running.
the code for indexing:
System.setProperty("org.apache.lucene.lockDir", System
.getProperty("user.dir"));
Ok, this is how I'm indexing. Both in indexing and searching I'm using
SimpleAnalyzer()
String loc = "New York, NY";
doc.add(new Field("location", loc, Field.Store.NO, Field.Index.TOKENIZED));
String loc2 = "New York Mills, NY";
doc.add(new Field("location", loc2, Field.Store.NO, Field.Index.TOK
I guess so, but without any information about your code nobody can tell what.
If you provide more information you willl get help!!
regards simon
On 8/1/06, Rajiv Roopan <[EMAIL PROTECTED]> wrote:
Hello, I have an index of locations for example. I'm indexing one field
using SimpleAnalyzer.
doc1
Yes, I use the nfs mount to share the index for other search instance
and all the instances have same lock directory configured, but the only
the difference is that nfs mount is read-only mount, so I have to
disable the lock mechanism for search instances, only lock is enabled
for index modif
Yes, I use the nfs mount to share the index for other search instance
and all the instances have same lock directory configured, but the only
the difference is that nfs mount is read-only mount, so I have to
disable the lock mechanism for search instances, only lock is enabled
for index modific
Ok thanks a lot.
-Original Message-
From: Michael McCandless [mailto:[EMAIL PROTECTED]
Sent: 01 August 2006 17:19
To: java-user@lucene.apache.org
Subject: Re: FileNotFoundException
> Ok if I well understood I have to put the lock file at the same place
in
> my indexing process and searc
Ok if I well understood I have to put the lock file at the same place in
my indexing process and searching process.
That's correct.
And, that place can't be an NFS mounted directory (until we fix locking
implementation...).
The two different processes will use this lock file to make sure
Ok if I well understood I have to put the lock file at the same place in
my indexing process and searching process.
-Original Message-
From: Michael McCandless [mailto:[EMAIL PROTECTED]
Sent: 01 August 2006 17:14
To: java-user@lucene.apache.org
Subject: Re: FileNotFoundException
> Yes
Yes
Yes, you're certain you have the same lock dir for both modifier &
search process?
Or, Yes you're using NFS as your lock dir?
Or, both?
Mike
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail
Yes
-Original Message-
From: Michael McCandless [mailto:[EMAIL PROTECTED]
Sent: 01 August 2006 17:10
To: java-user@lucene.apache.org
Subject: Re: FileNotFoundException
> I think its a directory access synchronisation problem, I have also
> posted about this before. The scenario can be
I think its a directory access synchronisation problem, I have also
posted about this before. The scenario can be like this ..
When Indexwriter object is created it reads the segment information from
the file "segments" which nothing but list of files with .cfs or mayn
more type, at teh same
I should say not exactly, the temporary solution I made is that, I
always copy the existing index to different directory run the
modification or optimization task and then copy back, somethign like
flip flop mechanism..
current index <-- searcher
copy to --> temp index <-- run optimization
tem
Have you solved thisproblem?
-Original Message-
From: Supriya Kumar Shyamal [mailto:[EMAIL PROTECTED]
Sent: 01 August 2006 16:30
To: java-user@lucene.apache.org
Subject: Re: FileNotFoundException
I think its a directory access synchronisation problem, I have also
posted about this befo
I think its a directory access synchronisation problem, I have also
posted about this before. The scenario can be like this ..
When Indexwriter object is created it reads the segment information from
the file "segments" which nothing but list of files with .cfs or mayn
more type, at teh same t
I'm sure that it's the good location.
When the index process is finished then I can access the index.
I know why but I don't know how to solve it.
When I indexing a lot of file with the extension cfs are created and
after few second the file are merge in an other file
ex:
I have a file with this na
On Tue, 2006-08-01 at 15:32 +0200, Vasily Borisov wrote:
> the presentation layer for them is never good since the user is
> exposed to the data model complexity
Isn't that why we have facades?
-
To unsubscribe, e-mail: [EMAIL
I agree completely. I was mostly responding to what appeared to be an
attempt to use lucene to actually execute a database query, which is
entirely different from restructing legacy data into a more-usable form as
you point out, and in which case all bets are off.
Erick
On 8/1/06, Vasily Borisov
So it sounds like you're not writing the index to the place you think you
are. Have you just looked in the directories and checked that there are
files there? If Luke can't find them, they're not where you think they are.
Especially if your writer had closed before you looked.
Erick
On 8/1/06, W
Eric,
I'm sure that is entirely true. E.g. in E&P industry we have a bunch of
legacy relational databases that are tremendously complex.
Therefore the presentation layer for them is never good since
the user is exposed to the data model complexity every time he uses this
database.
So, giving up
It's the same when I try to open the index with luke
-Original Message-
From: Erick Erickson [mailto:[EMAIL PROTECTED]
Sent: 01 August 2006 15:24
To: java-user@lucene.apache.org
Subject: Re: FileNotFoundException
two things come to mind
1> are you absolutely sure that your reader a
two things come to mind
1> are you absolutely sure that your reader and writer are pointing to the
same place? Really, absolutely, positively sure? You've hard-coded the path
into both writer and reader just to be really, absolutely positively sure?
Or, you could let the writer close and *the
You're absolutely right, lucene is a text searching tool, not a database
tool. There's no point in trying to jump through hoops to use lucene if your
database already works for you.
If you're trying to do text searches, particularly if want to ask questions
like "find the words biggest and large
I can't speak to performance, but there's no problem having different fields
for different documents. Stated differently, you don't need to have all
fields in all documents. It took me a while to get my head out of database
tables and accept this
I doubt there's a problem with speed, but as
I believe you'll need to inquire with the Slide community, which
unfortunately is a bit inactive lately.
Erik
On Aug 1, 2006, at 7:31 AM, aslam bari wrote:
Dear All,
I am facing a unknown situaltion. I am using webdav search, it is
working fine, i know it is slower than lucene.
For the index process I use IndexModifier class.
That happens when I try to search something into the index in the same
time that the index process still running.
the code for indexing:
System.setProperty("org.apache.lucene.lockDir", System
.getProperty("user.dir"));
When the indexing process still running on a index and I try to search
something on this index I retrive this error message:
java.io.FileNotFoundException:
\\tradluxstmp01\JavaIndex\tra\index_EN\_2hea.fnm (The system cannot find
the file specified)
How can I solve this.
Could you provide some
hi sandip,
first get all those fields on which you want search and store
it in some var.
then apply indexing with these var.
then fire ur search query
regards
amit kumar
DISCLAIMER
==
This e-mail may contain privileged and confidential information which is the
property of Persistent
Dear All,
I am facing a unknown situaltion. I am using webdav search, it is working
fine, i know it is slower than lucene. I am using jakarta-slide-2.1 and
lucene-2.1. I have configured my domain.xml file as:-
./index
I saw that in store/index folder is getting created. But i
Hi All,
I am confused with Apache Lucene.
I want to search my databse table records using apache lucene.
But what i found is that Lucene is full-text search engine.This means is it
only used to search documents text or anything else ?
I want to search my databse like e.g.
select * from tableNa
> file seeks instead of array lookups
I'm with you now. So you do seeks in your comparator. For a large index you
might as well use java.io.RandomAccessFile for the "array", because there
would be little value in buffering when the comparator is liable to jump all
around the file. This sounds ver
Hi Yonik,
>> So a Phrase search to "The xmen story" will fail. With a slop of 1 the
>> doc will be found.
>>
>> But when generating the query I won't know when to use a slop. So adding
>> slops isn't a nice solution.
>
> If you can't tolerate slop, this is a problem.
I use the WordDelimiterFilte
: Sure I would love to! Can you ping me at [EMAIL PROTECTED] and
: let me know what I need to do? Do I just post it to JIRA?
instructions on submitting code can be found in the wiki..
http://wiki.apache.org/jakarta-lucene/HowToContribute
note in particular that since you are primarily submit
: I take your point that Berkley DB would be much less clumsy, but an
: application that's already using a relational database for other purposes
: might as well use that relational database, no?
if you already have some need to access data about each matching doc from
a relational DB, then sure
When the indexing process still running on a index and I try to search
something on this index I retrive this error message:
java.io.FileNotFoundException:
\\tradluxstmp01\JavaIndex\tra\index_EN\_2hea.fnm (The system cannot find
the file specified)
How can I solve this.
I have 1 generic index, but am Indexing a lot of different things, like
actors, politicians, scientists, sportsmen.
And as you can see that though there are some common fields, like name &
DOB, there are also fields for each of these types of people that are
different.
e.g. Actors will have "Movi
45 matches
Mail list logo