Luke site is down?

2009-03-04 Thread Ruslan Sivak
I'm not getting anything when I go to http://www.getopt.org/luke/, or http://www.getopt.org. Does anyone know how long the site is expected to be down and is there an alternate download location for luke? Russ - To unsubscri

Re: Luke site is down?

2009-03-04 Thread Ruslan Sivak
Steven A Rowe wrote: When I initially tried an hour or so ago to access , I was able to see the site. But now, I'm not able to get anything. Here's the response I see using curl: PROMPT$ curl http://www.getopt.org/luke/ curl: (52) Empty reply from server Loo

IndexReader locking index

2007-11-30 Thread Ruslan Sivak
I am using MoreLikeThis functionality in my code. This code is running on four separate servers. When I ran tests, it seemed to be fine, but looks like under heavy use, the index file is always locked, and when I reindex all the docs, it doubles the size of the index (my guess is the old file

Refreshing RAMDirectory

2007-12-11 Thread Ruslan Sivak
I have an index of about 10mb. Since it's so small, I would like to keep it loaded in memory, and reload it about every minute or so, assuming that it has changed on disk. I have the following code, which works, except it doesn't reload the changes. protected String indexName; protected Ind

Re: Advice regarding fuzzy phrase searching

2007-12-11 Thread Ruslan Sivak
Look into SpanNearQuery. It has a slop which lets you say how close you want the terms to be. For a single document, if you are going to be doing a lot of these searches, I recommend using a MemoryIndex. Russ Jose Luna wrote: Hello, I am looking for some advice regarding which tools I migh

Re: Refreshing RAMDirectory

2007-12-11 Thread Ruslan Sivak
ffers changes and periodically flushes them out to disk. So the writer may not have flushed your data, depending upon how it's written. Best Erick On Dec 11, 2007 5:37 PM, Ruslan Sivak <[EMAIL PROTECTED]> wrote: I have an index of about 10mb. Since it's so small, I would like to kee

Re: Refreshing RAMDirectory

2007-12-11 Thread Ruslan Sivak
Michael McCandless wrote: Ruslan Sivak wrote: I have an index of about 10mb. Since it's so small, I would like to keep it loaded in memory, and reload it about every minute or so, assuming that it has changed on disk. I have the following code, which works, except it doesn't

Re: Refreshing RAMDirectory

2007-12-12 Thread Ruslan Sivak
re/when it was opened? Perhaps the easiest thing to do if you can't restructure your code as Michael suggested is just employ a singleton pattern to give you complete control over when/where a reader is opened. Best Erick On Dec 12, 2007 5:36 AM, Michael McCandless <[EMAIL PROTECTED]>

Re: Refreshing RAMDirectory

2007-12-12 Thread Ruslan Sivak
d not have to "open a new reader for every method call" -- you only need to open a new reader (and in your case, RAMDirectory) whenever the underlying index has changed. Mike Ruslan Sivak wrote: Thank you to everyone for your comments. I didn't realize that readers need to be ke

Re: Refreshing RAMDirectory

2007-12-12 Thread Ruslan Sivak
Michael McCandless wrote: Ruslan Sivak wrote: This seems to be problematic though. There are other things that depend on the reader that is not so obvious. For example, IndexReader reader=getReader(); IndexSearcher searcher=new IndexSearcher(reader); Hits hits=searcher.search(query

Re: Refreshing RAMDirectory

2007-12-13 Thread Ruslan Sivak
Michael McCandless wrote: Ruslan Sivak wrote: Michael McCandless wrote: Ruslan Sivak wrote: Since my app would be multithreaded, there could be multiple threads accessing the reader, while i'm reloading it. This means that if I close the reader, and another thread is using it, it

Distributed Indexes

2008-02-07 Thread Ruslan Sivak
I'm wondering if this is a problem that lucene users have already tackled. I have four copies of the application using a lucene index. They are located on two physical servers with two copies on each server accessing two copies of the lucene index. I use Windows FRS (File Replication Service

Re: Distributed Indexes

2008-02-07 Thread Ruslan Sivak
those files will not be deleted, while not blocking further updates to the index. Mike Ruslan Sivak wrote: I'm wondering if this is a problem that lucene users have already tackled. I have four copies of the application using a lucene index. They are located on two physical servers with

Re: Distributed Indexes

2008-02-07 Thread Ruslan Sivak
d work for this. It basically protects a consistent snapshot of your index, ensuring those files will not be deleted, while not blocking further updates to the index. Mike Ruslan Sivak wrote: I'm wondering if this is a problem that lucene users have already tackled. I have fou

Re: Distributed Indexes

2008-02-08 Thread Ruslan Sivak
e you hitting it with and how complex are the queries? Because unless you have *very* high query rate, I'd look at why my queries were taking so long before complexifying things this way. Best Erick On Feb 7, 2008 4:52 PM, Ruslan Sivak <[EMAIL PROTECTED]> wrote: My index is only 4mb.

Re: Distributed Indexes

2008-02-10 Thread Ruslan Sivak
So nobody's run into anything like this before? The need to share the index between many copies of the app possibly running on multiple servers? Russ Ruslan Sivak wrote: The app does other things then search the index. I'm basically using ColdFusion for the website and have four

Re: Distributed Indexes

2008-02-11 Thread Ruslan Sivak
n Feb 10, 2008, at 12:55 PM, Ruslan Sivak wrote: So nobody's run into anything like this before? The need to share the index between many copies of the app possibly running on multiple servers? Russ Ruslan Sivak wrote: The app does other things then search the index. I'm basically u

Re: Distributed Indexes

2008-02-11 Thread Ruslan Sivak
Cedric Ho wrote: On Feb 9, 2008 12:07 AM, Ruslan Sivak <[EMAIL PROTECTED]> wrote: The app does other things then search the index. I'm basically using ColdFusion for the website and have four instances running on two servers for load balancing. Each app does the searches, and

Simple Web Search

2008-06-16 Thread Ruslan Sivak
I would like to provide a simple web search of our web site. We have a lot of dynamic content, and instead of searching through the database, I would like to use some some sort of crawler to index the site. Is there one available that generates Lucene indexes? I looked into Nutch, but it loo

Re: Simple Web Search

2008-06-17 Thread Ruslan Sivak
bsight.net demo: http://search.dbsight.com Lucene Database Search in 3 minutes: http://wiki.dbsight.com/index.php?title=Create_Lucene_Database_Search_in_3_minutes DBSight customer, a shopping comparison site, (anonymous per request) got 2.6 Million Euro funding! On Mon, Jun 16, 2008 at 4:38 PM,

Re: Huge Index

2007-01-11 Thread Ruslan Sivak
Alice, If you have a computer that crashes once you put a lot of load on it, I'd say you have bigger problems then the speed of the indexing. A computer should not crash, no matter how much load you put on it. If you have such a huge database, I can't believe that you don't have access to o

optimizing single document searches

2007-02-27 Thread Ruslan Sivak
I am using Lucene in a little bit weird way, instead of searching all the documents for a specific query, I am searching a single document for many specific queries. On a single document of 10k characters, doing about 40k searches takes about 5 seconds. This is not bad, but I was wondering if

Re: optimizing single document searches

2007-02-28 Thread Ruslan Sivak
karl wettin wrote: 28 feb 2007 kl. 00.49 skrev Russ: Thanks, I will try it tommorow... Is it significantly different from using a standard index on a ramdir? A bit different. You can also try LUCENE-550. It has about the same speed as contrib/memory but can handle multiple documents and

retrieving matched slop

2007-03-20 Thread Ruslan Sivak
I have an app that searches a single document against many queries. Lets say the document was The quick brown fox jumped over the lazy dog. and my queries are SpanNearQuery("quick","brown",50) SpanNearQuery("quick","fox",50) I would like to retrieve the slop or some sort of score that was ma