Batch deletions of Records from index

2005-11-24 Thread Manish.Chowdhary
hii erik and other gurus, I have a big index with millions of document where every indexed document points to some system files and database rows. we are archiving the system files and database rows older then three months. Need is to device a solution that will delete all the old records ( thre

Re: Is It a Good Idea to Save Frequently Search Results in Database to Make It Faster?

2005-11-24 Thread Victor Lee
No, I am in the middle of development, actually more like design phase. So I don't know the speed of my searches. The final product will have heavy load on SELECT query if using database with possibility of joining 2 tables. I don't worry about speed of searches during light traffic. I am wo

Re: Is It a Good Idea to Save Frequently Search Results in Database to Make It Faster?

2005-11-24 Thread Erik Hatcher
How fast are your searches currently? With just a pure searcher.search(Query) using a basic query type like TermQuery, I very seriously doubt you'd beat MySQL performance. What kind of Query are you using for your searches? Erik On 24 Nov 2005, at 17:54, Victor Lee wrote: Sorry

Re: Is It a Good Idea to Save Frequently Search Results in Database to Make It Faster?

2005-11-24 Thread Victor Lee
Sorry, actually I meant all search results, not just frequent results. And there is only one search term per search, it's the stuff that belongs to the search terms change often. Victor Lee <[EMAIL PROTECTED]> wrote: Hi, I use Lucene to index stuff that are changed very often but don't need

Is It a Good Idea to Save Frequently Search Results in Database to Make It Faster?

2005-11-24 Thread Victor Lee
Hi, I use Lucene to index stuff that are changed very often but don't need to be real-time to searchers. e.g. the search result can be changed couple times per minute, but I only need to show the change every 5 minutes or so. Is it a good idea to save the search result to a database like m

Re: http://www.textmining.org/ is "hacked"

2005-11-24 Thread Guilherme Barile
I have some issues with textmining extracting text from odd word documents (ok, any .doc file is odd) The project seems somehow abandoned, and has no clear licence, so if you wish to improve it (maybe create a dev.java.net project for it) i'd be glad to help Also if you find something else (cross p

Re: http://www.textmining.org/ is "hacked"

2005-11-24 Thread Patrick Kimber
Thanks for the very quick response. On 24/11/05, Guilherme Barile <[EMAIL PROTECTED]> wrote: > I have it here, uploaded it to rapidshare > http://rapidshare.de/files/8097202/textmining.zip.html > > c ya > > > On Thu, 2005-11-24 at 16:46 +, Patrick Kimber wrote: > > Hi > > I am trying to downlo

Re: http://www.textmining.org/ is "hacked"

2005-11-24 Thread Guilherme Barile
I have it here, uploaded it to rapidshare http://rapidshare.de/files/8097202/textmining.zip.html c ya On Thu, 2005-11-24 at 16:46 +, Patrick Kimber wrote: > Hi > I am trying to download the source code for > tm-extractors-0.4.jar > from > http://www.textmining.org/ > > Looks like the site h

http://www.textmining.org/ is "hacked"

2005-11-24 Thread Patrick Kimber
Hi I am trying to download the source code for tm-extractors-0.4.jar from http://www.textmining.org/ Looks like the site has been hacked. Does anyone know the location of the CVS or SVN repository? Thanks for your help... Pat - T

Re: RE how to change scoring tactics?

2005-11-24 Thread Arjen van der Weijden
Merci beaucoup Bertrand, it works! salut Arjen |-+> | | "Bertrand VENZAL"| | | | | || | | 11/22/2005 04:07 | | | PM | |

RE: Searching for "keyword" fields using QueryParser

2005-11-24 Thread Tim.Wright
Excellent, that's exactly what I needed. Many thanks! Cheers, Tim. -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: 24 November 2005 14:51 To: java-user@lucene.apache.org Subject: Re: Searching for "keyword" fields using QueryParser Tim, The trick is to use PerFi

Re: Searching for "keyword" fields using QueryParser

2005-11-24 Thread Erik Hatcher
Tim, The trick is to use PerFieldAnalyzerWrapper with QueryParser, using StandardAnalyzer as the default, and using KeywordAnalyzer for each of the fields that should not be analyzed. KeywordAnalyzer is in the trunk of Subversion right now, not in a released version.

Searching for "keyword" fields using QueryParser

2005-11-24 Thread Tim.Wright
Hi, Our index has a large text field, and a number of "keyword" fields with things such as the publication code, article reference and so on. We're analysing using the StandardAnalyzer, which works well. Obviously the fields which are defined as Field.Keyword don't run through the analyzer. Th

Re: Search clustering question

2005-11-24 Thread Dawid Weiss
clusters documents and even labels them, and it takes a few seconds (up to two seconds) to cluster 100 search results snippets. Thanks for the info, Lorenzo. I'm just a little surprised at the timings you gave -- that's on a slow machine, right? On a PIII, 3GHZ the timings are between 250ms

Re: Search clustering question

2005-11-24 Thread Lorenzo Viscanti
Clustering is an intensive task. Carrot2 is an excellent framework that clusters documents and even labels them, and it takes a few seconds (up to two seconds) to cluster 100 search results snippets. If you are going to cluster entire documents you'll have to wait longer. Lorenzo On 11/23/05, Supr