Re: Lucene on SQL 2005

2006-12-04 Thread Lukas Vlcek
Hi, You should consider using Compass . Lukas On 12/5/06, Saroj K M <[EMAIL PROTECTED]> wrote: Dear All, I am a new user to Lucene. I am having a requirement as follows. I am using SQL Server 2005 database, The Database having a Table named --- Prod

Lucene on SQL 2005

2006-12-04 Thread Saroj K M
Dear All, I am a new user to Lucene. I am having a requirement as follows. I am using SQL Server 2005 database, The Database having a Table named --- Product and its columns are 1 Prod_id 2 Prod_name 3 Prod_desc 4. Prod_Price I need to index the table and search. I should be Pas

Lucene on SQL 2005

2006-12-04 Thread Saroj K M
Dear .., I am a new user to Lucene. I am having a requirement as follows. I am using SQL Server 2005 database, The Database having a Table named --- Product and its columns are 1 Prod_id 2 Prod_name 3 Prod_desc 4. Prod_Price I need to index the table and search. I should be Pass

Lucene search performance: linear?

2006-12-04 Thread Zhang, Lisheng
Hi, I indexed first 220,000, all with a special keyword, I did a simple query and only fetched 5 docs, with Hits.length()=220,000. Then I indexed 440,000 docs, with the same keyword, query it again and fetched a few docs, with Hits.length(0=440,000. I found that search time is about linear: 2nd

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Risov, Maria
It's in Contributions rather than being in the core Lucene folder. Marie Risov -Original Message- From: Erick Erickson [mailto:[EMAIL PROTECTED] Sent: Monday, December 04, 2006 2:35 PM To: java-user@lucene.apache.org Subject: Re: Problem: "The selected method Keyword was not found"

Re: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Erick Erickson
It's a secret, do you have the secret code ??? Try SnowballAnalyzer Erick On 12/4/06, Aaron Shaw <[EMAIL PROTECTED]> wrote: thank you both for your help, got it working now One additional thing, how do I get it to use stemming? I'm presuming I have to use one of the analyzers other tha

Re: Dreaded optimize (again!)

2006-12-04 Thread Michael McCandless
Stanislav Jordanov wrote: How much free disk space should be there (with respect to the index size) in order for the optimize to complete successfully? Good question! Really this detail should be included in the Javadoc for optimize (and more generally addDocument, addIndexes(*), etc.). I wi

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Aaron Shaw
thank you both for your help, got it working now One additional thing, how do I get it to use stemming? I'm presuming I have to use one of the analyzers other than "simpleanalyzer" but I can't seem to find out which? Samir Abdou wrote: > > In your classpath ! > > -Message d'origine-

Re: lucene - general question

2006-12-04 Thread Erick Erickson
I also think Lucene is overkill, assuming that you don't need to persist the XML you're talking about. Why not just use Java Sets? See http://java.sun.com/docs/books/tutorial/collections/interfaces/set.html for how to do set intersection, union, difference, etc. And what's the difference between

Re: Full disk space during indexing process with 120 gb of free disk space

2006-12-04 Thread Ben Litchfield
PDFBox version 0.6 is quite old and there have been many improvements, you should look at moving to the newest version 0.7.3, although from the description of your problem it probably would not resolve it. If there are a large number of temp files with "pdfbox" in the name then you are most li

Full disk space during indexing process with 120 gb of free disk space

2006-12-04 Thread Ariel Isaac Romero Cartaya
Hi every body: I am getting a problem during the indexing process, I am indexing big amounts of texts most of them in pdf format I am using pdf box 0.6 version. The space in hard disk before that the indexing process begin is around 120 Gb but incredibly even when my lucene index doesn't have y

Re: Incremental Index and Comparing different Scores from different Index

2006-12-04 Thread Soeren Pekrul
Hello Nils, how about having one index for all documents with two fields "date" and "content"? You can search documents for a specific date and the score uses the global idf of all documents. Sören Nils Höller schrieb: I thought of making the idf function a NOOP, since this is somehow one o

Re: Problem: "The selected method Keyword was not found"

2006-12-04 Thread David Sheldon
On Mon, Dec 04, 2006 at 02:43:13AM -0800, Aaron Shaw wrote: > field = CreateObject("java", "org.apache.lucene.document.Field"); > > I'm assuming the next line needs to be the equivalent of this: > > Field field = new Field("url","http://localhost:8500/cfdocs/dochome.htm";, > > however I'm not s

Re: Incremental Index and Comparing different Scores from different Index

2006-12-04 Thread Nils Höller
Am Freitag, den 01.12.2006, 11:54 -0800 schrieb Chris Hostetter: > the short answer is you can't, not with the DefaultSimilarity, but you > might be able to write a custom Similarity that makes the scores > comparable by making the idf function a NOOP (of course, then your scores > won't be as

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Samir Abdou
In your classpath ! -Message d'origine- De : Aaron Shaw [mailto:[EMAIL PROTECTED] Envoyé : lundi, 4. décembre 2006 12:02 À : java-user@lucene.apache.org Objet : RE: Problem: "The selected method Keyword was not found" Is there any quick way of telling whether I'm using 1.9 or 2? I want

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Aaron Shaw
Is there any quick way of telling whether I'm using 1.9 or 2? I want to make sure I successfully removed the old version Samir Abdou wrote: > > If you're trying to use the 1.9 version it's OK! The 'Keyword' static > method > is not available in the 2.0 version. > > Field field = new Field("ur

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Samir Abdou
If you're trying to use the 1.9 version it's OK! The 'Keyword' static method is not available in the 2.0 version. Field field = new Field("url","http://localhost:8500/cfdocs/dochome.htm";, >...is this creating a new object or calling a method of my >existing field object? This is creating a ne

Re: Multiple character wildcard search

2006-12-04 Thread Bhavin Pandya
Dont use " * " in term Query query = new PrefixQuery(new Term("name","z")); - Bhavin pandya - Original Message - From: Eshwaramoorthy Babu To: java-user@lucene.apache.org ; Bhavin Pandya Sent: Monday, December 04, 2006 3:16 PM Subject: Re: Multiple character wildcard sea

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Aaron Shaw
Hi, thanks for your help. Still having a problem after looking at that documentation though, it still seems to have Field.Keyword etc shown in the API? I've tried turning that into cf script however I'm not quite sure whether I'm doing it correctly I'm creating a new field object like so: fie

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Samir Abdou
Field field = new Field("url","http://localhost:8500/cfdocs/dochome.htm";, Field.Store.YES, Field.Index.UN_TOKENIZED); You need tot translate this to cf script :-) For more information, take a look to http://lucene.apache.org/java/1_9_1/api/index.html Welcome, Samir -Message d'origine---

Re: Multiple character wildcard search

2006-12-04 Thread Eshwaramoorthy Babu
HI, Do I have to use any specific analyser to use PrefixQuery. I am using WhitespaceAnalyzer and below is how I am populating the fileds when dding to writer. Document contactDocument = new Document(); contactDocument.add(new Field("type",contact.getType(),Field.Store.NO, Field.Index.TOKENIZED

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Aaron Shaw
Hi, I've tried it in 2.0 and 1.9 What syntax should I be using to add documents in the new version instead? Thanks, Aaron Samir Abdou wrote: > > Hi, > > The method Field.Keyword is not longer available in Lucene's last version! > > Which version are you using? > > Regards, > Samir > >

RE: Problem: "The selected method Keyword was not found"

2006-12-04 Thread Samir Abdou
Hi, The method Field.Keyword is not longer available in Lucene's last version! Which version are you using? Regards, Samir -Message d'origine- De : Aaron Shaw [mailto:[EMAIL PROTECTED] Envoyé : lundi, 4. décembre 2006 11:08 À : java-user@lucene.apache.org Objet : Problem: "The selected

Problem: "The selected method Keyword was not found"

2006-12-04 Thread Aaron Shaw
Hi, I'm attempting to use Lucene under Coldfusion MX, however when I try to create and index I am coming up against the following error message when I try to add a document: The selected method Keyword was not found. I'm using the following lines of code to try to create the index: analyzer =

Dreaded optimize (again!)

2006-12-04 Thread Stanislav Jordanov
Guys, there's another aspect of the index optimize operation, that confuses us a lot - the free disk space it requires to complete successfully. Initially we thought that an amount of free disk space equal to the index size (prior to optimization) should suffice. Then it became clear that havin

Re: lucene - general question

2006-12-04 Thread Eshwaramoorthy Babu
Hi Buics, Thanks for your response.. I will receive 2 xml files, I have to compare these 2 and generate a xml report with below 1. Matching id's from both xml 2. Duplicate id's from both xml The requirement is for reconcilation of 2 application data. For this I have to get the get all id's fr

Re: Multiple character wildcard search

2006-12-04 Thread Eshwaramoorthy Babu
Hi Bhavin, Thanks for your response. I tried the below Query query = new PrefixQuery(new Term("name", "Z*")); but it still the query returns 0 result. Also can you please tell me how to search form JAVA collection? Thanks, Babu On 12/4/06, Bhavin Pandya <[EMAIL PROTECTED]> wrote: Babu,

Re: lucene - general question

2006-12-04 Thread [EMAIL PROTECTED]
Hi Babu, your sample xml schema contains only few fields, why not consider to use db (mysql) todo: read your xml file, then use digester to convert to java object after that insert it your db. when your done with your insert stuff, you can simply query your db anytime you like. cheers, Buics

Re: Multiple character wildcard search

2006-12-04 Thread Bhavin Pandya
Babu, Use "PrefixQuery" and if you are looking for phrase also then "PhrasePrefixQuery"... check api for usage - Bhavin pandya - Original Message - From: "Eshwaramoorthy Babu" <[EMAIL PROTECTED]> To: Sent: Monday, December 04, 2006 2:15 PM Subject: Multiple character wildcard

Re: lucene - general question

2006-12-04 Thread Lukas Vlcek
Hi, Try to look at Groovy (I haven't used it yet but some people say it is much easire to work with XML file in Groovy then in Java). It produces class files so it can be integrated with your exisitng Java code. 6MB file is not that much unless you are working in limited environment (like mobile

Multiple character wildcard search

2006-12-04 Thread Eshwaramoorthy Babu
Hi, Can anyone please tell me how to specify multiple character wildcard searches in "Term" Below is my requirement 1) I want search all names that starts with Z (Z*) 2) My programme will receive list of names in JAVA collection (Vector or ArrayList or Hashtable), I want to search for all

Re: lucene - general question

2006-12-04 Thread Eshwaramoorthy Babu
Hi Lukas, Thanks for your response. I was planning to search for 1st xml ID's in 2nd XML. so I thought of using lucene for search. Can you please suggest me some scripting solution. Is perl right solution? Thanks, Babu On 12/4/06, Lukas Vlcek <[EMAIL PROTECTED]> wrote: Hi Babu, Sorry but