Re: Re: wild card with keyword fileld

2005-07-19 Thread Rahul D Thakare
  Hi Ian, Yes, I did implement Eric's suggestion last week, but couldn't help. I am using a demo program from Lucene.jar to test this, let me put a code here. doc.add(Field.Keyword("keywords", "MAIN BOARD")); while indexing and for retrieving PerFieldAnalyzerWrapper analyzer = new PerF

Corrupt index?

2005-07-19 Thread Kipping, Peter
Hi, I rebuilt my index with only a few minor changes, and now when I search on it I get nothing but out of memory exceptions. I don't know what would cause this but is there a way you can test the integrity of your index? Thanks, Peter

Re: Searching for similar documents

2005-07-19 Thread Erik Hatcher
On Jul 19, 2005, at 12:42 PM, Kadlabalu, Hareesh wrote: If someone could someone please extract a version of this file from source control that corresponds to lucene 1.4.3 or if this can file can be back-ported, it would be greatly helpful. The old Jakarta Lucene Sandbox is still available vi

Re: QueryParser, phrases and stopwords

2005-07-19 Thread Stephen Halsey
Hi, I'm just writing to ask if you know if the the change discussed below is likely to be in the next version of Lucene as a default for StopFilter. I'm happy to apply the diff supplied by Mike Barry on my own source code to stop "climate control" matching "climate of control", but if its lik

Re: retrieving raw scores

2005-07-19 Thread yahootintin . 11533894
Thanks. I'll try that... --- java-user@lucene.apache.org wrote: Use HitCollector's collect method: > > http://lucene.apache.org/java/docs/api/org/apache/lucene/search/HitCollector.html#collect(int,%20float) > > Otis > > > --- [EMAIL PROTECTED] wrote: > > > hi, > > > > i need to retrieve th

Re: retrieving raw scores

2005-07-19 Thread Otis Gospodnetic
Use HitCollector's collect method: http://lucene.apache.org/java/docs/api/org/apache/lucene/search/HitCollector.html#collect(int,%20float) Otis --- [EMAIL PROTECTED] wrote: > hi, > > i need to retrieve the raw scores (3.6, 2.8, etc) for a hit and not > the normalized score (1.0, 0.8, etc). c

Re: New line

2005-07-19 Thread Otis Gospodnetic
I may be misunderstanding you, but \n is the "newline" character. http://www.google.com/search?q=newline%20character%20java Otis --- christopher may <[EMAIL PROTECTED]> wrote: > > I am using text files in my index. What can be used as the new line > character ? Say I have > A batch of apples

RE: Searching for similar documents

2005-07-19 Thread Otis Gospodnetic
Hareesh - you should be able to search for "David Spencer" and MoreLikeThis in the mailing list archives. He contributed that class with 1.4.3 support originally. Otis --- "Kadlabalu, Hareesh" <[EMAIL PROTECTED]> wrote: > I want to use Lucene version 1.4.3 and upgrade to the current SVN > vers

New line

2005-07-19 Thread christopher may
I am using text files in my index. What can be used as the new line character ? Say I have A batch of apples Apples . So the doc is returned as Apples and the summary is A batch of apples. If I want to then on the next line of the file put A state out west Arizona. This all blends together. W

RE: Searching for similar documents

2005-07-19 Thread Kadlabalu, Hareesh
I want to use Lucene version 1.4.3 and upgrade to the current SVN version later on. I was hoping if there is some way to use 'morelikethis' code in 1.4.3. If someone could someone please extract a version of this file from source control that corresponds to lucene 1.4.3 or if this can file can be

retrieving raw scores

2005-07-19 Thread yahootintin . 11533894
hi, i need to retrieve the raw scores (3.6, 2.8, etc) for a hit and not the normalized score (1.0, 0.8, etc). commenting out the normalizing code in Hits.java does what i want. is there a better way to do this? i'm wondering about adding a method to Similarity.java that looks like this: boole

RE: Searching for similar documents

2005-07-19 Thread Otis Gospodnetic
You need to point to the new Lucene Jar (from SVN, not the 1.4.3) version. Otis --- Derek Westfall <[EMAIL PROTECTED]> wrote: > I've been trying to compile the .java into a .class and I am getting > the > error below. > > Any idea what I am missing? > > Thanks, > > Derek > > > C:\CFusionM

RE: Searching for similar documents

2005-07-19 Thread Derek Westfall
I've been trying to compile the .java into a .class and I am getting the error below. Any idea what I am missing? Thanks, Derek C:\CFusionMX\wwwroot\WEB-INF\classes>javac morelikethis.java -classpath c:\cfusi onmx\lib\lucene-1.4.3.jar morelikethis.java:509: cannot resolve symbol symbol : vari

Re: Lucene and numerical fields search

2005-07-19 Thread Ray Tsang
Hi, That'd be great! I will post the code as is, hope to get feedbacks. There are some problems with BitSetQuery by itself with MultiSearcher due to different offsets, so it's in a bit of a mess right now, and cannot work by itself unless used with a custom ParallelMultiSearcher. However, It sho

QueryParser handling of backslash characters

2005-07-19 Thread Jeff Davis
Hi, I'm seeing some strange behavior in the way the QueryParser handles consecutive backslash characters. I know that backslash is the escape character in Lucene, and so I would expect "" to match fields that have two consecutive backslashes, but this does not seem to be the case. The fields

Summary

2005-07-19 Thread christopher may
How can I personalize the summary results ? Where and how does Lucene retrieve this data. The source location would be great to know but any help would be appreciated. Thanks - To unsubscribe, e-mail: [EMAIL PROTECTED] For ad

RE: n-gram indexing

2005-07-19 Thread Rajesh Munavalli
Let me explain a scenario where I would need to add the n-grams at indexing time. Consider two documents: Document 1: "united states is United airlines operates in 50 states. United states government." Document 2: "united states is United airlines operates in 50 states. United som

Re: Lucene and numerical fields search

2005-07-19 Thread Andrew Boyd
I second the motion. It sounds like a good solution to TooManyClauses exception. -Original Message- From: Otis Gospodnetic <[EMAIL PROTECTED]> Sent: Jul 16, 2005 5:59 PM To: java-user@lucene.apache.org, Ray Tsang <[EMAIL PROTECTED]> Subject: Re: Lucene and numerical fields search Hi Ray

Using QueryParser with a single field

2005-07-19 Thread Eyal
Hi, In my client application I allow the user to build a query by selecting a field from a combobox and entering a value to search by. I want the user to enter free text queries for each field, but I don't want to parse it myself so I thought I'd use QueryParser for that. My problem is that if th

RE: BOOLEAN OPERATOR HOWTO

2005-07-19 Thread Karthik N S
Hi Apologies... Given a Search word = 'erik OR hatcher AND otis OR gospodnetic' , Is it possible to RETURN COUNT occurances for each of the word with in the Searched documents. This would give me the Each word's Term Frequency. How to achieve this Thx in advance karthik -Original

Re: wild card with keyword fileld

2005-07-19 Thread Ian Lea
Have you tried Erik's suggestion from last week? http://mail-archives.apache.org/mod_mbox/lucene-java-user/200507.mbox/[EMAIL PROTECTED] There is certainly some case confusion in your examples there. Personally, I tend to just lowercase all text on indexing and searching. -- Ian. On 19 Jul 200