Updating documents in index with some fields not stored

2006-05-10 Thread Chun Wei Ho
I would like to make some updates to values within my large index. I understand that I have to delete and re-insert each document to be changed to do that. However I do have some large fields that are unstored (only indexed and no, these are not the fields that I am wanting to change), which means

Re: Searching across spaces

2006-05-10 Thread Robert Young
Yes, I looked at the synonym sollution from Lucene in Action but, as you point out, I have to know about it ahead of time. The only sollution I've had so far is to index the term without the spaces as well and then run two searches, one with spaces and one without. It would work but it just seems

Re: Searching across spaces

2006-05-10 Thread Erick Erickson
I suspect you have to do some fancy indexing. That is, index the following terms: sponge bob square pants spongebob squarepants. But this requires that you understand all the variations you want to hit on ahead of time. Or, you could conceivably deal with wildcard queries, but I think this is th

Re: wildcards in phrase searches

2006-05-10 Thread Erik Hatcher
On May 10, 2006, at 2:12 PM, [EMAIL PROTECTED] wrote: Thanks for your reply. Is there any sample code that would demonstrate how to use these classes properly to get the desired effect of what im looking for? I am not able to access Lucene's svn at the moment, but here's my local copy of

Re: (Lucene) tools/algorithms for co-occurrence terms computation

2006-05-10 Thread Grant Ingersoll
Take a look at my ApacheCon example code at http://www.cnlp.org/apachecon2005. In particular there is some sample code in the file IndexAnalysis.java that demonstrates what Karl is talking about. I don't think it is exactly what you want, but it shows how to get co-occurrence information from

RE: wildcards in phrase searches

2006-05-10 Thread Lee_Gary
Thanks for your reply. Is there any sample code that would demonstrate how to use these classes properly to get the desired effect of what im looking for? Thanks -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Wednesday, May 10, 2006 11:00 AM To: java-user@lucene.a

Searching across spaces

2006-05-10 Thread Robert Young
Hi, How can I search accross spaces in the document when the spaces aren't present in the search. For example, if the document contains "spongebob squarepants" but the user searches on "sponge bob" I would like to get the result. Thanks Rob --

Re: (Lucene) tools/algorithms for co-occurrence terms computation

2006-05-10 Thread karl wettin
On Wed, 2006-05-10 at 10:26 -0700, Xiaocheng Luan wrote: > Is there any Lucene tools Not that I know. > (or general tools/algorithms) that can compute the co-occurrence terms > for a given query (or term)? Might be slow, but you can work the TermFreqVector. It would probably be best to store th

Re: wildcards in phrase searches

2006-05-10 Thread Erik Hatcher
On May 10, 2006, at 1:47 PM, [EMAIL PROTECTED] wrote: Is it possible to have wildcards in a phrase search? For example, if my object is indexed with a phrase "benchmark properties", is there a way to specify a phrase search that uses wildcards, such as "benchmar* properties" or "benchmark

wildcards in phrase searches

2006-05-10 Thread Lee_Gary
Is it possible to have wildcards in a phrase search? For example, if my object is indexed with a phrase "benchmark properties", is there a way to specify a phrase search that uses wildcards, such as "benchmar* properties" or "benchmark pro*"? I have tried using WildcardQuery, but it doesnt seem to

RE: Lucene - index writer

2006-05-10 Thread Satuluri, Venu_Madhav
Hi, I don't think it should cause any conflicts in the index itself (the indexing process proper is decoupled from the analyzing), and if you can decide as to which analyzer to use when you're searching based on the field/kind of search, then it should be fine. Regards, Venu -Original Messa

Lucene - index writer

2006-05-10 Thread MAHADEVAN SAMBAMURTHY
Hi, I am new to Lucene and recently started making use of it. Can anyone please clarify the following: I have 2 services and need to have index on the fetched datas from respective services. I am trying to create the index for both of them in the same directory with 2 different analyzers like t

(Lucene) tools/algorithms for co-occurrence terms computation

2006-05-10 Thread Xiaocheng Luan
Hi, Is there any Lucene tools (or general tools/algorithms) that can compute the co-occurrence terms for a given query (or term)? For example, if the user types in "avian flu", the top co-occurrence terms may include "Hong Kong", "vaccine", "H5N1", or "pandemic", depending on the underlying da

Re: Lucene Index Stored on Database instead of FileSystem

2006-05-10 Thread Otis Gospodnetic
Nikhil, This will help: http://www.google.com/search?q=jdbcdirectory%20lucene We need to add this to the FAQ. Otis - Original Message From: Nikhil Goel <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Wednesday, May 10, 2006 11:38:03 AM Subject: Lucene Index Stored on Database i

Lucene Index Stored on Database instead of FileSystem

2006-05-10 Thread Nikhil Goel
Hi, By default, Lucene create a directory for index on FileSystem. Is there a plugin or support provided by Lucene to store the index in Databse instead of FileSystem, where:- 1) Directory can be equal to table in DB . 2) Each file in the index directory represents one row in that table. 3) Data

Re: Adding a new search field but needs searching for all

2006-05-10 Thread Erik Hatcher
Try a MatchAllDocsQuery combined in a BooleanQuery with a negative clause for b == 1. Erik On May 10, 2006, at 5:32 AM, Chun Wei Ho wrote: I have a large Lucene index that I am planning on adding one or more search fields, and perform searches on them. How do I include results from

RE: Lucene query question

2006-05-10 Thread Kinnar Kumar Sen, Noida
Hi Erik Is there any way to do a case insensitive search without modifying the index Regards and Thanks Kinnar Kumar Sen HCL Technologies Ltd. Sec-60, Noida-201301 Ph: - 09313297423 TO SUCEED BE DIFFERENT BE DARING AND BE THERE FIRST -Original Message- From: Erick Erickson

Re: Lucene query question

2006-05-10 Thread Mike Richmond
Hi Erick, I appreciate the help. I am using the "StandardAnalyzer" for both the query and for indexing. --Mike On 5/10/06, Erick Erickson <[EMAIL PROTECTED]> wrote: I'll take a quick stab at it. What analyzer are you using with the query? In the search page of Luke, near the upper right ther

Re: Lucene query question

2006-05-10 Thread Erick Erickson
I'll take a quick stab at it. What analyzer are you using with the query? In the search page of Luke, near the upper right there's the "Analyzer to use for query parsing:" box. You might try the WhitespaceAnalyzer, since that shouldn't do anything "interesting". Also, below the search box on the s

Re: Lucene query question

2006-05-10 Thread Mike Richmond
Mr. Gospodnetic, Thanks for the quick response. You make a good point about the field being tokenized, and I initially had the e-mail field UN_TOKENIZED but it did not change the result of the query (my example search was still failing). Do you have any ideas on what could be causing that? Th

lowering score of doc if synonyms matched (synonyms indexed)

2006-05-10 Thread zzzzz shalev
i am currently adding synonyms at index time (and not expanding the query), i fear that there is a problem with this implementation: is there a way to lower the score of a document if it was found due to a synonyms match and not due to a match of the word queried. from what i understand th

RE: lowering score of doc if synonyms matched (synonyms indexed)

2006-05-10 Thread Ziv Gome
As you might have already seen, Andrew Schetinin and I have published (at http://mail-archives.apache.org/mod_mbox/lucene-java-user/200603.mbox/[EMAIL PROTECTED]) a source code that handles synonyms at search time (query expansion). This code includes also a de-boost factor for synonyms (compa

Adding a new search field but needs searching for all

2006-05-10 Thread Chun Wei Ho
I have a large Lucene index that I am planning on adding one or more search fields, and perform searches on them. How do I include results from the other documents that do not have the new field? For example, I have 10 million documents in a index, and I update 200 of them adding the field "b" =