Re: Wording on FAQ entry

2005-10-27 Thread Chris Hostetter
: So, the part in question is "Multiple search threads may access the : index concurrently" : : However, I think the question is really asking "Can multiple threads : access the same SEARCHER at the same time?" yes, I've clarified the answer... Yes, IndexSearcher is thread-safe. Multiple sear

Re: trying to boost a phrase higher than its individual words

2005-10-27 Thread Chris Hostetter
: I've constructed the following query, using boost values... : : [+(content:"classical music"^5.0 content:classical^0.1 : content:music^0.1)] : : ...but the boost values don't seem to affect the order of the search : results. What you're describing should work. What does the value of Index

Re: trying to boost a phrase higher than its individual words

2005-10-27 Thread jian chen
Hi, It seems what you want to achieve could be implemented using the Cover Density algorithm. I am not sure if any existing query classes in the Lucene distribution does this already. But in case not, this is what I am think about: Make a custom query class, called CoverDensityQuery, which is mod

trying to boost a phrase higher than its individual words

2005-10-27 Thread Andy Lee
I have a situation where I want to search for individual words in a phrase as well as the phrase itself. For example, if the user enters ["classical music"] (with quotes) I want to find documents that contain "classical music" (the phrase) *and* the individual words "classical" and "music"

Hello

2005-10-27 Thread jaikumar Ganesh
Hello I was playing with the lucene based search engine at http://lucene.apache.org/java/docs/index.html I have been able to create the index and search for the query string at the cmd line. However, for the html version when i develop and set the index as "/opt/lucene/index" and then use the dem

Re: How to combine multiple fielded queries in Lucene

2005-10-27 Thread msftblows
You can write an expansionQuery class and enumerate through the fields and send in a query that does that. I did something like that not long ago, it would send a query that would first be "" then AND then OR... -Original Message- From: Xin Herbert Wu <[EMAIL PROTECTED]> To: java-use

Search problems

2005-10-27 Thread Sharma, Siddharth
Hi My index has 4 keyword fields and one unindexed field. I want to search by the 4 keyword fields and return the one unindexed field. I can iterate over the documents via Luke. But when I search for the same values that I see via Luke, it does not find the document. Out of the 4 fields, 2 are a

Re: Index File Size

2005-10-27 Thread Erik Hatcher
On 27 Oct 2005, at 10:21, Chandramohan wrote: In general, will index size be equal to the size of the document? Also, does Lucene employ any index compression schemes? I am a relatively new user of Lucene and I just love it! It depends on how you create Field's. The general rule of thumb I

Re: How to combine multiple fielded queries in Lucene

2005-10-27 Thread Erik Hatcher
On 27 Oct 2005, at 11:32, Peter Kim wrote: If you're using the QueryParser, you just prefix the field-specific part with the field. For example, the query in your email should be submitted to the query parser as: (Title:"baseball" OR Summary:"white sox") AND Publication_date:"199501 TO 200

Re: Better analysis of hyphenated words

2005-10-27 Thread Erik Hatcher
On 27 Oct 2005, at 12:13, Rob Young wrote: I'm using StandardAnalyzer during indexing and I have noticed that it splits hyphenated words in two, ditching the hyphen. This is messing up some of my search results. I would like to keep using StandardAnalyzer because it's very good on the whole

Re: Terms contain spaces

2005-10-27 Thread Volodymyr Bychkoviak
Hi. You can extend QueryParser and overwrite method getFieldQuery(String field, String queryText) and define your own behavior. But be careful not to call static method parse on it because you'll get old QP. Mike Streeton wrote: I have been given an index with a term that has been stored a

Terms contain spaces

2005-10-27 Thread Mike Streeton
I have been given an index with a term that has been stored as a keyword and contains spaces. We are parsing a query using QueryParser but given 'myfield:"abc def"' it generates a PhraseQuery for myfield:abc and myfield:def. What is needed is a TermQuery(new Term(myfield,"abc def")). Can you tell q

RE: How to combine multiple fielded queries in Lucene

2005-10-27 Thread Peter Kim
If you're using the QueryParser, you just prefix the field-specific part with the field. For example, the query in your email should be submitted to the query parser as: (Title:"baseball" OR Summary:"white sox") AND Publication_date:"199501 TO 200412" or if you're creating is programmatically, th

How to combine multiple fielded queries in Lucene

2005-10-27 Thread Xin Herbert Wu
I am developing a search application via Lucene and ran into a problem can be described as: Assume we have document schema with these fields: Title, Author, Summary, Publication_date(Date), Content The user query is like this: Find all documents where either Title field has word "baseba

Better analysis of hyphenated words

2005-10-27 Thread Rob Young
Hi, I'm using StandardAnalyzer during indexing and I have noticed that it splits hyphenated words in two, ditching the hyphen. This is messing up some of my search results. I would like to keep using StandardAnalyzer because it's very good on the whole, however I would like to add an extra te

Wording on FAQ entry

2005-10-27 Thread Dan Quaroni
Hi, I just wanted to check the wording on this FAQ entry to see if it means what it says, or what I'd prefer it to mean. The entry is: "Is the IndexSearcher thread-safe? Yes, IndexSearcher is thread-safe. Multiple search threads may access the index concurrently without any problems. " So, the

Index File Size

2005-10-27 Thread Chandramohan
In general, will index size be equal to the size of the document? Also, does Lucene employ any index compression schemes? I am a relatively new user of Lucene and I just love it! thanks! __ Yahoo! Mail - PC Magazine Editors' Choice 200

Re: java on 64 bits

2005-10-27 Thread Roxana Angheluta
Hello everyone! Here are the conclusions we got after digging more into the problem, maybe they help someone: 1) Filling of the hard-drive was not due to java 64, this was coincidentally. 2) The intermediate files Yonik talked about (*.f*) were present because the indexing process was mergin