Accessing "term frequency information" for documents

2006-07-18 Thread ericbae
Hello. What I want to access through Lucene is this. I search for documents by inserting a particular query and for each result that is returned, I want to view its term frequency information. For example, if documents A and B are returned, is there a easy way to check which words appear in A a

Re: Index-Format difference between 1.4.3 and 2.0

2006-07-18 Thread kevin
Hi, how to highlight the keyword in the search result summary ? can i use the /highlight/ package? Thanks! - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Index-Format difference between 1.4.3 and 2.0

2006-07-18 Thread Nicolas Lalevée
Le Mardi 18 Juillet 2006 20:53, lude a écrit : > Hello, > > sorry, didn't find the information elsewhere: > > 1.) Did the format of the lucene-index change between version 1.4.3 and > 2.0? 2.) Is it possible to use the old Luke-Tool with a new lucene 2 index? > > Thanks > lude ASAIK, the format ch

Index-Format difference between 1.4.3 and 2.0

2006-07-18 Thread lude
Hello, sorry, didn't find the information elsewhere: 1.) Did the format of the lucene-index change between version 1.4.3 and 2.0? 2.) Is it possible to use the old Luke-Tool with a new lucene 2 index? Thanks lude

Re: FileNotFoundException: occurs during the optimization of index

2006-07-18 Thread Michael McCandless
Can anybody suggest how to avoid this problem and concurrently access in the index accroos the network at the same time maintaining the index. Unfortunately, there are known issues with locking and NFS. The lock files (and underlying locking protocol) do not work reliably when used over NFS

Re: Empty fields ...

2006-07-18 Thread Erick Erickson
Quoting the guys "it depends" ... At root, a filter is a bitset. So size-wise, you are using 1 bit/doc (plus some small overhead). Both the storage required and the time to construct are dependent on the characteristics of your corpus. I guess the only way you can answer that for your particular

Re: Part of Index (spezial Field) into Memory

2006-07-18 Thread neils
Damn, of course - sorry & thanks :-)) I will try this tomorrow :-) -- View this message in context: http://www.nabble.com/Part-of-Index-%28spezial-Field%29-into-Memory-tf1955582.html#a5381183 Sent from the Lucene - Java Users forum at Nabble.com. -

Re: Part of Index (spezial Field) into Memory

2006-07-18 Thread Yonik Seeley
On 7/18/06, neils <[EMAIL PROTECTED]> wrote: ok, I made the mention indexes for name, and the rest (2 indexes) When i now make a search mit mulitsearcher on the name field, I can only access the values in first (name) index. Not from the second one. Don't use a MultiSearcher or a ParallelMulti

Re: Empty fields ...

2006-07-18 Thread Dragon Fly
Thanks for the quick reply, Erick. A couple of follow-up questions though. If I had 200 fields (instead of 10), would you still recommend the same approach? Would 200 filters use up too much memory? Would 200 filters be too slow to construct/search? From: "Erick Erickson" <[EMAIL PROTECTED]> Re

Re: Building easy to use search guis? How to save queries...

2006-07-18 Thread mark harwood
Have you looked at the XML Query Parser in the "contrib" section of the SVN repository? I have been using XSLT recently to mix dynamic content with query templates expressed as XML and then parsed by this framework. If you don't like the default XML syntax used the parser can be configured to s

Re: Empty fields ...

2006-07-18 Thread Erick Erickson
You could use a filter. You'd just construct a filter with the bits on for all docs that had that term. You could cache these to make future searchers easier. I envision, say, caching one filter for each field and keeping them around so you could re-use them whenever you wanted. See ConstantScore

Empty fields ...

2006-07-18 Thread Dragon Fly
Hi, I have 10 fields in my index and some of the fields can be empty. I'd like to be able do something like "IS NOT NULL" in SQL. For example: Name:Jane AND Addr IS NOT NULL AND Zip IS NOT NULL Zip:90210 AND Name IS NOT NULL Is there an easy way to do this? Thank you. _

Re: Part of Index (spezial Field) into Memory

2006-07-18 Thread neils
Hi, ok, I made the mention indexes for name, and the rest (2 indexes) When i now make a search mit mulitsearcher on the name field, I can only access the values in first (name) index. Not from the second one. When i search in street field, i can access the values like street, phonenumber, etc b

Re: Building easy to use search guis? How to save queries...

2006-07-18 Thread Michael J. Prichard
That is really cool. But I am looking for something that I could save and then recreate. I am thinking of building an XML representation such as: or something similar. I just want to see if anyone has done something like this before even up to th

Re: Question about Field.TermVector

2006-07-18 Thread freeman
http://www.lucene.com.cn/fc.htm maybe it's helpful. On 7/18/06, Liao Xuefeng <[EMAIL PROTECTED]> wrote: hi, i'm using lucene 2.0. To index a very long text i use Field.Index.TOKENIZED & Field.Store.NO. I don't know how to get its content (actually, only need words near keywords, like google's

Question about Field.TermVector

2006-07-18 Thread Liao Xuefeng
hi, i'm using lucene 2.0. To index a very long text i use Field.Index.TOKENIZED & Field.Store.NO. I don't know how to get its content (actually, only need words near keywords, like google's results: ... found this keyword here...) without querying database? Someone told me using term vector to p