Re: bunch of newbie queries, PS

2008-10-12 Thread Chris Hostetter
: the "anonymous" SVN (http://svn.apache.org/repos/asf/lucene/java/trunk/) : does not work for me (I am using Eclipse 3.3, and have the subversion plug-in, v. : 1.2.4, and have successfully checked out code using SVN from other repositories). : Apparently here I need a user-id and pwd -- what is

Re: Enumerating all the terms of a particular field

2008-10-12 Thread Chris Hostetter
Someone just asked this question a week ago (unforunatley they asked it on the wrong list)... http://www.nabble.com/Can-I-filter-the-results-returned-by-IndexReader.terms%28field%29-using-a-field--to19849593.html#a19849593 : Subject: Enumerating all the terms of a particular field -Hoss --

Re: Searching sets of documents

2008-10-12 Thread 叶双明
all folders which match "A AND Y", do you search for file name? If yes, A or Y in "A AND Y" is a Strring too, so you can do it by: construct a Lucene Document for each folder, and name of files under the folder is the search data. 2008/10/13 <[EMAIL PROTECTED]> > Hi, > > I want to search for sets

Enumerating all the terms of a particular field

2008-10-12 Thread Khawaja Shams
Hello, How can I get a list of all the terms for a particular field? Is the right approach to extend FilteredTermEnum? Thanks!!

Detecting why a collection of documents matched a query

2008-10-12 Thread Khawaja Shams
Hello, I noticed that indexSearcher.explain() method is not supposed to be run for a large collection of documents, so I am looking for an alternative that just explains why a document matched without all the scoring information. Basically, I would like to know which field of the document was resp

Searching sets of documents

2008-10-12 Thread spring
Hi, I want to search for sets of documents. For instance I index some folders with documents in it and now I do not want to find certain documents but folders. Sample: folder A doc 1, contains X, Y doc 2, contains Y, Z folder B doc 3, contains X, Y doc 4, contains A, Z Now I want to fi

Re: Access Scoring Values of Lucene for Post-Processing

2008-10-12 Thread Grant Ingersoll
Have a look at the o.a.lucene.search.function package and the ValueSourceQuery. You will probably be able to factor in those pieces during scoring, so no need to resort at all. -Grant On Oct 8, 2008, at 11:15 AM, excitingComm2 wrote: Hi everybody, I am using Lucene for searching items i

Re: Retrieving Top Terms for a subset of the index (or for all results of a query)

2008-10-12 Thread Grant Ingersoll
How large of a subset are you talking? You might look at the FitleredTermEnum class, but you will probably have to do some work on it to extend it to what you want If you are talking a smallish subset (say, at most a couple hundred docs), then you could store Term Vectors and use the TermVe

Re: Is lucene right for us

2008-10-12 Thread Grant Ingersoll
Lucene should work quite well for this, you'll just need some infrastructure around it to get the file and extract the contents (see Lucene's Tika project). And, yes, Lucene is thread-safe, so you can index safely as you describe. On Oct 11, 2008, at 10:22 AM, Mag Gam wrote: Hello All,