Re: regarding FieldSelector

2007-09-14 Thread Chris Hostetter
: well, I can't see any doc() method with FieldSelector argument, perhaps this : is provided in nightly builds of Lucene, currently I am using Lucene v2.1.0 2.2 was released in June, in it the Searchable interface defines a doc method which takes a FieldSelector... http://lucene.apache.org/java

Re: Java Heap Space -Out Of Memory Error

2007-09-14 Thread Sebastin
HI testn, it gives performance improvement while optimizing the Index. Now i seprate the IndexStore on a daily basis.(ie) For Every Day it create a new Index store ,sep- 08-2007,sep-09-2007 like wise it will minimize the size of the IndexStore.could you give me an idea on how to open every day

Lucene indexes and relationship

2007-09-14 Thread Mohammad Norouzi
Hello, In our application, we have many categories (indexes) in which different kind of information have been indexed. we provided a facility for our users to opt their category to search and we also provided a way that they select more than one category to search, afterwards, we must return back t

Re: regarding FieldSelector

2007-09-14 Thread Mohammad Norouzi
well, I can't see any doc() method with FieldSelector argument, perhaps this is provided in nightly builds of Lucene, currently I am using Lucene v2.1.0 I am using org.apache.lucene.search.Searcher and new IndexSearcher(a_directory) to instantiate an instance of it On 9/14/07, Grant Ingersoll <[E

Re: UTF-8/unicode input in querying in Lucene

2007-09-14 Thread Chris Hostetter
: > I don't believe the QueryParser actually treats \uN as a special : > syntax : : LUCENE-716 added unicode escapes. doh! that's what i get for assuming the random solr port i used to sanity check my assumption was relatively up to date. LUCENE-1000 -Hoss ---

Re: UTF-8/unicode input in querying in Lucene

2007-09-14 Thread Yonik Seeley
On 9/14/07, Chris Hostetter <[EMAIL PROTECTED]> wrote: > I don't believe the QueryParser actually treats \uN as a special > syntax LUCENE-716 added unicode escapes. -Yonik - To unsubscribe, e-mail: [EMAIL PROTECTED] For addi

Re: UTF-8/unicode input in querying in Lucene

2007-09-14 Thread Chris Hostetter
: The page http://lucene.apache.org/java/docs/queryparsersyntax.html does not : mention that \u Unicode syntax is supported. : For example, \u0048\u0045\u004c\u004c\u004f is HELLO. : : Please add this to the page, it took experimentation to discover it. I don't believe the QueryParser actua

UTF-8/unicode input in querying in Lucene

2007-09-14 Thread Lance Norskog
Hi- The page http://lucene.apache.org/java/docs/queryparsersyntax.html does not mention that \u Unicode syntax is supported. For example, \u0048\u0045\u004c\u004c\u004f is HELLO. Please add this to the page, it took experimentation to discover it. Thanks, Lance Norskog

Re: Java Heap Space -Out Of Memory Error

2007-09-14 Thread testn
So did you see any improvement in performance? Sebastin wrote: > > It works finally .i use Lucene 2.2 in my application.thanks testn and > Mike > > Michael McCandless-2 wrote: >> >> >> It sounds like there may be a Lucene version mismatch? When Luke was >> used >> it was likely based on Luc

Problem compiling with JBoss portal server

2007-09-14 Thread Vijay Kumar
Hello all, I ran into an issue "can not resolve symbol" while compiling my class that uses MultiFieldQueryParser class. It turns out that the version of Jboss portal 2.6 Beta 1 that we are using includes a portal-cms.sar that has the older version of lucene.jar, This version does not have the

Re: how best to limit search result to a subset of document ids?

2007-09-14 Thread mark harwood
Hi Chris, Have a look at the "TermsFilter" in contrib/queries. The code would look something like this: TermsFilter filter=new TermsFilter() //for( all 5 id terms filter.addTerm(new Term("id", id[i])); hits=searcher.search(moreLikeThisQuery, filter); Cheers Mark - O

how best to limit search result to a subset of document ids?

2007-09-14 Thread Chris Sizemore
hello-- i'm trying to figure out the best way to limit a query to a small subset if a lucene index's documents, and i'm a bit of a novice and a bit stumped... my scenario is: 1) search the index for a term (very boring normal case) 2) take the top 5 results from that search result set 3) next

Re: Oracle-Lucene integration (OJVMDirectory and Lucene Domain Index) - LONG

2007-09-14 Thread Chris Lu
Hi, Joaquin, Very interested to know the indexing performance inside Oracle JVM, especially with large amount of data. -- Chris Lu - Instant Scalable Full-Text Search On Any Database/Application site: http://www.dbsight.net demo: http://search.dbsight.com Lucene Database

multiple Tokens in a Tokengroup not matching

2007-09-14 Thread Dave Schneider
Hi, I've inherited some Lucene 1.9.1 code, and have run into the following problem: I have a TokenGroup with multiple tokens in it, and a query that should match against multiple tokens (e.g. X and Y) in the TokenGroup. However, when I look in the Hit that results, I see that one of the To

Fwd: Oracle-Lucene integration (OJVMDirectory and Lucene Domain Index) - LONG

2007-09-14 Thread Marcelo Ochoa
From: J. Delgado <[EMAIL PROTECTED]> Date: Sep 13, 2007 7:27 PM Subject: Oracle-Lucene integration (OJVMDirectory and Lucene Domain Index) - LONG To: [EMAIL PROTECTED] Cc: [EMAIL PROTECTED] I'm very happy to announce the partial rework and extension to LUCENE-724 (Oracle-Lucene Integration), prim

Re: regarding FieldSelector

2007-09-14 Thread Grant Ingersoll
Searcher is a Searchable and Searchable defines the doc() method with FieldSelector, but I suppose we could add an abstract declaration of it to Searcher, since it has to be implemented on all derived classes anyway due to it being on the Searchable interface. So, you can either cast to a

Re: Treating values of numeric fields as numbers

2007-09-14 Thread Ivan Vasilev
Hi Eric, Yes they help for particularly this problem - they shrik all avilable Longs to 14 character strings. So they free me from the limitation that I wrote about (1970 - 2280 years). So now the advantage that I have with treating fields representing UTC dates as numbers (but not as strings)

HTMLParser and Chinese

2007-09-14 Thread Jennifer May
Hello! I want to index an HTML document with the lucene demo, but have problems parsing some Chinese files. I changed code in the HTMLDocument class as to be able to define the encoding of the document to be parsed: InputStreamReader fis = new InputStreamReader(new FileInputStream(f), IndexH

Re: regarding FieldSelector

2007-09-14 Thread Mohammad Norouzi
actually, I show the result with pagination support, and they have option to choose the number of records per page. and yes, I should provide a test program, but about the HitCollector, I already created one, and collect all lucene's document id and also my needed ID that stored in the index >> yo