Re: Help interpreting explanation

2006-03-02 Thread Yonik Seeley
On 3/2/06, Eugene Ezekiel <[EMAIL PROTECTED]> wrote: > Thanks Yonik for the reply. I got just a couple more questions, > > 1) Why does the explanantion print so many times? Because it was a compound query with multiple parts to it. It's one explanation with multiple parts. >From the explain out

Re: Help interpreting explanation

2006-03-02 Thread Eugene Ezekiel
Thanks Yonik for the reply. I got just a couple more questions, 1) Why does the explanantion print so many times? 2) Since my query is made up of multiple terms how do I know what term "x" is referring to? On 3/3/06, Yonik Seeley <[EMAIL PROTECTED]> wrote: > > I think Lucene in Action does a

RE: Accented characters problem

2006-03-02 Thread Seeta Somagani
When I had this problem, I found out that the characters that I'm entering were in UTF-8 format and java converts numbers to a cp1252 encoding. I took care of this using xml.getBytes("UTF-8") for writing and similarly new String(buffer,0,bytes_read,"UTF8") for reading. This solved my problem.

Accented characters problem

2006-03-02 Thread David denBoer
Hi all, We are havign a small problem searching for text with accents in the query. Our index has a word like 'agréé', and when we search for it, we get no results. The query parses (using Snowball) to : 'name:"agr\213 \213"' Using the ISOLatin filter, we get : 'name:agra' neither gets an

Re: Storing byte arrays into an index

2006-03-02 Thread Yonik Seeley
Methinks it's due to http://issues.apache.org/jira/browse/LUCENE-511 It will be fixed shortly. If you know how to build Lucene, it would be helpful if you could try the patch yourself. -Yonik On 3/2/06, Petr Hrebejk <[EMAIL PROTECTED]> wrote: > Hi, > > > > I'm trying to use following call: > > d

Re: Storing byte arrays into an index

2006-03-02 Thread Daniel Naber
On Donnerstag 02 März 2006 12:48, Petr Hrebejk wrote: > I'm trying to use following call: > > doc.add(new Field("contents", array, Field.Store.YES ) ); > > When array.length > 1024, then I get following exception: Could you post a small, self-contained example? I cannot reproduce the problem her

Re: Help interpreting explanation

2006-03-02 Thread Yonik Seeley
I think Lucene in Action does a good job of it. There is also a formula given in the javadoc for DefaultSimilarity http://lucene.apache.org/java/docs/api/org/apache/lucene/search/Similarity.html See my comments below (inline) On 3/2/06, Eugene <[EMAIL PROTECTED]> wrote: > Hi All, > > I'm not sure

Re: Ignoring XML tags when creating an index with Lucene

2006-03-02 Thread varun sood
Hi, You can use SAX2 / DOM parser to parse XML document before submiting it to IndexWriter (Luncene) and select only those tags which you want to index. Its fairly easy to implement XML parser. Hope this helps. Varun ** On 3/2/06, Cian O'Maidin <[EMAIL PROTECTED]> wrote: > > Hi , > >

Help interpreting explanation

2006-03-02 Thread Eugene
Hi All, I'm not sure how to interpret the result of the toString method of Explanation. I'm trying to see the values of each component of the Default Similarity formula for a particular query and a doc. Given below is a sample of my Explanation output. Many thanks if anyone could help expla

Re: javadoc lookup

2006-03-02 Thread Yonik Seeley
On 3/2/06, Larry Ogrodnek <[EMAIL PROTECTED]> wrote: > I do like the idea of breaking on case, this would be especially great > for finding interface implementations (searching for readers and > writers, e.g.). WordDelimiterFilter does this, but it's not in core lucene yet... you'd have to get it

RE: javadoc lookup

2006-03-02 Thread Larry Ogrodnek
Thanks for checking it out! My implementation is pretty simple. Building the index, I only look at elements from the generated HTML, which conveniently contains all class/interface names, as well as methods and constants. The last two I'm currently indexing, but not yet searching, mostly becaus

Ignoring XML tags when creating an index with Lucene

2006-03-02 Thread Cian O'Maidin
Hi , I am currently trying to Full-Text-Search Enable an application server that deals solely in one type of XML document.(Of my design) Currently I’m sending bitmap files embedded in an tag in the XML, the bitmaps are encoded using Base64. I want to exclude these from the index

Storing byte arrays into an index

2006-03-02 Thread Petr Hrebejk
Hi, I'm trying to use following call: doc.add(new Field("contents", array, Field.Store.YES ) ); When array.length > 1024, then I get following exception: [java] at org.apache.lucene.index.FieldsReader.doc(FieldsReader.java:88) [java] at org.apache.lucene.index.SegmentReader.doc

Compressed fields and gcj-build lucene: does it work?

2006-03-02 Thread Michele Bini
Hello, I'm trying to use compressed fields with a gcj build of yesterday trunk lucene sources, but it crashes with a java exception. The same crash also occurred with a checkout made on October, 8th.The relevant backtrace for the exception is: java.lang.ArrayIndexOutOfBoundsException a