Re: boosting indexing terms

2007-06-26 Thread yu
Thanks, Karl, for your link/suggestion. I did not know Payloads and will check it out. Jay karl wettin wrote: 27 jun 2007 kl. 04.55 skrev yu: I like Lucene's ability to boost each query term during query time.. However, for indexing, the smallest unit you can boost is a field. I wonder if

Re: boosting indexing terms

2007-06-26 Thread karl wettin
27 jun 2007 kl. 04.55 skrev yu: I like Lucene's ability to boost each query term during query time.. However, for indexing, the smallest unit you can boost is a field. I wonder if anyone here has good idea or suggestion on how one might boost terms without putting them in separate fields?

boosting indexing terms

2007-06-26 Thread yu
I like Lucene's ability to boost each query term during query time.. However, for indexing, the smallest unit you can boost is a field. I wonder if anyone here has good idea or suggestion on how one might boost terms without putting them in separate fields? Thanks. Jay ---

Re: Standard Tokenizer Question

2007-06-26 Thread John Paul Sondag
So out of curiosity exactly what is startLine and endLine? --JP On 6/26/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: In fact,it should be: Standardtoken.image=Token.termText, Standardtoken.beginColumn= Token.startOffset, Standardtoken.endColumn =Token.endOffset, You can reference Standard

RE: Standard Tokenizer Question

2007-06-26 Thread Liu_Andy2
In fact,it should be: Standardtoken.image=Token.termText, Standardtoken.beginColumn= Token.startOffset, Standardtoken.endColumn =Token.endOffset, You can reference StandardTokenizer.java, about line 73. Andy -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf

Re: Standard Tokenizer Question

2007-06-26 Thread John Paul Sondag
That solves getting the actual text but Token. My other problem is that Token also has "startOffset" and "endOffset" fields.Standard Token has "startColumn/Line" and "endColumn/Line" but I was not exactly sure how to use these. Could you possibly give me a small example of using these? I th

Re: Lucene as primary object storage

2007-06-26 Thread karl wettin
27 jun 2007 kl. 03.19 skrev [EMAIL PROTECTED]: Perhaps this reference http://www.mail-archive.com/[EMAIL PROTECTED]/ msg062 60.html will give you some help. It's about Hibernate Lucene integration. Thanks, interesting. Do you know more about this? Was the ORM implemented? Does Hibernate ha

RE: Standard Tokenizer Question

2007-06-26 Thread Liu_Andy2
Token.termText() perhaps is the same as st.getToken(y).image Andy -Original Message- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of John Paul Sondag Sent: Wednesday, June 27, 2007 9:32 AM To: java-user@lucene.apache.org Subject: Standard Tokenizer Question Hey, I Think th

Standard Tokenizer Question

2007-06-26 Thread John Paul Sondag
Hey, I Think this is where I ask this. I'm pretty new to this so this is probably a dumb question. I'm using the StandardTokenizer class to turn a file into tokens. I then need to be able to later skip to a specific token in the file sent to me from another source. So say my StandardTokenizer

RE: Lucene as primary object storage

2007-06-26 Thread Liu_Andy2
Perhaps this reference http://www.mail-archive.com/[EMAIL PROTECTED]/msg062 60.html will give you some help. It's about Hibernate Lucene integration. Andy -Original Message- From: karl wettin [mailto:[EMAIL PROTECTED] Sent: Wednesday, June 27, 2007 8:17 AM To: java-user@lucene.apache.org

Lucene as primary object storage

2007-06-26 Thread karl wettin
Until recently I have only used Lucene as an index. I'm very interested to hear what you people have to say about using Lucene as the primary and only persistance layer in an application. I would feel much safer if I had a transactionlog for crash recovery. So I might get me one of those. A

Re: Question about search and analyzer

2007-06-26 Thread Erick Erickson
Have you used Luke to examine your index and try queries? This will tell you a LOT about what's *really* happening. Google 'lucene' 'luke' and try it. Also, try query.toString() (and include the output for others to see, please). Best Erick On 6/26/07, [EMAIL PROTECTED] <[EMAIL PROTECTED]>

Question about search and analyzer

2007-06-26 Thread tanya
Hi, I have indexed files using StandardAnalyzer. Now I am trying to search these indexes and use Highlighter to display the content. I have a weird problem with numbers: The line below is displayed when the following queries are issued 18? 183 +accept +18? globus-gatekeeper: accept_sec_context

Re: JavaCC Download

2007-06-26 Thread Steven Rowe
I don't think you need to register - I am not registered and I can download from there. My guess is that Mahdi Rahimi's browser doesn't know how to speak the HTTPS protocol. Here's an invocation of wget (I have version 1.10.2) that works for me to get the .zip archive (all on one line): wget --n

Re: Similarity algorithm

2007-06-26 Thread Grant Ingersoll
Lucene In Action is a great book, but you can also have a look at http://lucene.apache.org/java/docs/scoring.html for more info on scoring and how to change the similarity and other details of scoring. Also, search the archives for things you are interested in, there is a lot of information

RE: Similarity algorithm

2007-06-26 Thread Damien McCarthy
The PDF of Lucene in Action can be purchased from www.manning.com I'd suggest reading and understanding Lucene in Action before you attempt anything else :) -Original Message- From: Mahdi Rahimi [mailto:[EMAIL PROTECTED] Sent: 26 June 2007 16:38 To: java-user@lucene.apache.org Subject: Si

Similarity algorithm

2007-06-26 Thread Mahdi Rahimi
In the name of God. Hi ALL. I'm looking for some resources about Lucene Similarity algorithm. I want to change it for my special aims. Can anyone help me and introduce some references? Is "Lucene in Action" book usefull for me? How can I find PDF or DOC file of this book? Also, I want to change