RE: Is it possible to add multiple keywords to a single field from one doc?

2008-05-25 Thread Tom Conlon
etween document table(pk docid) and docattribute table (pk docid,attributeid) TIA, Tom -Original Message- From: Mark Miller [mailto:[EMAIL PROTECTED] Sent: 24 May 2008 16:53 To: java-user@lucene.apache.org Subject: Re: Is it possible to add multiple keywords to a single field from one d

Is it possible to add multiple keywords to a single field from one doc?

2008-05-24 Thread Tom Conlon
Hi, I haven't been able to find the answer to this question easily so any help would be appreciated. Thanks, Tom - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: C++ as token in StandardAnalyzer?

2008-03-05 Thread Tom Conlon
Hi Donna - See previous post below that may help. Tom Hi, In case this is of help to others: Crux of problem: I wanted numbers and characters such as # and + to be considered. Solution: implement a LowercaseWhitespaceAnalyzer and a Lowerc

RE: Hits.score mystery

2007-11-01 Thread Tom Conlon
please? Thanks, Tom -Original Message----- From: Tom Conlon [mailto:[EMAIL PROTECTED] Sent: 01 November 2007 09:18 To: java-user@lucene.apache.org Subject: RE: Hits.score mystery Thanks Daniel, I'm using Searcher.explain() & luke to try to understand the reasons for the score. --

RE: Hits.score mystery

2007-11-01 Thread Tom Conlon
ober 2007 19:14, Tom Conlon wrote: > 119.txt 17.865013    97%    (13 occurences) 45.txt  8.600986 > 47%  (18 occurences) 45.txt might be a document with more therms so that its score is lower although it contains more matches. Regards Daniel -- http://www

RE: Hits.score mystery

2007-10-31 Thread Tom Conlon
es -Grant On Oct 31, 2007, at 2:14 PM, Tom Conlon wrote: > Hi All, > > Query:systems AND 2000 > Results: 558 total matching documents > > I'm returning the document plus hits.score(i) * 100 but when the > relevance is examined in the User interface it doesn&

Hits.score mystery

2007-10-31 Thread Tom Conlon
Hi All, Query: systems AND 2000 Results:558 total matching documents I'm returning the document plus hits.score(i) * 100 but when the relevance is examined in the User interface it doesn't seem to be working. E.g. 'rough' feedback in terms of occurences 61.txt 18.356403 100%

Poor performance - 2/3 ORs

2007-09-30 Thread Tom Conlon
Hi, Don't get me wrong - I think lucene is great. However, the first site I am using it with has 15k docs and the performance for ORs seem longer than I'd expect. Any tips to improve this? Thanks, Tom -Original Message----- From: Tom Conlon [mailto:[EMAIL PROTECTED] Sent: 30

RE: Escaping special characters

2007-09-30 Thread Tom Conlon
Hi, In case this is of help to others: Crux of problem: I wanted numbers and characters such as # and + to be considered. Solution: implement a LowercaseWhitespaceAnalyzer and a LowercaseWhitespaceTokenizer. Tom === Diagnostic

RE: Escaping special characters

2007-09-23 Thread Tom Conlon
ke this, or possibly create your own analyzer that uses LowerCaseFilter and breaks on whitespace. I suspect that if you open your index with Luke, you'll find that what you actually indexed was "C" rather than "C++" etc. But that's only a guess. What does query.toSt

RE: Escaping special characters

2007-09-23 Thread Tom Conlon
---Original Message- From: Karl Wettin [mailto:[EMAIL PROTECTED] Sent: 23 September 2007 10:39 To: java-user@lucene.apache.org Subject: Re: Escaping special characters 23 sep 2007 kl. 10.53 skrev Tom Conlon: > > Unless I'm missing something, according to: > > http:/

Escaping special characters

2007-09-23 Thread Tom Conlon
Hi, Unless I'm missing something, according to: http://lucene.apache.org/java/docs/queryparsersyntax.html#Escaping%20Spe cial%20Characters I should be able to search for C++ and C# using something like: C\+\+ and C\#. This doesn't work. I apologise if this sort of question has been asked man