Rather than concentrating on the specific mechanisms, could you
elaborate a bit on the overall goals and features of the system? Who is
providing the documents? What are the threats you are guarding against?
How is the system going to be used?
Colin
-Original Message-
From: George Washin
The mock synonym code in LIA will work just fine with multiple words,
but you need to pass them to the synonym engine as a single string
containing multiple words, which means treating some phrases in the
input text as single tokens, and that's likely where you'll have to put
in the work.
I think
I had the same problem (almost exactly, even down to searching city
names) and the way I solved it was to provide my own tokenizer that
doesn't tokenize -- I'm passing in stuff to the analyzer already
tokenized, so in your example I'd pass in "new york" as a single entity
to analyze, and the tokeni
I have thought about that. I couldn't figure out a way to make it work.
Fortunately, I have managed to solve the problem (excepting prefix or
wildcard searches) which is very close to what Rajesh suggested (also
see my response to his response).
Thanks for taking a look.
Colin
-Original Mes
ence it wont be retrieved.
Query 2:
"ny united states of america"
Expanded query:
(primary:"ny"^1 AND secondary:"united states of america"^SLOPE 1) OR
(primary:"ny united"^2 AND secondary:"states of america"^SLOPE 1) OR
(
NY, USA
For query "NY, USA" you should be able to retrieve 1, 2 and 3 eventhough
the primary information for Doc3 is "Albany".
--
Rajesh Munavalli
On 1/27/06, Colin Young <[EMAIL PROTECTED]> wrote:
>
> The reason I only want 2 hits is because [2] is more "speci
:28
To: java-user@lucene.apache.org
Subject: Re: Help with indexing and query strategy
Hi Colin,
Even assuming you came up with a good way of indexing, the
example query "Ontario, CA" should yield 3 hits. All 2, 3 and 4 are
valid retrievals. Could you please justify which 2 h
o, CA" where we don't know if the user
intended the "CA" to represent the state of California or the country of
Canada, while "London, UK" would be unambiguous.
The major problem as I see it is that at parse time I don't know if the user is
searching for a city,
I did manage to get things built with ant this morning (I had some time and
space on the train) including the DBDJE support, although I wasn't aware any of
it had been checked into contrib. I'll have to grab that tonight.
I had tried building a normal Eclipse project, but was having some proble
So far I've succeeded in grabbing the latest source (and I can build
with ant), but now I'm trying to open it in eclipse by creating a new
"Java Project from Existing And Buildfile," but when I attempt to open
build.xml, I get an alert telling me that the "Specified buildfile does
not contain a jav
Thanks to everyone for the help. It appears to be working now. I
probably should have mentioned that I already had TortoiseSVN installed,
and was having trouble with finding the correct URL. I'm sure now when I
go back to the Lucene home page it will be blindingly obvious...
Thanks
Colin
Notice:
oint me in the right direction I'd appreciate it (we
could do it offline to avoid polluting this list any further).
Thanks
Colin Young
Notice: This email message is for the sole use of the intended recipient(s) and
may contain confidential and privileged information. Any unauthorized re
ard to even say. It could be. No substantial
changes to the trunk before 1.9 is officially released are planned that
I know of.
Erik
On Jan 2, 2006, at 3:51 PM, Colin Young wrote:
> That would probably explain things. Is 1.9 close, or are we still
> talking months aways? Unfortuna
check out 1.9 to see what the current issues and and see
how stable it is for my purposes.
Thanks
Colin Young
-Original Message-
From: Erik Hatcher [mailto:[EMAIL PROTECTED]
Sent: 2 January, 2006 05:12
To: java-user@lucene.apache.org
Subject: Re: Problems with sandbox - can
I'm attempting to compile Lucene with some sandbox code -- specifically
the Berkely DB index storage -- and I'm running into and issue where the
code is attempting to import IndexInput (apparently located in
org.apache.lucene.store.IndexInput) but I can't find it in the source
anywhere. I'm not sur
15 matches
Mail list logo