nerate a list of terms and hits based on their
proximity to "foo" in all the documents. So I'll end up with something
like:
far : 4
time : 3
away : 2
Any help would be greatly appreciated.
Thanks much!
-Sven
a list of terms and hits based on their
proximity to "foo" in all the documents. So I'll end up with something
like:
far : 4
time : 3
away : 2
Any help would be greatly appreciated.
Thanks much!
-Sven
-
T
ut there, I'd prefer to peruse that first. My thought
being that I can simply generate my own kind of hash for each combined
term and write it out to a custom file structure similar to Lucene - but
the specifics of how to (optimally) do so are not
Hi Daniel,
>>von Willebrand<< is not the query but a document in the index The task
is to detect exact matches of phrases inside a query (large document) with
these phrases stored in the index.
Cheers,
Sven
> -Ursprüngliche Nachricht-
> Von: Daniel Naber [mailt
exact matches only on the one hand (and thats exactly what I
need) and furthermore yields matches even for concepts that are in plural
form in the query document.
So the "von Willebrands" will yield "von Willebrand".
Thanks for your efforts,
Sven
--- Ursprüngliche Nachricht ---
fields
Thanks,
Sven
Hello,
does Lucene provide a zero or one character wildcard (like ? in Perl RegEx)?
Example of what I mean:
"house%" finds "house" and "houses"
As far as I know in Lucene the ? wildcard is for exactly one character,
but I need a zero or one character wild
score 8. If I am
not wrong, the above would mean that the document from B with score 8
can be a better match than the document from A with score 10.
Is that right?
If so, what can I do to make the scores from multiple indices comparable?
Best regards,
Sven Teic
This works as long as no
document is deleted and the index optimized after it.
Is this a good solution or should we use Fields or DocValues for this?
What is the fastest solution?
Regards,
Sven Teichmann
-
To unsubscribe, e-
d how do I receive the
DocValues then?
Sven
Am 07.05.2014 16:09, schrieb Wouter Heijke:
Hey Mike,
That was a very useful response, also for long time Lucene users like
myself who were stuck in legacy ways of doing things!
I managed to easily change indexing of keys to DocValues and found m
Thank you, that helped me a lot.
Sven Teichmann
__
Software for Intellectual Property GmbH
Gewerbering 14a
83607 Holzkirchen (Germany)
Phone: +49 (0)8024 46699-00
Fax:+49 (0)8024 46699-02
E-Mail: s.teichm...@s4ip.de
Local Court of Munich
rs in this case (we are using MMapDirectory)?
Is there a way to cleanly abort a running search?
Sven Teichmann
tMergeScheduler.java:619)
We use DirectDocValuesFormat for two fields and I think that this is the
problem.
Any ideas how to upgrade the index to 5.2 anyway?
Thanks for your help!
Sven
-
To unsubscribe, e-mail: java-user-uns
lp on how to implement that feature.
can anyone help please?
Thanks
Sven
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org
w we dont have the problems with Jgroups coz we dont have such
many Nodes (we handle currently 2 to 7 nodes) and the number of
messages per minute is very very low.
Yours,
--- Sven
Le jeudi 28 décembre 2006 à 08:33:37, vous écriviez :
AF> Hello,
AF> I saw that Doug Cutting had an inter
ycoprotein
(3) endothelial glycoprotein
(4) multimeric megakaryocytes
So the result should be:
(1) Score: 1
(2) Score: 1
(3) Score: less than 1
(4) Score: less than 1
Is there any way of doing this with lucene?
Thanks and best whishes,
Sven
Hello,
Or you can purchase the Book "Lucene in Action"
You will find the t.o.c and some sample chapters here http://lucenebook.com
Have a nice day
--- Sven
Le mercredi 31 mai 2006 à 13:32:45, vous écriviez :
AKY> http://lucene.apache.org/java/docs/gettingstarted.html
AKY&
Hi,
Or simply grab it online (paper or pdf eBook ) here :
http://www.manning.com/hatcher2/
--- Sven
Le mardi 6 juin 2006 à 13:05:45, vous écriviez :
MC> Try here..
MC> http://www.abebooks.co.uk
MC> Maybe they have one cheaper.
MC> Malcolm
Hello,
Nice job, thanks.
Question : When will it be available on Maven2 repositories ?
Thanks
--- Sven
Le samedi 27 mai 2006 à 13:37:54, vous écriviez :
DN> -- Forwarded Message --
DN> Subject: Lucene 2.0.0 release available
DN> Date: Samstag 27 Mai 2006 05:57
27;ll
just put the lib on a local one
Thanks and have a nice day.
--- Sven
Le vendredi 9 juin 2006 � 01:54:48, vous �criviez :
OG> Hi Sven,
OG> Somebody asked about the same thing a few weeks back and opened a
OG> JIRA issue. It looks like the push to ibiblio servers did not
OG>
Hello Erik,
Since wilcard queries are not analyzed, how can we deal with accents ?
For instance (in french) a query like "ingé*" will not match documents
containing
"ingénieur" but the query "inge*" will.
Thanks
---
sven
Le jeudi 31 mars 2005 à 17:51:25, vous
eFilter (in order to handle terms like with apostrophes,
for instance "l'expérience" will be decompozed into two tokens : "l"
"expérience"
EH> You will need to employ some form of character normalization on
EH> wildcard queries too.
thanks, it
n is selected
It force the first term to be required so the user will not
need to specify the "+" or "AND" operator
Maybe there is a more elegant way to do this ?
The code is following
Thanks
---
Sven (is not a bersek)
*/-
details).
Yes, that what i was first thinking of, but they (the end users) wanted
all clauses except the first to be handled by the 'OR' operator.
I'll try to convince them that it will make my (and their) life easier
if the default operator
instance the query (tomcat jsp) will be rewrited as
(tomcat (jsp OR "java server pages" OR "javaserver pages"))
Any thoughts ?
Thanks in advance
---
Sven
mercredi 13 avril 2005, 19:36:44, vous avez écrit:
CH> : Another approach would be to index this as:
CH> :
ome people are interested
Thanks again.
Sven
Le vendredi 29 avril 2005 à 21:58:54, vous écriviez :
PL> I knew there was a catch...
PL> I do think, however, that the point is a delicate one which would
PL> consideration
Hello Andrew,
There already is a plugin available for Nutch :
http://wiki.apache.org/nutch/GeoPosition
I think that one can easily integrate it into a lucene app (i'll make
some experimentations with it soon)
----
Sven
Le dimanche 8 mai 2005 à 18:26:14, vous écriviez :
AB> Hi All,
A
Hi,
yeah, i just added it into simpy when i read René post ;)
congrats for simpy
Sven
Le jeudi 12 mai 2005 à 09:59:18, vous écriviez :
OG> Somebody asked about this today, and I just found this through Simpy:
OG> http://www.unine.ch/info/clef/
OG> Scroll half-way through the page
hi,
this exception does not seems to be lucene related
concerning the way to look at index content, you could try luke
(http://www.getopt.org/luke/)
hope it will help
--- sven
Le jeudi 9 juin 2005 à 18:06:07, vous écriviez :
a> Hello,
a> I'm able to create index file fo
Hello,
As i also encountered the phrase query issue with the old highlighter
code, I added a fieldName argument in case of the analyser handle it in the
tokenStream() method.
I just submitted the patch on bugzilla
--- sven
jeudi 30 juin 2005, 22:12:51, vous avez écrit:
BD> FYI, I made
have the stack trace right now under my
hand). I think that it is because the search and indexation are made in two
different java processes. We will rework the routines to lock the search when
an indexation is running and vice versa
--- sven
lundi 11 juillet 2005, 03:03:29, vous avez écrit:
.
i thought it was a lucene user list, not a dbsight one
--- sven
Le mercredi 13 juillet 2005 à 17:47:14, vous écriviez :
CL> Hi, Klaus, thanks.
CL> You can simply use DBSight to create the index. It's in Lucene's
CL> standard fo
32 matches
Mail list logo