How to get the terms within 5 words of another term?

2008-11-12 Thread Sven
nerate a list of terms and hits based on their proximity to "foo" in all the documents. So I'll end up with something like: far : 4 time : 3 away : 2 Any help would be greatly appreciated. Thanks much! -Sven

How to get the terms within 5 words of another term?

2008-11-12 Thread Sven
a list of terms and hits based on their proximity to "foo" in all the documents. So I'll end up with something like: far : 4 time : 3 away : 2 Any help would be greatly appreciated. Thanks much! -Sven - T

constructing a mini-index with just the number of hits for a term

2008-11-13 Thread Sven
ut there, I'd prefer to peruse that first. My thought being that I can simply generate my own kind of hash for each combined term and write it out to a custom file structure similar to Lucene - but the specifics of how to (optimally) do so are not

AW: Part-Of Match

2006-01-15 Thread Sven
Hi Daniel, >>von Willebrand<< is not the query but a document in the index The task is to detect exact matches of phrases inside a query (large document) with these phrases stored in the index. Cheers, Sven > -Ursprüngliche Nachricht- > Von: Daniel Naber [mailt

AW: Part-Of Match

2006-01-16 Thread sven
exact matches only on the one hand (and thats exactly what I need) and furthermore yields matches even for concepts that are in plural form in the query document. So the "von Willebrands" will yield "von Willebrand". Thanks for your efforts, Sven --- Ursprüngliche Nachricht ---

Question about Boosting Fields in a Query

2005-04-27 Thread Sven
fields Thanks, Sven

Lucene Wildcard for zero or one character

2014-03-25 Thread Sven Teichmann
Hello, does Lucene provide a zero or one character wildcard (like ? in Perl RegEx)? Example of what I mean: "house%" finds "house" and "houses" As far as I know in Lucene the ? wildcard is for exactly one character, but I need a zero or one character wild

Compare scores from multiple indices

2014-04-16 Thread Sven Teichmann
score 8. If I am not wrong, the above would mean that the document from B with score 8 can be a better match than the document from A with score 10. Is that right? If so, what can I do to make the scores from multiple indices comparable? Best regards, Sven Teic

Best practice to map Lucene docids to real ids

2014-05-06 Thread Sven Teichmann
This works as long as no document is deleted and the index optimized after it. Is this a good solution or should we use Fields or DocValues for this? What is the fastest solution? Regards, Sven Teichmann - To unsubscribe, e-

Re: Best practice to map Lucene docids to real ids

2014-05-12 Thread Sven Teichmann
d how do I receive the DocValues then? Sven Am 07.05.2014 16:09, schrieb Wouter Heijke: Hey Mike, That was a very useful response, also for long time Lucene users like myself who were stuck in legacy ways of doing things! I managed to easily change indexing of keys to DocValues and found m

Re: Best practice to map Lucene docids to real ids

2014-05-18 Thread Sven Teichmann
Thank you, that helped me a lot. Sven Teichmann __ Software for Intellectual Property GmbH Gewerbering 14a 83607 Holzkirchen (Germany) Phone: +49 (0)8024 46699-00 Fax:+49 (0)8024 46699-02 E-Mail: s.teichm...@s4ip.de Local Court of Munich

Abort a search

2014-10-13 Thread Sven-Teichmann
rs in this case (we are using MMapDirectory)? Is there a way to cleanly abort a running search? Sven Teichmann

IndexFormatTooOldException while upgrading Lucene 4.10 index to 5.2

2015-06-16 Thread Sven Teichmann
tMergeScheduler.java:619) We use DirectDocValuesFormat for two fields and I think that this is the problem. Any ideas how to upgrade the index to 5.2 anyway? Thanks for your help! Sven - To unsubscribe, e-mail: java-user-uns

how to down-weight synonyms

2009-08-26 Thread Sven Fischer
lp on how to implement that feature. can anyone help please? Thanks Sven - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re[2]: Clustering Lucene with 40 Servers

2006-12-28 Thread Sven Duzont
w we dont have the problems with Jgroups coz we dont have such many Nodes (we handle currently 2 to 7 nodes) and the number of messages per minute is very very low. Yours, --- Sven Le jeudi 28 décembre 2006 à 08:33:37, vous écriviez : AF> Hello, AF> I saw that Doug Cutting had an inter

Part-Of Match

2006-01-14 Thread Sven Schmeier
ycoprotein (3) endothelial glycoprotein (4) multimeric megakaryocytes So the result should be: (1) Score: 1 (2) Score: 1 (3) Score: less than 1 (4) Score: less than 1 Is there any way of doing this with lucene? Thanks and best whishes, Sven

Re[2]: Re : good link to start working on Lucene

2006-05-31 Thread Sven Duzont
Hello, Or you can purchase the Book "Lucene in Action" You will find the t.o.c and some sample chapters here http://lucenebook.com Have a nice day --- Sven Le mercredi 31 mai 2006 à 13:32:45, vous écriviez : AKY> http://lucene.apache.org/java/docs/gettingstarted.html AKY&

Re[2]: Lucene in Action

2006-06-06 Thread Sven Duzont
Hi, Or simply grab it online (paper or pdf eBook ) here : http://www.manning.com/hatcher2/ --- Sven Le mardi 6 juin 2006 à 13:05:45, vous écriviez : MC> Try here.. MC> http://www.abebooks.co.uk MC> Maybe they have one cheaper. MC> Malcolm

Re: Fwd: Lucene 2.0.0 release available

2006-06-08 Thread Sven Duzont
Hello, Nice job, thanks. Question : When will it be available on Maven2 repositories ? Thanks --- Sven Le samedi 27 mai 2006 à 13:37:54, vous écriviez : DN> -- Forwarded Message -- DN> Subject: Lucene 2.0.0 release available DN> Date: Samstag 27 Mai 2006 05:57

Re[2]: Fwd: Lucene 2.0.0 release available

2006-06-10 Thread Sven Duzont
27;ll just put the lib on a local one Thanks and have a nice day. --- Sven Le vendredi 9 juin 2006 � 01:54:48, vous �criviez : OG> Hi Sven, OG> Somebody asked about the same thing a few weeks back and opened a OG> JIRA issue. It looks like the push to ibiblio servers did not OG>

Re[2]: Analyzer don't work with wildcard queries, snowball analyzer.

2005-04-01 Thread Sven Duzont
Hello Erik, Since wilcard queries are not analyzed, how can we deal with accents ? For instance (in french) a query like "ingé*" will not match documents containing "ingénieur" but the query "inge*" will. Thanks --- sven Le jeudi 31 mars 2005 à 17:51:25, vous

Re[4]: Analyzer don't work with wildcard queries, snowball analyzer.

2005-04-01 Thread Sven Duzont
eFilter (in order to handle terms like with apostrophes, for instance "l'expérience" will be decompozed into two tokens : "l" "expérience" EH> You will need to employ some form of character normalization on EH> wildcard queries too. thanks, it

Re[6]: Analyzer don't work with wildcard queries, snowball analyzer.

2005-04-02 Thread Sven Duzont
n is selected It force the first term to be required so the user will not need to specify the "+" or "AND" operator Maybe there is a more elegant way to do this ? The code is following Thanks --- Sven (is not a bersek) */-

Re[8]: Analyzer don't work with wildcard queries, snowball analyzer.

2005-04-03 Thread Sven Duzont
details). Yes, that what i was first thinking of, but they (the end users) wanted all clauses except the first to be handled by the 'OR' operator. I'll try to convince them that it will make my (and their) life easier if the default operator

Re[2]: multi word synonym (was Hungarian notation analyzer and phrase queries)

2005-04-27 Thread Sven Duzont
instance the query (tomcat jsp) will be rewrited as (tomcat (jsp OR "java server pages" OR "javaserver pages")) Any thoughts ? Thanks in advance --- Sven mercredi 13 avril 2005, 19:36:44, vous avez écrit: CH> : Another approach would be to index this as: CH> :

Re[4]: multi word synonym (was Hungarian notation analyzer and phrase queries)

2005-04-30 Thread Sven Duzont
ome people are interested Thanks again. Sven Le vendredi 29 avril 2005 à 21:58:54, vous écriviez : PL> I knew there was a catch... PL> I do think, however, that the point is a delicate one which would PL> consideration

Re: Latitude/Longitude and Lexigraphical search

2005-05-08 Thread Sven Duzont
Hello Andrew, There already is a plugin available for Nutch : http://wiki.apache.org/nutch/GeoPosition I think that one can easily integrate it into a lucene app (i'll make some experimentations with it soon) ---- Sven Le dimanche 8 mai 2005 à 18:26:14, vous écriviez : AB> Hi All, A

Re: Top most frequent words

2005-05-12 Thread Sven Duzont
Hi, yeah, i just added it into simpy when i read René post ;) congrats for simpy Sven Le jeudi 12 mai 2005 à 09:59:18, vous écriviez : OG> Somebody asked about this today, and I just found this through Simpy: OG> http://www.unine.ch/info/clef/ OG> Scroll half-way through the page

Re: Exception in full text search

2005-06-09 Thread Sven Duzont
hi, this exception does not seems to be lucene related concerning the way to look at index content, you could try luke (http://www.getopt.org/luke/) hope it will help --- sven Le jeudi 9 juin 2005 à 18:06:07, vous écriviez : a> Hello, a> I'm able to create index file fo

Re[2]: Does highlighter highlight phrases only?

2005-07-04 Thread Sven Duzont
Hello, As i also encountered the phrase query issue with the old highlighter code, I added a fieldName argument in case of the analyser handle it in the tokenStream() method. I just submitted the patch on bugzilla --- sven jeudi 30 juin 2005, 22:12:51, vous avez écrit: BD> FYI, I made

Re[2]: Index Partitioning ( was Re: Search deadlocking under load)

2005-07-11 Thread Sven Duzont
have the stack trace right now under my hand). I think that it is because the search and indexation are made in two different java processes. We will rework the routines to lock the search when an indexation is running and vice versa --- sven lundi 11 juillet 2005, 03:03:29, vous avez écrit:

Re[2]: SIMPLE Lucene / MySQL Indexer

2005-07-13 Thread Sven Duzont
. i thought it was a lucene user list, not a dbsight one --- sven Le mercredi 13 juillet 2005 à 17:47:14, vous écriviez : CL> Hi, Klaus, thanks. CL> You can simply use DBSight to create the index. It's in Lucene's CL> standard fo