Get name of field where term was found

2006-01-14 Thread Mathias
Hello Everybody! Do you know if it is possible to get the name of the field in which Lucene found a term? I know that you can get the content of all fields with the document id. But I need to know in which field Lucene found my term, if I searched in several fields. I'm thankful for every kind of

Lucene vs Glimpse

2013-02-04 Thread Mathias Dahl
point here and am I correct in that it would be hard to implement something as "free" as with Glimpse? I am not trying to critizise, just understand how Lucene (and Glimpse) works. Oh, yes, Glimpse has one big drawback: it onl

Re: Lucene vs Glimpse

2013-02-05 Thread Mathias Dahl
ough the files. I hope this made things at least a little bit clearer ;) Again, I am seeing it from the perspective of a Glimpse user where the searches most people use "just work" (but due to licensing I don't think we can continue to use it). Thanks! /Mathias On Mon, Feb 4, 2013

Re: Lucene vs Glimpse

2013-02-05 Thread Mathias Dahl
Thanks for the input! Seems I should give this another chance using the hints you all sent me. I'll report back my findings here. /Mathias On Mon, Feb 4, 2013 at 7:01 PM, Mathias Dahl wrote: > Hi, > > I have hacked together a small web front end to the Glimpse text > indexing

Stored fields: decompression slows down in my scenario ... any idea for a workaround?

2013-06-23 Thread Mathias Lux
dexing and local feature search are based on terms, ... and fast. But linear search makes up an important part of LIRE, so I'd be glad to get some suggestions how either to disable compression, or how to sneak in byte[] data with some textual data that is "fast as hell" to read. chee

Re: Stored fields: decompression slows down in my scenario ... any idea for a workaround?

2013-06-24 Thread Mathias Lux
Hi! Thanks!! I'll try the DocValues for sure, and of course the smaller chunk size. Just to add up on the number of bytes stored: it's for instance 72 bytes for CEDD, ~96 for JCD, 64 bytes for OpponentHistogram, etc. and there is 0 wrote: > Uwe, > I think Mathias was talking abo

Re: Stored fields: decompression slows down in my scenario ... any idea for a workaround?

2013-06-24 Thread Mathias Lux
ay stable. As I'm quite a lazy programmer I'd like to hear you opinion on how stable the APIs for BinaryDocValues and Codec might be? :) cheers, Mathias On Mon, Jun 24, 2013 at 9:23 AM, Adrien Grand wrote: > Hi, > > On Sun, Jun 23, 2013 at 9:08 PM, Savia Beson wrote: >>

Re: Stored fields: decompression slows down in my scenario ... any idea for a workaround?

2013-06-24 Thread Mathias Lux
on as a lot of re-coding was needed. cheers, Mathias On Mon, Jun 24, 2013 at 3:13 PM, Adrien Grand wrote: > Hi, > > On Mon, Jun 24, 2013 at 2:47 PM, Mathias Lux wrote: >> Still, I've read that all the BinaryDocValues go directly to memory. >> Am I right with this? > >

Counting search results

2009-09-15 Thread Mathias Bank
using a Weight object (order is not interesting), but I haven't seen a way to do this. I hope, someone has already solved this problem. Mathias - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For addi

Re: Counting search results

2009-09-15 Thread Mathias Bank
ch is - again - very slow. So, unfortunatelly, your solution does not help. Mathias 2009/9/15 Simon Willnauer : > Did you try: > int numDocs > TermDocs termDocs = reader.termDocs(new Term("myField", "myTerm")); > while (termDocs.next()) { numDocs++; } > >

Re: Counting search results

2009-09-17 Thread Mathias Bank
vide an iterator, no hash checks are possible. Are there any other possibilities to improve speed? Mathias Am 15.09.2009 17:13 schrieb Simon Willnauer : > Hmm, so if you wanna use the Filter to narrow down the search results > > you could use it in the while loop like this: > >

Re: Creating tag clouds with lucene

2009-11-06 Thread Mathias Bank
nd than iterate all terms only counting documents in this filtered set? I cannot imagine, that this is performant because I have more than 10 mio documents (fast growing). Mathias 2009/11/6 Chris Lu : > Isn't the tag cloud just another facet search? Only difference is the tag is >

adapting lucene's practical scoring function

2010-03-25 Thread Mathias Silbermann
TF, IDF, Boost... put together? Regards, Mathias Silbermann - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org

Re: Incremental updates / slow searches.

2006-10-09 Thread Mathias Lux
-> well ... I'd like to other opinions too ;) hope that helps a bit, Mathias -- ' '' ' '' Mathias Lux o/ ' \o [EMAIL PROTECTED] /-'-\ skype://dermotte, icq # 1988617 /\

Re: Lucene BOF @ apachecon.eu ?

2007-04-11 Thread Mathias Herberts
+1 - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

RE: Use the lucene for searching in the Semantic Web.

2006-01-19 Thread Mathias Lux
t and their application to information retrieval on the web, in 'WIDM '05: Proceedings of the 7th annual ACM international workshop on Web information and data management', ACM Press, New York, NY, USA, pp. 10--16. regards, Mathias =

AW: Use the lucene for searching in the Semantic Web.

2006-01-19 Thread Mathias Lux
/ classes / node types (whatever :) depends on the use case. hope this helps a bit, mathias > -Ursprüngliche Nachricht- > Von: xing jiang [mailto:[EMAIL PROTECTED] > Gesendet: Donnerstag, 19. Jänner 2006 12:14 > An: java-user@lucene.apache.org > Betreff: Re: Use the lu

AW: Use the lucene for searching in the Semantic Web.

2006-01-19 Thread Mathias Lux
(URIs per definitionem in RDF), a limited set of possible relations (limited by the number of nodes in RDF, but that should do also) and so on. - mathias - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

AW: Use the lucene for searching in the Semantic Web.

2006-01-19 Thread Mathias Lux
t;^ 4 Of course you have to think about the whole thing, but I think with good boost values it would work. - mathias ps. instead of C+D I would use (l-1)*C + l*D, so l from [0,1] can be used to specify if concept or content has more influence. --

Request for feedback: CBIR for Lucene

2006-02-06 Thread Mathias Lux
www.semanticmetadata.net/lire, http://freshmeat.net/projects/lirecbir/ hoping for feedback - mathias - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

AW: Searching in paths

2006-03-15 Thread Mathias Lux
ot;World-Poland", "Poland-Abc") would result in a better relevance for above example. regards, Mathias -- Mathias Lux http://www.semanticmetadata.net > -Ursprüngliche Nachricht- > Von: kieran [mailto:[EMAIL PROTECTED] > Gesendet: Mittwoch, 15. Mär

internal Searching behavior or how to get a hit?

2006-05-03 Thread Mathias Keilbach
Hi! I have a question concerning the interal searching behavior of lucene. How does lucene get a hit. If I search for the a term, will each index document be checked for this term or is there an internal relation between terms and lucene documents? Thanks for any advice. Matt

AW: internal Searching behavior or how to get a hit?

2006-05-04 Thread Mathias Keilbach
(terms) to the documents that they appear in. Some background info on inverted indices: http://en.wikipedia.org/wiki/Inverted_index Miles -- On Wednesday 03 May 2006 14:56, Mathias Keilbach wrote: > I have a quest

Analyzer which distributes tokens to many fields

2006-05-16 Thread Mathias Keilbach
Hi! I'm going to create a small application with Lucene, which analyze diffenrent Strings. While analyzing the strings, patterns (like emails or urls) shall be sort out and saved in a seperate index field. I'm not sure if I can handle this with a self implemented Analyzer class. Afaik you can't

Linear search using reader vs. scorer implementation

2006-08-07 Thread Mathias Lux
f and how I could implement a linear search (L1/L2 distance on integer vectors) using scorers, so that filters and other features can be used? regards, Mathias ps. Yes I know that this is in general easy to implement within a database, which I have done for oracle, mysql and derby but peop

Exception raised when searching with sorting on an empty index.

2006-08-18 Thread Mathias Herberts
x27;m misunderstanding something it appears to me this is a bug. I am using Lucene 2.0.0 with JDK 1.4 on Mac OS X 10.4 Did anybody else notice this problem? Mathias. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional comman

Advise for Mediabase with Lucene

2008-10-02 Thread Mathias P.W Nilsson
Hi! I'm currently developing a mediabase for 20-100 customers. A Customer can upload a file, folder via ftp and a file grabber searches the file system and adds the new file to a mysql database. It also creates thumbnails, adds search words etc. Now, this mediabase is pretty old and is developed

Re: Advise for Mediabase with Lucene

2008-10-02 Thread Mathias P.W Nilsson
Thanks Erick! I've just bought the book Lucene in action and I will see where that leeds me. I'm aware of that lucene doens't do the other magic ;) just what it is made for. Indexing and searching. -- View this message in context: http://www.nabble.com/Advise-for-Mediabase-with-Lucene-tp197878

Re: Advise for Mediabase with Lucene

2008-10-02 Thread Mathias P.W Nilsson
Oh, I forgot. Would you save the documents as index on the file system or use Hibernate search with lucene? -- View this message in context: http://www.nabble.com/Advise-for-Mediabase-with-Lucene-tp19787867p19789551.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. -

Re: Advise for Mediabase with Lucene

2008-10-02 Thread Mathias P.W Nilsson
I don't know if this is going to work. Let's say I have a root folder that is the startpoint for a client. The only thing I have in the database is the startPoint When traversing the child folders I want to check If the folder has changed since the last time.Can I store this in a lucene index, a