Lucene Concurrent Search

2013-09-04 Thread David Miranda
Hi, I'm developing a web application, that contains a REST service in the Tomcat, that receives several requests per second. The REST requests do research in a Lucene index, to do this i use the IndexSearch. My questions are: - There are concurrency problems in multiple research? - What the best

RE: Lucene Text Similarity

2013-09-04 Thread Allison, Timothy B.
If MoreLikeThis doesn't work, you might want to look into Wikipedia Miner: http://www.nzdl.org/wikification/about.html http://www.wikipedia-miner.sourceforge.net/ or other wikifiers. Best, Tim From: David Miranda [david.b.mira...@gmail.com] Sent

Re: Lucene Text Similarity

2013-09-04 Thread David Miranda
Thanks to all, I will take into account your suggestions. But I think that should have given the concrete use case. Therefore, taking into account my first example given, I have the email received by a user and that email I extract topics of interest to associate the terms of DBpedia (basically DB

RE: Lucene Text Similarity

2013-09-04 Thread Allison, Timothy B.
I agree with Ivan and Koji. You also might want to look into MoreLikeThis, which should take care of finding the highest tf*idf terms for you to use in your query -- http://lucene.apache.org/core/4_4_0/queries/org/apache/lucene/queries/mlt/MoreLikeThis.html Best, Tim _

Re: Lucene Text Similarity

2013-09-04 Thread Ivan Krišto
On 09/03/2013 07:33 PM, David Miranda wrote: Is there any way to check the similarity of texts with Lucene? I have the DBpedia indexed and wanted to get the texts more similar between the abstract and DBpedia another text. If I do a search in the abstract field, with a particular text the result i