Re: [ANN] KEA-lucene (program pvlg extracts keyphrases from Lucene 2,5? 3. index)jģfgyrtrhhutlt

2016-07-15 Thread Muhammad Atif Qureshi
z

MmapDirectory and IndexReader reuse

2016-07-15 Thread Vladimir Kotal
Hi all, when trying to identify bottlenecks in our application, I found that each search which involves multiple indexes is performing lots of mmap()/open() syscalls. This is a natural consequence of using MmapDirectory. So even if file system caches are properly warmed, this might add coupl

RE: MmapDirectory and IndexReader reuse

2016-07-15 Thread Uwe Schindler
Hi, You should keep the IndexReader open for the whole time! Otherwise there are more bottlenecks and slowdowns. If you are updating the Index, you should use SearcherManager that reopens the index reader accordingly. After updating the index you should also not completely close and reopen the

Re: CustomAnalyzer and AttributeFactories

2016-07-15 Thread Terry Smith
Uwe, Thanks! I've created LUCENE-7382 for this issue. --Terry On Thu, Jul 14, 2016 at 3:54 PM, Uwe Schindler wrote: > Can you open issue? This is a bug because the wrong default is used. > > Uwe > > Am 14. Juli 2016 19:26:28 MESZ, schrieb Te

Re: "Point in polygon" search with Lucene / Spatial4j / JTS

2016-07-15 Thread Michael McCandless
FYI I just pushed an improvement (will be in Lucene 6.2) to Lucene's Polygon class, to make it easy to construct Polygons from a GeoJSON string without using an external spatial library: https://issues.apache.org/jira/browse/LUCENE-7380 That issue just adds a new Polygon.fromGeoJSON(String) static

Lucene with Semantic Vectors (LSA, LSI, LDA)?

2016-07-15 Thread Mila88
Hello, I have a project which indexes and scores documents using Lucene. However, I'd like to do that using semantic indexing (LSI, LSA, LDA or Semantic Vectors). I've read old posts and some people said that Semantic Vectors plays well with Lucene. However, I noticed that its classes are use