Search similar documents using dense vectors (alternative to MORELIKETHIS)

2016-02-24 Thread Jan Rygl
Hello, I would like to ask if has somebody tried/planned to implement indexing for dense vectors. The default scoring process is suitable only for text documents, but we would like to use/support/develop a plugin enabling to combine/replace default index by the dense vector index for non-textual d

Re: Spaces in regular expressions

2016-02-24 Thread Kudrettin Güleryüz
I appreciate the pointers Jack. More on that, where can I read more on enabling full regexp support on indexed source code documents using Lucene? Any suggestions regarding cases where developers implemented this kind of capability using Lucene/Solr/ElasticSearch/... would be more than welcome. T

Grouping on MultiValue Field

2016-02-24 Thread Taher Galal
Hi, I have a tree structure of documents, with a document having multiple parents. This is mapped in the Lucene index using a multivaluefield when indexing as so: Document doc = new Document(); doc.add(new Field("uuid","value",MYFIELDTYPE.NOTANALYZED); doc.add(new Field("parent","parent",MYFIELDT

RE: Weird ClassCastException running lucene 5.2.1 on Java 1.8.

2016-02-24 Thread Uwe Schindler
Hi Kannan, the errors you see look like JVM bugs. We have no reports about this, but as the 2nd occurrence of this bug also refers to Java 8u65, I would say: Broken JVM. This has nothing to do with Lucene and recompiling the sources with Java 8 will make no difference, as this is caused by brok

Re: Weird ClassCastException running lucene 5.2.1 on Java 1.8.

2016-02-24 Thread Torsten Krah
Hi, you've already asked something similar in January about index corruption and Uwe Schindler already told you to switch to a modern JVM and not that antique one. See: http://www.gossamer-threads.com/lists/lucene/java-user/296418?page=last Quote from him: ... This Java version is "antique".