Hi
According to
http://www.gossamer-threads.com/lists/lucene/java-dev/37421
one cannot overwrite the default write lock timeout of 1000ms once a
write.lock already exists (for example inside a multi-threaded
web-application), because in order to use the method
setWriteLockTimeout(long) one
Am 13.12.11 19:36, schrieb Michael Wechner:
Hi
According to
http://www.gossamer-threads.com/lists/lucene/java-dev/37421
one cannot overwrite the default write lock timeout of 1000ms once a
write.lock already exists (for example inside a multi-threaded
web-application), because in order to
Maybe Tika is also of help to you
http://tika.apache.org/
HTH
Michael
Am 11.01.12 20:13, schrieb Reyna Melara:
Hi, my name is Reyna Melara I'm a PhD student form Mexico, and I have a set
of 11,051,447 files with txt extension but the content of each file is in
fact in wiki format, I want and
you also might like to consider
Jackrabbit: http://jackrabbit.apache.org/
or
Yarep: https://github.com/wyona/yarep
which are both using Lucene for indexing, but the actual data storage is
hidden by an abstraction layer and is configurable/customizable.
HTH
Michael
Am 29.03.13 02:24, schri
On 3/22/11 8:40 AM, shrinath.m wrote:
On Tue, Mar 22, 2011 at 12:39 PM, Anshum-2 [via Lucene]<
ml-node+2713899-1210341880-376...@n3.nabble.com> wrote:
No as of now, there's no way to do so.
Thank you Anshum-2,
how do you propose I do this ?
I have thought of a way like this :
- first get the
On 3/22/11 10:09 AM, shrinath.m wrote:
On Tue, Mar 22, 2011 at 1:37 PM, Michael Wechner [via Lucene]<
ml-node+2714008-984126374-376...@n3.nabble.com> wrote:
are you looking for something like
http://hrycan.com/2009/11/26/updating-document-fields-in-lucene/
?
Precisely that.
I am O
Am 18.06.11 19:05, schrieb Steven A Rowe:
Hi Hamada,
Do you know about the Lucene demo?:
http://lucene.apache.org/java/3_2_0/demo.html
also you might want to use
http://code.google.com/p/luke/
in order to view your search index and check what fields it actually
contains
HTH
Michael
Ste
Hi
I have cloned the github version of Lucene/Solr yesterday
https://github.com/apache/lucene-solr
and was running
ant compile
ant test
successfully. Also Jetty seems to startup fine, but when I access
http://localhost:8983/solr/
then I receive
HTTP ERROR: 503
Problem accessing /solr
thanks very much for your help. I will use the solr mailing list for
future solr related questions.
After running
ant example
ant run-example
inside the solr folder, I was able to access
http://localhost:8983/solr
without a problem.
I think it would make sense to change the main README and th
Hi
I would like to train "DeepPavlov FAQ"
http://docs.deeppavlov.ai/en/master/features/skills/faq.html
https://colab.research.google.com/github/deepmipt/dp_notebooks/blob/master/DP_autoFAQ.ipynb
https://medium.com/deeppavlov/simple-intent-recognition-and-question-answering-with-deeppavlov-c54ccf
Hi
The following FAQ might be a bit outdated, but nevertheless you should
find some answers there as well
https://cwiki.apache.org/confluence/display/lucene/LuceneFAQ
For example to answer your question 4) see
https://cwiki.apache.org/confluence/display/lucene/LuceneFAQ#LuceneFAQ-CanIuseLuce
ngine as a personal project .
On Mon, 5 Apr 2021, 10:57 Michael Wechner,
wrote:
Hi
The following FAQ might be a bit outdated, but nevertheless you should
find some answers there as well
https://cwiki.apache.org/confluence/display/lucene/LuceneFAQ
For example to answer your question 4) s
Hi
I recently found the following articles re Lucene/Solr and BERT
https://dmitry-kan.medium.com/neural-search-with-bert-and-solr-ea5ead060b28
https://medium.com/swlh/fun-with-apache-lucene-and-bert-embeddings-c2c496baa559
and would like to ask whether there might be more recent developments
w
e are some test
suites that index and search Glove vectors. My first impression was that
indexing seems surprisingly slow, but it's entirely possible I'm doing
something wrong.
On Wed, Apr 21, 2021 at 9:31 AM Michael Wechner
wrote:
Hi
I recently found the following articles re Lucen
Hi Amitesh
I don't have statistical proof , but I think it doesn't help on mailing
lists with volunteeers to write "I badly need some help", because it
seems to me the contrary will happen, that people will not help at all.
I think there are various reasons for this behaviour, which is
inter
Hi Amitesh
Thanks for the more concrete examples.
Unfortunately I do not know how to solve this better with Lucene itself
in a more general context, but did you ever consider using BERT in
combination with Lucene/Solr
https://blog.google/products/search/search-language-understanding-bert/
ht
Yes, it would be great if you could share code snippets. Maybe it will
help others or maybe someone will have a suggestion to improve or an
alternative.
All the best
Michael
Am 29.04.21 um 14:35 schrieb amitesh116:
Thank you Michael!
I solved this requirement by setting the tokenStream at t
uites that index and search Glove vectors. My first impression was that
indexing seems surprisingly slow, but it's entirely possible I'm doing
something wrong.
On Wed, Apr 21, 2021 at 9:31 AM Michael Wechner
wrote:
Hi
I recently found the following articles re Lucene/Solr and BERT
sure the VectorFormat API (might still get
renamed due to confusion with other kinds of vectors existing in
Lucene) can support alternative KNN implementations.
On Wed, May 19, 2021 at 12:22 PM Michael Wechner
wrote:
Hi Alex
Just to make sure I understand better what the additions are about
Am
ds of vectors existing in
Lucene) can support alternative KNN implementations.
On Wed, May 19, 2021 at 12:22 PM Michael Wechner
wrote:
Hi Alex
Just to make sure I understand better what the additions are about
Am 21.04.21 um 17:21 schrieb Alex K:
There were a couple additions recently merge
Hi
I am using Lucene 8.8.2 in production and I am currently doing some
tests using 9.0.0-SNAPSHOT, whereas I have included
lucene-backward-codecs, because in the log files it was asking me
whether I have forgotten to include lucene-backward-codecs.jar
org.apache.lucene
Hi Alex
Thank you very much for your feedback and the various insights!
Am 26.05.21 um 04:41 schrieb Alex K:
Hi Michael and others,
Sorry just now getting back to you. For your three original questions:
- Yes, I was referring to the Lucene90Hnsw* classes. Michael S. had a
thorough response.
-
:
I think you need backward-codecs-9.0.0-SNAPSHOT there. It enables 9.0
to read 8.x indexes.
On Wed, May 26, 2021 at 9:27 AM Michael Wechner
wrote:
Hi
I am using Lucene 8.8.2 in production and I am currently doing some
tests using 9.0.0-SNAPSHOT, whereas I have included
lucene-backward-codecs
t
indexing, and searching, performance, you should generally index as
large a number of documents as possible before flushing.
-Mike
On Wed, May 26, 2021 at 9:43 AM Michael Wechner
wrote:
Hi Alex
Thank you very much for your feedback and the various insights!
Am 26.05.21 um 04:41 schrieb Alex K:
I have added a QnA
https://cwiki.apache.org/confluence/display/LUCENE/LuceneFAQ#LuceneFAQ-WhenIupradeLucene,forexamplefrom8.8.2to9.0.0,doIhavetoreindex?
Hope that makes sense, otherwise let me know and I can correct/update :-)
Am 26.05.21 um 23:56 schrieb Michael Wechner:
using lucene
possible you *should*
update because the 8.x index may not be able to be read by the
eventual 10 release.
On Thu, May 27, 2021 at 7:52 AM Michael Wechner
wrote:
I have added a QnA
https://cwiki.apache.org/confluence/display/LUCENE/LuceneFAQ#LuceneFAQ-WhenIupradeLucene,forexamplefrom8.8.2to9.0.0
Hi
According to the FAQ one can delete documents using the IndexReader
https://cwiki.apache.org/confluence/display/lucene/lucenefaq#LuceneFAQ-HowdoIdeletedocumentsfromtheindex?
but when I look at the javadoc of Lucene version 8_8_2
https://lucene.apache.org/core/8_8_2/core/org/apache/lucene/in
cool, thanks very much for your quick response and updating the FAQ!
Am 17.06.21 um 10:28 schrieb Adrien Grand:
Good catch Michael, removing from IndexReader has actually been removed a
long time ago. I just edited the FAQ to correct this.
On Thu, Jun 17, 2021 at 10:08 AM Michael Wechner
I don't know either, whereas I searched a little and found various good
explanations what segments are, e.g.
https://www.alibabacloud.com/blog/analysis-of-lucene---basic-concepts_594672
but not in which order the segments are being read.
I am nore sure where in the code the segments are being
Hi
I am trying to implement a search with Lucene similar to what for
example various "Note Apps" (e.g. "Google Keep" or "Samsung Notes") are
offering, that with every new letter typed a new search is being executed.
For example when I type "tes", then all documents are being returned
contain
really want to just search on prefixes and
jumble up the results (perhaps because you are gonna just sort by some
custom document feature instead of relevance), then you can do that if
you really want. You can use the n-gram/edge-ngram/shingle filters in
the analysis package for that.
On Wed, Oct 6, 2
Michael
On Thu, Oct 7, 2021 at 2:30 AM Michael Wechner
wrote:
Thanks very much for your feedback!
I will try it :-)
As I wrote I would like to add a summary to the Lucene FAQ
(https://cwiki.apache.org/confluence/display/lucene/lucenefaq)
Would the following questions make sense
according to Lucene
8.10.1 suggest API
If you know any simple, recent examples, please let me know
Thanks
Michael
Am 08.10.21 um 21:40 schrieb Michael Wechner:
Am 08.10.21 um 18:49 schrieb Michael Sokolov:
Thank you for offering to add to the FAQ! Indeed it should mention the
suggester
Hi
I recently started to use the Autosuggest/Autocomplete package as
suggested by Robert
https://www.mail-archive.com/java-user@lucene.apache.org/msg51403.html
which works very fine, thanks again for your help :-)
But it is not clear to me what are the best practices building a
suggester us
("contract
search","",asList("a84581a3-302f-4b73-80d9-0e60da5238f9"),1));
entities.add(new Item("claims management
system","",asList("a84581a3-302f-4b73-80d9-0e60da5238f9"),1));
suggester.build(new ItemIterator(entities.iterator()));
)
I was
Hi Yuxin
Can you provide a concrete example of a query and a document/code snippet?
Thanks
Michael
Am 20.12.21 um 03:06 schrieb Yuxin Liu:
Dear development community of Lucene:
Hi from student research assistant Yuxin Liu. I'm using Lucene to build an
index search for source code indexes usi
Hi Together
I would be interested to submit a proposal/presentation re Lucene's
vector search, but would like to ask first whether somebody else wants
to do this as well or might be interested to do this together?
Thanks
Michael
Am 30.03.22 um 14:16 schrieb Rich Bowen:
[You are receiving
Hi Together
I would be interested to submit a proposal/presentation re Lucene's
vector search, but would like to ask first whether somebody else wants
to do this as well or might be interested to do this together?
Thanks
Michael
Am 30.03.22 um 14:16 schrieb Rich Bowen:
[You are receiving
uld be reviewed independently and if there is
another proposals that clashes, the abstract would help the program
committee pick the one (or both) that's best suited for the audience.
Good luck!
-Anshum
On Wed, Mar 30, 2022 at 5:47 AM Michael Wechner
wrote:
Hi Together
I would be interested
for helping spread the word about Lucene's new vector
search capabilities!
On Thu, Mar 31, 2022 at 7:36 AM Michael Wechner
wrote:
ok :-) thanks!
Anyway, if somebody would like to join re a "vector search" proposal,
please let me know
Michael
Am 30.03.22 um 20:13 schrieb An
Hi Lokesh
IIUC each document (like for example a shop description) has a longitude
and a latitude associated with.
The user search input are some keywords and the the user's geo location.
The keywords you use to search for the documents and the users's geo
location you would like to use for
Hi Bridger
Inside
https://dlcdn.apache.org/lucene/java/9.1.0/lucene-9.1.0.tgz
you should find
modules/lucene-core-9.1.0.jar
modules/lucene-queryparser-9.1.0.jar
modules/lucene-analysis-common-9.1.0.jar
modules/lucene-demo-9.1.0.jar
I guess the documentation is not quite right.
Re your second
PR
Thanks
Michael
Am 25.04.22 um 23:37 schrieb Michael Wechner:
Hi Bridger
Inside
https://dlcdn.apache.org/lucene/java/9.1.0/lucene-9.1.0.tgz
you should find
modules/lucene-core-9.1.0.jar
modules/lucene-queryparser-9.1.0.jar
modules/lucene-analysis-common-9.1.0.jar
modules/lucene-demo-9.
great, thanks!
Am 26.04.22 um 21:48 schrieb Michael Sokolov:
thanks, I fixed the doc!
On Tue, Apr 26, 2022 at 9:13 AM Bridger Dyson-Smith
wrote:
Hi Michael -
On Mon, Apr 25, 2022 at 5:38 PM Michael Wechner
wrote:
Hi Bridger
Inside
https://dlcdn.apache.org/lucene/java/9.1.0/lucene-9.1.0
Hi
I noticed that Lucene 9.1.0 supports filtering in nearest-neighbor
vector search, which is great
I have found
https://issues.apache.org/jira/browse/SOLR-15947
https://issues.apache.org/jira/browse/LUCENE-10382
and
https://lucene.apache.org/core/9_1_0/demo/org/apache/lucene/demo/knn/packa
Hi
I noticed that Lucene 9.1.0 supports filtering in nearest-neighbor
vector search, which is great :-)
I have found
http://url7093.wyona.com/ls/click?upn=JOH5Fjdv9AA9sbvUyiP84WWONyl36e4Tdd3VZFG-2B7pcYPJTPhVT3xqtcUDjPgQX5jI0WYWlJZX8h9NDC6okDRg-3D-3DHvvY_UMWFA-2BOn91WS4mEQPCWI9gZNzEZlJPmWPGP2C
-summary.html
which I was not aware of, but disabled the tracking now and hope it will
be ok now.
Thanks
Michael
Am 09.05.22 um 15:12 schrieb Michael Wechner:
Hi
I noticed that Lucene 9.1.0 supports filtering in nearest-neighbor
vector search, which is great :-)
I have found
http://url7093
et us know if you run into any questions/ issues while trying it out!
Julie
On Mon, May 9, 2022 at 8:08 AM Michael Wechner
wrote:
sorry for the URLs below.
I have tested Twilio SendGrid as outgoing server and it just rewrote the
URLs
https://issues.apache.org/jira/browse/SOLR-15947
https://issues
rch(query, k);
Does that make sense to you?
Thanks
Michael
Am 11.05.22 um 07:59 schrieb Michael Wechner:
Hi Julie
Cool, thanks!
I try to apply it and if it works could create an example to the demo
package.
Will keep you posted :-)
Thanks
Michael
Am 11.05.22 um 02:13 schrieb Julie Tibshi
AM Michael Wechner
Hi Julie
I got it running and it seems to work fine so far :-)
Re an example for the demo package, I guess this would go here
https://lucene.apache.org/core/9_1_0/demo/org/apache/lucene/demo/knn/package-summary.html
and I thought of something like
we are using AnalyzingInfixSuggester but I would also be curious to
know whether this is the best way :-)
Thanks
Michael
Am 25.05.22 um 14:39 schrieb Anastasiya Tarasenko:
Hi All,
I have a question regarding auto-complete functionality in Lucene.
On the StackOverflow the suggestion regardin
Maybe I misunderstand the problem, but why don't you decouple showing
the results from the results of the query?
Am 23.06.22 um 14:03 schrieb Patrick Bernardina:
How to test if a value in a multi-value field matches a specific query?
Example of the problem:
I've created a query to return all
Hi
I am currently filtering a KnnVectorQuery as follows
Query filter =new TermQuery(new Term(CLASSIFICATION_FIELD, classification));
query =new KnnVectorQuery(VECTOR_FIELD, queryVector, k, filter);
but it is not clear to me how I can filter for multiple terms.
Should I subclass MultiTermQuery
BooleanQuery.Builder.
As noted in TermsInSetQuery (
https://github.com/apache/lucene/blob/main/lucene/core/src/java/org/apache/lucene/search/TermInSetQuery.java#L62)
multiple terms could be represented as a boolean query with Occur.SHOULD.
~Matt
On Wed, Aug 31, 2022 at 11:15 AM Michael Wechner
wrote
u can also pass a BooleanQuery with
multiple terms or a combination of other queries, a numeric range,...
or a fulltext query out of Lucene's query parsers.
Uwe
Am 31.08.2022 um 22:19 schrieb Michael Wechner:
Hi Matt
Thanks very much for your feedback!
According to your links I will try
C
great, thank you very much!
Just in time for ApacheCon :-)
Am 01.10.22 um 00:09 schrieb Michael Sokolov:
The Lucene PMC is pleased to announce the release of Apache Lucene 9.4.0.
Apache Lucene is a high-performance, full-featured search engine
library written entirely in Java. It is a technolo
Hi
I have just upgraded from 9.1.0 to 9.4.0 and compiling works fine, but
when I run and re-index my data using KnnVectorField, then I receive the
following exception:
java.lang.UnsupportedOperationException: Old codecs may only be used for
reading
at
org.apache.lucene.backward_codecs.l
?
Thanks
Michael
Am 01.10.22 um 08:06 schrieb Adrien Grand:
I would guess that you are configuring your IndexWriterConfig with a
"Lucene91Codec" instance. You need to replace it with a "Lucene94Codec"
instance.
Le sam. 1 oct. 2022, 06:12, Michael Wechner a
écrit :
Hi
I hav
HNSW parameters? If so, there is no better way than what you are doing.
Le sam. 1 oct. 2022, 12:31, Michael Wechner a
écrit :
Hi Adrien
Thank you very much for your help!
That was it :-) I completely forgot that I set this somewhere hidden
inside my code.
I made a note in the pom file, such that I s
Hi Together
I just read the following article, where the author compares Lucene and
Vespa re HSWN
https://bergum.medium.com/will-new-vector-databases-dislodge-traditional-search-engines-b4fdb398fb43
What is your take on "comparing Lucene and Vespa re HSWN latency and
recall"?
Thanks
Micha
l comparison, but every choice
is a compromise. We've known for centuries that "Odyous of olde been
comparisonis, And of comparisonis engendyrd is haterede."
On Sat, Oct 1, 2022 at 7:18 AM Michael Wechner
wrote:
Hi Together
I just read the following article, where the author compares L
Hi
I just noticed that the ApacheCon Asia 2022 have been published on YouTube
https://apachecon.com/
https://www.youtube.com/c/TheApacheFoundation/playlists
Will this also happen for ApacheCon North America 2022?
Thanks
Michael
Hi
On the Lucene FAQ there is no mentioning re tf-idf or bm25 and I would
like to add some notes, but to be sure I don't write anything wrong I
would like to ask
whether the current default similarity implementation of Lucene is
really BM25, right?
as described at
https://opensourceconnec
cool, thanks!
Am 23.11.22 um 10:55 schrieb Adrien Grand:
This is correct. See IndexSearcher#getDefaultSimilarity().
On Wed, Nov 23, 2022 at 10:53 AM Michael Wechner
wrote:
Hi
On the Lucene FAQ there is no mentioning re tf-idf or bm25 and I would
like to add some notes, but to be sure I
I have enhanced the FAQ
https://cwiki.apache.org/confluence/display/LUCENE/LuceneFAQ#LuceneFAQ-Whatisthedefaultrelevance/similarityimplementationofLucene?
Hope it is ok like this :-)
Thanks
Michael
Am 23.11.22 um 10:58 schrieb Michael Wechner:
cool, thanks!
Am 23.11.22 um 10:55 schrieb
Hi
I am using Lucen 9.4.2 vector search and everything seems to work fine,
except that when I delete some documents from the index, then the method
https://lucene.apache.org/core/9_0_0/core/org/apache/lucene/index/IndexReader.html#numDeletedDocs()
always returns 0, whereas I would have expect
reader to see deletes from the indexwriter.
Am 08.12.2022 um 10:32 schrieb Hrvoje Lončar:
Did you call this method before or after commit method?
My wild guess would be that you can count deleted documents inside
transaction only.
On Thu, Dec 8, 2022 at 12:10 AM Michael Wechner
wrote:
Hi
I
is high.
Uwe
Am 08.12.2022 um 11:44 schrieb Michael Wechner:
My code at the moment is as follows:
Directory dir = FSDirectory.open(Paths.get(vectorIndexPath));
IndexReader reader =
DirectoryReader.open(FSDirectory.open(Paths.get(vectorIndexPath)));
int numberOfDocsBeforeDeleting
Hi Anh
The following Stackoverflow link might help
https://stackoverflow.com/questions/73240494/can-someone-assist-me-with-a-multi-word-synonym-problem-in-lucene
The following thread seems to confirm, that escaping the space with a
backslash does not help
https://lists.apache.org/list?java-u
r SynonymQuery; I have
just used the standard QueryParser. Instead the synonym processing occurs
in the indexing phase, which is not only simpler (one search pattern, one
query), but also I think you would also find it gives you superior
performance (because the synonym processing occurs once at indexing
Hi
IIUC Lucene currently supports
VectorSimilarityFunction.COSINE
VectorSimilarityFunction.DOT_PRODUCT
VectorSimilarityFunction.EUCLIDEAN
whereas some embedding models have been trained with other metrics.
Also see
https://docs.scipy.org/doc/scipy/reference/generated/scipy.spatial.distance.cdi
vectors format that ignores the vector
similarity configured on the field and uses its own.
Le sam. 14 janv. 2023, 21:33, Michael Wechner a
écrit :
Hi
IIUC Lucene currently supports
VectorSimilarityFunction.COSINE
VectorSimilarityFunction.DOT_PRODUCT
VectorSimilarityFunction.EUCLIDEAN
whereas s
aybe it is easier to just contribute another metric as part of the
source, than make it configurable dynamically with a custom implementation.
Thanks
Michael
On Sat, Jan 14, 2023 at 6:04 PM Michael Wechner
wrote:
Hi Adrien
Thanks for your feedback! Whereas I am not sure I fully understand wha
Hi
I use the vector search of Lucene, whereas the embeddings I get from
SentenceBERT for example.
According to
https://www.sbert.net/examples/applications/retrieve_rerank/README.html
a re-ranking with a cross-encoder after the vector search (bi-encoding)
can improve the ranking.
Would it
em with
your vectors, very fast, only 500 calculations required, no HNSW or
anything needed. Of course you could use a vector search instead of a
BM25 search as the initial search to pull the top 500 hits too.
So it could meet both use-cases and provide a really performant option
for users that want
Hi Marcos
The indexing looks kind of
Document doc =new Document();
float[] vector = getEmbedding(text);
FieldType vectorFieldType = KnnVectorField.createFieldType(vector.length,
VectorSimilarityFunction.COSINE);
KnnVectorField vectorField =new KnnVectorField("my_vector_field", vector,
vectorFi
Thank you very much for the release! Works very fine so far :-)
All the best
Michael
Am 10.05.23 um 09:49 schrieb Alan Woodward:
The Lucene PMC is pleased to announce the release of Apache Lucene 9.6.0.
Apache Lucene is a high-performance, full-featured search engine library
written entirely
Hi
I use Lucene 9.7.0 but experienced the same behaviour with Lucene 9.6.0
when doing vector search as follows:
I have indexed about 200 vectors (dimension 768)
I build the query as follows
Query query = new KnnFloatVectorQuery("vector-field-name",
queryVector, k);
and do the search as f
" KNN and can get caught in local minima
(maxima?). Increasing K has, indirectly, the effect of expanding the
search space because the minimum score in the priority score (score of
the Kth item) is used as a threshold for deciding when to terminate
the search
On Wed, Aug 2, 2023 at 5:19 PM Michael We
Hi Together
You might be interesed in this paper / article
https://arxiv.org/abs/2308.14963
Thanks
Michael
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@luce
Hi
I recently noctived that
IndexReader.document(int)
is deprecated, whereas my code is currently
TopDocs topDocs = searcher.search(query, k);
for (ScoreDoc scoreDoc : topDocs.scoreDocs) {
Document doc = indexReader.document(scoreDoc.doc);
}
How do I best replace document(int)?
Thanks
M
)
- Shubham
On Mon, Sep 25, 2023 at 1:59 AM Michael Wechner
wrote:
Hi
I recently noctived that
IndexReader.document(int)
is deprecated, whereas my code is currently
TopDocs topDocs = searcher.search(query, k);
for (ScoreDoc scoreDoc : topDocs.scoreDocs) {
Document doc
);
}
```
Note that these StoredFields and TermVectors instances should only
be consumed in the thread where
they were acquired. For instance, it is illegal to share them across
threads.
Uwe
Am 25.09.2023 um 07:53 schrieb Michael Wechner:
Hi Shubham
Great, thank you very much
that
gives the missing information in 9.x Javadocs, too.
Uwe
Am 25.09.2023 um 11:02 schrieb Michael Wechner:
you mean once per search request?
I mean for example
GET https://localhost:8080/search?q=Lucene
and the following would be executed
IndexReader reader = DirectoryReader
Hi
I recently upgraded Lucene to 9.8.0 and was running tests with OpenAI's
embedding model, which has the vector dimension 1536 and received the
following error
Field[vector]vector's dimensions must be <= [1024]; got 1536
wheres this worked previously with the hack to override the vector
di
I forgot to mention, that when using the custom FieldType and 1536
vector dimension does work with Lucene 9.7.0
Thanks
Michael
Am 19.10.23 um 10:39 schrieb Michael Wechner:
Hi
I recently upgraded Lucene to 9.8.0 and was running tests with
OpenAI's embedding model, which has the v
trees.
Uwe
Am 19.10.2023 um 10:53 schrieb Michael Wechner:
I forgot to mention, that when using the custom FieldType and 1536
vector dimension does work with Lucene 9.7.0
Thanks
Michael
Am 19.10.23 um 10:39 schrieb Michael Wechner:
Hi
I recently upgraded Lucene to 9.8.0 and was r
Hi
I have found the following simple Facet Example
https://github.com/apache/lucene/blob/main/lucene/demo/src/java/org/apache/lucene/demo/facet/SimpleFacetsExample.java
whereas for a simple categorization of documents I currently use
StringField, e.g.
doc1.add(new StringField("category", "bo
less
http://blog.mikemccandless.com
On Fri, Oct 20, 2023 at 5:43 AM Michael Wechner
wrote:
Hi
I have found the following simple Facet Example
https://github.com/apache/lucene/blob/main/lucene/demo/src/java/org/apache/lucene/demo/facet/SimpleFacetsExample.java
whereas for a simple categorization of d
et by different
points/levels of your hierarchy.
Mike McCandless
http://blog.mikemccandless.com
On Fri, Oct 20, 2023 at 5:43 AM Michael Wechner
wrote:
> Hi
>
> I have found the following simple Facet Example
>
>
>
https://github
omyWriter).
It used to be that the "doc values" based faceting did not support
arbitrary hierarchy, but I think that was fixed at some point.
Mike McCandless
http://blog.mikemccandless.com
On Fri, Oct 20, 2023 at 9:03 AM Michael Wechner
wrote:
Hi Mike
Thanks for your feedback!
II
hich case, yes, you need to create a TaxonomyWriter).
It used to be that the "doc values" based faceting did not support
arbitrary hierarchy, but I think that was fixed at some point.
Mike McCandless
http://blog.mikemccandless.com
On Fri, Oct 20, 2023 at 9:03 AM Michael Wechner <
probably have too many ways to do the same
thing in the faceting module, and maybe our documentation could be a bit
more helpful.
Cheers,
-Greg
On Fri, Oct 20, 2023 at 2:54 PM Michael Wechner
wrote:
thanks very much for this additional information, Marc!
Am 20.10.23 um 20:30 schrieb Marc D
nt vectors format like a delegator as descirbed
before.
The responsibility was shifted to the codec, because there may be
better alternatives to HNSW that have different limits especially with
regard to performance during merging and query response times, e.g.
BKD trees.
Uwe
Am 19.10.2023 um
* implement getKnnVectorsFormatForField() and return the wrapper with
other max dimension
Reading indexes still works with unmodified default codec, you only
need to set it for IndexWriter. When reading the actual codec is
looked up by name.
Uwe
Am 07.11.2023 um 17:03 schrieb Michael Wechner:
Hi Uwe
HI
IIUC I can get all terms of a particular field of an index with
IndexReader reader = DirectoryReader.open(„index_directory");
List list = reader.leaves();
for (LeafReaderContext lrc : list) {
Terms terms = lrc.reader().terms(„field_name");
if (terms != null) {
TermsEnum termsEn
alyse it again, get all terms.
Good Luck
On Sun, Nov 12, 2023 at 7:47 PM Michael Wechner
wrote:
HI
IIUC I can get all terms of a particular field of an index with
IndexReader reader = DirectoryReader.open(„index_directory");
List list = reader.leaves();
for (LeafReaderContext
here
> https://github.com/apache/lucene/blob/4e2ce76b3e131ba92b7327a52460e6c4d92c5e33/lucene/highlighter/src/java/org/apache/lucene/search/highlight/Highlighter.java#L159
>
>
> On Sun, Nov 12, 2023 at 11:42 PM Michael Wechner
> wrote:
>
>> Hi Mikhail
>>
>> Thank you
/core/9_8_0/core/org/apache/lucene/analysis/TokenStream.html
correctly, then one should add it.
Thanks
Michael
Am 12.11.23 um 23:36 schrieb Michael Wechner:
Thanks again, whereas I think I have found now what I wanted (without needing
the Highlighter):
IndexReader reader = DirectoryReader.o
Hi Tamas
Can one download your plugin somewhere to test it?
Thanks
Michael
Am 13.11.23 um 10:07 schrieb Balog Tamás:
Hello everyone!
I've been working on a proof of concept of creating an IntelliJ plugin from the
Luke application and it reached a demoable state.
If anyone of the Lucene c
1 - 100 of 146 matches
Mail list logo