Re: [VOTE] Lucene logo contest, third time's a charm

2020-09-02 Thread Marcio Napoli
vote: A1, A2 Em qua., 2 de set. de 2020 às 16:06, Michal Hlavac escreveu: > vote: A1, A2 > > > > > - > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apac

Re: Updating specific fields of huge docs

2019-02-14 Thread Marcio Napoli
Hi Luís, If the contents of the files dont change one solution is to store the text parsed by tika in a compressed way, ~7% extracted text size. In updating the document, just search the old one with the contents ready (compressed) and update the other fields that you need. Best, Marcio http://w

Re: Facet

2016-06-20 Thread Marcio Napoli
e it's an inverted structure > but facets need a forward structure. > > Mike McCandless > > http://blog.mikemccandless.com > > On Wed, Jun 15, 2016 at 9:25 AM, Marcio Napoli > wrote: > > > Hey! > > > > The Lucene facets module uses integer encodin

Facet

2016-06-15 Thread Marcio Napoli
Hey! The Lucene facets module uses integer encoding using the method " FacetsConfig.dedupAndEncode " . It would be convenient to use the IntPoint ? Thanks! Marcio Napoli

Re: IndexWriter is not closing the FDs (deleted files)

2015-09-01 Thread Marcio Napoli
ccandless.com/2011/09/lucenes-searchermanager-simplifies.html > > Regards, > András > > On Tue, Sep 1, 2015 at 2:16 PM, Marcio Napoli > wrote: > > > Ian, > > > > Well remembered! > > > > Thanks, > > Napoli > > > > Em ter,

Re: IndexWriter is not closing the FDs (deleted files)

2015-09-01 Thread Marcio Napoli
toryReader.html#openIfChanged(org.apache.lucene.index.DirectoryReader) > . > You may wish to pay attention to the words about not closing readers > while they may still be in use. > > > -- > Ian. > > > On Tue, Sep 1, 2015 at 12:55 PM, Marcio Napoli > wrote: >

Re: IndexWriter is not closing the FDs (deleted files)

2015-09-01 Thread Marcio Napoli
ogger.error("Não foi possível inserir o Cidadao no índice Lucene \n" + cidadaoBean, e); e.printStackTrace(); } } Em seg, 31 de ago de 2015 às 21:13, Anton Zenkov escreveu: > Are you sure you are not holding open readers somewhere? > > On Mon, Aug 31, 2015 at 7:46 PM, Marcio Na

IndexWriter is not closing the FDs (deleted files)

2015-08-31 Thread Marcio Napoli
Hey! :) It seems IndexWriter is not closing the descriptors of the removed files, see the log below. Thanks, Napoli [root@server01 log]# ls -l /proc/59491/fd | grep index l-wx--. 1 wildfly wildfly 64 Ago 31 11:26 429 -> /usr/local/wildfly-2.0/standalone/data/index/cidadao/write.lock lr-x---

Re: indexing json

2014-09-04 Thread Marcio Napoli
Hey! Elasticsearch Is a good option and uses Lucene as core :) http://www.elasticsearch.org/overview/elasticsearch/ []s Napoli http://numere.stela.org.br 2014-09-04 7:46 GMT-03:00 Larry White : > Hi, > > Is there a way to index an entire json document automatically as one can do > with the

Re: MAX_TERM_LENGTH

2014-02-13 Thread Marcio Napoli
Thanks for note, Marcio Napoli Go beyond Apache Lucene(tm) features with Numere(R) http://numere.stela.org.br 2014-02-13 14:56 GMT-02:00 Michael McCandless : > You can use IndexReader.getBinaryDocValues(field). > > BTW your site should reference *Apache* Lucene, not just Lucene.

Re: MAX_TERM_LENGTH

2014-02-13 Thread Marcio Napoli
Hey Mike, I need quick access to values per document. The use of binary values is possible via doc FieldCache -> FieldCacheSource.getValues ()? Thanks, Marcio Napoli Go beyond Lucene(tm) features with Numere(R) http://numere.stela.org.br 2014-02-13 13:16 GMT-02:00 Michael McCandless : &g

MAX_TERM_LENGTH

2014-02-13 Thread Marcio Napoli
Hi All, I have a need to work with big terms. So the 32k is not enough. How can i increase the maximum size of a term? Found in the IndexWriter MAX_TERM_LENGTH constant, which refers to FieldCache and DocumentsWriterPerThread (BYTE_BLOCK_SIZE-2). Thanks, Marcio Napoli Go beyond Lucene(tm

Re: Result Group Ordering

2014-01-29 Thread Marcio Napoli
Hi Maturu, The Numere API allows you grouping, count distinct values, sort, and other operations SQL / OLAP. See: http://numere.stela.org.br/examples.html Good lock! Marcio Napoli 2014-01-29 srineal maturu > Hi,I have been working on lucene for past 2 months. I have been working

Re: Building term frequency matrix over 6 million documents...

2014-01-24 Thread Marcio Napoli
Hi! I believe the approach below can help you. http://svn.apache.org/repos/asf/lucene/dev/trunk/lucene/misc/src/java/org/apache/lucene/misc/HighFreqTerms.java Marcio http://numere.stela.org.br Go beyond Lucene™ features with Numere® 2014/1/24 Witdouck, Xavier > Hi all, > > We have over 6 m

Re: in-memory terms dictionary/Lucene-3069

2014-01-23 Thread Marcio Napoli
orField, to return FSTPostingFormat. > > Then set that Codec on the IndexWriterConfig you pass to IndexWriter. > > Mike McCandless > > http://blog.mikemccandless.com > > > On Wed, Jan 22, 2014 at 10:54 AM, Marcio Napoli > wrote: > > Thanks Mike! > > > &g

Re: in-memory terms dictionary/Lucene-3069

2014-01-22 Thread Marcio Napoli
gt; Mike McCandless > > http://blog.mikemccandless.com > > > On Wed, Jan 22, 2014 at 6:38 AM, Marcio Napoli > wrote: > > Hey! :) > > > > How could i use this new reader/writer ? > > > > https://issues.apache.org/jira/browse/LUCENE-3069 > &g

in-memory terms dictionary/Lucene-3069

2014-01-22 Thread Marcio Napoli
Hey! :) How could i use this new reader/writer ? https://issues.apache.org/jira/browse/LUCENE-3069 Thanks, Marcio Go beyond Lucene™ features with Numere® http://numere.stela.org.br

Removing/Adding term to one field by docId

2014-01-17 Thread Marcio Napoli
ter.removeTerm (docId, field, term); writer.addTerm (docId, field, term); Thanks, Marcio Napoli napoli.mar...@gmail.com http://numere.stela.org.br/

Removing/Adding term to one field by docId

2014-01-17 Thread Marcio Napoli
ter.removeTerm (docId, field, term); writer.addTerm (docId, field, term); Thanks, Marcio Napoli napoli.mar...@gmail.com http://numere.stela.org.br/