Re: Indexing documents using ExtractHandler returning metadata

2023-02-08 Thread Sergio García Maroto
Found the solution. We have to captureAttr=true on solrconfix.xml and upfrefix=_ignored DocContentS false true ignored_ -MM-dd After on schema.xml. Add On Fri, 3 Feb 2023 at 16:50, Sergio García Maroto wrote: > Hi, > > I am indexing documents using tika and ExtractRequest handler.

Indexing documents using ExtractHandler returning metadata

2023-02-03 Thread Sergio García Maroto
Hi, I am indexing documents using tika and ExtractRequest handler. DocContentS false -MM-dd After indexing I see my field DocContentS cointains not only the text of the documents. As well metadata like. \n \n stream_size 204286 \n X-Parsed-By org.apache.tika.parser.DefaultParser \n X-P