This came up recently for me too. Same issue. I can maybe implement this later today
On Mon, Mar 7, 2022, 3:08 PM Tim Allison <[email protected]> wrote: > Yes please. We can add a limiting MetadataFilter. > > On Mon, Mar 7, 2022 at 8:39 AM Julien Massiera < > [email protected]> wrote: > > > Hi Tim, > > > > > > > > We identified cases where pdf files may contain abnormaly big metadata > > (several MB, be it for the metadata values, the metadata names, but also > > for > > the total amount of metadata). Some time ago, I proposed the creation of > a > > "writeLimit" header in Tika Server (and you accepted to implement it, > > thanks > > for that) on the /rmeta endpoint. We think it would make sense to have an > > equivalent for the metadata content, e.g. to avoid potential OOMs. > > > > Would you think it is worth it that I create a ticket for this feature ? > > > > > > > > Regards, > > > > Julien > > > > > > > > > > > > >
