Jon Stewart via Mastodon recently pointed me to thie: https://github.com/yobix-ai/extractous
It uses Tika compiled on graalvm via Rust for file formats not natively supported in rust. Cheers, Tim [0] https://mastodon.social/@codeslack@infosec.exchange/114212821913586576