Thanks to a recommendation from a user and the developer of datasette, I configured the proxy correctly so that this now works:
https://corpora.tika.apache.org/datasette/ Make sure to include the final /. https://corpora.tika.apache.org/datasette does not work. Cheers, Tim