Thank you so much Maruan! @Andreas Lehmkuehler <andr...@lehmi.de> , y, the files (or the URLS) derived from govdocs1, commoncrawl or some custom crawlers for bug trackers. All of it was publicly available and crawlable at some point. However, as you well know, there are still issues...
Thank you, all! On Thu, Jan 9, 2025 at 1:52 AM Andreas Lehmkühler <andr...@lehmi.de.invalid> wrote: > Hi, > > I agree with Maruan. :-( > > Just out of curiosity, the origin source of those files is some public > webserver, isn't it? > > Andreas > > Am 09.01.25 um 05:27 schrieb Maruan Sahyoun: > > Hi, > > > > this is unfortunate but as this is posing the risk of legal actions to > the ASF but also to me hosting the site I think we should stop that. > > > > BR > > Maruan > > > >> Am 09.01.2025 um 02:37 schrieb Tim Allison <talli...@apache.org>: > >> > >> \All, > >> We've gotten a handful of takedown requests recently. I had initially > >> envisioned public sharing of files as a key component of our server. We > can > >> still use the files and offer read access to fellow file researchers. > I'm > >> not sure I want to deal with further takedown requests. > >> As an intermediate step, we could ask robots not to crawl the data, but > >> that's not reliable. > >> So, in lieu of that, with heavy heart, I ask if it is time to close off > >> public access? > >> WDYT? > >> > >> Best, > >> > >> Tim > >