> On Jun 22, 2024, at 6:59 AM, Marcus <marcus.m...@wtnet.de> wrote:
>
> Am 22.06.24 um 14:53 schrieb Bidouille:
>>> I remember from old time that the QA team at Sun/Oracle had really a
>>> lot of documents for general and special testing.
>>>
>>> These were not part of the code repository and were loaded from their
>>> own test software. Maybe this is the link to the storage outside of
>>> the project.
>> If you have an URL, you can try to get with the WayBack machine
>> https://wayback-api.archive.org/
>
> they were stored on an internal server.
The Apache Tika and Apache POI projects make use of Common Crawl to create a
large corpus for regression tests.
https://commoncrawl.org
Perhaps we can start to do the same? We can ask for help from Tika at
d...@tika.apache.org or POI at d...@poi.apache.org
Best,
Dave
>
> Marcus
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org
> For additional commands, e-mail: dev-h...@openoffice.apache.org
>
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org
For additional commands, e-mail: dev-h...@openoffice.apache.org