Hi, On Sun, 2025-05-04 at 15:25 +0200, Matthias Urlichs wrote: >  On 04.05.25 14:27, Aigars Mahinovs wrote: >  > > The simple fact that none of the LLMs have been sued out of > > existence by any copyright owner is de facto proof that it does not > > work that way in the eyes of the judicial system. > > That may or may not be correct in the long run, IANAL and all that. > > However. Copyright is only one aspect of whether or not models should > end up in main. Plain old reproducibility is important to us too.
What is not reproducible (in the reproducible build sense Debian uses) about, say, the Tesseract OCR models? Compared to say a pre-processed photograph (using non-free in-camera firmware) of a building or landscape (which can't be shipped in main). You can change the models against a different one, just as you can replace a photo with a different one. But without the building readily available, it is hard to change perspective or other changes that would be possible if the "source" was available. Ansgar