Hello, Sorry I did not believe that the precise values were relevant but here they are. The average cumulative reward (score) of the agent for exactly the same script and using Guix for the environment is 1658.3733235021457 on Arch Laptop and 1820.325441905902 on the Ubuntu one. But I think due to the feedback loop of such simulator (if there is a small difference in the action at time t, it can imply a large difference at the end of the process) this could be due to a small difference in the computations.
Best, Timothée ----- Mail original ----- > De: "Rutherther" <ruthert...@ditigal.xyz> > À: "Timothee Mathieu" <timothee.math...@inria.fr>, "help-guix" > <help-guix@gnu.org> > Envoyé: Mercredi 30 Avril 2025 22:47:47 > Objet: Re: Reproducibility of guix shell container across different host OS > Hi Timothee > > Timothee Mathieu <timothee.math...@inria.fr> writes: > >> >> Doing so, we noticed that the results were indeed reproducible between two >> Ubuntu computer (one is a laptop, the other a server). However, when trying >> the >> exact same command with the exact same channels file (with fixed commit) on >> some Arch-Linux laptop, the result was different. We did the test on two Arch >> laptops and the results were are reproducible but with a different value from >> the Ubuntu ones. All the considered laptops and servers have different kernel >> but this doesn't seem to be the problem because Ubuntu is reproducible with >> Ubuntu and Arch reproducible with Arch. >> Moreover, the difference is not small, which is weird because in the script >> we >> fix the random seed. Do you have any idea why there is a difference? > > So how does the difference look like exactly? There is not much to go on > without > that. > > Regards > Rutherther