Re: ostree/fedora atomic and impact on the mirror network

Colin Walters Mon, 10 Mar 2014 07:16:11 -0700

On Mon, Mar 10, 2014 at 9:19 AM, Matthew Miller<mat...@fedoraproject.org> wrote:

So, I've been thinking about Colin Walters' ostree project (FedoraAtomicInitiative) -- see <http://sched.co/1eVhZ05>. One of the concerns Ihave iswith requirements on the mirror network. Right now, the impact onmirrors ofan update is a few metadata requests plus one per package. It seemslike
ostree could be significantly worse, with requests _per file_.

Yep, it's definitely true that OSTree's HTTP replication can be worsethan yum/rpm/deltarpm in many scenarios. (There are scenarios wherecurrent OSTree is better too). Static deltas are the ultimate solutionhere, and initial code already exists (seehttps://bugzilla.gnome.org/show_bug.cgi?id=721799 )

Now, a few things. First, the current goal of Fedora Atomic Initiativeis just to track Rawhide - I was talking with Dennis Gilmore atdevconf.cz and we felt this made the most sense rather than trying tojump all the way to releases. So the idea here is that it's for userswho are already updating weekly or faster.

Tracking rawhide plays into the other strengths of OSTree, such as thefact that after you upgrade, you still have the previous tree around tofall back on if things are broken.

Now, let's talk about space usage on the mirror network. A *very*interesting question is how much tree history we keep. A lot of thisis a function of how many trees we generate (at the moment, I just madeup some "baseline" products) as well as how often the packages in thosetrees change.

One model I'd like to aim for here is we say "the repository will takeup at most N GB" (where e.g. N=100) and we keep anintelligently-scheduled series of snapshots, like backup systems do.We don't need to keep every change to every RPM, just interesting ones- keep only a few old snapshots from last year, plus a few from eachmonth this yer, plus many from this week. OSTree has some very simplesupport for pruning already (ostree prune --refs-only --depth=100) -max size model would be harder but is doable.

Another thing I've been thinking about is that there should likely beseparate "development" and "release" repositories.

And the "release" repository would be synced out to more mirrors. Thisrepo might contain just each "gold" release, plus the intermediatealpha/beta snapshots. Plus say monthly update snapshots.

In this model, the release repository would be a separate compositionfrom the development repo - it would reprocess the same RPM versions,and would require re-GPG-signing, etc.


So an offhand TODO list for production releases:

- Anaconda support (working on it)
- Move rpm-ostree into Koji
  - Requires RHEL7 or newer build host
  - Write Koji plugin
  - GPG signing (or TLS for metadata)
- Static deltas (initial code exists, needs HTTP/GPG plus optimization)
- Determine mirror impact
 - Space availability

- Determine whether some mirrors would want to opt out of higher HTTPload

-- 
devel mailing list
devel@lists.fedoraproject.org
https://admin.fedoraproject.org/mailman/listinfo/devel
Fedora Code of Conduct: http://fedoraproject.org/code-of-conduct

Re: ostree/fedora atomic and impact on the mirror network

Reply via email to