[ceph-users] Re: Request for Recommendations: Tiering/Archiving from NetApp to Ceph (with stub file support)

Tim Holloway Mon, 05 May 2025 15:42:57 -0700

There are likely simpler answers if you want to tier entire buckets, butit sounds like you are hosting a filesystem(s) on NetApp and want totier them. It would be nice to have NetApp running Ceph as a blockstore, but I don't think crush is sophisticated enough to migratecomponents of a filesystem pool based on the ages of files/directoriesin them. For one thing, I'm not sure that the PGs in the pool can/shouldbe that aware of such details or that you might not get into problemswith fragments of files in different PGs with much of a PG being un-ageddata. So I'm not optimistic on that concept.

What that suggests to me is that you might use an overlay filesystem,where the different tiers overlay each other to present a unifiedfilesystem image. This is precisely what containers do, although much oftheir goal is simply optimising shared image layers. A variation of thisis Copy-on-Write (COW), but what you want is more like the reverse.

At any rate, a frontend overlay filesystem with NetApp overlaying asecondary Ceph system seems like a likely solution. Then all you'd needwould be a mechanism to move aged-out resources. That might even be agood use of rsync.


   Tim

On 5/4/25 10:20, sacawulu wrote:

Hi all,
We're exploring solutions to offload large volumes of data (on theorder of petabytes) from our NetApp all-flash storage to our morecost-effective, HDD-based Ceph storage cluster, based on criteria suchas: last access time older than X years.
Ideally, we would like to leave behind a 'stub' or placeholder file onthe NetApp side to preserve the original directory structure andpotentially enable some sort of transparent access or recall ifneeded. This kind of setup is commonly supported by solutions likeDataCore/FileFly, but as far as we can tell, FileFly doesn’t supportCeph as a backend and instead favors its own Swarm object store.
Has anyone here implemented a similar tiering/archive/migrationsolution involving NetApp and Ceph?
We’re specifically looking for:

*    Enterprise-grade tooling

*    Stub file support or similar metadata-preserving offload
* Support and reliability (given the scale, we can’t afford dataloss or inconsistency)
*    Either commercial or well-supported open source solutions
Any do’s/don’ts, war stories, or product recommendations would begreatly appreciated. We’re open to paying for software or services ifit brings us the reliability and integration we need.
Thanks in advance!

MJ
_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

_______________________________________________
ceph-users mailing list -- ceph-users@ceph.io
To unsubscribe send an email to ceph-users-le...@ceph.io

[ceph-users] Re: Request for Recommendations: Tiering/Archiving from NetApp to Ceph (with stub file support)

Reply via email to