Dear Hesham, Thanks for sharing this paper with us. It looks interesting! We are going to organize another Inter-DC AI side meeting in Dublin. I will check with the authors for their availabilities.
Kind regards David From: Hesham ElBakoury <helbako...@gmail.com> Sent: Thursday, October 10, 2024 6:40 PM To: Dirk Trossen <dirk.trossen=40huawei....@dmarc.ietf.org> Cc: rtgwg <rtgwg@ietf.org> Subject: Re: Sidemeeting on "Inter-DC AI: Requirements and Challenges" on July 25th 5pm to 6.16pm This paper [1] is published in ICDCS 2024. Its title "AdapCC: Making Collective Communication in Distributed Machine Learning Adaptive". It is relevant to the discussion of this sidemeeting. You may think of inviting the authors to present this paper in future Inter-DC AI sidemeeting. Hesham [1] https://i.cs.hku.hk/~cwu/papers/xyzhao-icdcs24.pdf On Fri, Jul 19, 2024, 2:55 AM Dirk Trossen <dirk.trossen=40huawei....@dmarc.ietf.org<mailto:40huawei....@dmarc.ietf.org>> wrote: Dear all, We are organizing a side meeting on “Inter-DC AI: Requirements and Challenges” on Thursday 25th from 5pm to 6.15pm in Prince of Wales/Oxford. The backdrop for this side meeting is the significant growth in demands for computing and networking resources for (Large data) AI, where boundaries for future growth are being set through power consumption (including location of the DC), space and cooling tech as well as complexity and cost. This has led to the emerging view that inter-DC AI computation is the way forward to overcome those local DC challenges. This side meeting at IETF120 will start the needed dialogue to identify and discuss key challenges on (1) congestion control to improve goodput across DCs, (2) efficient support for cross-DC collective communication primitives, (3) obtaining suitable knowledge of network topologies across underlays to segregate training tasks across DCs, (4) utilizing new congestion control mechanisms to optimize large-scale inferencing traffic from millions of clients to few Points of Presence (PoPs), as well as (5) providing suitable means to attest for secure and private transfer of needed training input data from customers to AI training providers. The side meeting will feature presentations to tease out key insights from various players in this field, leaving dedicated time for discussing possible next and concrete steps in the IETF to move forward, with the following agenda Room: Prince of Wales/Oxford Agenda: 17.00 - 17.05: Settling & Introduction (Luigi Iannone, Huawei) 17.05 - 17.15: Inter-DC AI: Requirements & Opportunities (Dirk Trossen, Huawei) 17.15 - 17.25: On Congestion Control (Michael Welzl) 17.25 - 17.35: On Attestation (Ramki Krishan, Intel) 17.35 - 17.45: On Collective Communication (Kehan Yao, China Mobile) 17.45 - 18.15: Discussion on Next Steps (Dirk Trossen, Huawei) Meeting link: https://ietf.webex.com/meet/ietfsidemeeting2 Github link (with presentation slides coming next week): https://github.com/dirk-trossen-huawei/ietf120_inter-dc_ai We hope to see you on 25th for a fruitful and engaging discussion! Best, Dirk Trossen, Luigi Iannone, David Lou _______________________________________________ rtgwg mailing list -- rtgwg@ietf.org<mailto:rtgwg@ietf.org> To unsubscribe send an email to rtgwg-le...@ietf.org<mailto:rtgwg-le...@ietf.org>
_______________________________________________ rtgwg mailing list -- rtgwg@ietf.org To unsubscribe send an email to rtgwg-le...@ietf.org