Award Abstract # 1911104
CNS Core: Small: Designing Efficient Cloud Datacenter Network Fabrics

NSF Org: CNS
Division Of Computer and Network Systems
Recipient: UNIVERSITY OF CALIFORNIA, SAN DIEGO
Initial Amendment Date: August 5, 2019
Latest Amendment Date: August 5, 2019
Award Number: 1911104
Award Instrument: Standard Grant
Program Manager: Darleen Fisher
dlfisher@nsf.gov
 (703)292-8950
CNS
 Division Of Computer and Network Systems
CSE
 Direct For Computer & Info Scie & Enginr
Start Date: September 1, 2019
End Date: August 31, 2023 (Estimated)
Total Intended Award Amount: $500,000.00
Total Awarded Amount to Date: $500,000.00
Funds Obligated to Date: FY 2019 = $500,000.00
History of Investigator:
  • Alex Snoeren (Principal Investigator)
    snoeren@cs.ucsd.edu
  • George Porter (Co-Principal Investigator)
Recipient Sponsored Research Office: University of California-San Diego
9500 GILMAN DR
LA JOLLA
CA  US  92093-0021
(858)534-4896
Sponsor Congressional District: 50
Primary Place of Performance: University of California-San Diego
CA  US  92093-0404
Primary Place of Performance
Congressional District:
50
Unique Entity Identifier (UEI): UYTTZT6G9DT1
Parent UEI:
NSF Program(s): Networking Technology and Syst
Primary Program Source: 01001920DB NSF RESEARCH & RELATED ACTIVIT
Program Reference Code(s): 7923
Program Element Code(s): 736300
Award Agency Code: 4900
Fund Agency Code: 4900
Assistance Listing Number(s): 47.070

ABSTRACT

Cloud datacenter networks are tasked with providing connectivity between an ever-increasing number of end hosts whose link rates improve by orders of magnitude every few years. What network operators would ideally like is a single, full-bandwidth switch that could connect every endpoint at full rate. Such an idealized network would enable them to place jobs and data where it is convenient, without worrying about bandwidth bottlenecks, hotspots, and other network-induced limitations. Unfortunately, preserving this ``big-switch'' illusion of a single network with full bandwidth is increasingly cost prohibitive and likely soon infeasible.

This project will explore an alternative method of constructing datacenter network fabrics based upon a provably optimal topological construct, an expander graph. If successful, the project will result in network fabrics that are more flexible, capable, and scalable than existing state-of-the-art approaches. This project will develop a family of cloud datacenter network topologies based on expander graphs that eliminate the capacity bottlenecks inherent in hierarchical Clos-based topologies while minimizing the bandwidth tax incurred due to indirect routing. A single, large expander-graph network topology can be constructed out of multiple, disjoint expander graphs; this project will show how judicious tenant placement can then provide both isolation and dynamic capacity while minimizing the bandwidth tax. Moreover, by employing reconfigurable network components (i.e., circuit switches), it is even possible to evolve the set of constituent expander graphs over various time scales, allowing cloud datacenter operators to better suit the needs of their current tenants. Indeed, if the timescales are sufficiently small (e.g., 100s of milliseconds) tenants may then choose to buffer traffic until a particularly favorable (set of) path(s) is available, further decreasing the overall bandwidth inefficiency or "tax". If the network topology evolves at a rapid rate, it is possible to choose, on a per-packet basis, whether to either (1) immediately send a packet over whatever static expander is currently instantiated, incurring a modest tax on this small fraction of traffic, or (2) buffer the packet and wait until a direct link is established to the ultimate destination, eliminating the bandwidth tax on the vast majority of bytes.

This project will engage graduate and undergraduate students through structured courses, intense mentorship, and hands-on research activities through participation in the NSF-funded UC San Diego Early Research Scholars Program (ERSP).

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

PUBLICATIONS PRODUCED AS A RESULT OF THIS RESEARCH

Note:  When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Guo, Yibo and Mellette, William M. and Snoeren, Alex C. and Porter, George "Scaling beyond packet switch limits with multiple dataplanes" CoNEXT '22: Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies , 2022 https://doi.org/10.1145/3555050.3569141 Citation Details
Grant, Stewart and Snoeren, Alex C. "In-network Contention Resolution for Disaggregated Memory" Proceedings of the Workshop on Resource Disaggregation and Serverless (WORDS) , 2021 Citation Details
William M. Mellette, Rajdeep Das "Expanding across time to deliver bandwidth efficiency and low latency" USENIX Symposium on Networked Systems Design and Implementation (NSDI) , 2020 Citation Details
Grant, Stewart and Yelam, Anil and Bland, Maxwell and Snoeren, Alex C. "SmartNIC Performance Isolation with FairNIC: Programmable Networking for the Cloud" Proceedings of the Annual conference of the ACM Special Interest Group on Data Communication on the applications, technologies, architectures, and protocols for computer communication , 2020 10.1145/3387514.3405895 Citation Details
Chen, Shawn Shuoshuo and Wang, Weiyang and Canel, Christopher and Seshan, Srinivasan and Snoeren, Alex C. and Steenkiste, Peter "Time-division TCP for reconfigurable data center networks" Proceedings of the ACM SIGCOMM 2022 Conference , 2022 https://doi.org/10.1145/3544216.3544254 Citation Details

PROJECT OUTCOMES REPORT

Disclaimer

This Project Outcomes Report for the General Public is displayed verbatim as submitted by the Principal Investigator (PI) for this award. Any opinions, findings, and conclusions or recommendations expressed in this Report are those of the PI and do not necessarily reflect the views of the National Science Foundation; NSF has not approved or endorsed its content.

Cloud datacenter networks are tasked with providing connectivity between an ever-increasing number of end hosts whose link rates improve by orders of magnitude every few years.  Unfortunately the speed of individual datacenter switches and routers has not kept pace with the increases in bandwidth at the end points, meaning that it isn't possible to directly upgrade the network as in the past.  This project sought to explore alternative network design paradigms that can improve the performance of future datacenter network fabrics.  Concretely, we considered the potential of topologies based on expander graphs that eliminate the capacity bottlenecks inherent in today's hierarchical Clos-based topologies.

Our efforts resulted in multiple major publications each containing significant research results.  Two of the major outcomes are described in two papers, one that appeared at CoNEXT 2022 and another at SIGCOMM 2022.

In our CoNEXT 2022 paper we observe that modern datacenter operators are deploying  parallel fabric designs in the core of the network that deliver 𝑁-times the bandwidth by simply forwarding traffic over any of 𝑁 parallel network fabrics. In our work, we considered extending this parallel network idea all the way to the end host, and leveraging the fact that there are now multiple networks to deploy heterogeneous parallel network fabrics that can deliver even higher bandwidth, lower latency, and improved resiliency than traditional designs constructed from the same constituent components.

Our initial studies found that direct application of existing path selection and forwarding techniques resulted in poor performance. Instead, we showed that appropriate path selection and forwarding protocols can not only improve the performance of existing, homogeneous parallel fabrics, but fully realize the potential of heterogenous fabrics.

Our SIGCOMM 2022 builds on the theme, but considers the challenges of an alternate approach where, rather than deploying multiple, heterogenous networks in parallel, an operator constructs a single network that is reconfigurable--i.e., the set of connects can vary over time.  Prior research has shown that providing multiple time-varying paths can improve network capacity and lower physical latency. However, our work shows that existing TCP variants are ill-suited to utilize available capacity because their congestion control cannot react quickly enough to drastic variations in bandwidth and latency. We developed Time-division TCP (TDTCP), a new TCP variant designed for reconfigurable data center networks. TDTCP recognizes that communication in these fabrics happens over a set of paths, each having its own physical characteristics and cross traffic. TDTCP multiplexes each connection across multiple independent congestion states--one for each distinct path--while managing connection-wide tasks in a shared fashion.  It leverages network support to receive timely notification of path changes and promptly matches its local view to the current path. We implementedTDTCP in the Linux kernel. Results on an emulated network show that TDTCP improves throughput over both traditional TCP variants, such as DCTCP and CUBIC, and multipath TCP by 24-41% without requiring significant in-network buffering to hide path variations.


Last Modified: 10/11/2023
Modified by: Alex C Snoeren

Please report errors in award information by writing to: awardsearch@nsf.gov.

Print this page

Back to Top of page