Conference paper
Exploiting spatial parallelism in Ethernet-based cluster interconnects
In this work we examine the implications of building a single logical link out of multiple physical links. We use MultiEdge to examine the throughput-CPU utilization tradeoffs and examine how overheads and performance scale with the number and speed of links. We use low- level instrumentation to understand associated overheads, we experiment with setups between 1 and 8 1-GBit/s links, and we contrast our results with a single 10-GBit/s link.
We find that: (a) Our base protocol achieves up-to 65% of the nominal aggregate throughput, (b) Replacing the interrupts with polling significantly impacts only the multiple link configurations, reaching 80% of nominal throughput, (c) The impact of copying on CPU overhead is significant, and removing copying results in up-to 66% improvement in maximum throughput, reaching almost 100% of the nominal throughput, (d) Scheduling packets over heterogeneous links requires simple but dynamic scheduling to account for different link speeds and varying load.
Language: | English |
---|---|
Publisher: | IEEE Computer Society Press |
Year: | 2008 |
Pages: | 1-8 |
Proceedings: | 2008 IEEE International Symposium on Parallel and Distributed Processing |
ISBN: | 1424416930 , 1424416949 , 9781424416936 and 9781424416943 |
ISSN: | 23321237 and 15302075 |
Types: | Conference paper |
DOI: | 10.1109/IPDPS.2008.4536145 |
ORCIDs: | Karlsson, Sven |
Central Processing Unit Computer science Costs Dynamic scheduling Ethernet networks Ethernet-based cluster interconnects High performance computing MultiEdge Parallel processing Power system interconnection Protocols Throughput heterogeneous links local area networks multiple physical links protocols scheduling scheduling packets spatial parallelism throughput-CPU utilization tradeoffs