An O(log2N) parallel algorithm for output queuing

INFOCOM(2002)

引用 14|浏览10
暂无评分
摘要
Output queued switches are appealing because they have better latency and throughput than input queued switches. However, they are difficult to build: a direct implementation of an N N output-queued switch requires the switching fabric and the packet memories at the outputs to run at N times the line rate. Attempts have been made to implement output queuing with slow components, e.g., by having memories at both inputs and outputs running at twice the line rate. In these approaches, even though the packet memory speed is reduced, the scheduler time complex- ity is high — at least ( N). We show that idealized output queuing can be simulated in a shared memory architecture with (3N 2) packet memories running at the line rate, using a scheduling algo- rithm whose time complexity is O(log2 N) on a parallel random access machine (PRAM). The number of processing elements and memory cells used by the PRAM are a small multiple of the size of the idealized switch.
更多
查看译文
关键词
parallel algorithm,time complexity,shared memory,parallel random access machine
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要