Improving Network Connection Locality on Multicore Systems

机译：在多核系统上改进网络连接局部性

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Incoming and outgoing processing for a given TCP connec-tion often execute on different cores: an incoming packet is typically processed on the core that receives the interrupt, while outgoing data processing occurs on the core running the relevant user code. As a result, accesses to read/write connec-tion state (such as TCP control blocks) often involve cache in-validations and data movement between cores' caches. These can take hundreds of processor cycles, enough to significantly reduce performance. We present a new design, called Affinity-Accept, that causes all processing for a given TCP connection to occur on the same core. Affinity-Accept arranges for the network interface to determine the core on which application process-ing for each new connection occurs, in a lightweight way; it adjusts the card's choices only in response to imbalances in CPU scheduling. Measurements show that for the Apache web server serving static files on a 48-core AMD system, Affinity-Accept reduces time spent in the TCP stack by 30% and improves overall throughput by 24%.

机译：给定TCP连接的传入和传出处理通常在不同的核上执行：传入数据包通常在接收中断的核心上处理，而传出数据处理发生在运行相关用户代码的核心上。因此，访问读/写连接态（例如TCP控制块）通常涉及CORE在CORE'高速缓存之间的缓存内验证和数据移动。这些可以采用数百个处理器周期，足以显着降低性能。我们提出了一种称为亲和接受的新设计，这导致在同一核心上发生给定的TCP连接的所有处理。亲和接受网络接口的安排，以确定以轻量级方式确定每个新连接的应用程序处理的核心;它仅在响应CPU调度中的不平衡时调整卡的选择。测量结果表明，对于在48核心AMD系统上为静态文件提供静态文件的Apache Web服务器，亲和接受将TCP堆栈中的时间减少30％，并提高了24％的整体吞吐量。

著录项

来源
《ACM EuroSys conference on computer systems》|2012年||共14页
会议地点
作者
Aleksey Pesterev; Jacob Strauss; Nickolai Zeldovich; Robert T. Morris;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词
Multi-core; Packet Processing; Cache Misses;

机译：多核;包处理;缓存未命中;

相似文献

外文文献
中文文献
专利

1. Improving network performance on multicore systems: Impact of core affinities on high throughput flows [J] . Nathan Hanford, Vishal Ahuja, Matthew Farrens, Future generation computer systems . 2016,第MARa期

机译：改善多核系统上的网络性能：核心亲和力对高吞吐量流的影响
2. Improving Performance of Dynamic Programming via Parallelism and Locality on Multicore Architectures [J] . Guangming Tan, Ninghui Sun, Gao G.R. IEEE Transactions on Parallel and Distributed Systems . 2009,第2期

机译：通过并行和局部化在多核体系结构上提高动态编程的性能
3. Improving Networked File System Performance Using a Locality-Aware Cooperative Cache Protocol [J] . Jiang Song, Zhang Xuechen, Liang Shuang, Computers, IEEE Transactions on . 2010,第11期

机译：使用可识别位置的协作式缓存协议提高网络文件系统的性能
4. Improving Network Connection Locality on Multicore Systems [C] . Aleksey Pesterev, Jacob Strauss, Nickolai Zeldovich, Proceedings of the EuroSys 2012 conference . 2012

机译：改善多核系统上的网络连接本地性
5. Improving cache locality for thread-level speculation systems. [D] . Fung, Stanley Lap Chiu. 2005

机译：改善线程级推测系统的缓存局部性。
6. Layer-Skipping Connections Improve the Effectiveness of Equilibrium Propagation on Layered Networks [O] . Jimmy Gammell, Sonia Buckley, Sae Woo Nam, 2021

机译：层跳过连接提高了分层网络上平衡传播的有效性
7. Improving Network Connection Locality on Multicore Systems [O] . 2012

机译：改善多核系统上的网络连接本地性

Improving Network Connection Locality on Multicore Systems

摘要

著录项

相似文献

相关主题

期刊订阅