典型文献
Analyzing and Optimizing Packet Corruption in RDMA Network
文献摘要:
Remote direct memory access(RDMA)has become one of the state-of-the-art high-performance network technologies in datacenters.The reliable transport of RDMA is designed based on a lossless underlying network and cannot endure a high packet loss rate.However,except for switch buffer overflow,there is another kind of packet loss in the RDMA network,i.e.,packet corruption,which has not been discussed in depth.The packet corruption incurs long application tail latency by causing timeout retransmissions.The challenges to solving packet corruption in the RDMA network include:1)packet corruption is inevitable with any remedial mechanisms and 2)RDMA hardware is not programmable.This paper proposes some designs which can guarantee the expected tail latency of applications with the existence of packet corruption.The key idea is controlling the occurring probabilities of timeout events caused by packet corruption through transforming timeout retransmissions into out-of-order retransmissions.We build a probabilistic model to estimate the occurrence probabilities and real effects of the corruption patterns.We implement these two mechanisms with the help of programmable switches and the zero-byte message RDMA feature.We build an ns-3 simulation and implement optimization mechanisms on our testbed.The simulation and testbed experiments show that the optimizations can decrease the flow completion time by several orders of magnitudes with less than 3%bandwidth cost at different packet corruption rates.
文献关键词:
中图分类号:
作者姓名:
Yi-Xiao Gao;Chen Tian;Wei Chen;Duo-Xing Li;Jian Yan;Yuan-Yuan Gong;Bing-Quan Wang;Tao Wu;Lei Han;Fa-Zhi Qi;Shan Zeng;Wan-Chun Dou;Gui-Hai Chen
作者机构:
State Key Laboratory for Novel Software Technology,Nanjing University,Nanjing 210046,China;Huawei Technologies Co.Ltd,Nanjing 210012,China;Institute of High Energy Physics,Chinese Academy of Sciences,Beijing 100190,China
文献出处:
引用格式:
[1]Yi-Xiao Gao;Chen Tian;Wei Chen;Duo-Xing Li;Jian Yan;Yuan-Yuan Gong;Bing-Quan Wang;Tao Wu;Lei Han;Fa-Zhi Qi;Shan Zeng;Wan-Chun Dou;Gui-Hai Chen-.Analyzing and Optimizing Packet Corruption in RDMA Network)[J].计算机科学技术学报(英文版),2022(04):743-762
A类:
Corruption,datacenters,timeout,retransmissions
B类:
Analyzing,Optimizing,Packet,RDMA,Network,Remote,direct,memory,access,has,become,one,state,art,high,performance,network,technologies,reliable,transport,designed,lossless,underlying,cannot,endure,packet,However,except,buffer,overflow,there,another,kind,corruption,which,been,discussed,depth,incurs,long,tail,latency,causing,challenges,solving,include,inevitable,any,remedial,mechanisms,hardware,programmable,This,paper,proposes,some,designs,guarantee,expected,applications,existence,key,idea,controlling,occurring,probabilities,events,caused,through,transforming,into,We,build,probabilistic,model,estimate,occurrence,real,effects,patterns,implement,these,help,switches,zero,byte,message,feature,simulation,our,testbed,experiments,show,that,optimizations,decrease,completion,several,orders,magnitudes,than,bandwidth,cost,different,rates
AB值:
0.495193
相似文献
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。