典型文献
A bi-metric autoscaling approach for n-tier web applications on kubernetes
文献摘要:
Container-based virtualization techniques are beco-ming an alternative to traditional virtual machines,due to less overhead and better scaling.As one of the most widely used open-source container orchestration systems,Kubernetes provi-des a built-in mechanism,that is,horizontal pod autoscaler(HPA),for dynamic resource provisioning.By default,scaling pods only based on CPU utilization,a single performance metric,HPA may create more pods than actually needed.Thr-ough extensive measurements of a containerized n-tier appli-cation benchmark,RUBBoS,we find that excessive pods consume more CPU and memory and even deteriorate response times of applications,due to interference.Furthermore,a Kubernetes service does not balance incoming requests among old pods and new pods created by HPA,due to stateful HTTP.In this paper,we propose a bi-metric approach to scaling pods by taking into account both CPU utilization and utilization of a thread pool,which is a kind of important soft resource in Httpd and Tomcat.Our approach collects the utilization of CPU and memory of pods.Meanwhile,it makes use of ELBA,a milli-bottleneck detector,to calculate queue lengths of Httpd and Tomcat pods and then evaluate the utilization of their thread pools.Based on the utilization of both CPU and thread pools,our approach could scale up less replicas of Httpd and Tomcat pods,contributing to a reduction of hardware resource utilization.At the same time,our approach leverages preStop hook along with liveness and readiness probes to relieve load imbalance among old Tomcat pods and new ones.Based on the containerized RUBBoS,our experimental results show that the proposed approach could not only reduce the usage of CPU and memory by as much as 14%and 24%when compared with HPA,but also relieve the load imbalance to reduce average response time of requests by as much as 80%.Our approach also demonstrates that it is better to scale pods by multiple metrics rather than a single one.
文献关键词:
中图分类号:
作者姓名:
Changpeng ZHU;Bo HAN;Yinliang ZHAO
作者机构:
School of Artificial Intelligence,Chongqing University of Technology,Chongqing 401135,China;School of Computer Science and Engineering,Chongqing University of Technology,Chongqing 400054,China;School of Journalism and New Media,Xi'an Jiaotong University,Xi'an 710049,China;School of Computer Science and Technology,Xi'an Jiaotong University,Xi'an 710049,China
文献出处:
引用格式:
[1]Changpeng ZHU;Bo HAN;Yinliang ZHAO-.A bi-metric autoscaling approach for n-tier web applications on kubernetes)[J].计算机科学前沿,2022(03):9-20
A类:
autoscaling,beco,provi,autoscaler,RUBBoS,stateful,Httpd,ELBA,preStop
B类:
bi,approach,tier,web,applications,kubernetes,Container,virtualization,techniques,alternative,traditional,machines,due,less,overhead,better,most,widely,used,open,orchestration,systems,Kubernetes,des,built,mechanism,that,horizontal,HPA,dynamic,resource,provisioning,By,default,pods,only,CPU,utilization,single,performance,may,than,actually,needed,Thr,ough,extensive,measurements,containerized,benchmark,find,excessive,consume,memory,even,deteriorate,response,times,interference,Furthermore,service,does,not,incoming,requests,among,old,new,created,by,HTTP,In,this,paper,taking,into,account,both,thread,which,kind,important,soft,Tomcat,Our,collects,Meanwhile,makes,milli,bottleneck,detector,calculate,queue,lengths,then,evaluate,their,pools,Based,could,up,replicas,contributing,reduction,hardware,At,same,leverages,hook,along,liveness,readiness,probes,relieve,load,imbalance,ones,experimental,results,show,proposed,reduce,usage,much,when,compared,also,average,demonstrates,multiple,metrics,rather
AB值:
0.452245
相似文献
机标中图分类号,由域田数据科技根据网络公开资料自动分析生成,仅供学习研究参考。