簡易檢索 / 詳目顯示

研究生: 李建樺
Li, Jian-Hua
論文名稱: 適用於高可用性雲端計算之熱平衡虛擬機器遷移演算法
Thermal-Aware VM Migration for High-Availability Cloud Computing
指導教授: 鄭憲宗
Cheng, Sheng-Tzong
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2012
畢業學年度: 100
語文別: 英文
論文頁數: 50
中文關鍵詞: 雲端運算可用性虛擬機器轉移熱平衡
外文關鍵詞: Cloud computing, availability, VM migration, thermal balance
相關次數: 點閱:119下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 雲端基礎設施即服務(IaaS)的供應商藉由虛擬化技術將資源從實體機器中獨立出來出租給使用者做其所需的應用服務,已經是雲端運算發展到目前最常見的商業運作模式,然而使用者對於可用性仍舊有很大的顧慮。影響到雲端機房的可用性原因非常多,因為各種硬體元件的毀損影響服務正常運作非常常見,本研究探討機殼風扇毀損造成CPU過熱的熱緊急事件,以期能在發生當機之前將危險排除。
    我們設計出一套管理機制,能從溫度資訊與負載量判斷主機是否出錯並且預測到達危險所剩餘的時間。藉由虛擬機器搬遷技術,能夠有效的降低危險機器負載,進而降低危險主機溫度並脫離危險。我們提出轉移時間內送出的熱量作為挑選移出VM的指標,並且提出考量機器耐熱能力的負載平衡演算法作為挑選VM合適的目的地,提升整體系統可用性。根據模擬結果顯示,我們提出的管理機制與演算法能夠有效提升整體系統的可用性,降低VM的失效次數,達成確保服務品質的目標。

    Cloud computing, such as Infrastructure as a Service (IaaS), enables a vendor to use virtualization technology to rent computing resources on a physical machine for executing users’ desired applications. IaaS has been the most common business model of cloud computing, but its availability is still a concern among users. Many factors affect the availability of a cloud computing center, such as interruption of service caused by hardware component damage. In this study, we focus on the thermal emergency event of CPU overheating caused by chassis fan damage, and we find a way to resolve the crisis before a crash occurs. We design a Thermal-Aware VM Migration Manager that can determine the health of a physical machine from its temperature and resource utilization information. Through the leveraging of VM migration, the endangered physical machine can be removed from danger by transferring its load to a normal one and reducing the CPU temperature. We propose heat transfer in migration time as criteria for VM selection policy and load balance algorithm with concern for thermal tolerance as VM allocation policy. The simulation results show that a Thermal-Aware VM Migration Manager with our proposed VM selection and allocation policy can enhance system ability and reduce the number of VMs failure.

    摘 要 i Abstract ii ACKNOWLEDGEMENT iii TABLE OF CONTENTS iv LIST OF FIGURES vi LIST OF TABLES vii 1. INTRODUCTION AND MOTIVATION 1 1.1 Availability Issue in Cloud Computing 2 1.2 Thermal Emergencies in Cloud Centers 3 1.3 Thesis Overview 4 2. RELATED WORK 6 2.1 Monitors and Predictors 6 2.2 Action for Thermal Emergencies 7 2.3 Resource Reallocation by VM Migration 7 3. SYSTEM MODEL 9 3.1 System Architecture 9 3.1.1 System States 10 3.1.2 Thermal-Aware VM Migration Manager Framework 12 3.2 Thermal Model 13 3.2.1 CPU thermal model 13 3.2.2 Thermal balance with thermal emergency 14 3.2.3 RC thermal model 16 3.3 VM Live Migration 17 4. Thermal-Aware VM Migration Manager 20 4.1 Criteria of Thermal Aware VM Migration Manager 20 4.2 Thermal-Aware VM Migration Manager Algorithms 23 4.2.1 Danger Predictor 24 4.2.2 Migration Preparation 26 4.2.3 Migration Recommendation and Execution 29 5. PERFORMANCE EVALUATION 33 5.1 Simulation Settings 33 5.2 Results and Evaluation 36 5.2.1 TAVMM with different environment parameter setting 36 5.2.2 TAVMM with different VM selection policies 39 5.2.3 TAVMM with different VM allocation policies 44 6. CONCLUSIONS AND FUTURE WORK 47 REFERENCES 48

    [1] P. Mell and T. Grance, “The NIST Definition of Cloud Computing,” NIST Special Publication 800-145, 2011.
    [2] Nines (engineering), http://en.wikipedia.org/wiki/Nines_(engineering)
    [3] IT Cloud Services User Survey, pt.2: Top Benefits & Challenges, http://blogs.idc.com/ie/?p=210
    [4] Summary of the Amazon EC2 and Amazon RDS Service Disruption in the US East Region, http://aws.amazon.com/message/65648/
    [5] Amazon EC2 Service Level Agreement, http://aws.amazon.com/ec2-sla/
    [6] J. Moore, J. Chase, P. Ranganathan, and R. Sharma, “Making scheduling "cool": Temperature-aware workload placement in data centers,” In Proceedings of USENIX Annual Technical Conference, pages 61–75, 2005.
    [7] L. Ramos and R. Bianchini, “C-Oracle: Predictive thermal management for data centers,” In Proceedings of the Fourteenth International Symposium on High-Performance Computer Architecture (HPCA’08), 2008.
    [8] S. Fu, “Failure-aware resource management for high-availability computing clusters with distributed virtual machines,” In Proceedings of Journal of Parallel and Distributed Computing, 2010.
    [9] Q. Guan, Z. Zhang, and S. Fu, “Ensemble of Bayesian predictors and decision trees for proactive failure management in cloud computing systems,” In Proceedings of Journal of Communications, vol. 7, no. 1, 2012.
    [10] Y. Wang and M. Qiao, “Virtual machine auto-configuration for web application,” In Proceedings of Performance Computing and Communications Conference (IPCCC), 2010 IEEE 29th International, 2010.
    [11] H. Salami, H. Saadatfar, F. R. Fard, S. K. Shekofteh, and H. Deldari, “Improving cluster computing performance based on job futurity prediction,” In Proceedings of Advanced Computer Theory and Engineering (ICACTE), 2010 3rd International Conference on, 2010.
    [12] R.K. Sahoo, A. J. Oliner, I. Rish, M. Gupta, J.E. Moreira and S. Ma, “Critical event prediction for proactive management in large-scale computer clusters,” In Proceedings of ACM International Conference on Knowledge Discovery and Data Dining (SIGKDD), 2003.
    [13] J. W. Mickens and B. D. Noble, “Exploiting availability prediction in distributed systems,” In Proceedings of USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2006.
    [14] S. Fu and C. Xu, “Quantifying event correlations for proactive failure management in networked computing systems,” In Proceedings of Journal of Parallel and Distributed Computing, vol. 70, no. 11, pp. 1100–1109, 2010.
    [15] J. Gu, Z. Zheng, Z. Lan, J. White, E. Hocks, and B.H. Park, “Dynamic meta-learning for failure prediction in large-scale systems: A case study,” In Proceedings of IEEE International Conference on Parallel Processing (ICPP), 2008.
    [16] R. J. Frank, N. Davey and S. P. Hunt, “Time series prediction and neural networks,” http://www.smartquant.com/references/NeuralNetworks/neural30.pdf
    [17] R2012a documentation neural network toolbox: time series prediction, http://www.mathworks.com/help/toolbox/nnet/gs/f9-56659.html
    [18] R. Ivan, E. K. Lee, D. Pompili, M. Parashar, M. Gamell, and R. J. Figueiredo, “Towards energy-efficient reactive thermal management in instrumented datacenters,” In Proceedings of IEEE/ACM International Conference on Energy Efficient Grids, Clouds and Clusters Workshop (E2GC2), Brussels, Belguim, 2010.
    [19] J. Choi, Y. Kim, A. Sivasubramaniam, J. Srebric, Q. Wang, and J. Lee, "Modeling and managing thermal proles of rack-mounted servers with ThermoStat," In Proceedings of IEEE 13th Int'l Symp. High Performance Computer Architecture, 2007.
    [20] A. Beloglazov and R. Buyya, “Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in cloud data centers,” In Proceedings of Concurrency Computat.: Pract. Exper, 2011
    [21] X. Wang and Y. Wang, “Coordinating power control and performance management for virtualized server clusters,” In Proceedings of Parallel and Distributed Systems, IEEE Transactions on, 2011.
    [22] Y. Zhao, “Adaptive Distributed Load Balancing Algorithm Based on Live Migration of Virtual Machines in Cloud,” In Proceedings of INC, IMS and IDC, 2009
    [23] N. J. Kansal and I. Chana, “Cloud Load Balancing Techniques: A Step Towards Green Computing,” in Proceedings of IJCSI, 2012
    [24] D. Epping and F. Denneman, “VMware vSphere 4.1 HA and DRS technical deepdive,” CreateSpace, USA, 2010.
    [25] L. Wang, G. von Laszewski, J. Dayal, X. He, A. Younge, and T. Furlani, “Towards thermal aware workload scheduling in a data center,” in Proceedings of the 2009 10th International Symposium on Pervasive Systems, Algorithms, and Networks, 2009.
    [26] E. Pakbaznia, M. Ghasemazar, and M. Pedram, “Temperature aware dynamic resource provisioning in a power optimized datacenter,” In Proceedings of Design Automation and Test in Europe, 2010.
    [27] Q. Tang, S. K. S. Gupta, D. Stanzione and P. Cayton, “Thermal-aware task scheduling to minimize energy usage of blade server based datacenters,” In Proceedings of IEEE International Symposium on Dependable, Autonomic and Secure Computing (DASC'06), 2006.
    [28] A. Weissel and F. Bellosa, “Dynamic thermal management for distributed systems,” In Proceedings of the first workshop on Temperature-aware Computer Systems, 2004.
    [29] A. Ferreira, D. Mosse, and J. Oh, “Thermal faults modeling using a RC model with an application to web farms,” In Proceedings of Real-Time Systems, 2007.
    [30] T. Heath, A. P. Centeno, P. George, L. Ramos, Y. Jaluria, and R. Bianchini, “Mercury and freon: temperature emulation and management for server systems,” In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), October 2006.
    [31] C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Wareld, “Live migration of virtual machines,” In Proceedings of the 2nd ACM/USENIX Symposium on Networked Systems Design and Implementation (NSDI), 2005.
    [32] H. Liu, C. Z. Xu, H. Jin, J. Gong and X. Liao, “Performance and energy modeling for live migration of virtual machine,” In Proceedings of the 20th international symposium on High performance distributed computing, 2011
    [33] M. Zhao and R. J. Figueiredom, “Experimental study of virtual machine migration in support of reservation of cluster resources,” In Proceedings of the 2nd international workshop on Virtualization technology in distributed computing, 2007
    [34] Po-Hsiang Wang and Chien Chen, “Energy aware load-balancing for cloud computing,” Thesis paper, 2011.
    [35] Rodrigo N. Calheiros, Rajiv Ranjan, Anton Beloglazov, Cesar A. F. De Rose, and Rajkumar Buyya, “CloudSim: A toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms,” In Proceedings of Software: Practice and Experience (SPE), Volume 41, Number 1, Pages: 23-50, ISSN: 0038-0644, Wiley Press, New York, USA, January, 2011.

    無法下載圖示 校內:2017-08-17公開
    校外:不公開
    電子論文尚未授權公開,紙本請查館藏目錄
    QR CODE