| 研究生: |
曼亞倫 MANN, Allan |
|---|---|
| 論文名稱: |
應用開放計算語言及圖形處理器於結構有限元素分析 Static and Dynamic Finite Element Analysis Using Parallel Programming on GPU with OpenCL |
| 指導教授: |
崔兆棠
CHOI, Siu-Tong |
| 共同指導教授: |
李汶樺
Matthew SMITH |
| 學位類別: |
碩士 Master |
| 系所名稱: |
工學院 - 航空太空工程學系 Department of Aeronautics & Astronautics |
| 論文出版年: | 2014 |
| 畢業學年度: | 102 |
| 語文別: | 英文 |
| 論文頁數: | 128 |
| 外文關鍵詞: | GPU, OpenCL, Finite Element, Static analysis, Dynamic analysis |
| 相關次數: | 點閱:116 下載:2 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
Modern engineering systems are becoming increasingly complex, there is a need of more and more degrees of freedom to simulate their accurate behavior. Due to the limitation in computing capabilities, there is an urge of finding a new concept for simulating large problems. The Graphics Processing Units (GPU) were originally designed for offloading graphical display but with the improved capabilities of GPU, the use of GPU for General Purpose Programming (GPGPU) has been noticed. Due to the parallel architecture which allows the concurrency of the tasks, GPU can be used for solving large systems of equations. Implementation of the Finite Element Method on GPU architecture is quite straightforward because this method deals with linear equations or ordinary differential equations and when a problem has a large number of degrees of freedom, the use of GPU capabilities can significantly decrease the computation time. Two specific examples are the transient analysis and the static analysis which can reach higher performance using parallel programming. By using appropriate numerical methods and techniques for matrix-vectors operations such as the Conjugate Gradient method, the solution of a linear system using an AMD HD 7970 can be found 11 times faster than using an Intel i7. By using an appropriate precondition matrix in the Preconditioned Conjugate Gradient method (PCG), the solution can be found 14 times faster for a problem involving 50000 degrees of freedom. Transient analysis of 3D problems show that using the Newmark method of integration with about 50000 degrees of freedom, the solution can be found 18 times faster on the AMD HD 7970. Such high levels of performance are unable with current processors (CPU) even the most powerful ones.
[1] S. S. Rao, The Finite element method in engineering: Fifth Edition, Waltham MA: Else-
vier/Butterworth Heinemann, 2011
[2] L. J. Segerlind, Applied Finite element analysis, Michigan State University, 1976
[3] N. Kikuchi, Finite element methods in mechanics, Press syndicate of the university of Cambridge,
1986
[4] A. Munshi, B. Gaster, T. G. Mattson, J. Fung and D. Ginsburg, OpenCL Programming Guide,
Addison-Wesley professionnal,2012
[5] J. Sanders and E. Kandrot, CUDA BY EXAMPLE: An introduction to General-Purpose GPU pro-
gramming, Addison-Wesley professionnal, 16/07/2010
[6] D.Weber, J. Bender, M. Schnoes, A. Stork and D. Fellner, Efficient GPU data structures and methods
to solve sparse linear systems in dynamics applications, Computer Graphics Forum, Wiley Online
Library, 2012
[7] C. Cecka, A. Lew and E. Darve, Assembly of Finite element methods on graphics processors, Interna-
tional journal for numerical methods in engineering, vol. 85, no. 5, pp. 640-669, 2011
[8] R. Ma , S. Sirouspour, B. Mahdavikhah, B. Moody, K. Elizeh, A. Kinsman and N. Nicolici, A parallel
computing platform for real-time haptic interaction with deformable bodies, IEEE Transactions on
Haptics, vol. 3, no. 3, pp. 211-223, July/September 2010
[9] B. Mahdavikhah, R. Ma , S. Sirouspour and N. Nicolici, Haptic rendering of deformable objects
using a multiple FPGA parallel computing architecture, FPGA 10: Proceedings of the 18th annual
ACM/SIGDA international symposium on Field programmable gate arrays, 2010
[10] A. Cevahir, A. Nukada, and S.Matsuoka, High performance conjugate gradient solver on multi-GPU
clusters using hypergraph partitioning, Computer Science-Research and Development, vol. 25, no. 1/2,
pp. 83-91, 2010
[11] S. Georgescu and H. Okuda, Conjugate gradients on multiple GPUs, International Journal for Nu-
merical Methods in Fluids, vol. 64, no. 10-12, pp. 1254-1273, 2010
[12] M. Ament, G. Knittel, D. Weiskopf and W. Strasser, A parallel preconditioned conjugate gradient
solver for the poisson problem on a multi-GPU platform, Parallel, Distributed and Network-Based
Processing (PDP), 2010 18th Euromicro International Conference on IEEE, pp. 583-592, 2010
[13] Y. Liu, W. Zhou and Q. Yang, A distributed memory parallel element-by-element scheme based on
jacobi-conditioned conjugate gradient for 3D finite element analysis, Finite Elements in Analysis and
Design, vol. 43, no. 6, pp.494-503, 2007
[14] J. Allard, H. Courtecuisse and F. Faure, Implicit FEM solver on GPU for interactive deformation
simulation, GPU Computing Gems Jade Edition, Morgan Kaufmann Publishers, San Francisco, CA,
USA, ch. 21, pp. 281-294, 2012
[15] Origin 8 User Guide, OriginLab Corporation, 2007
[16] Introduction to Comsol Multiphysics, Version 4.4, December 2013
[17] K. J. Bathe and E. L. Wilson, Numerical methods in finite element analysis, US Edition, 1975
[18] N. Bell and M. Garland, Efficient Sparse Matrix- Vector Multiplication on CUDA NVIDIA Technical
Report Dec. 2008: NVR-2008- 004
[19] A.I. Khan and B.H.V. Topping, Parallel finite element analysis using Jacobi-conditioned conjugate
gradient algorithm, Advances in Engineering Software, Elsevier, March/April 1996
[20] J.S. Kowalik and S.P. Kumar, An efficient parallel block conjugate gradient method for linear equa-
tions, Proc. 1982 International Conference on Parallel Processing, ICPR 93, pp. 47-52, August 1982
[21] B. Gaster, Heterogeneous computing with OpenCL, Waltham MA : Morgan Kaufmann, 2012
[22] E. H. Dill, The Finite Element Method for Mechanics of Solids with ANSYS Applications, CRC
Press, Taylor and Francis Group, 2012