成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	曾毓豪 Tseng, Yu-Hao
論文名稱：	藉由GPU之快速且節省記憶體空間的NFA字串比對 Fast and Memory Efficient NFA Pattern Matching using GPU
指導教授：	張燕光 Chang, Yeim-Kuan
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 資訊工程學系 Department of Computer Science and Information Engineering
論文出版年：	2014
畢業學年度：	102
語文別：	英文
論文頁數：	68
中文關鍵詞：	深度封包檢測、決定性有限狀態自動機、非決定性有限狀態自動機、GPU 、CUDA 、字串比對、正規表示式
外文關鍵詞：	Deep Packet Inspection, DFA, NFA, GPU, CUDA, Pattern Matching, regular expression
相關次數：	點閱：272 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

近年來，隨著網路發展快速，大量的惡意封包在網際網路之間傳播。惡意封包包含有病毒(virus)、惡意郵件(malware)…等。網路入侵偵測系統則是一套被設計來監測網路上是否含有這些惡意攻擊。在此系統中，我們常使用字串比對技術去保護我們的系統，所以字串比對技術是不可或缺的。藉由事先定義的病毒碼，網路入侵偵測系統能找出封包內容中是否為包含已定義的病毒碼。有限狀態自動機是一種常被使用在網路入侵偵測系統的比對技術，例如：NFA及DFA。由於網路的迅速膨脹，使得單一執行緒演算法已經不能負荷我們的高需求，所以我們需要藉由硬體協助來加速字串比對的效能。 GPU繪圖處理器常被利用來執行平行運算，有效地加速了字串比對的速度。
在這篇論文中，我們提出以NFA比對的方式為基礎並且有效利用GPU上的有限的記憶體空間存放更多更複雜的正規表示式。我們基於相似於NFA轉DFA的子集構造法轉換原先的NFA變成我們提出的CNFA。相較於原先的NFA與DFA，新提出的CNFA強加了一個限制。這個限制是每個狀態自動機只能擁有最多兩條相同字元的路徑，分別是對self-loop的路徑與non-self-loop的路徑兩種去做轉換。根據我們的實驗結果，新提出的CNFA在GPU上最佳表現可達大約100 Gbps。還有，CNFA只花費相對於iNFAnt使用量的18%。甚至對於iNFAnt無法處理的複雜規則集，CNFA可以勝任。

With the rapid development of networks, a large number of malicious packets, such as virus and malware, are spreading in recent years. Network intrusion detection system (NIDS) is mainly designed for monitoring whether there are some malicious packets over the Internet. The technique of pattern matching plays an important role because we often use it to protect our systems. With pre-defined virus signatures called patterns, NIDS can find out whether these pre-defined patterns exist in the payload of packets. Finite state automata (FSA), such as NFA and DFA, are one of pattern matching techniques in NIDS. Because processing a large amount of network packets only on a single thread cannot satisfy our high-speed demand, we need hardware to accelerate the performance of pattern matching. GPU can be used to effectively accelerate the performance of pattern matching because abundant parallel hardware threads are supported.
In this thesis, we propose a new NFA-based method to store more complex regular expressions in limited memory of GPU effectively. The proposed constrained NFA (CNFA) is constructed from the original NFA based on the method used in the subset construction algorithm that converts NFA to DFA. Compared to original NFA and DFA, the proposed CNFA imposes a constraint that each state can only have at most two transitions for each character, namely self-loop and non-self-loop transitions. Based on our experimental results, the proposed CNFA can achieve the performance of about 100 Gbps at the best case on GPU. Also, the proposed CNFA only needs 18% of memory usage, used in iNFAnt. In addition, the proposed CNFA can be used for more complex rule sets that is not possible to be implemented in iNFAnt.

Abstract ...............................iv
TABLE OF CONTENTS ......................vi
LIST OF TABLES ........................vii
LIST OF FIGURES ......................viii
Chapter 1 Introduction ..................1
1.1 Motivation ..........................1
1.2 Organization of the Thesis ..........2
Chapter 2 Related Work ..................3
2.1 Pattern Matching Problem ............3
2.2 String Matching Algorithm ...........3
2.3 Regular Expression .................11
2.4 GPU-based Algorithm	................21
Chapter 3 CUDA .........................24
3.1 Introduction .......................24
3.2 CUDA Programming Model .............24
3.3 CUDA Hardware Model	................28
Chapter 4 Proposed Scheme ..............31
4.1 Motivation .........................31
4.2 Overview ...........................34
4.3 The proposed CNFA ..................36
4.4 Compression Scheme .................44
Chapter 5 GPU Implementation ...........53
5.1 Introduction .......................53
5.2 Tasks per block ....................54
5.3 Various Memory Utilization .........54
Chapter 6 Experiment Result ............58
6.1 Experimental Environment ...........58
6.2 Rule set and Input Trace ...........58
6.3 Memory consumption .................61
6.4 Evaluation of Different Rule Sets ..63
6.5 Evaluation of Tasks per Block ......65
Chapter 7 Conclusion ...................66
Reference ..............................67
                                    

[1] E. Knuth, J. H. M. Jr., and V. R. Pratt, “Fast pattern matching in strings,” SIAM Journal on Computing, vol. 6, no. 2, pp.323–350, 1977.
[2] R. S. Boyer and J. S. Moore, “A fast string searching algorithm,” Communications of the ACM, vol. 20, no 10, pp.762-772, 1977.
[3] V. Aho and M. J. Corasick, “Efficient string matching: An aid to bibliography search,” Communications of the ACM, vol. 18, no.6, pp.333-340, 1975.
[4] Z. K. Baker and V. K. Prasanna, “Time and Area Efficient Pattern Matching on FPGAs,” Proceedings of the 2004 ACM/SIGDA 12th international symposium on Field programmable gate arrays, pp.223-232, 2004.
[5] X. Zha and S. Sahni, “GPU-to-GPU and Host-to-Host Multipattern String Matching On A GPU,” IEEE Transactions on Computers, vol.62, pp.1156-1169, 2013.
[6] C.-H. Lin, C.-H. Liu, L.-S. Chien and S.C. Chang, “Accelerating Pattern Matching Using a Novel Parallel Algorithm on GPUs,” IEEE Transactions on Computers, vol.PP, p.1, 2012.
[7] R. Nigel Horspool, “Practical Fast Searching in Strings.” Software-Practice & Experience 10-6 (1980) pp.501-506.
[8] Andrew Hume & Daniel Sunday, “Fast String Searching.” Software-Practice & Experience 21-11 (1991) pp.1221-1248.
[9] Ken Thompson, “Programming Techniques: Regular expression search algorithm.” Communications of the ACM Volume 11 Issue 6 (1968) pp. 419-422.
[10] V-M. Glushkov, “The abstract theory of automata.” Russian Mathematical Surveys 16-5 (1961) pp.1–53.
[11] Martin, John, “Introduction to Languages and the Theory of Computation.” McGraw Hill (2010) pp.108.
[12] CUDA. [Online]. Available : http://www.nvidia.com/object/cuda_home_new.html
[13] Niccolo’Cascarano et al., “iNFAnt: NFA Pattern Matching on GPGPU Devices,” ACM SIGCOMM Computer Communication Review, vol. 40 Num. 5, pp. 21-26, 2010.
[14] Kepler. [Online]. Available : http://www.nvidia.com/object/nvidia-kepler.html
[15] NVIDIA Corporation, “Whitepaper NVIDIA’s Next Generation CUDA Compute Architecture: Fermi,” 2009
[16] M. Becchi, C. Wiseman, and P. Crowley, “Evaluating Regular Expression Matching Engines on Network and General Purpose Processors,” the 5th ACM/IEEE Symposium on Architectures for Networking and Communications Systems, 2009.
[17] L7-filter. [Online]. Available : http://l7-filter.sourceforge.net/
[18] Emerging Threats. [Online]. Available : http://emergingthreats.net/
[19] Suricata. [Online]. Available : http://suricata-ids.org/
[20] VRT. [Online]. Available : https://www.snort.org/vrt/
[21] Oinkmaster. [Online]. Available : http://oinkmaster.sourceforge.net/
[22] Suricata Rule Set. [Online]. Available:
https://rules.emergingthreats.net/open/suricata/emerging-all.rules
[23] PCRE. [Online]. Available: http://www.pcre.org/
[24] Defcon. [Online]. Available : http://www.defcon.org/
[25] ClamAV. [Online]. Available : http://www.clamav.net/

2024-12-31公開

簡易檢索 / 詳目顯示

相關論文