成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	官振安 Guan, Zhen-An
論文名稱：	基於現場可程式化邏輯閘陣列之行人偵測與追蹤系統: 低記憶體成本方案 A Real-Time FPGA-Based Pedestrian Detection and Tracking System: A Low Memory Cost Approach
指導教授：	陳進興 Chen, Chin-Hsing
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 電腦與通信工程研究所 Institute of Computer & Communication Engineering
論文出版年：	2020
畢業學年度：	108
語文別：	英文
論文頁數：	77
中文關鍵詞：	現場可程式化邏輯閘陣列、即時、移動物體偵測、移動物體辨識、移動物體追蹤、背景去除、K-平均群集、KBMOT 、KRBMOT
外文關鍵詞：	FPGA, real-time, moving object detection, moving object recognition, moving object tracking, background subtraction, K-Means Clustering, KBMOT, KRBMOT
相關次數：	點閱：152 下載：1
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

本文提出了一種基於現場可程式化邏輯閘陣列(FPGA)之即時移動物體偵測和追蹤系統，以行人為主要目標。從攝影機捕捉到的輸入視訊，經過如:彩色轉灰階、背景去除、形態學運算、K-方形氣球移動物體追蹤(KRBMOT)、邊框產生等一系列的處理之後，產生出能在視訊圖形陣列(VGA)螢幕上顯示的輸出視訊。整體的視訊處理能以超視訊圖形陣列(SVGA)的解析度，即 800×600 的解析度輸出，同時達到即時的要求，更準確地說，每秒60幀的影格率(FPS)。
本文提出的演算法KRBMOT是受了K-平均群集演算法(KMC)的啟發。KRBMOT是一個移動物體追蹤(MOT)演算法，藉由加入額外的參數到KMC再加上一些調整而得。KRBMOT特別適用於形狀接近長方形的物體，其中一個重要的應用就是行人追蹤。
如同KMC，KRBMOT只需要記憶體用來儲存很少的參數。這大大降低了本文提出的系統的記憶體需求。本文提出的系統只需要210 kbit的晶片內記憶體和一個頻寬為400 MB/s的晶片外同步動態隨機存取記憶體(SDRAM)。相較於其他方案，本文提出的系統建立在便宜得多的硬體架構上，同時維持良好的性能。

This thesis proposed a real-time moving object detection and tracking system, which mainly focuses on the pedestrians, based on a field programmable gate array (FPGA). After being applied a series of processing, including color to greyscale transformation, background subtraction, morphological denoising, the K-Rectangle-Balloons Moving Object Tracking (KRBMOT), and bounding boxes generation, the input color video captured by the camera is transformed to the output color video, which is shown on the video graphics array (VGA) screen. The proposed video processing system is real-time, specifically, its frame rate is 60 frames per second (FPS), each of resolution 800×600.
The proposed algorithm, KRBMOT, is inspired by the K-Means Clustering (KMC) algorithm. KRBMOT is a moving object tracking (MOT) algorithm derived by adding additional parameters to KMC with a few adjustments. KRBMOT is suitable for objects with shape closed to a rectangle. An important application of KRBMOT is pedestrian tracking. Like KMC, KRBMOT only requires memories to store very few parameters. This greatly reduces the memories required by the proposed system. The total memories which the proposed system used are 210 kbit on-chip memories and an off-chip synchronous dynamic random-access memory (SDRAM) with 400 MB/s bandwidth. Comparing to other approaches, the proposed system is much cheaper while with good performance.

摘  要	I
Abstract	III
誌　謝	V
Acknowledgment	VI
Contents	VII
List of Tables	X
List of Figures	XI
Chapter 1	Introduction	1
1	Background	1
1.1	Field Programmable Gate Array (FPGA)	1
1.2	Hardware Architecture Design (HAD)	1
1.3	Moving Object Detection (MOD) and Moving Object Tracking (MOT)	2
2	Motivation	4
2.1	Choice of Background Subtraction Algorithm	4
2.2	The Proposed MOT Algorithm	5
3	Thesis Contribution	6
4	Thesis Outline	6
Chapter 2	Models and Algorithms	7
1	Overview	7
2	Tensor Formulation for Videos	9
3	Color to Greyscale Transformation	13
4	Background Subtraction (BS)	14
5	Morphological Denoising	15
Chapter 3	The Proposed Moving Object Tracking Algorithm	18
1	Overview	18
2	The K-Means Clustering Algorithm (KMC)	19
3	Problems in Extending KMC to MOT	22
4	The K-Balloons MOT Algorithm (KBMOT)	24
5	Object Rediscovering and Merging	28
6	The K-Rectangle-Balloons MOT Algorithm (KRBMOT)	33
7	Tensorization for KRBMOT	36
8	Bounding Boxes Generation	43
Chapter 4	Hardware Architecture Design of the Proposed System	49
1	Devices and Environments	49
1.1	The DE2-115 Board	49
1.2	The TRDB-D5M Camera Kit	49
1.3	Programming Interface	49
1.4	The Cyclone IV E FPGA	51
1.5	SDRAM	51
1.6	VGA DAC	51
1.7	RS232 Interface	51
2	The Top-Level Module and Data Transmission	53
2.1	The Top-Level Module	53
2.2	Tensor Transmission	53
2.3	Notations Explanation	55
3	The Color to Greyscale Transformation Module	57
4	The Parameter-Update Module	59
5	The Background Subtraction Module	63
6	The Morphological Denoising Module	66
7	The KRBMOT Module	67
Chapter 5	Experimental Results	69
Chapter 6	Conclusion	74
References	75


                                    

[1] D. G. Bailey, Design for Embedded Image Processing on FPGAs, John Wiley & Sons (Asia), Singapore, SG, pp. 21-25, 234-248, 352-359, 2011.
[2] F. Chollet, Deep Learning with Python, Manning Publications, Shelter Island, NY, US, pp. 31-37, 2018.
[3] S. Friedberg, A. Insel, and L. Spence, Linear Algebra, 4 ed., Pearson Education, London, England, UK, pp. 439-443, 2014.
[4] M. Genovese, E. Napoli, D. De Caro, N. Petra, and A. G. M. Strollo, "FPGA Implementation of Gaussian Mixture Model Algorithm for 47 fps Segmentation of 1080p Video," Journal of Electrical and Computer Engineering, vol. 2013, Art no. 129589, pp. 1-8, 2013.
[5] R. C. Gonzalez and R. E. Woods, Digital Image Processing, 4 ed., Pearson Education, London, England, UK, pp. 636-648,770-772, 2018.
[6] J. Heikkilä and O. Silven, "A real-time system for monitoring of cyclists and pedestrians," in Proceedings Second IEEE Workshop on Visual Surveillance (VS'99) (Cat. No.98-89223), pp. 74-81, 1999.
[7] J. Hoshen and R. Kopelman, "Percolation and cluster distribution. I. Cluster multiple labeling technique and critical concentration algorithm," Phys. Rev. B, vol. 14, pp. 3438-3445, 1976.
[8] Integrated Silicon Solution Inc, "IS42S86400B, IS42S16320B, IS45S16320B," Integrated Silicon Solution Inc, 2009.
[9] J. S. S. Kutty, F. Boussaid, and A. Amira, "A high speed configurable FPGA architecture for k-mean clustering," in 2013 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1801-1804, 2013.
[10] T. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, "Focal Loss for Dense Object Detection," in 2017 IEEE International Conference on Computer Vision (ICCV), pp. 2999-3007, 2017.
[11] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 779-788, 2016.
[12] K. H. Rosen, Discrete Mathematics and Its Applications, McGraw-Hill,, New York, NY, US, pp. A3-1-A3-6, 2007.
[13] S. Shaikh, K. Saeed, and N. Chaki, Moving Object Detection Using Background Subtraction (SpringerBriefs in Computer Science), Springer, Heidelberg, DE, pp. 5-14, 2014.
[14] S. Shalev-Shwartz and S. Ben-David, Understanding Machine Learning, Cambridge University Press, Cambridge, England, UK, pp. 268-271, 2014.
[15] C. Stauffer and W. E. L. Grimson, "Adaptive background mixture models for real-time tracking," in Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149), vol. 2, pp. 246-252, 1999.
[16] Terasic Technology Inc., "DE2-115 User Manual," Terasic Technology Inc., 2012.
[17] Terasic Technology Inc., "TRDB-D5M Hardware Specification," Terasic Technology Inc., 2009.
[18] Terasic Technology Inc., "TRDB-D5M User Manual," Terasic Technology Inc., 2017.
[19] S. Zhang, "Real Time Image Processing on FPGAs," PhD thesis, Department of Electrical Engineering and Electronics, University of Liverpool, Liverpool, England, UK, pp. 67-71, 128-144, 2018.
[20] Zywyn Corparation, "ZT3220E, ZT3221E, ZT3222E, ZT3223E, ZT3232E, ZT1385E," Zywyn Corparation, 2005.

校內：2022-09-01公開
校外：2024-09-01公開

簡易檢索 / 詳目顯示

相關論文