成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	陳易萱 Chen, Yi-Hsuan
論文名稱：	基於卷積神經網路之移動物偵測 Moving Object Detection Based on Convolutional Neural Network
指導教授：	楊家輝 Yang, Jar-Ferr
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 電腦與通信工程研究所 Institute of Computer & Communication Engineering
論文出版年：	2018
畢業學年度：	106
語文別：	英文
論文頁數：	43
中文關鍵詞：	自駕車、先進輔助駕駛系統、移動物偵測、卷積神經網路、損失函數
外文關鍵詞：	Self-driving Car, Advanced Driving Assistance System, Moving Object Detection, Convolution Neural Network, Loss Function
相關次數：	點閱：58 下載：3
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

近年來，自駕車技術成為熱門的研究，無論是現在的先進輔助駕駛系統(ADAS)或是在未來的自駕車，都必須設計一套系統以適時輔助駕駛並避免危險。尤其行駛在的多變狀況的道路上，移動物的偵測又顯得格外困難與重要。在本論文中，我們針對先進輔助駕駛系統提出一套機於卷積神經網路(CNN)的移動物偵測方法。降低記憶體的使用量，使之可以運用到輕量級的裝置上亦是我們的另一考量。其中我們也引入了焦距損失函數(Loss Function)當作我們的目標函數之一，使模型訓練的效率提升。我們的移動物偵測主要可辨識一般市區道路上常見的人、騎士、汽車這三個物件。為了使模型更適合臺灣的一般道路，我們也針對台灣市區的道路情境進行資料的收集和建置，尤其是機車的資料建置，因為台灣的機車數量遠大於其他先進國家。在實驗結果中，我們的方法達到合理的準確度，並在偵測率和誤鳴率中取得良好的平衡。

In recent years, self-driving technologies have become a popular research trend. No matter for the current advanced assisted driving system (ADAS) or the future self-driving car, the smart detections are needed to assist drivers to avoid any accidents on the road. In particular, the detections of moving objects in various conditions of road driving are relatively difficult and important. In this thesis, we proposed a moving object detection system for ADAS based on convolutional neural networks. The reduction of the memory usage and computation load will be another concern such that the system can be appled in lightweight devices. We also proposed a focal loss as the objective function to improve the training efficiency. The designed detector mainly recognizes three targets: pedestrians, riders and cars that are common on the roads. In order to be more suitable for the circumstances in Taiwan, we also collected and constructed dataset on Taiwanese urban roads, especially the moto-riders data, because the number of motorcycles in Taiwan is much larger than those in other developed countries. The experimental results demonstrate that our proposed system achieves reasonable accuracy and keeps a good balance between detection rates and false alarm rates.

摘 要	I
Abstract	II
誌謝	III
Contents	IV
List of Tables	VII
List of Figures	VIII
Chapter 1: Introduction	1
1.1. Research Background	1
1.2. Motivations	3
1.3. Literature Reviews	3
1.4. Organization of Thesis	7
Chapter 2: Related Works	9
2.1. Convolution Neural Network	9
2.2. CNN Based Object Detection	12
2.2.1. RCNN, Fast RCNN and Faster RCNN	12
2.2.2. One-shot Detector	13
2.3. Anchor box	14
2.4. Objective Functions	16
2.4.1. Confidence Loss	16
2.4.2. Location Loss	17
Chapter 3: The Proposed MOD System	18
3.1. Single Shot Multibox Detector	19
3.1.1. SSD structure	20
3.1.2. Convolutional predictor	21
3.2. Feature extraction	22
3.2.1. Depthwise Separable Convolution	23
3.2.2. Network Structure	23
3.3. Loss Function	26
3.3.1. Total Loss	26
3.3.2. Location Loss	26
3.3.3. Confidence Loss	27
Chapter 4: Experimental Results	29
4.1. Experimental Environments	29
4.2. Fusion of Databases	30
4.2. Comparison with Different Methods	34
Chapter 5: Conclusions	39
Chapter 6: Future Works	40
References	41
                                    

[1] D. G. Lowe, "Object recognition from local scale-invariant features," in Computer vision, 1999. The proceedings of the seventh IEEE international conference on, 1999, pp. 1150-1157.
[2] H. Bay, A. Ess, T. Tuytelaars, and L. Van Gool, "Speeded-up robust features (SURF)," Computer vision and image understanding, vol. 110, pp. 346-359, 2008.
[3] Y. Freund and R. E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting," Journal of computer and system sciences, vol. 55, pp. 119-139, 1997.
[4] N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection," in Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, 2005, pp. 886-893.
[5] P. F. Felzenszwalb, R. B. Girshick, D. McAllester, and D. Ramanan, "Object detection with discriminatively trained part-based models," IEEE transactions on pattern analysis and machine intelligence, vol. 32, pp. 1627-1645, 2010.
[6] C.-W. Hsu, C.-C. Chang, and C.-J. Lin, "A practical guide to support vector classification," 2003.
[7] J. R. Quinlan, "Induction of decision trees," Machine learning, vol. 1, pp. 81-106, 1986.
[8] W. Zhang and J. Kosecka, "Localization based on building recognition," in Computer Vision and Pattern Recognition-Workshops, 2005. CVPR Workshops. IEEE Computer Society Conference on, 2005, pp. 21-21.
[9] C. N. Khac, J. H. Park, and H.-Y. Jung, "Face detection using variance based Haar-like feature and SVM," World Academy of Science, Engineering and Technology, vol. 60, pp. 165-168, 2009.
[10] H. Cho, P. E. Rybski, A. Bar-Hillel, and W. Zhang, "Real-time pedestrian detection with deformable part models," in Intelligent Vehicles Symposium (IV), 2012 IEEE, 2012, pp. 1035-1042.
[11] A. Krizhevsky, I. Sutskever, and G. E. Hinton, "Imagenet classification with deep convolutional neural networks," in Advances in neural information processing systems, 2012, pp. 1097-1105.
[12] Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, et al., "Backpropagation applied to handwritten zip code recognition," Neural computation, vol. 1, pp. 541-551, 1989.
[13] K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition," arXiv preprint arXiv:1409.1556, 2014.
[14] C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, et al., "Going deeper with convolutions," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2015, pp. 1-9.
[15] K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770-778.
[16] F. N. Iandola, S. Han, M. W. Moskewicz, K. Ashraf, W. J. Dally, and K. Keutzer, "Squeezenet: Alexnet-level accuracy with 50x fewer parameters and< 0.5 mb model size," arXiv preprint arXiv:1602.07360, 2016.
[17] A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, et al., "Mobilenets: Efficient convolutional neural networks for mobile vision applications," arXiv preprint arXiv:1704.04861, 2017.
[18] X. Zhang, X. Zhou, M. Lin, and J. Sun. Shufflenet: An extremely efficient convolutional neural network for mobile devices. arXiv:1707.01083, 2017.
[19] R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2014, pp. 580-587.
[20] R. Girshick, "Fast r-cnn." in Proceedings of the IEEE international conference on computer vision, 2015, pp. 1440-1448.
[21] S. Ren, K. He, R. Girshick, and J. Sun, "Faster r-cnn: Towards real-time object detection with region proposal networks," in Advances in neural information processing systems, 2015, pp. 91-99.
[22] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 779-788.
[23] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, et al., "Ssd: Single shot multibox detector," in European conference on computer vision, 2016, pp. 21-37.

校內：2023-08-01公開
校外：2023-08-01公開

簡易檢索 / 詳目顯示

相關論文