| 研究生: |
劉冠甫 Liu, Kuan-Fu |
|---|---|
| 論文名稱: |
使用高斯濾波和多項式誤差嵌入改善立體視覺的3D重建 Improved 3D Reconstruction using Stereo Vision with Gaussian Filter and Polynomial Disparity Fitting |
| 指導教授: |
王大中
Wang, Ta-chung |
| 學位類別: |
碩士 Master |
| 系所名稱: |
工學院 - 民航研究所 Institute of Civil Aviation |
| 論文出版年: | 2016 |
| 畢業學年度: | 104 |
| 語文別: | 英文 |
| 論文頁數: | 81 |
| 中文關鍵詞: | 立體匹配 、銳利化 、高斯濾波 、輪廓 |
| 外文關鍵詞: | Stereo matching, Unsharp mask, Gaussian filter, Contour |
| 相關次數: | 點閱:88 下載:2 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文研究目的主要是使用立體視覺去計算出物體在空間中和相機的距離,並利用反投影將影像中的物體在三維空間中重建目標物體的輪廓。在立體視覺中最重要的步驟就是立體匹配,而在立體匹配過程中有兩個關鍵是很重要的精確度和計算時間,所以本文針對這兩部分去進行探討。在演算法的地方,從FNCC匹配換成SSD匹配使計算時間減少了一大半,並且加入了disparity範圍限制除了再加速計算外也減少了誤差的產生。為了提升精確度,在方法中加入濾波和多項式誤差嵌入去提升精確度。本研究嘗試四種不同的濾波和數種不同銳利化程度以及兩種不同矩形的匹配窗口,來探討銳利化程度以及不同矩形的匹配窗口對濾波提升精度的影響。就結論而言,此方法可以大大提升精確度和降低計算時間。
In this thesis, it will be explained how the stereo vision can be used to calculate the distance between an object and a camera. By using a back projection method in 3D space, two existing images are recovered by which to analyze the contour of a target object. Stereo matching is the most important step in stereo vision as it will determine the accuracy and computational time of required to obtain experimental results. This thesis discusses both accuracy and computational time. In terms of an algorithm, the process is changed from FNCC matching to SSD matching. Then, a disparity range limitations is used to reduce both computational time and the number of mismatched points. To further improve the accuracy, a Gaussian filter and polynomial disparity fitting are added. An unsharp mask filter is used in the experiment to sharpen images, the window size is altered, and different filters are used to find the best way to improve the accuracy and speed of stereo matching. In conclusion, propose method reduced both the number of mismatching points and computational time.
[1] J. Zhao, S. Chang, and G.-Z. Men, "3D Reconstruction based on the matching method of GA," in 2010 International Conference on Machine Learning and Cybernetics, 2010, pp. 1798-1801.
[2] T. Li, G. Yuan, and Q. Duan, "Navigation technology of autonomous mobile robots in unknown environments," in Mechatronics and Embedded Systems and Applications (MESA), 2010 IEEE/ASME International Conference on, 2010, pp. 533-538.
[3] J. C. Andersen, N. A. Andersen, and O. Ravn, "Trinocular stereo vision for intelligent robot navigation," in 5th IFAC/EURON Symposium on Intelligent Autonomous Vehicles, 2004.
[4] A.-L. Hou, X. Cui, Y. Geng, W.-J. Yuan, and J. Hou, "Measurement of safe driving distance based on stereo vision," in Image and Graphics (ICIG), 2011 Sixth International Conference on, 2011, pp. 902-907.
[5] K. Muljowidodo, M. A. Rasyid, N. SaptoAdi, and A. Budiyono, "Vision based distance measurement system using single laser pointer design for underwater vehicle," Indian Journal of Marine Sciences, vol. 38, no. 3, pp. 324-331, Sep. 2009.
[6] J. Mrovlje and D. Vrancic, "Distance measuring based on stereoscopic pictures," in 9th International PhD Workshop on Systems and Control: Young Generation Viewpoint, 2008.
[7] H.-S. Baek, J.-M. Choi, and B.-S. Lee, "Improvement of distance measurement algorithm on stereo vision system (svs)," in Ubiquitous Information Technologies and Applications (CUTE), 2010 Proceedings of the 5th International Conference on, 2010, pp. 1-3.
[8] K. Yoshida and S. Hirose, "Real-time stereo vision with multiple arrayed camera," in Robotics and Automation, 1992. Proceedings., 1992 IEEE International Conference on, 1992, pp. 1765-1770.
[9] M. Okutomi and T. Kanade, "A multiple-baseline stereo," IEEE Transactions on pattern analysis and machine intelligence, vol. 15, no. 4, pp. 353-363, Mar. 1993.
[10] M. A. Muquit and T. Shibahara, "A high-accuracy passive 3D measurement system using phase-based image matching," IEICE Transactions on fundamentals of electronics, communications and computer sciences, vol. 89, no. 8, pp. 686-697, Mar. 2006.
[11] R. Hartley and A. Zisserman, Multiple view geometry in computer vision: Cambridge university press, 2003.
[12] Q.-T. Luong, R. Deriche, O. Faugeras, and T. Papadopoulo, "On determining the fundamental matrix: Analysis of different methods and experimental results," INRIA, 1993.
[13] Z. Zhang, "Determining the epipolar geometry and its uncertainty: A review," International journal of computer vision, vol. 27, no. 2, pp. 161-195, Mar. 1998.
[14] V. Nozick, "Multiple view image rectification," in Access Spaces (ISAS), 2011 1st International Symposium on, 2011, pp. 277-282.
[15] C. Loop and Z. Zhang, "Computing rectifying homographies for stereo vision," in Computer Vision and Pattern Recognition, 1999. IEEE Computer Society Conference on., 1999.
[16] R. I. Hartley, "Theory and practice of projective rectification," International Journal of Computer Vision, vol. 35, no. 2, pp. 115-127, Nov.-Dec. 1999.
[17] H. MORAVEC, "Toward automatic visual obstacle avoidance. 5 th Int. Joint Conf," Artificial Intelligence, p. 584, Aug. 1977.
[18] T. Lindeberg, "Feature detection with automatic scale selection," International journal of computer vision, vol. 30, no. 2, pp. 79-116, Nov. 1998.
[19] K. Mikolajczyk and C. Schmid, "Indexing based on scale invariant interest points," in Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on, 2001, pp. 525-531.
[20] K. Mikolajczyk and C. Schmid, "An affine invariant interest point detector," in European conference on computer vision, 2002, pp. 128-142.
[21] T. Tuytelaars and L. Van Gool, "Matching widely separated views based on affine invariant regions," International journal of computer vision, vol. 59, no. 1, pp. 61-85, Aug. 2004.
[22] J. Banks, M. Bennamoun, and P. Corke, "Non-parametric techniques for fast and robust stereo matching," in TENCON'97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE, 1997, pp. 365-368.
[23] W. Cheng, J. Liu, and J. Zhang, "A binocular computer vision system for aerial image pairs," in Signal Processing, 1996., 3rd International Conference on, 1996, pp. 954-957.
[24] G. Thorpe and D. Fraser, "Problems in area-based image registration for stereo," in TENCON'97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE, 1997, pp. 347-352.
[25] M. Shimizu and M. Okutomi, "Precise subpixel estimation on area‐based matching," systems and computers in Japan, vol. 33, no. 7, pp. 1-10, Jun. 2002.
[26] E. Izquierdo, "Disparity/segmentation analysis: matching with an adaptive window and depth-driven segmentation," IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 4, pp. 589-607, Jun. 1999.
[27] T. Kanade and M. Okutomi, "A stereo matching algorithm with an adaptive window: Theory and experiment," IEEE transactions on Pattern analysis and Machine Intelligence, vol. 16, no. 9, pp. 920-932, Sep. 1994.
[28] Y.-F. Liao, "Object Geometry Reconstruction by Stereo Image Data," Department of Aeronautics and Astronautics, National Cheng Kung University, 2005.
[29] T.-S. Hsu and T.-C. Wang, "An Improvement Stereo Vision Images Processing for Object Distance Measurement," International Journal of Automation and Smart Technology, vol. 5, no. 2, pp. 85-90, Jun. 2015.
[30] C. i. Colour. (20 December). Cambridge in colour.
Available:http://www.cambridgeincolour.com/tutorials/imagesharpening.htm
[31] J. R. Parker, Algorithms for image processing and computer vision: John Wiley & Sons, 2010.
[32] M. Badamchizadeh and A. Aghagolzadeh, "Comparative study of unsharp masking methods for image enhancement," in Image and Graphics (ICIG'04), Third International Conference on, 2004, pp. 27-30.
[33] J. Weng, N. Ahuja, and T. S. Huang, "Two-view Matching," in ICCV, 1988, pp. 64-73.
[34] H. S. Lim and T. O. Binford, "Stereo correspondence: A hierarchical approach," in Proc. Image Understanding Workshop, 1987, pp. 234-241.
[35] R. Wang, "AdaBoost for feature selection, classification and its relation with SVM, a review," Physics Procedia, vol. 25, pp. 800-807, Apr. 2012.
[36] D. G. Lowe, "Object recognition from local scale-invariant features," in Computer vision, 1999. The proceedings of the seventh IEEE international conference on, 1999, pp. 1150-1157.
[37] D. G. Lowe, "Distinctive image features from scale-invariant keypoints," International journal of computer vision, vol. 60, no.2, pp. 91-110, Nov. 2004.
[38] D. Scharstein and R. Szeliski, "A taxonomy and evaluation of dense two-frame stereo correspondence algorithms," International journal of computer vision, vol. 47, no. 1-3, pp. 7-42, Apr.-Jun. 2002.
[39] V. Kolmogorov and R. Zabih, "Computing visual correspondence with occlusions using graph cuts," in Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on, 2001, pp. 508-515.
[40] L. Di Stefano and S. Mattoccia, "Fast Stereo Matching for the VIDET System using a General Purpose Processor with Multimedia Extensions," in camp, 2000, pp. 356-.
[41] T. Kanade, H. Kano, S. Kimura, A. Yoshida, and K. Oda, "Development of a video-rate stereo machine," in Intelligent Robots and Systems 95.'Human Robot Interaction and Cooperative Robots', Proceedings. 1995 IEEE/RSJ International Conference on, 1995, pp. 95-100.
[42] Stanford Computer Graphics Laboratory, "Stanford Light Field Archive," 2008.