研究生: |
林邦昱 Lin, Pang-Yu |
---|---|
論文名稱: |
利用SIFT及三角化重建3D點雲圖的一硬體實現 3D Point Cloud Reconstruction via SIFT and Triangulation and Its FPGA Implementation |
指導教授: |
陳進興
Chen, Chin-Hsing |
學位類別: |
碩士 Master |
系所名稱: |
電機資訊學院 - 電腦與通信工程研究所 Institute of Computer & Communication Engineering |
論文出版年: | 2023 |
畢業學年度: | 111 |
語文別: | 英文 |
論文頁數: | 53 |
中文關鍵詞: | 現場可程式化邏輯閘陣列 、尺度不變特徵轉換 、特徵匹配 、運動恢復結構 、3D點雲重建 、相機標定 、三角測量 、影像處理 、點雲 |
外文關鍵詞: | FPGA, SIFT, feature matching, structure from motion, 3D point cloud reconstruction, camera calibration, triangulation, image processing, point cloud |
相關次數: | 點閱:108 下載:0 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
本論文提出一個以FPGA實現從兩張視角圖重建出物體3D點雲圖的系統。此一系統主要由雙模組構成。第一模塊使用RS232 將兩張圖像傳輸到 FPGA 中,並利用Scale-invariant feature transform (SIFT) 演算法計算出圖像中的特徵點座標和描述子。接下來,我們將對兩張圖像的特徵點進行匹配。最後將匹配點的座標通過 RS232 傳輸回 PC 端。
由於FPGA 記憶體的大小限制,我們的系統只能處理大小為256*256的兩張影像進行特徵點的匹配。為了增加匹配點的數量,從而獲得更準確的三角測量結果並進行點雲的可視化展示,本論文將大小為N*N的圖像分解成大小為(N/256)*(N/256)張256*256的子圖像,並對所有的子圖像進行特徵匹配。
第二個模組是三角測量模組。首先,我們運用相機標定演算法來計算相機的內部參數。接著,我們透過RS232將第一個模塊計算出的特徵點座標傳送至FPGA。同時,利用三角測量方法和內部參數,我們計算出了點雲的資訊。最後,將點雲通過 RS232 傳輸回 PC 端並在軟體中進行顯示。
由實驗解果顯示,我們成功的還原了圖像的3D點雲圖,並且由視覺化中顯示了我們的3D點雲圖的還原度是精準的,也成功證明了FPGA是可以做到3D點雲圖的重建。
This thesis presents a system for structure from motion (SFM) from two views via scale-invariant feature transform (SIFT) and triangulation algorithm with FPGA Implementation. The system consists of two modules. The first module first transmits two images into FPGA via RS232, finds the feature points and its descriptor via SIFT, and then implements feature matching, to find the coordinates of the matching point, finally transmits the coordinate with a matching point to the PC via RS232.
Owing to the limit of the memory on FPGA, our system can only implement feature matching on two 256*256 images. In order to get more matching points to get the better vision after triangulation. We divide N*N image to (N/256)*(N/256) subimages, and implement the feature matching for all subimages, so that we can get more matching points.
The second module is the triangulation module. We first get the intrinsic matrix for our camera via camera calibration, then transmit the coordinates from the first module to FPGA via RS232, then calculate the point clouds by triangulation algorithm with intrinsic matrix. Finally, the point cloud is transmitted back to the PC end via RS232 and displayed on the screen. The experimental results show that we have successfully reconstructed the 3D point cloud from two images. The accuracy of our reconstructed 3D point cloud was demonstrated through visualization, which affirms that FPGA can indeed achieve accurate 3D reconstruction.
[1] C. Freeman, "Multiple methods beyond triangulation: collage as a methodological framework in geography," Geografiska Annaler: Series B, Human Geography, 2020.
[2] C. H. Chien, C. J. Chien and C. C. Hsu, "Hardware-Software Co-Design of an Image Feature Extraction and Matching Algorithm," 2019 2nd International Conference on Intelligent Autonomous Systems (ICoIAS), Singapore, pp. 37-41, 2019.
[3] D. Lowe, "Distinctive image features from scale-invariant keypoints," International Journal of Computer Vision, 2004.
[4] F. Porikli and A. Divakaran, "Multi-camera calibration, object tracking and query generation," 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698), Baltimore, MD, USA, pp. I-653, 2003.
[5] H. Nejad and Z. Nasri, "Adaptive RANSAC and extended region-growing algorithm for object recognition over remote-sensing images," Multimed Tools Appl, 2022.
[6] J. J. Moré, "The Levenberg-Marquardt algorithm: Implementation and theory," Numerical Analysis. Lecture Notes in Mathematics, vol. 630. Springer, Berlin, Heidelberg, 1978.
[7] J. Xu, Z. P. Fang, A. A. Malcolm and H. Wang, "A robust close-range photogrammetric system for industrial metrology," 7th International Conference on Control, Automation, Robotics and Vision, 2002. ICARCV 2002, Singapore, pp. 114-119, vol.1, 2002.
[8] J. Zhang, Y. Xiu, "Image stitching based on human visual system and SIFT algorithm," Vis Comput, 2023.
[9] K. A. Neumann, "A Structure From Motion Pipeline for Orthographic Multi-View Images," 2022 IEEE International Conference on Image Processing (ICIP), Bordeaux, France, pp. 1181-1185, 2022.
[10] L. Yao, H. Feng, Y. Zhu, Z. Jiang, D. Zhao and W. Feng, "An architecture of optimised SIFT feature detection for an FPGA implementation of an image matcher," 2009 International Conference on Field-Programmable Technology, Sydney, NSW, Australia, pp. 30-37, 2009.
[11] M. Wang, X. Du, Z. Chang, K. Wang, "A Scene Perception Method Based on MobileNetV3 for Bionic Robotic Fish," Neural Computing for Advanced Applications. NCAA 2022. Communications in Computer and Information Science, vol. 1638, Springer, Singapore, 2022.
[12] P. Kreowsky and B. Stabernack, "A Full-Featured FPGA-Based Pipelined Architecture for SIFT Extraction," IEEE Access, vol. 9, pp. 128564-128573, 2021.
[13] S. Battiato, G. Gallo, G. Puglisi and S. Scellato, "SIFT Features Tracking for Video Stabilization," 14th International Conference on Image Analysis and Processing (ICIAP 2007), Modena, Italy, pp. 825-830, 2007.
[14] S. Gibson, J. Cook, T. Howard, R. Hubbold and D. Oram, "Accurate camera calibration for off-line, video-based augmented reality," Proceedings. International Symposium on Mixed and Augmented Reality, Darmstadt, Germany, pp. 37-46, 2002.
[15] S. Se and P. Jasiobedzki, "Photo-realistic 3D model reconstruction," Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006, Orlando, FL, USA, pp. 3076-3082, 2006.
[16] V. Abalakin, M. Chubey, G. Eroshkin, and I. Kopylov, "Triangulation Measurements in the Solar System," International Astronomical Union Colloquium, 2000.
[17] W. S. Lin, Y. L. Wu, W. C. Hung, and C. Y. Tang, "A Study of Real-Time Hand Gesture Recognition Using SIFT on Binary Images," Advances in Intelligent Systems and Applications - Volume 2. Smart Innovation, Systems and Technologies, vol. 21. Springer, Berlin, Heidelberg, 2013