簡易檢索 / 詳目顯示

研究生: 顏伯丞
Yen, Po-Cheng
論文名稱: 結合U型卷積神經網路與變分自編碼器之雙權重動態線性插值策略於腦部核磁共振運動偽影校正
Motion Artifact Correction in Brain MRI Combining U-Net and VAE with Dynamic Dual-Weight Linear Interpolation
指導教授: 洪昌鈺
HORNG, MING-HUWI
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 資訊工程學系
Department of Computer Science and Information Engineering
論文出版年: 2025
畢業學年度: 113
語文別: 中文
論文頁數: 62
中文關鍵詞: 核磁共振運動偽影校正變分自編碼器U型卷積神經網路2.5維深度學習影像重建雙權重線性插值動態調整
外文關鍵詞: MRI, Motion Artifact Correction, Variational Autoencoder, U-Net, 2.5D, Deep Learning, Image Reconstruction, Dynamic Dual-Weight Linear Interpolation
相關次數: 點閱:14下載:1
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 醫學影像的判讀既耗時又高度仰賴經驗,尤其是在核磁共振影像中產生的失真影像,而AI可自動執行偽影與運動失真的校正,從而縮短判讀流程並減少重複掃描。核磁共振掃描中約10–42% 的偽影源自受檢者顫抖、心跳或呼吸,其中約有20%需重做檢查,平均延長20–30分鐘並增加約15%成本,年均對每台儀器造成超過14萬美元的損失。而傳統校正倚賴繁瑣的人工作業,且需要動輒數十萬美元的硬體投入與冗長的後處理,難以兼顧影像品質與臨床效率。
    本研究聚焦於核磁共振腦部成像因患者運動所致的偽影失真問題,採用公開T1腦部影像資料集,構建一套基於變分自編碼器的深度學習運動偽影校正模型,並整合了U型卷積神經網路的架構,為提升重建品質,引入了2.5維輸入策略,並設計雙權重線性插值的動態調整機制,以平衡重建與正則化損失項,並逐步增強變分自編碼器的KL項權重。
    實驗結果表明,本模型在結構相似度指數、峰值訊噪比及歸一化均方根誤差等指標上,相較於運動偽影校正網路及深度密度先驗重建變分自編碼器均有顯著提升,能有效還原細節並抑制偽影,尤其在高度運動校正任務上效果更加優異,這證明了本研究模型的泛化能力與訓練穩定性。

    Interpretation of medical images is both time consuming and highly dependent on expertise, especially for distorted MRI scans, while AI can automatically correct artifacts and motion distortions, shortening interpretation time and reducing repeat scans. In brain MRI, 10–42% of artifacts result from patient tremor, cardiac pulsation, or respiration, and about 20% of scans must be repeated, adding 20–30 minutes per exam and increasing costs by roughly 15%, at an annual loss exceeding $140,000 per scanner. Traditional correction depends on laborious manual adjustments and demands hundreds of thousands of dollars in hardware plus lengthy post-processing, making it difficult to balance image quality with clinical efficiency.
    This study addresses motion-induced artifact distortion in brain MRI. Using a publicly available T1-weighted dataset, this work develops a deep learning-based motion-artifact correction model grounded in a Variational Autoencoder (VAE) and integrated with a U-shaped Convolutional Neural Network (U-Net). To enhance reconstruction quality, the study introduces a 2.5D multi-slice input strategy and designs a dynamic dual-weight linear-interpolation mechanism to balance reconstruction and regularization losses while progressively increasing the VAE's KL divergence weight. Experiments demonstrate that the proposed model significantly outperforms both the Motion Correction Network (MC-Net) and Deep Density Prior Reconstruction VAE (DDP Recon VAE) in Structural Similarity Index Measure (SSIM), Peak Signal-to-Noise Ratio (PSNR), and Normalized Root Mean Square Error (NRMSE), effectively restoring fine details and suppressing artifacts, especially in high-motion correction tasks, thereby confirming its generalization capability and training stability.

    摘要 ii Extended Abstract iii 誌謝 vii 目錄 viii 表目錄 x 圖目錄 xi 第一章 緒論 1 1.1 研究背景與動機 1 1.2 研究目的 2 第二章 文獻探討 4 2.1 變分自編碼器 4 2.2 U型卷積神經網路 6 2.3 2.5維 8 2.4 深度密度先驗重建變分自編碼器 10 2.5 運動偽影校正網路 11 第三章 研究方法 13 3.1 研究流程 13 3.2 研究對象 15 3.3 資料前處理 16 3.4 資料集切分 16 3.5 運動偽影校正 16 3.6 資料驗證 29 3.7 實驗設備 30 3.8 實驗設計 30 3.8.1 雙權重動態調整比較 30 3.8.2 現有模型對照比較 30 3.8.3 驗證流程 31 第四章 研究分析與結果 32 4.1 研究分析 32 4.2 雙權重動態調整分析 36 4.3 現有模型對照分析 40 4.4 研究結果總結 45 第五章 結論與未來方向 47 5.1 研究結論 47 5.2 未來研究方向 47 參考文獻 48

    [1] Andre, J. B., Bresnahan, B. W., Mossa-Basha, M., Hoff, M. N., Smith, C. P., Anzai, Y., & Cohen, W. A. (2015). Toward quantifying the prevalence, severity, and cost associated with patient motion during clinical MR examinations. Journal of the American College of Radiology, 12(7), 689-695
    [2] Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. (2004). Image quality assessment: from error visibility to structural similarity. IEEE transactions on image processing, 13(4), 600-612.
    [3] Hore, A., & Ziou, D. (2010, August). Image quality metrics: PSNR[3] vs. SSIM[2]. In 2010 20th international conference on pattern recognition (pp. 2366-2369). IEEE.
    [4] Armstrong, J. S., & Collopy, F. (1992). Error measures for generalizing about forecasting methods: Empirical comparisons. International journal of forecasting, 8(1), 69-80.
    [5] Legendre, A. M. (1806). Nouvelles méthodes pour la détermination des orbites des comètes: avec un supplément contenant divers perfectionnemens de ces méthodes et leur application aux deux comètes de 1805. Courcier.
    [6] Laplace, P. S. (1774). Mémoire sur la probabilité de causes par les évenements. Mémoire de l'académie royale des sciences.
    [7] Mathieu, M., Couprie, C., & LeCun, Y. (2015). Deep multi-scale video prediction beyond mean square error. arXiv preprint arXiv:1511.05440.
    [8] Kullback, S., & Leibler, R. A. (1951). On information and sufficiency. The annals of mathematical statistics, 22(1), 79-86.
    [9] Lauterbur, P. C. (1973). Image formation by induced local interactions: examples employing nuclear magnetic resonance. nature, 242(5394), 190-191.
    [10] Akiba, T., Sano, S., Yanase, T., Ohta, T., & Koyama, M. (2019, July). Optuna: A next-generation hyperparameter optimization framework. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining (pp. 2623-2631).
    [11] Kingma, D. P., & Welling, M. (2013, December). Auto-encoding variational bayes.
    [12] Ronneberger, O., Fischer, P., & Brox, T. (2015). U-Net: Convolutional networks for biomedical image segmentation. In Medical image computing and computer-assisted intervention–MICCAI 2015: 18th international conference, Munich, Germany, October 5-9, 2015, proceedings, part III 18 (pp. 234-241). Springer international publishing.
    [13] Setio, A. A. A., Ciompi, F., Litjens, G., Gerke, P., Jacobs, C., Van Riel, S. J., ... & Van Ginneken, B. (2016). Pulmonary nodule detection in CT images: false positive reduction using multi-view convolutional networks. IEEE transactions on medical imaging, 35(5), 1160-1169.
    [14] Yang, J., Huang, X., He, Y., Xu, J., Yang, C., Xu, G., & Ni, B. (2021). Reinventing 2d convolutions for 3d images. IEEE Journal of Biomedical and Health Informatics, 25(8), 3009-3018.
    [15] Tezcan, K. C., Baumgartner, C. F., Luechinger, R., Pruessmann, K. P., & Konukoglu, E. (2018). MR image reconstruction using deep density priors. IEEE transactions on medical imaging, 38(7), 1633-1642.
    [16] Zhang, Y., Liu, M., Zhang, Z., & Dunson, D. (2024). Motion-invariant variational autoencoding of brain structural connectomes. Imaging Neuroscience, 2, 1-27.
    [17] Nárai, Á., Hermann, P., Auer, T., Kemenczky, P., Szalma, J., Homolya, I., ... & Vidnyánszky, Z. (2022). Movement-related artefacts (MR-ART) dataset of matched motion-corrupted and clean structural MRI brain scans. Scientific data, 9(1), 630.
    [18] Loshchilov, I., & Hutter, F. (2017). Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101.
    [19] LeCun, Y., Bottou, L., Orr, G. B., & Müller, K. R. (2002). Efficient backprop. In Neural networks: Tricks of the trade (pp. 9-50). Berlin, Heidelberg: Springer Berlin Heidelberg.
    [20] Bowman, S. R., Vilnis, L., Vinyals, O., Dai, A. M., Jozefowicz, R., & Bengio, S. (2015). Generating sentences from a continuous space. arXiv preprint arXiv:1511.06349.

    下載圖示 校內:立即公開
    校外:立即公開
    QR CODE