簡易檢索 / 詳目顯示

研究生: 黃庭筠
Huang, Ting-Yun
論文名稱: 應用交叉禎注意力機制於遠程光體積變化描記圖法訊號細節保留之萃取網路
A Cross Frame Attention Network for Detail-Preserving Remote PPG Signal Extraction
指導教授: 李順裕
Lee, Shuenn-Yuh
共同指導教授: 黃春融
Huang, Chun-Rong
陳儒逸
Chen, Ju-Yi
學位類別: 碩士
Master
系所名稱: 電機資訊學院 - 電機工程學系
Department of Electrical Engineering
論文出版年: 2024
畢業學年度: 112
語文別: 英文
論文頁數: 65
中文關鍵詞: 遠程光體積變化描記圖法心率分析自注意力機制脈波表示
外文關鍵詞: Remote photoplethysmography (rPPG), heart rate estimation, self-attention mechanism, pulse representation
相關次數: 點閱:57下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 摘要 I Abstract III 誌謝 V Table of Contents VI Table Captions VIII Figure Captions IX Chapter 1 Introduction 1 1.1 Motivation 1 1.2 Research Background 3 1.3 Thesis Organization 6 Chapter 2 Related Works 7 2.1 Conventional methods 7 2.2 Deep learning-based methods 7 2.2.1 2D CNN-based methods 8 2.2.2 Spatial-temporal methods 10 2.2.3 Attention-based methods 11 Chapter 3 The Proposed Method 15 3.1 Overview 15 3.2 Preprocessing 16 3.3 The First Stage 18 3.3.1 Local Feature Extractor (LFE) 19 3.3.2 Global Feature Extractor (GFE) 21 3.4 The Second Stage 23 3.5 Loss Functions and The Two Step Training Process 24 3.5.1 Loss Functions 24 3.5.2 Training process 25 Chapter 4 Measurement and Results 27 4.1 Datasets and Settings 27 4.1.1 Datasets 27 4.1.2 Settings 28 4.2 Experiments 29 4.2.1 Cross-Dataset Validation of the 1st Stage 29 4.2.2 Intra-Dataset Validation of the whole model 34 Chapter 5 Conclusions 45 References 47

    [1] R. Stricker, S. Müller and H. -M. Gross, "Non-contact video-based pulse rate measurement on a mobile service robot," in Proc. The 23rd IEEE International Symposium on Robot and Human Interactive Communication, pp. 1056-1062, 2014.
    [2] S. Bobbia, R. Macwan, Y. Benezeth, A. Mansouri, J. Dubois, "Unsupervised skin tissue segmentation for remote photoplethysmography," Pattern Recognition Letters, vol. 124, pp. 82-90, 2019.
    [3] T. Tamura, Y. Maeda, M. Sekine, M. Yoshida, "Wearable Photoplethysmographic Sensors—Past and Present," Electronics, vol. 3, pp. 282-302, 2014.
    [4] Hanguang Xiao, Tianqi Liu, Yisha Sun, Yulin Li, Shiyi Zhao, Alberto Avolio, "Remote photoplethysmography for heart rate measurement: A review," Biomedical Signal Processing and Control, vol. 88, 2024.
    [5] Verkruysse W, Svaasand LO, Nelson JS, “Remote plethysmographic imaging using ambient light,” Opt Express, vol.16, pp. 21434-21445, Dec. 2008.
    [6] Ming-Zher Poh, Daniel J. McDuff, and Rosalind W. Picard, "Non-contact, automated cardiac pulse measurements using video imaging and blind source separation," Opt. Express, vol. 18, pp. 10762-10774, 2010.
    [7] M. Lewandowska, J. Rumiński, T. Kocejko and J. Nowak, "Measuring pulse rate with a webcam — A non-contact method for evaluating cardiac activity," in Proc. Federated Conference on Computer Science and Information Systems (FedCSIS), pp. 405-410, 2011.
    [8] G. de Haan and V. Jeanne, "Robust Pulse Rate From Chrominance-Based rPPG," IEEE Transactions on Biomedical Engineering, vol. 60, no. 10, pp. 2878-2886, Oct. 2013.
    [9] Wang, W., den Brinker, A. C., Stuijk, S., & de Haan, G., “Algorithmic principles of remote-PPG,” IEEE Transactions on Biomedical Engineering, vol. 64, pp. 1479-1491, 2017.
    [10] W. Zhang, “Shift-invariant pattern recognition neural network and its optical architecture,” in Proc. Annu. Conf. Jpn. Soc. Appl. Phys., pp. 734, 1988.
    [11] Špetlík, Radim, Vojtech Franc, and Jirí Matas, "Visual heart rate estimation with convolutional neural network," in Proc. British Machine Vision Conference, 2018.
    [12] Chen, Weixuan, and Daniel McDuff, "Deepphys: Video-based physiological measurement using convolutional attention networks." in Proc. European Conference on Computer Vision (ECCV) , pp. 356–373, 2018.
    [13] Zhicheng Ji, Hongkai Ji, “TSCAN: Pseudo-time reconstruction and evaluation in single-cell RNA-seq analysis ,” Nucleic Acids Research, vol. 44, no.13, pp. e117, July 2016.
    [14] Lin, Ji, Chuang Gan, and Song Han, "Tsm: Temporal shift module for efficient video understanding," in Proc. IEEE/CVF International Conference on Computer Vision, 2019, pp. 7083-7093.
    [15] Li, Jianwei, Zitong Yu, and Jingang Shi, "Learning motion-robust remote photoplethysmography through arbitrary resolution videos," in Proc. AAAI Conference on Artificial Intelligence, vol. 37, no. 1, 2023.
    [16] Niu, Xuesong, et al., "Rhythmnet: End-to-end heart rate estimation from face via spatial-temporal representation," IEEE Transactions on Image Processing, vol. 29, pp. 2409-2423, 2019.
    [17] Dong, Yuhang, Gongping Yang, and Yilong Yin. "Drnet: Decomposition and reconstruction network for remote physiological measurement." arXiv preprint, arXiv:2206.05687, 2022.
    [18] Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, and Manohar Paluri, “Learning spatiotemporal features with 3d convolutional networks,” in Proceedings of the IEEE International Conference on Computer Vision, pp. 4489–4497, 2015.
    [19] Sepp Hochreiter and Jürgen Schmidhuber, “Long short-term memory,” Neural computation, vol 9, no.8, pp.1735–1780, 1997.
    [20] Yu, Zitong, Xiaobai Li, and Guoying Zhao, "Remote photoplethysmograph signal measurement from facial videos using spatio-temporal networks," arXiv preprint, arXiv:1905.02419, 2019.
    [21] Zhao, Changchen, et al., "Learning spatio-temporal pulse representation with global-local interaction and supervision for remote prediction of heart rate," IEEE Journal of Biomedical and Health Informatics, vol. 28, no. 2, pp. 609-620, Feb. 2024.
    [22] M. Hu, F. Qian, D. Guo, X. Wang, L. He and F. Ren, "ETA-rPPGNet: Effective Time-Domain Attention Network for Remote Heart Rate Measurement," IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1-12, 2021.
    [23] M. Das, M. K. Bhuyan and L. N. Sharma, "Time–Frequency Learning Framework for rPPG Signal Estimation Using Scalogram-Based Feature Map of Facial Video Data," IEEE Transactions on Instrumentation and Measurement, vol. 72, pp. 1-10, 2023.
    [24] X. Liu, W. Wei, H. Kuang, X. Ma, “Heart Rate Measurement Based on 3D Central Difference Convolution with Attention Mechanism,” Sensors, vol. 22, no. 2, pp. 688, 2022.
    [25] Zhao, Changchen, et al., "JAMSNet: A remote pulse extraction network based on joint attention and multi-scale fusion," IEEE Transactions on Circuits and Systems for Video Technology, vol. 33, no. 6, pp. 2783-2797, June 2023.
    [26] Vaswani, Ashish, et al., "Attention is all you need," in Proc. Advances in Neural Information Processing Systems, vol. 30, 2017.
    [27] Dosovitskiy, Alexey, et al., "An image is worth 16x16 words: Transformers for image recognition at scale." arXiv preprint, arXiv:2010.11929, 2020.
    [28] Qian, Wei, et al., "Dual-path tokenlearner for remote photoplethysmography-based physiological measurement with facial videos," IEEE Transactions on Computational Social Systems, vol. 11, no. 3, pp. 4465-4477, June 2024.
    [29] Yu, Zitong, et al., "Physformer: Facial video-based physiological measurement with temporal difference transformer," in Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 4176-4186.
    [30] Yu, Zitong, et al. "Physformer++: Facial video-based physiological measurement with slowfast temporal difference transformer." International Journal of Computer Vision, vol. 131, no. 6, pp. 1307-1330, 2023.
    [31] Liu, Xin, et al., "Efficientphys: Enabling simple, fast and accurate camera-based cardiac measurement," in Proc. IEEE/CVF Winter Conference on Applications of Computer Vision, 2023, pp. 4997-5006.
    [32] Loshchilov, Ilya, and Frank Hutter, "Decoupled weight decay regularization," arXiv preprint, arXiv:1711.05101, 2017.
    [33] Liu, Xin, et al., "rppg-toolbox: Deep remote ppg toolbox," in Proc. Advances in Neural Information Processing Systems, vol. 36, 2024.

    無法下載圖示 校內:2029-07-24公開
    校外:2029-07-24公開
    電子論文尚未授權公開,紙本請查館藏目錄
    QR CODE