成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	陳怡君 Chen, Yi-Chun
論文名稱：	AICURE：預訓練跨領域 Transformer Encoder 進行 ICU 電子病歷預測 AICURE: Pre-training of Cross-Modality Transformer Encoder for ICU Electronic Health Records Prediction
指導教授：	高宏宇 Kao, Hung-Yu
學位類別：	碩士 Master
系所名稱：	電機資訊學院 - 醫學資訊研究所 Institute of Medical Informatics
論文出版年：	2021
畢業學年度：	109
語文別：	英文
論文頁數：	66
中文關鍵詞：	電子病歷預測任務、跨領域學習、自然語言處理、預訓練與微調
外文關鍵詞：	Prediction Tasks on Electronic Health Records, Cross-Modality Learning, Natural Language Processing, Pre-training and Fine-tuning Framework
相關次數：	點閱：163 下載：13
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

近年來，深度學習逐漸被使用在醫療領域來協助解決臨床問題。其中對於電子病歷的深度學習研究雖逐年增加，但研究方法、任務定義卻相當分歧，且通常僅單靠醫療代碼如診斷碼或藥物編碼這單一種資料類型來學習病人的健康狀況及進行預測，這些問題都限制了深度學習模型在醫療領域的可應用性。

為了解決上述這些問題，我們提出了加護病房病歷編碼器（a ICU record encoder, AICURE），其為編碼器預訓練在各次就診病歷資料上以學習出好的就診病歷向量表達，再針對各個電子病歷預測任務做微調訓練以進行預測。模型的資料集為就診病歷，內容包含醫療代碼、臨床紀錄與病人資訊。我們在病歷編碼器中使用跨領域學習的機制來學習這些不同領域的資料，並導入自然語言處理的概念來處理病史資料（即多次就診病歷）。並且我們依據不同臨床場景設計了四個更符合實際場域狀況的預測任務。

因為預訓練病歷編碼器得到好的就診病歷向量表達，我們的模型可以廣泛應用在多種類型的電子病歷預測任務上，並在此四個任務上都得到突出的表現。此外，透過模型推理過程的視覺分析，病歷編碼器可以提供可解釋的預測結果。最後我們以個案探討的方式呈現此病歷編碼器的表現及性能。

Recently, there have been increasing researches applying deep learning on electronic health records (EHR). However, their methodologies and task definitions are very diverse, and most past works depended solely on medical codes for learning patient's health status and making prediction. These problems limit applicability of deep learning models on medical domain.

To solve difficulties above, we propose AICURE (a ICU record encoder), an encoder pre-trained on each visit record for learning good visit vector and then fine-tuned on each EHR prediction task. Dataset here comprises visit records which contain medical codes, clinical notes and patients' demographics. We adapt cross-modality learning for combining information from different domains, and introduce the concept in natural language processing for learning patient's medical history, i.e. visit record sequence. And we design 4 EHR tasks based on actual clinical scenario situations for more proper definition.

Because our pre-trained AICURE learns good visit vectors, it can be applied to many EHR tasks, and has competitive performances on these 4 tasks. Moreover, our model can provide interpretable predictions by visualizing inference procedure. In the end, we analyze performance and abilities of AICURE by case study.

中文摘要 i
Abstract ii
Acknowledgments iii
Table of Contents iv
Chapter 1. Introduction 1
1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
3 Our work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
Chapter 2. Related Work 6
1 Visit record representation: How to represent visit record contents well? 7
1.1 External knowledge for medical codes . . . . . . . . . . . . . . . 7
1.2 Graphical structure in visit records . . . . . . . . . . . . . . . . . 7
1.3 External knowledge for medical codes & graphical structure in visit records . . . . . . . . . . . . . . . . . . . . . . . . . . . . .8
2 Patient status representation: How to aggregate visit sequence to represent patient health status? . . . . . . . . . . . . . . . . . . . . . . 9
2.1 Language model sequence processing technique . . . . . . . . . 9
3 Information from multiple domains: How to combine codes and texts? . 10
3.1 Comparison of medical codes and textual data . . . . . . . . . . . 10
3.2 Cross-modality learning . . . . . . . . . . . . . . . . . . . . . . 10
4 Rich features: How to train a general model for EHR tasks? . . . . . . . 10
4.1 Pre-training and fine-tuning technique . . . . . . . . . . . . . . . 11
Chapter 3.Methodology 12
1 Pre-training . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
1.1 Record contents . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
1.2 Visit encoder . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
1.3 Language-model-like sequence processing (LM module) . . . . . 26
2 Fine-tuning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
2.1 Patient encoder . . . . . . . . . . . . . . . . . . . . . . . . . . . 31
2.2 Prediction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Chapter 4.Experimental Results 39
1 Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
2 Performance comparison . . . . . . . . . . . . . . . . . . . . . . . . . 42
2.1 Automated ICD-9 coding in hospitalization . . . . . . . . . . . . 42
2.2 Medication recommendation in hospitalization . . . . . . . . . . 43
2.3 Medication recommendation in out-patient . . . . . . . . . . . . 44
2.4 Diagnosis prediction . . . . . . . . . . . . . . . . . . . . . . . . 45
Chapter 5.Analysis 47
1 Pre-training technique . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
2 Cross-modality learning . . . . . . . . . . . . . . . . . . . . . . . . . . 51
3 Language-model-like sequence processing (LM module) . . . . . . . . . 52
4 Case study . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
4.1 SUBJECT_ID=222: some limitations . . . . . . . . . . . . . . . 55
4.2 SUBJECT_ID=75420: our model’s better performance . . . . . . 57
5 Model performance with different data quantity . . . . . . . . . . . . . 60
Chapter 6.Conclusion 61
1 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
2 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
References 63
                                    

[1] World Health Organization. Building foundations for eHealth: progress of Member States: report of the WHO lobal Observatory for eHealth. World Health Organization, 2006.
[2] Edward Choi, Mohammad Taha Bahadori, Le Song, Walter F. Stewart, and Jimeng Sun. GRAM: Graph-based attention model for healthcare representation learning. In Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, page 787–795, 2017.
[3] Junyuan Shang, Tengfei Ma, Cao Xiao, and Jimeng Sun. Pre-training of graph augmented transformers for medication recommendation. In IJCAI, pages 5953–5959, 2019.
[4] Edward Choi, Mohammad Taha Bahadori, Elizabeth Searles, Catherine Coffey, Michael Thompson, James Bost, Javier Tejedor-Sojo, and Jimeng Sun. Multi-layer representation learning for medical concepts. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, page 1495–1504, 2016.
[5] Edward Choi, Mohammad Taha Bahadori, Jimeng Sun, Joshua Kulas, Andy Schuetz, and Walter Stewart. Retain: An interpretable predictive model for healthcare using reverse time attention mechanism. In Advances in Neural Information Processing Systems, pages 3504–3512, 2016.
[6] Tengfei Ma, Cao Xiao, and Fei Wang. Health-ATM: A deep architecture for multifaceted patient health record representation and risk prediction. In Proceedings of the 2018 SIAM International Conference on Data Mining (SDM), pages 261–269, 2018.
[7] Edward Choi, Zhen Xu, Yujia Li, Michael Dusenberry, Gerardo Flores, Emily Xue, and Andrew Dai. Learning the graphical structure of electronic health records with graph convolutional transformer. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 606–613, Apr. 2020.
[8] Edward Choi, Cao Xiao, Walter Stewart, and Jimeng Sun. MiME: Multilevel medical embedding of electronic health records for predictive healthcare. In Advances in Neural Information Processing Systems, volume 31, pages 4547–4557, 2018.
[9] Benjamin A Goldstein, Ann Marie Navar, Michael J Pencina, and John Ioannidis. Opportunities and challenges in developing risk prediction models with electronic health records data: a systematic review. Journal of the American Medical Informatics Association, 24(1):198–208, 2017.
[10] World Health Organization et al. International classification of diseases: [9th] ninth revision, basic tabulation list with alphabetic index. World Health Organization, 1978.
[11] WHO International Working Group for Drug Statistics Methodology (2004). ATC/DDD classification (final), 2005.
[12] Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. Attention is all you need. In Advances in Neural Information Processing Systems 30, page 5998–6008, 2017.
[13] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186. Association for Computational Linguistics, June 2019.
[14] Peter W Battaglia, Jessica B Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, et al. Relational inductive biases, deep learning, and graph networks. arXiv preprint arXiv:1806.01261, 2018.
[15] Yoshua Bengio, Réjean Ducharme, Pascal Vincent, and Christian Jauvin. A neural probabilistic language model. Journal of machine learning research, 3(Feb):1137–1155, 2003.
[16] Tomas Mikolov, Kai Chen, Greg Corrado, and Jeffrey Dean. Efficient estimation of word representations in vector space. arXiv preprint arXiv: 1301.3781, 2013.
[17] Liunian Harold Li, Mark Yatskar, Da Yin, Cho-Jui Hsieh, and Kai-Wei Chang. Visualbert: A simple and performant baseline for vision and language. arXiv preprint arXiv:1908.03557, 2019.
[18] Matthew Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, and Luke Zettlemoyer. Deep contextualized word representations. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 2227–2237. Association for Computational Linguistics, June 2018.
[19] Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Russ R Salakhutdinov, and Quoc V Le. XLNet: Generalized autoregressive pretraining for language understanding. In Advances in Neural Information Processing Systems, volume 32, pages 5753–5763. Curran Associates, Inc., 2019.
[20] Emily Alsentzer, John Murphy, William Boag, Wei-Hung Weng, Di Jin, Tristan Naumann, and Matthew McDermott. Publicly available clinical BERT embeddings. In Proceedings of the 2nd Clinical Natural Language Processing Workshop of the Association for Computational Linguistics (NAACL), pages 72–78, June 2019.
[21] Jinhyuk Lee, Wonjin Yoon, Sungdong Kim, Donghyeon Kim, Sunkyu Kim, Chan Ho So, and Jaewoo Kang. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics, 36(4):1234–1240, 2020.
[22] Alistair EW Johnson, Tom J Pollard, Lu Shen, H Lehman Li-Wei, Mengling Feng, Mohammad Ghassemi, Benjamin Moody, Peter Szolovits, Leo Anthony Celi, and Roger G Mark. MIMIC-III, a freely accessible critical care database. Scientific data, 3(1):1–9, 2016.
[23] Petar Veličković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. Graph attention networks. In ICLR 2018, 2017.
[24] Ye Zhang and Byron Wallace. A sensitivity analysis of (and practitioners’ guide to) convolutional neural networks for sentence classification. CoRR, abs/1510.03820, 2015.
[25] Larry Medsker and Lakhmi C Jain. Recurrent neural networks: design and applications. CRC press, 1999.
[26] Sebastian Gehrmann, Franck Dernoncourt, Yeran Li, Eric T Carlson, Joy T Wu, Jonathan Welt, John Foote Jr, Edward T Moseley, David W Grant, Patrick D Tyler, et al. Comparing rule-based and deep learning models for patient phenotyping. CoRR, abs/1703.08705, 2017.
[27] Zichao Yang, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, and Eduard Hovy. Hierarchical attention networks for document classification. In Proceedings of the 2016 conference of the North American chapter of the association for computational linguistics: human language technologies, pages 1480–1489, 2016.
[28] James Mullenbach, Sarah Wiegreffe, Jon Duke, Jimeng Sun, and Jacob Eisenstein. Explainable prediction of medical codes from clinical text. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 1101–1111. Association for Computational Linguistics, June 2018.

校內：2022-01-31公開
校外：2022-01-31公開

簡易檢索 / 詳目顯示

相關論文