| 研究生: |
陳志豪 CHEN, JHIH-HAO |
|---|---|
| 論文名稱: |
基於反卷積模型的多種癌症中免疫細胞組成之因果中介分析 Causal mediation analysis of immune cell composition in various cancers based on deconvolution model |
| 指導教授: |
馬瀰嘉
Ma, Mi-Chia 戴安順 Tai, An-Shun |
| 學位類別: |
碩士 Master |
| 系所名稱: |
管理學院 - 統計學系 Department of Statistics |
| 論文出版年: | 2025 |
| 畢業學年度: | 113 |
| 語文別: | 中文 |
| 論文頁數: | 105 |
| 中文關鍵詞: | CIBERSORT 演算法 、ESTIMATE 演算法 、Cox 比例風險模型 、中介分析 、Wilcoxon 排序和檢定 、配對T 檢定 |
| 外文關鍵詞: | CIBERSORT Algorithm, ESTIMATE Algorithm, Cox proportional hazards model, mediation analysis, Wilcoxon rank-sum test,, Paired t test |
| 相關次數: | 點閱:116 下載:0 |
| 分享至: |
| 查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
癌症是全球主要死因之一,免疫細胞在腫瘤微環境中扮演著重要角色,影響腫瘤生長、轉移及治療反應。理解免疫細胞與臨床特徵及患者存活的關係對優化治療策略至關重要。本研究利用癌症基因組圖譜(TCGA)數據庫中的RNA測序數據,採用CIBERSORT演算法估算不同癌症樣本中的免疫細胞組成比例。CIBERSORT能精確解析腫瘤樣本中各種免疫細胞的比例,提供腫瘤免疫環境的詳細資訊。在估算完免疫細胞比例後,本研究使用ESTIMATE 演算法來估計腫瘤純度,並根據腫瘤純度調整免疫細胞比例,提高估算準確性,更精確地反映腫瘤微環境中的免疫狀況。
本研究首先運用Wilcoxon排序和檢定(Wilcoxon rank-sum test)和配對T 檢定(Paired t test)比較不同癌症樣本中各類免疫細胞比例的差異,辨別不同癌症類型中的免疫細胞特徵。在存活分析方面,本研究採用Cox比例風險模型,評估免疫細胞比例對患者存活時間的影響,幫助理解特定免疫細胞類型的存在和比例如何與患者存活相關,為臨床決策提供依據。最後,透過中介分析,本研究推斷了免疫細胞比例作為中介變數,影響患者臨床特徵與存活狀態、存活時間的關係。
Cancer is one of the leading causes of death globally, and immune cells play a crucial role in the tumor microenvironment, influencing tumor growth, metastasis, and treatment response. Understanding the relationship between immune cells, clinical characteristics, and patient prognosis is vital for optimizing treatment strategies. This study utilizes RNA sequencing data from The Cancer Genome Atlas (TCGA) database and employs the CIBERSORT algorithm to estimate the composition ratios of immune cells in different cancer samples. CIBERSORT accurately parses the proportions of various immune cells within tumor samples, providing detailed information about the tumor immune environment. After estimating the proportions of immune cells, this study further uses the ESTIMATE Algorithm to assess tumor purity and adjust immune cell proportions based on tumor purity, enhancing the accuracy of the estimations and more precisely reflecting the immune status within the tumor microenvironment.
First, this study employs the Wilcoxon rank-sum test and Paired t test to compare the differences in immune cell proportions across various cancer samples, identifying immune cell characteristics in different cancer types. For survival analysis, this study uses the Cox proportional hazards model to evaluate the impact of immune cell proportions on patient survival time, aiding in the understanding of how the presence and proportion of specific immune cell types affect patient prognosis, providing a basis for clinical decision-making. Finally, through mediation analysis, this study infers the role of immune cell proportions as mediating variables influencing the relationship between clinical characteristics and survival status, survival time.
1. Newman, A.M., Liu, C.L., Green, M.R., Gentles, A.J., Feng, W., Xu, Y., ... & Alizadeh, A.A. (2015). Robust enumeration of cell subsets from tissue expression profiles. Nature Methods, 12(5), 453-457.
2. Chen, B., Khodadoust, M. S., Liu, C. L., Newman, A. M., & Alizadeh, A. A. (2018). Profiling tumor infiltrating immune cells with CIBERSORT. Methods in Molecular Biology, 1711, 243–259. https://doi.org/10.1007/978-1-4939-7493-1_12
3. Li, B., Severson, E., Pignon, J. C., Zhao, H., Li, T., Novak, J., Jiang, P., Shen, H., Aster, J. C., Rodig, S., Signoretti, S., Liu, J. S., & Liu, X. S. (2016). Comprehensive analyses of tumor immunity: implications for cancer immunotherapy. Genome Biology, 17(1), 174. https://doi.org/10.1186/s13059-016-1028-7
4. Guan,M., Jiao, Y., & Zhou, L. (2022). Immune Infiltration Analysis with the CIBERSORT Method in Lung Cancer. Disease Markers, v.2022; 2022
5. Siegel, R. L., Miller, K. D., & Jemal, A. (2020). Cancer statistics for the year 2020: An overview. CA: A Cancer Journal for Clinicians, 70(1), 7-30. https://doi.org/10.3322/caac.21590
6. Dumoulin , V., & Visin ,F. (2016). A guide to convolution arithmetic for deep learning. arXiv preprint arXiv:1603.07285. https://doi.org/10.48550/arXiv.1603.07285
7. Racle, J., de Jonge, K., Baumgaertner, P., Speiser, D. E., & Gfeller, D. (2017). Simultaneous estimation of immune and cancer cells in tumor tissues using gene expression data. eLife, 6, e26476. https://doi.org/10.7554/eLife.26476
8. Wang, X., Park, J., Susztak, K., Zhang, N. R., & Li, M. (2019). Bulk tissue cell type deconvolution with multi-subject single-cell expression reference. Nature Communications, 10, Article 380. https://doi.org/10.1038/s41467-019-08378-0
9. Gong, T., & Szustakowski, J. D. (2013). DeconRNASeq: A statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data. Bioinformatics, 29(8), 1083–1085. https://doi.org/10.1093/bioinformatics/btt090
10. Salem, M., & Khalil, M. G. (2022). The Support Vector Regression Model: A new Improvement for some Data Reduction Methods with Application. Pak.j.stat.oper.res.18(2), 427-435
11. Jolliffe, I. T., and Cadima, J. (2016). Principal component analysis: a review and recent developments. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 374(2065), 20150202.
12. Yoshihara, K., Shahmoradgoli, M., Martínez, E., Vegesna, R., Kim, H., Torres-Garcia, W., ... & Verhaak, R.G.W. (2013). Inferring tumour purity and stromal and immune cell admixture from expression data. Nature Communications, 4, 2612.
13. Subramanian, A., Tamayo, P., Mootha, V. K., Mukherjee, S., Ebert, B. L., Gillette, M. A., Paulovich, A., Pomeroy, S. L., Golub, T. R., Lander, E. S., & Mesirov, J. P. (2005). Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proceedings of the National Academy of Sciences, 102(43), 15545-15550.
14. Mootha, V. K., Lindgren, C. M., Eriksson, K.-F., Subramanian, A., Sihag, S., Lehar, J., Puigserver, P., Carlsson, E., Ridderstråle, M., Laurila, E., Houstis, N., Daly, M. J., Patterson, N., Mesirov, J. P., Golub, T. R., Tamayo, P., Spiegelman, B. M., Lander, E. S., Hirschhorn, J. N., Altshuler, D., & Groop, L. C. (2003). PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nature Genetics, 34(3), 267-273.
15. Reimand, J., Isserlin, R., Voisin, V., Kucera, M., Tannus-Lopes, C., Rostamianfar, A., Wadi, L., Meyer, M., Wong, J., Xu, C., Merico, D., & Bader, G. D. (2019). Pathway enrichment analysis and visualization of omics data using g :Profiler, GSEA, Cytoscape and EnrichmentMap. Nature Protocols, 14(3), 482-517.
16. Goeman, J. J., & Buhlmann, P. (2007). Analyzing gene expression data in terms of gene sets: methodological issues. Bioinformatics, 23(8), 980-987.
17. Barbie, D. A., Tamayo, P., Boehm, J. S., Kim, S. Y., Moody, S. E., Dunn, I. F., Schinzel, A. C., Sandy, P., Meylan, E., Scholl, C., Fröhling, S., Chan, E. M., Sos, M. L., Michel, K., Mermel, C., Silver, S. J., Weir, B. A., Reiling, J. H., Sheng, Q., Gupta, P. B., Wadlow, R. C., Le, H., Hoersch, S., Wittner, B. S., & Hahn, W. C. (2009). Systematic RNA interference reveals that oncogenic KRAS-driven cancers require TBK1. Nature, 462(7269), 108-112.
18. Rooney, M. S., Shukla, S. A., Wu, C. J., Getz, G., & Hacohen, N. (2015). Molecular and genetic properties of tumors associated with local immune cytolytic activity. Cell, 160(1-2), 48-61.
19. Cox, D. R. (1972). Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological), 34(2), 187-202.
20. Tibshirani, R. (1996). Regression shrinkage and selection via the LASSO. Journal of the Royal Statistical Society. Series B (Methodological), 267-288.
21. Hoerl, A. E., & Kennard, R. W. (1970). Ridge regression: Biased estimation for nonorthogonal problems. Technometrics, 12(1), 55-67.
22. Zou, H. (2006). The adaptive LASSO and its oracle properties. Journal of the American statistical association, 101(476), 1418-1429.
23. Al-Subaihi, A. A. (2002). Variable selection in multivariable regression using SAS/IML. Journal of Statistical Software, 7(12), 1-20.
24. Zhang, Z., Zheng, C., Kim, C., Van Poucke, S., Lin, S., & Lan, P. (2016). Causal mediation analysis in the context of clinical research. Annals of Translational Medicine, 4(21), 425.
25. Hayes, A. F. (2009). Beyond Baron and Kenny: Statistical mediation analysis in the new millennium. Communication Monographs, 76(4), 408-420.
26. VanderWeele, T. J., & Shpitser, I. (2013). On the definition of a confounder. Annals of Statistics, 41(1), 196-220.
27. Cancer Genome Atlas Network. (2015). Comprehensive genomic characterization of head and neck squamous cell carcinomas. Nature, 517(7536), 576–582. https://doi.org/10.1038/nature14129
28. McKnight, P. E., & Najab, J. (2010). "Mann-Whitney U Test." In Corsini Encyclopedia of Psychology (4th ed.). John Wiley & Sons, Inc. https://doi.org/10.1002/9780470479216.corpsy0524
校內:2030-02-07公開