簡易檢索 / 詳目顯示

研究生: 曾冠傑
Tseng, Kuan-Chieh
論文名稱: 植物長鏈非編碼RNA資料庫建構與其調控特性之探討
Construction of a database and investigation of regulatory properties for plant long non-coding RNA
指導教授: 黃浩仁
Huang, Hao-Jen
張文綺
Chang, Wen-Chi
學位類別: 博士
Doctor
系所名稱: 生物科學與科技學院 - 生命科學系
Department of Life Sciences
論文出版年: 2023
畢業學年度: 111
語文別: 英文
論文頁數: 63
中文關鍵詞: 長鏈非編碼RNA表現譜功能網路高溫效應熱逆境
外文關鍵詞: long non-coding RNA, expression profile, functional network, high-temperature effect, heat stress
相關次數: 點閱:122下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 長鏈非編碼RNA (long non-coding RNA,lncRNA)在許多生物過程中都有重要調控功能,雖然目前已有許多植物lncRNA被發掘,但大部分的調控機制仍然不清楚。由於探討lncRNA與編碼基因之間的共表現關係有助於了解lncRNA的調控功能,因此在本研究的第一部分,我們建構了一個植物lncRNA的表現譜資料庫JustRNA。目前收錄了來自80種植物的1,024,354 條lncRNAs,表現量資料的部分則是針對六個模式植物收錄了825種情況的3,692組次世代定序資料。除了基本的表現量查詢,JustRNA也提供功能性分析幫助了解lncRNA的調控機制。透過基因組相關性分析和微小RNA標的分析可以建構lncRNA的調控網路,找出可能會和lncRNA有交互作用的鄰近基因和微小RNA。接著進行共表現分析並建構這群基因的共表現網路,將能提高交互作用的可信度。另一方面,JustRNA還整合了轉錄因子和組蛋白修飾的染色質免疫沉澱定序資料,藉此探討lncRNA上游的轉錄調控機制。對於植物lncRNA有興趣的學者將能利用JustRNA獲得調控關係的重要資訊。目前JustRNA資料庫可由以下網址免費取得: http://JustRNA.itps.ncku.edu.tw。本研究的第二部分我們提出了探討lncRNA的分析流程,並實際應用在熱逆境的研究上。由於全球暖化可能會對農作物造成熱逆境的傷害,因此了解lncRNA在長期高溫效應中的調控機制是很重要的課題。為模擬長期高溫效應,我們將阿拉伯芥種植在28 ℃與其最適生長溫度23 ℃進行次世代轉錄體定序的比較分析。根據定序資料的分析結果,lncRNA和編碼基因之間的基因特徵存在明顯差異,可惜這些基因特徵無法用於區別與長期熱效應相關或是沒有改變的lncRNA。為了進一步了解與長期熱效應相關的lncRNA可能的調控功能,我們利用JustRNA調控網路的功能分析這些lncRNA。在共表現網路中,擁有最多交互作用的lncRNA AT2G05215.1 有三個在基因組上鄰近的基因同時也參與在長期熱反應,因此這四個基因很適合做進一步的研究。總結來說,本研究建立了一個植物專屬的lncRNA資料庫,並提出一個探討lncRNA調控特性的分析流程,這將有助於科學家更有效的研究植物lncRNA。

    Long noncoding RNAs (lncRNAs) are regulatory RNAs involved in numerous biological processes. Many plant lncRNAs have been identified, but their regulatory mechanisms remain largely unknown. A resource that enables the investigation of lncRNA activity under various conditions is required because the co-expression between lncRNAs and protein-coding genes may reveal the effects of lncRNAs. In the first part of this study, the JustRNA, an expression profiling resource for plant lncRNAs was developed. The platform currently contains 1,024,354 lncRNA annotations for 80 plant species. In addition, it includes 3,692 RNA-seq samples derived from 825 conditions in six model plants. Functional network reconstruction provides insight into the regulatory roles of lncRNAs. Genomic association analysis and microRNA target prediction can be employed to depict potential interactions with nearby genes and microRNAs, respectively. Subsequent co-expression analysis can be employed to strengthen confidence in the interactions among genes. Chromatin immunoprecipitation sequencing data of transcription factors and histone modifications were integrated into the JustRNA platform to identify the transcriptional regulation of lncRNAs in several plant species. The JustRNA platform provides researchers with valuable insight into the regulatory mechanisms of plant lncRNAs. JustRNA is a free platform that can be accessed at http://JustRNA.itps.ncku.edu.tw. In the second part of this study, analysis pipelines for investigation of lncRNAs were proposed and applied to heat stress research. Global warming could cause heat stress on agricultural crops and thus it is necessary to explore the regulatory mechanisms of lncRNAs involved in long-term high-temperature effects. To simulate these effects, we conducted a comparative analysis of the next-generation transcriptome sequencing of Arabidopsis thaliana grown at 28°C and its optimal growth temperature of 23°C. Results of gene feature comparison demonstrated significant different between lncRNAs and protein-coding genes. However, the features could not be used to distinguish between long-term high-temperature effects and no change lncRNAs. Further functional analyses were carried out for long-term high-temperature effects related lncRNAs with “Network Search” of JustRNA. A lncRNAs AT2G05215.1 with the most interactions was observed to co-expressed with three nearby genes which were also involved in long-term high-temperature effects. Therefore, they could be good candidate for further investigation. Overall, this study developed a database for plant lncRNAs and proposed analysis pipelines for discovering regulatory properties. We believe that these will be helpful for scientists to efficiently study insight into plant lncRNAs.

    中文摘要 I Abstract II 誌謝 IV Index of contents V List of Tables VII List of Figures VIII List of Appendixes IX 1. Introduction 1 1.1 Current resources for plant lncRNAs 2 1.2 The lncRNAs involved in heat stress response 3 1.3 The specific aims of this study 4 2. Material and Methods 6 2.1 Construction of a database of plant lncRNA expression profiles and functional networks 6 2.1.1 Integration of gene annotation data and reference sequences 6 2.1.2 RNA-seq data collection and analysis 7 2.1.3 Differentially expressed gene analysis 8 2.1.4 Z-score transformation of gene expression level 9 2.1.5 Integration of ChIP-seq occupancy of regulatory factors 9 2.1.6 Functional network analysis 10 2.1.7 Integration of genetic markers and quantitative trait loci 10 2.1.8 Web interface construction 11 2.2 The lncRNA landscape of Arabidopsis under heat stress 11 2.2.1 Plant materials preparation 11 2.2.2 RNA-seq library preparation 12 2.2.3 RNA-seq data analysis 12 2.2.4 Small RNA-seq data analysis 13 2.2.5 Gene groups identification 14 2.2.6 Gene features characterization and comparison 14 2.2.7 Functional network reconstruction of heat stress related lncRNAs 15 3. Results and Discussion 16 3.1 Construction of a database of plant lncRNA expression profiles and functional networks 16 3.1.1 Database construction 16 3.1.2 Gene information and genomic location of protein-coding and non-coding genes 17 3.1.3 Investigation of gene expression profiles 17 3.1.4 Identification of regulatory factors involved in gene expression regulation 18 3.1.5 Construction of functional networks 19 3.1.6 Association of genetic markers and quantitative trait loci 21 3.2 The lncRNA landscape of Arabidopsis under heat stress 21 3.2.1 The phenotypes of Arabidopsis used in this study 21 3.2.2 Arabidopsis transcriptome data processing 22 3.2.3 Identification and target prediction of Arabidopsis miRNAs 23 3.2.4 Expression comparison and gene groups definition 23 3.2.5 Gene features comparison between heat stress related and no change genes 24 3.2.6 Regulatory properties of heat stress related lncRNAs 25 4. Conclusion 27 5. References 29

    [1]J. B. Heo and S. Sung, "Vernalization-mediated epigenetic silencing by a long intronic noncoding RNA," Science, vol. 331, no. 6013, pp. 76-9, Jan 7 2011.
    [2] T. Csorba, J. I. Questa, Q. Sun, and C. Dean, "Antisense COOLAIR mediates the coordinated switching of chromatin states at FLC during vernalization," Proc Natl Acad Sci U S A, vol. 111, no. 45, pp. 16160-5, Nov 11 2014.
    [3] J. Ding et al., "A long noncoding RNA regulates photoperiod-sensitive male sterility, an essential component of hybrid rice," Proc Natl Acad Sci U S A, vol. 109, no. 7, pp. 2654-9, Feb 14 2012.
    [4] B. Zhu et al., "RNA sequencing and functional analysis implicate the regulatory role of long non-coding RNAs in tomato fruit ripening," J Exp Bot, vol. 66, no. 15, pp. 4483-95, Aug 2015.
    [5] J. E. Wilusz, H. Sunwoo, and D. L. Spector, "Long noncoding RNAs: functional surprises from the RNA world," (in English), Gene Dev, vol. 23, no. 13, pp. 1494-1504, Jul 1 2009.
    [6] H. Y. Tian et al., "Discovery, identification, and functional characterization of long noncoding RNAs inArachis hypogaeaL.," (in English), BMC Plant Biology, vol. 20, no. 1, Jul 2 2020.
    [7] H. Hezroni, D. Koppstein, M. G. Schwartz, A. Avrutin, D. P. Bartel, and I. Ulitsky, "Principles of Long Noncoding RNA Evolution Derived from Direct Comparison of Transcriptomes in 17 Species," (in English), Cell Rep, vol. 11, no. 7, pp. 1110-1122, May 19 2015.
    [8] C. Di et al., "Characterization of stress-responsive lncRNAs in Arabidopsis thaliana by integrating expression, epigenetic and structural features," Plant J, vol. 80, no. 5, pp. 848-61, Dec 2014.
    [9] C. P. Morice et al., "An Updated Assessment of Near-Surface Temperature Change From 1850: The HadCRUT5 Data Set," (in English), J Geophys Res-Atmos, vol. 126, no. 3, Feb 16 2021.
    [10] Z. Ding et al., "Strand-specific RNA-seq based identification and functional prediction of drought-responsive lncRNAs in cassava," BMC Genomics, vol. 20, no. 1, p. 214, Mar 13 2019.
    [11] S. Guil and M. Esteller, "Cis-acting noncoding RNAs: friends and foes," Nat Struct Mol Biol, vol. 19, no. 11, pp. 1068-75, Nov 2012.
    [12] Y. Yu, H. Zhang, Y. Long, Y. Shu, and J. Zhai, "Plant Public RNA-seq Database: a comprehensive online database for expression analysis of ~45 000 plant public RNA-Seq libraries," Plant Biotechnol J, vol. 20, no. 5, pp. 806-808, May 2022.
    [13] R. N. Consortium, "RNAcentral 2021: secondary structure integration, improved sequence search and new member databases," Nucleic Acids Res, vol. 49, no. D1, pp. D212-D220, Jan 8 2021.
    [14] M. Di Marsico, A. Paytuvi Gallart, W. Sanseverino, and R. Aiese Cigliano, "GreeNC 2.0: a comprehensive database of plant long non-coding RNAs," Nucleic Acids Res, vol. 50, no. D1, pp. D1442-D1447, Jan 7 2022.
    [15] M. W. Szczesniak, O. Bryzghalov, J. Ciomborowska-Basheer, and I. Makalowska, "CANTATAdb 2.0: Expanding the Collection of Plant Long Noncoding RNAs," Methods Mol Biol, vol. 1933, pp. 415-429, 2019.
    [16] L. Zhao et al., "NONCODEV6: an updated database dedicated to long non-coding RNA annotation in both animals and plants," Nucleic Acids Res, vol. 49, no. D1, pp. D165-D171, Jan 8 2021.
    [17] J. Jin et al., "PLncDB V2.0: a comprehensive encyclopedia of plant long noncoding RNAs," Nucleic Acids Res, vol. 49, no. D1, pp. D1489-D1495, Jan 8 2021.
    [18] B. Zhang et al., "RiceNCexp: a rice non-coding RNA co-expression atlas based on massive RNA-seq and small-RNA seq data," J Exp Bot, vol. 73, no. 18, pp. 6068-6077, Oct 18 2022.
    [19] J. Liu, L. Feng, J. Li, and Z. He, "Genetic and epigenetic control of plant heat responses," Front Plant Sci, vol. 6, p. 267, 2015.
    [20] D. S. Battisti and R. L. Naylor, "Historical warnings of future food insecurity with unprecedented seasonal heat," Science, vol. 323, no. 5911, pp. 240-4, Jan 9 2009.
    [21] D. B. Lobell, M. B. Burke, C. Tebaldi, M. D. Mastrandrea, W. P. Falcon, and R. L. Naylor, "Prioritizing climate change adaptation needs for food security in 2030," Science, vol. 319, no. 5863, pp. 607-10, Feb 1 2008.
    [22] S. Peng et al., "Rice yields decline with higher night temperature from global warming," Proc Natl Acad Sci U S A, vol. 101, no. 27, pp. 9971-5, Jul 6 2004.
    [23] M. Wunderlich, R. Gross-Hardt, and F. Schoffl, "Heat shock factor HSFB2a involved in gametophyte development of Arabidopsis thaliana and its expression is controlled by a heat-inducible long non-coding antisense RNA," Plant Mol Biol, vol. 85, no. 6, pp. 541-50, Aug 2014.
    [24] Y. Wang et al., "Overexpressing lncRNA LAIR increases grain yield and regulates neighbouring gene cluster expression in rice," Nat Commun, vol. 9, no. 1, p. 3516, Aug 29 2018.
    [25] M. Jens and N. Rajewsky, "Competition between target sites of regulators shapes post-transcriptional gene regulation," (in English), Nat Rev Genet, vol. 16, no. 2, pp. 113-126, Feb 2015.
    [26] F. Bardou et al., "Long Noncoding RNA Modulates Alternative Splicing Regulators in Arabidopsis," (in English), Dev Cell, vol. 30, no. 2, pp. 166-176, Jul 28 2014.
    [27] P. Kindgren, R. Ard, M. Ivanov, and S. Marquardt, "Transcriptional read-through of the long non-coding RNA SVALKA governs plant cold acclimation," Nat Commun, vol. 9, no. 1, p. 4561, Nov 1 2018.
    [28] S. Subhash, K. Mishra, V. S. Akhade, M. Kanduri, T. Mondal, and C. Kanduri, "H3K4me2 and WDR5 enriched chromatin interacting long non-coding RNAs maintain transcriptionally competent chromatin at divergent transcriptional units," (in English), Nucleic Acids Research, vol. 46, no. 18, pp. 9384-9400, Oct 12 2018.
    [29] Y. J. Li, X. R. Li, J. Yang, and Y. K. He, "Natural antisense transcripts of MIR398 genes suppress microR398 processing and attenuate plant thermotolerance," (in English), Nature Communications, vol. 11, no. 1, Oct 22 2020.
    [30] P. Lamesch et al., "The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools," Nucleic Acids Res, vol. 40, no. Database issue, pp. D1202-10, Jan 2012.
    [31] H. Sakai et al., "Rice Annotation Project Database (RAP-DB): an integrative and interactive database for rice genomics," Plant Cell Physiol, vol. 54, no. 2, p. e6, Feb 2013.
    [32] L. Harper, J. Gardiner, C. Andorf, and C. J. Lawrence, "MaizeGDB: The Maize Genetics and Genomics Database," Methods Mol Biol, vol. 1374, pp. 187-202, 2016.
    [33] D. M. Bolser, D. M. Staines, E. Perry, and P. J. Kersey, "Ensembl Plants: Integrating Tools for Visualizing, Mining, and Analyzing Plant Genomic Data," Methods Mol Biol, vol. 1533, pp. 1-31, 2017.
    [34] D. Grant, R. T. Nelson, S. B. Cannon, and R. C. Shoemaker, "SoyBase, the USDA-ARS soybean genetics and genomics database," Nucleic Acids Res, vol. 38, no. Database issue, pp. D843-6, Jan 2010.
    [35] A. Bombarely et al., "The Sol Genomics Network (solgenomics.net): growing tomatoes using Perl," Nucleic Acids Res, vol. 39, no. Database issue, pp. D1149-55, Jan 2011.
    [36] B. L. Zhou et al., "EVLncRNAs 2.0: an updated database of manually curated functional long non-coding RNAs validated by low-throughput experiments," (in English), Nucleic Acids Research, vol. 49, no. D1, pp. D86-D91, Jan 8 2021.
    [37] T. Barrett et al., "NCBI GEO: archive for functional genomics data sets--update," Nucleic Acids Res, vol. 41, no. Database issue, pp. D991-5, Jan 2013.
    [38] M. A. Reynoso, P. Juntawong, M. Lancia, F. A. Blanco, J. Bailey-Serres, and M. E. Zanetti, "Translating Ribosome Affinity Purification (TRAP) followed by RNA sequencing technology (TRAP-SEQ) for quantitative assessment of plant translatomes," Methods Mol Biol, vol. 1284, pp. 185-207, 2015.
    [39] M. A. Schon, M. J. Kellner, A. Plotnikova, F. Hofmann, and M. D. Nodine, "NanoPARE: parallel analysis of RNA 5' ends from low-input RNA," Genome Res, vol. 28, no. 12, pp. 1931-1942, Dec 2018.
    [40] D. Kim, J. M. Paggi, C. Park, C. Bennett, and S. L. Salzberg, "Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype," Nat Biotechnol, vol. 37, no. 8, pp. 907-915, Aug 2019.
    [41] M. Pertea, G. M. Pertea, C. M. Antonescu, T. C. Chang, J. T. Mendell, and S. L. Salzberg, "StringTie enables improved reconstruction of a transcriptome from RNA-seq reads," Nat Biotechnol, vol. 33, no. 3, pp. 290-5, Mar 2015.
    [42] K. Sun, H. Wang, and H. Sun, "NAMS webserver: coding potential assessment and functional annotation of plant transcripts," Brief Bioinform, vol. 22, no. 3, May 20 2021.
    [43] B. Li and C. N. Dewey, "RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome," BMC Bioinformatics, vol. 12, p. 323, 2011.
    [44] B. Langmead and S. L. Salzberg, "Fast gapped-read alignment with Bowtie 2," Nat Methods, vol. 9, no. 4, pp. 357-9, Mar 4 2012.
    [45] R. C. Team, "R: A Language and Environment for Statistical Computing," ed, 2019.
    [46] C. N. Chow et al., "PlantPAN3.0: a new and updated resource for reconstructing transcriptional regulatory networks from ChIP-seq experiments in plants," Nucleic Acids Res, vol. 47, no. D1, pp. D1155-D1163, Jan 8 2019.
    [47] A. R. Quinlan, "BEDTools: The Swiss-Army Tool for Genome Feature Analysis," Curr Protoc Bioinformatics, vol. 47, pp. 11 12 1-34, Sep 8 2014.
    [48] K. C. Tseng et al., "sRIS: A Small RNA Illustration System for Plant Next-Generation Sequencing Data Analysis," Plant Cell Physiol, vol. 61, no. 6, pp. 1204-1212, Jun 1 2020.
    [49] C. T. Lopes, M. Franz, F. Kazi, S. L. Donaldson, Q. Morris, and G. D. Bader, "Cytoscape Web: an interactive web-based network browser," Bioinformatics, vol. 26, no. 18, pp. 2347-8, Sep 15 2010.
    [50] K. Youens-Clark et al., "Gramene database in 2010: updates and extensions," Nucleic Acids Res, vol. 39, no. Database issue, pp. D1085-94, Jan 2011.
    [51] C. Camacho et al., "BLAST+: architecture and applications," BMC Bioinformatics, vol. 10, p. 421, Dec 15 2009.
    [52] R. Buels et al., "JBrowse: a dynamic web platform for genome visualization and analysis," Genome Biol, vol. 17, p. 66, Apr 12 2016.
    [53] G. Sablok, K. Sun, and H. Sun, "NAMS: Noncoding Assessment of long RNAs in Magnoliophyta Species," (in English), Plant Long Non-Coding Rnas, vol. 1933, pp. 257-264, 2019.
    [54] K. Palos et al., "Identification and functional annotation of long intergenic non-coding RNAs in Brassicaceae," Plant Cell, vol. 34, no. 9, pp. 3233-3260, Aug 25 2022.
    [55] S. Zhao, Y. Zhang, R. Gamini, B. Zhang, and D. von Schack, "Evaluation of two main RNA-seq approaches for gene quantification in clinical RNA sequencing: polyA+ selection versus rRNA depletion," Sci Rep, vol. 8, no. 1, p. 4781, Mar 19 2018.
    [56] Q. Sang et al., "MicroRNA156 conditions auxin sensitivity to enable growth plasticity in response to environmental changes in Arabidopsis," Nat Commun, vol. 14, no. 1, p. 1449, Mar 22 2023.
    [57] H. Matsuura, Y. Ishibashi, A. Shinmyo, S. Kanaya, and K. Kato, "Genome-wide analyses of early translational responses to elevated temperature and high salinity in Arabidopsis thaliana," Plant Cell Physiol, vol. 51, no. 3, pp. 448-62, Mar 2010.

    無法下載圖示 校內:2028-08-28公開
    校外:2028-08-28公開
    電子論文尚未授權公開,紙本請查館藏目錄
    QR CODE