成功大學博碩士論文系統

簡易檢索 / 詳目顯示

回結果列表

研究生：	鄭吉宏 Cheng, Ji-Hong
論文名稱：	在病毒次世代定序資料中偵測缺失片段的精確位點與變異性之研究 A Study of Detecting Exact Breakpoints of Deletions with Diversity in Viral Next-Generation Sequencing Data
指導教授：	謝孫源 Hsieh, Sun-Yuan
共同指導:	曾新穆 Tseng, Vincent S.
學位類別：	博士 Doctor
系所名稱：	電機資訊學院 - 資訊工程學系 Department of Computer Science and Information Engineering
論文出版年：	2021
畢業學年度：	109
語文別：	英文
論文頁數：	74
中文關鍵詞：	次世代定序技術、病毒基因體的缺失片段、B型肝炎病毒、新型冠狀病毒、圖形處理器、大數據
外文關鍵詞：	Next-generation sequencing, Viral genomic deletion, Hepatitis B Virus, SARS-CoV-2, Graphics Processing Unit, Big data
相關次數：	點閱：125 下載：0
分享至:	分享至facebook 分享至twitter

查詢本校圖書館目錄查詢臺灣博碩士論文知識加值系統勘誤回報

COVID-19的大流行已造成百萬人的死亡，並對於全世界人類的健康、生活與經濟產生嚴重的損害。根據最新研究的指出SARS-CoV-2基因體中特定區域的序列缺失片段與病毒的感染性有關聯。在B型肝炎病毒(hepatitis B virus, HBV)中也發現特定區域的序列缺失片段與肝癌(Hepatocellular carcinoma, HCC)有關聯。透過次世代定序技術(Next-generation sequencing, NGS)可以快速獲取生物基因體序列的資料，然而NGS技術產生短序列(read)的數量，經常是高達百萬條以上的大數據；除此之外，病毒的基因體序列資料又具有高演化率與變異性，因此從NGS的大數據資料中發掘病毒的序列缺失片段是一個挑戰。雖然有一些研究已致力於開發從NGS資料中發掘基因體的缺失片段的方法，但仍存有以下不足之處: (1)很少研究方法和實驗設計是在不同變異性的病毒NGS資料中偵測缺失片段的精確位點；(2)另外有一些方法使用近似計算的方式減少比對時間，但是找到位點不夠精確；(3)有一些方法藉由NGS的pair-end資料的特性縮小比對的範圍減少偵測缺失片段精確位點的時間，但是在single-end資料中就無法有效的減少時間。

為了解決上述議題，本論文探討了三個主題：(1)在病毒的NGS資料中偵測缺失片段的精確位點；(2)提升在病毒的NGS資料中偵測缺失片段位點的正確性；(3)提升在病毒的NGS資料中偵測缺失片段位點的效率。

在第一個研究主題中，本論文提出在病毒的NGS資料中偵測缺失片段精確位點的方法，VirDelect (Virus Deletion Detect)。我們證明VirDelect可以有效減少比對的次數，並且找到精確的遺失片段位點。實驗結果顯示VirDelect可以在不同變異度的模擬資料和真實資料獲得較高的分數。在第二個研究主題中，本論文提出增強型單鹼基比對演算法。我們提出新的相似度計算方式和新的資料結構來提升比對的正確性。實驗結果顯示，使用增強型單鹼基比對演算法的VirDelect其正確性在不同變異度的模擬資料和真實資料都高於第一個研究主題的結果。除此之外，增強型單鹼基比對演算法可偵測同一條參考序列中兩條缺失片段的精確位置。第三個研究主題中，本研究提出可並行化的缺失片段偵測演算法。我們讓並行化的VirDelect可以應用中央處理器(Central Processing Unit , CPU)和圖形處理器(Graphics Processing Unit, GPU)的多核心資源提升運算的效率。在CPU中，VirDelect使用多執行緒非同步地偵測多條NGS短序列的缺失片段。在GPU中，VirDelect的One-base alignment Plus使用陣列運算的方式並行比對序列產生分數，它可有效的應用GPU在陣列運算超高效能。實驗結果顯示，並行化的VirDelect確實在可以有效減少運算時間提升效率。

上述的三個主題之共同目標是基於現今方法與研究不足之處，提出一系列可以在病毒的NGS資料中有效果並且有效率的偵測缺片段的精確位點之方法。我們將這三個主題有系統地整合在本文論當中，並經由實驗驗證我們提出的方法比代表性的方法有更好的表現。期盼本論文的方法可被實際應用在生物與醫學資訊領域，並且對人類的生活與健康有所貢獻。

The COVID-19 pandemic has caused millions of deaths and has seriously impacted humans worldwide. According to the latest research, deletions in specific regions of the SARS-CoV-2 genome are related to viral infectivity. In hepatitis B virus (HBV), deletions in specific regions are associated with hepatocellular carcinoma (HCC). Next-generation sequencing (NGS) technology generates short read data at high speed; however, these are large datasets. In addition, viral genomes have a high mutation rate and diversity, it is therefore challenging to detect deletions in viral NGS data. Although approaches for the detection of deletions from NGS data have been developed, they have the following limitations: first, few methods have been designed for the detection of viral deletions with diversity in NGS data; second, some methods use approximate calculations to reduce time-cost, but the position of deletions are not exact; third, some methods are able to reduce the time-cost due to the characteristics of NGS paired-end data, but cannot effectively reduce the time cost in NGS single-end data.

To resolve the aforementioned issues, this dissertation discusses three topics: (1) detecting the exact breakpoint of sequence deletions in viral NGS data, (2) improving the correctness of detecting deletions in viral NGS data, and (3) improving the efficiency of detecting sequence deletions in viral NGS data.

To address the first research topic, we proposed a method, VirDelect, to detect the exact breakpoint of deletions in viral NGS data and proved that VirDelect effectively reduced the number of alignments and detected exact breakpoints. The results showed that VirDelect obtained higher scores in both simulated and real data. To address the second research topic, we proposed a method, One-base alignment plus (OAP), and defined a new similarity formula. Experiments showed that the correctness of VirDelect with OAP was higher than that of VirDelect without OAP in both simulated and real data. In addition, VirDelect with OAP can detect two deletions in one reference. To address the third research topic, we implemented VirDelect with multi-core Central Processing Unit (CPU) and Graphics Processing Unit (GPU) resources to improve computing efficiency based on a concurrent algorithm. VirDelect implements multiple threads of the CPU to detect deletions. OAP uses array operations that apply the GPU at high performance. The results showed that concurrent VirDelect can effectively reduce execution time-cost and improve computing efficiency.

The aforementioned research topics are related to the major aim of developing a series of methods that can effectively and efficiently detect the exact breakpoints of deletions from viral NGS data based on current deficiencies in approach and research. Experimental results show that our proposed algorithm outperforms state-of-the-art algorithms. We hope that the approach described in this dissertation can be applied in the fields of biology and medical information, and will help to save lives and improve public health in general.

摘  要	I
ABSTRACT	III
誌 謝	V
Contents	VI
List of Figures	IX
List of Tables	X
Chapter 1 Introduction	1
1 Motivation	1
2 Overview of the Dissertation	4
2.1 A Base Approach to Detect Exact Breakpoint of Deletions with Diversity in Viral NGS Data	5
2.2 An Enhanced Algorithm to Detect Exact Breakpoints of Deletions in Viral NGS Data	5
2.3 Efficient Algorithms to Detect Exact Breakpoints of Deletions in Viral NGS Data	6
3 Organization of the Dissertation	7
Chapter 2 Background and Related Work	8
1 Diversity of SARS-CoV-2	8
2 Diversity of Hepatitis B Virus	8
3 Hepatitis B Viral Whole Genome Cloning	11
4 Graphics Processing Unit	12
5 Next-generation sequencing	13
Chapter 3 A Base Approach to Detect Exact Breakpoint of Deletions with Diversity in Viral NGS Data	16
1 Introduction	16
2 Proposed Method	17
2.1 Definition of Diversity and Split Read Alignment	19
2.1.1 Definition of Diversity:	19
2.1.2 Split Read Alignment	20
2.2 Reducing Phases	22
2.2.1 Phase 1: Parallel Alignment	22
2.2.2 Phase 2: One-base Alignment	23
2.2.3 Phase 3: Vertical-checking and Cross-checking	23
2.2.4 Vertical-checking and Max_Global_Similarity	27
2.2.5 Cross-checking	27
3 Result and Discussion	28
3.1 Datasets and Environment	29
3.2 Scoring Function	30
3.3 Experimental results of VirDelect and Pindel	31
3.4 Analyzing Simulated Data for Different Lengths of Deletion and Higher Diversity	33
3.5 Analysis Results of Simulated and Real Data	34
4 Summary	36
Chapter 4 An Enhanced Algorithm to Detect Exact Breakpoints of Deletions in Viral NGS Data	37
1 Introduction	37
2 Proposed Method	38
2.1 Definition of One-base Alignment Plus	38
2.1.1 Similarity Score	38
2.1.2 Score Table and Hit Table	38
2.1.3 Initialize Table	39
2.2 Prefix of One-base Alignment Plus	40
2.2.1 Initial Table	40
2.2.2 Generate New Score Table	40
2.3 Suffix of One-base Alignment Plus	42
2.3.1 Initial Table	42
2.3.2 Generate New Score Table	43
2.3.3 Generate New Score Table in Zero Position	44
3 Result and Discussion	44
3.1 Dataset and Environment	44
3.1.1 Real Dataset and Environment:	44
3.1.2 Simulated Dataset	45
3.2 Scoring Function	46
3.3 Experimental Results of VirDelect with OAP	46
3.4 Discussion	49
4 Summary	50
Chapter 5 Efficient Algorithms to Detect Exact Breakpoints of Deletions in Viral NGS Data	51
1 Introduction	51
2 Proposed Method	53
2.1 VirDelect with Multiple Threads of CPU	53
2.2 Definition of GPU Function and Award Table	55
2.3 Prefix of One-base Alignment Plus on GPU	57
2.4 Suffix of One-base Alignment Plus on GPU	58
3 Result and Discussion	60
3.1 Dataset and Environment	60
3.2 Result and Discussion	61
4 Summary	62
Chapter 6 Conclusion and Future Works	63
1 Conclusion	63
2 Future Works	64
Bibliography	66
                                    

[1] S.Cleemput, W.Dumon, V.Fonseca, W.Abdool Karim, M.Giovanetti, L. C.Alcantara, K.Deforche, andT.deOliveira, “Genome Detective Coronavirus Typing Tool for rapid identification and characterization of novel coronavirus genomes,” Bioinformatics, 2020.
[2] R.Kong, G.Yang, R.Xue, M.Liu, F.Wang, J.Hu, X.Guo, andS.Chang, “COVID-19 Docking Server: A meta server for docking small molecules, peptides and antibodies against potential targets of COVID-19,” Bioinformatics, 2020.
[3] D.Domingo-Fernández, S.Baksi, B.Schultz, Y.Gadiya, R.Karki, T.Raschka, C.Ebeling, M.Hofmann-Apitius, andA. T.Kodamullil, “COVID-19 Knowledge Graph: a computable, multi-modal, cause-and-effect knowledge model of COVID-19 pathophysiology,” Bioinformatics, 2020.
[4] D.Korn, T.Bobrowski, M.Li, Y.Kebede, P.Wang, P.Owen, G.Vaidya, E.Muratov, R.Chirkova, C.Bizon, andA.Tropsha, “COVID-KOP: integrating emerging COVID-19 data with the ROBOKOP database,” Bioinformatics, 2020.
[5] S.Liu, Q.Zheng, andZ.Wang, “Potential covalent drugs targeting the main protease of the SARS-CoV-2 coronavirus,” Bioinformatics, 2020.
[6] F.Hufsky, K.Lamkiewicz, A.Almeida, A.Aouacheria, C.Arighi, A.Bateman, J.Baumbach, N.Beerenwinkel, C.Brandt, M.Cacciabue, S.Chuguransky, M.Marz, et al., “Computational strategies to combat COVID-19: useful tools to accelerate SARS-CoV-2 and coronavirus research,” Brief. Bioinform., 2020.
[7] T.Phan, “Genetic diversity and evolution of SARS-CoV-2,” Infect. Genet. Evol., 2020.
[8] Y. C. F.Su, D. E.Anderson, B. E.Young, M.Linster, F.Zhu, J.Jayakumar, Y.Zhuang, S.Kalimuddin, J. G. H.Low, C. W.Tan, W. N.Chia, T. M.Mak, S.Octavia, J. M.Chavatte, R. T. C.Lee, S.Pada, S. Y.Tan, L.Sun, G. Z.Yan, S.Maurer-Stroh, I. H.Mendenhall, Y. S.Leo, D. C.Lye, L. F.Wang, andG. J. D.Smith, “Discovery and genomic characterization of a 382-nucleotide deletion in ORF7B and orf8 during the early evolution of SARS-CoV-2,” MBio, 2020.
[9] M. R.Islam, M. N.Hoque, M. S.Rahman, A. S. M. R. U.Alam, M.Akther, J. A.Puspo, S.Akter, M.Sultana, K. A.Crandall, andM. A.Hossain, “Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity,” Sci. Rep., vol. 10, no. 1, pp. 1–9, 2020.
[10] W. C.Liu, I. C.Wu, Y. C.Lee, C. P.Lin, J. H.Cheng, Y. J.Lin, C. J.Yen, P. N.Cheng, P. F.Li, Y. T.Cheng, P. W.Cheng, K. T.Sun, S. L.Yan, J. J.Lin, J. C.Yang, K. C.Chang, C. H.Ho, V. S.Tseng, B. C. H.Chang, J. C.Wu, andT. T.Chang, “Hepatocellular carcinoma-associated single-nucleotide variants and deletions identified by the use of genome-wide high-throughput analysis of hepatitis B virus,” J. Pathol., 2017.
[11] J.Shendure, andH.Ji, “Next-generation DNA sequencing,” Nature Biotechnology. 2008.
[12] J.Jia, X.Liang, S.Chen, H.Wang, H.Li, M.Fang, X.Bai, Z.Wang, M.Wang, S.Zhu, F.Sun, andC.Gao, “Next-generation sequencing revealed divergence in deletions of the pres region in the HBV genome between different HBV-related liver diseases,” J. Gen. Virol., 2017.
[13] C. F.Teng, T. C.Li, H. Y.Huang, J. H.Lin, W. S.Chen, W. C.Shyu, H. C.Wu, C. Y.Peng, I. J.Su, andL.BinJeng, “Next-generation sequencing-based quantitative detection of hepatitis B virus Pre-S mutants in plasma predicts hepatocellular carcinoma recurrence,” Viruses, 2020.
[14] A.Jary, V.Leducq, I.Malet, S.Marot, E.Klement-Frutos, E.Teyssou, C.Soulié, B.Abdi, M.Wirden, V.Pourcher, E.Caumes, V.Calvez, S.Burrel, A. G.Marcelin, andD.Boutolleau, “Evolution of viral quasispecies during SARS-CoV-2 infection,” Clin. Microbiol. Infect., no. xxxx, 2020.
[15] W.Zhang, J.Chen, Y.Yang, Y.Tang, J.Shang, andB.Shen, “A practical comparison of De Novo genome assembly software tools for next-generation sequencing technologies,” PLoS One, 2011.
[16] D.Yorukoglu, Y. W.Yu, J.Peng, andB.Berger, “Compressive mapping for next-generation sequencing,” Nature Biotechnology. 2016.
[17] K.Chen, J. W.Wallis, M. D.McLellan, D. E.Larson, J. M.Kalicki, C. S.Pohl, S. D.McGrath, M. C.Wendl, Q.Zhang, D. P.Locke, X.Shi, R. S.Fulton, T. J.Ley, R. K.Wilson, L.Ding, andE. R.Mardis, “BreakDancer: An algorithm for high-resolution mapping of genomic structural variation,” Nat. Methods, 2009.
[18] C.Bartenhagen, andM.Dugas, “Robust and exact structural variation detection with paired-end and soft-clipped alignments: SoftSV compared with eight algorithms,” Brief. Bioinform., 2016.
[19] J. O.Korbel, A.Abyzov, X. J.Mu, N.Carriero, P.Cayting, Z.Zhang, M.Snyder, andM. B.Gerstein, “PEMer: A computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data,” Genome Biol., 2009.
[20] C.Alkan, J. M.Kidd, T.Marques-Bonet, G.Aksay, F.Antonacci, F.Hormozdiari, J. O.Kitzman, C.Baker, M.Malig, O.Mutlu, S. C.Sahinalp, R. A.Gibbs, andE. E.Eichler, “Personalized copy number and segmental duplication maps using next-generation sequencing,” Nat. Genet., 2009.
[21] J.Zhang, J.Wang, andY.Wu, “An improved approach for accurate and efficient calling of structural variations with low-coverage sequence data.,” BMC Bioinformatics, 2012.
[22] S. S.Sindi, S.Önal, L. C.Peng, H. T.Wu, andB. J.Raphael, “An integrative probabilistic model for identification of structural variation in sequencing data,” Genome Biol., 2012.
[23] Y.Jiang, Y.Wang, andM.Brudno, “PRISM: Pair-read informed split-read mapping for base-pair level detection of insertion, deletion and structural variants,” Bioinformatics, 2012.
[24] K.Ye, M. H.Schulz, Q.Long, R.Apweiler, andZ.Ning, “Pindel: A pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads,” Bioinformatics, 2009.
[25] P.Guan, andW. K.Sung, “Structural variation detection using next-generation sequencing data: A comparative technical review,” Methods. 2016.
[26] P.Medvedev, M.Stanciu, andM.Brudno, “Computational methods for discovering structural variation with next-generation sequencing,” Nat. Methods, 2009.
[27] N. F.Lahens, E.Ricciotti, O.Smirnova, E.Toorens, E. J.Kim, G.Baruzzo, K. E.Hayer, T.Ganguly, J.Schug, andG. R.Grant, “A comparison of Illumina and Ion Torrent sequencing platforms in the context of differential gene expression,” BMC Genomics, 2017.
[28] J. H.Cheng, W. C.Liu, T. T.Chang, S. Y.Hsieh, andV. S.Tseng, “Detecting exact breakpoints of deletions with diversity in hepatitis B viral genomic DNA from next-generation sequencing data,” Methods, 2017.
[29] S. F.Ahmed, A. A.Quadeer, andM. R.McKay, “Preliminary identification of potential vaccine targets for the COVID-19 Coronavirus (SARS-CoV-2) Based on SARS-CoV Immunological Studies,” Viruses, 2020.
[30] T.Phan, “Novel coronavirus: From discovery to clinical diagnostics,” Infection, Genetics and Evolution. 2020.
[31] A. C.Walls, Y. J.Park, M. A.Tortorici, A.Wall, A. T.McGuire, andD.Veesler, “Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein,” Cell, 2020.
[32] C.Wang, Z.Liu, Z.Chen, X.Huang, M.Xu, T.He, andZ.Zhang, “The establishment of reference sequence for SARS-CoV-2 and variation analysis.,” J. Med. Virol., vol. 92, no. 6, pp. 667–674, Jun.2020.
[33] M.Pachetti, B.Marini, F.Benedetti, F.Giudici, E.Mauro, P.Storici, C.Masciovecchio, S.Angeletti, M.Ciccozzi, R. C.Gallo, D.Zella, andR.Ippodrino, “Emerging SARS-CoV-2 mutation hot spots include a novel RNA-dependent-RNA polymerase variant,” J. Transl. Med., vol. 18, no. 1, pp. 1–9, 2020.
[34] D.Ganem, andA. M.Prince, “Hepatitis B Virus Infection — Natural History and Clinical Consequences,” N. Engl. J. Med., 2004.
[35] D.Lavanchy, “Hepatitis B virus epidemiology, disease burden, treatment, arid current and emerging prevention and control measures,” Journal of Viral Hepatitis. 2004.
[36] A. S. F.Lok, andB. J.McMahon, “Chronic hepatitis B,” Hepatology. 2007.
[37] C.Seeger, andW. S.Mason, “Molecular biology of hepatitis B virus infection,” Virology. 2015.
[38] F.Rodriguez-Frias, M.Buti, D.Tabernero, andM.Homs, “Quasispecies structure, cornerstone of hepatitis B virus infection: Mass sequencing approach,” World Journal of Gastroenterology. 2013.
[39] X.Li, Y.Qin, Y.Liu, F.Li, H.Liao, S.Lu, Y.Qiao, D.Xu, andJ.Li, “PreS deletion profiles of hepatitis B virus (HBV) are associated with clinical presentations of chronic HBV infection,” J. Clin. Virol., 2016.
[40] I. C.Wu, W. C.Liu, andT. T.Chang, “Applications of next-generation sequencing analysis for the detection of hepatocellular carcinoma-associated hepatitis B virus mutations,” Journal of Biomedical Science. 2018.
[41] H. J.Lin, C. L.Lai, I. J.Lauder, P. C.Wu, T. K.Lau, andM. W.Fong, “Application of hepatitis B virus (HBV) DNA sequence polymorphisms to the study of HBV transmission,” J. Infect. Dis., 1991.
[42] M.Sterneck, S.Günther, J.Gerlach, N.V.Naoumov, T.Santantonio, L.Fischer, X.Rogiers, H.Greten, R.Williams, andH.Will, “Hepatitis B virus sequence changes evolving in liver transplant recipients with fulminant hepatitis,” J. Hepatol., 1997.
[43] H.Bozkaya, U. S.Akarca, B.Ayola, andA. S. F.Lok, “High degree of conservation in the hepatitis B virus core gene during the immune tolerant phase in perinatally acquired chronic hepatitis B virus infection,” J. Hepatol., 1997.
[44] S. A.Whalley, J. M.Murray, D.Brown, G. J. M.Webster, V. C.Emery, G. M.Dusheiko, andA. S.Perelson, “Kinetics of acute hepatitis B virus infection in humans,” J. Exp. Med., 2001.
[45] H.Okamoto, F.Tsuda, H.Sakugawa, R. I.Sastrosoewignjo, M.Imai, Y.Miyakawa, andM.Mayumi, “Typing hepatitis B virus by homology in nucleotide sequence: Comparison of surface antigen subtypes,” J. Gen. Virol., 1988.
[46] A.Cassidy, S.Mossman, A.Olivieri, M.DeRidder, andG.Leroux-Roels, “Hepatitis B vaccine effectiveness in the face of global HBV genotype diversity,” Expert Review of Vaccines. 2011.
[47] W. C.Liu, M.Mizokami, M.Buti, M.Lindh, K. C.Young, K. T.Sun, Y. C.Chi, H. H.Li, andT. T.Chang, “Simultaneous quantification and genotyping of hepatitis B virus for genotypes A to G by real-time PCR and two-step melting curve analysis,” J. Clin. Microbiol., 2006.
[48] A.Kramvis, K.Arakawa, M. C.Yu, R.Nogueira, D. O.Stram, andM. C.Kew, “Relationship of serological subtype, basic core promoter and precore mutations to genotypes/subgenotypes of hepatitis B virus.,” J. Med. Virol., vol. 80, no. 1, pp. 27–46, Jan.2008.
[49] M.Sunbul, “Hepatitis B virus genotypes: global distribution and clinical importance.,” World J. Gastroenterol., vol. 20, no. 18, pp. 5427–5434, May2014.
[50] D.Zhang, P.Dong, K.Zhang, L.Deng, C.Bach, W.Chen, F.Li, U.Protzer, H.Ding, andC.Zeng, “Whole genome HBV deletion profiles and the accumulation of preS deletion mutant during antiviral treatment.,” BMC Microbiol., 2012.
[51] W. C.Liu, P. H.Phiet, T. Y.Chiang, K. T.Sun, K. H.Hung, K. C.Young, I. C.Wu, P. N.Cheng, andT. T.Chang, “Five subgenotypes of hepatitis B virus genotype B with distinct geographic and virological characteristics,” Virus Res., 2007.
[52] W. C.Liu, C. P.Lin, C. P.Cheng, C. H.Ho, K. L.Lan, J. H.Cheng, C. J.Yen, P. N.Cheng, I. C.Wu, I. C.Li, B. C. H.Chang, V. S.Tseng, Y. C.Chiu, andT. T.Chang, “Aligning to the sample-specific reference sequence to optimize the accuracy of next-generation sequencing analysis for hepatitis B virus,” Hepatol. Int., 2016.
[53] M.Jamshidi, A.Lalbakhsh, J.Talla, Z.Peroutka, F.Hadjilooei, P.Lalbakhsh, M.Jamshidi, L.LaSpada, M.Mirmozafari, M.Dehghani, A.Sabet, S.Roshani, S.Roshani, N.Bayat-Makou, B.Mohamadzade, Z.Malek, A.Jamshidi, S.Kiani, H.Hashemi-Dezaki, andW.Mohyuddin, “Artificial Intelligence and COVID-19: Deep Learning Approaches for Diagnosis and Treatment,” IEEE Access, 2020.
[54] X.Mei, H. C.Lee, K. yueDiao, M.Huang, B.Lin, C.Liu, Z.Xie, Y.Ma, P. M.Robson, M.Chung, A.Bernheim, V.Mani, C.Calcagno, K.Li, S.Li, H.Shan, J.Lv, T.Zhao, J.Xia, Q.Long, S.Steinberger, A.Jacobi, T.Deyer, M.Luksza, F.Liu, B. P.Little, Z. A.Fayad, andY.Yang, “Artificial intelligence–enabled rapid diagnosis of patients with COVID-19,” Nat. Med., 2020.
[55] Q. V.Pham, D. C.Nguyen, T.Huynh-The, W. J.Hwang, andP. N.Pathirana, “Artificial Intelligence (AI) and Big Data for Coronavirus (COVID-19) Pandemic: A Survey on the State-of-the-Arts,” IEEE Access. 2020.
[56] Y.Oh, S.Park, andJ. C.Ye, “Deep Learning COVID-19 Features on CXR Using Limited Training Data Sets,” IEEE Trans. Med. Imaging, 2020.
[57] R.Vaishya, M.Javaid, I. H.Khan, andA.Haleem, “Artificial Intelligence (AI) applications for COVID-19 pandemic,” Diabetes Metab. Syndr. Clin. Res. Rev., 2020.
[58] L.Li, L.Qin, Z.Xu, Y.Yin, X.Wang, B.Kong, J.Bai, Y.Lu, Z.Fang, Q.Song, K.Cao, D.Liu, G.Wang, Q.Xu, X.Fang, S.Zhang, J.Xia, andJ.Xia, “Artificial Intelligence Distinguishes COVID-19 from Community Acquired Pneumonia on Chest CT.,” Radiology, 2020.
[59] F.Shi, J.Wang, J.Shi, Z.Wu, Q.Wang, Z.Tang, K.He, Y.Shi, andD.Shen, “Review of Artificial Intelligence Techniques in Imaging Data Acquisition, Segmentation, and Diagnosis for COVID-19,” IEEE Reviews in Biomedical Engineering. 2021.
[60] J.Laguarta, F.Hueto, andB.Subirana, “COVID-19 Artificial Intelligence Diagnosis Using Only Cough Recordings,” IEEE Open J. Eng. Med. Biol., 2020.
[61] M. S.Nobile, P.Cazzaniga, A.Tangherloni, andD.Besozzi, “Graphics processing units in bioinformatics, computational biology and systems biology,” Brief. Bioinform., 2017.
[62] L.Dematté, andD.Prandi, “GPU computing for systems biology,” Brief. Bioinform., 2010.
[63] J.Zhou, X.Liu, D. S.Stones, Q.Xie, andG.Wang, “MrBayes on a graphics processing unit,” Bioinformatics, 2011.
[64] J.Zhang, H.Wang, andW. C.Feng, “CuBLASTP: Fine-Grained Parallelization of Protein Sequence Search on CPU+GPU,” IEEE/ACM Trans. Comput. Biol. Bioinforma., 2017.
[65] E.Rucci, C.Garcia, G.Botella, A.DeGiusti, M.Naiouf, andM.Prieto-Matias, “SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences,” BMC Syst. Biol., 2018.
[66] P. D.Vouzis, andN.V.Sahinidis, “GPU-BLAST: Using graphics processors to accelerate protein sequence alignment,” Bioinformatics, 2011.
[67] C. L.Hung, Y. S.Lin, C. Y.Lin, Y. C.Chung, andY. F.Chung, “CUDA ClustalW: An efficient parallel algorithm for progressive multiple sequence alignment on Multi-GPUs,” Comput. Biol. Chem., 2015.
[68] A.Haldane, andR. M.Levy, “Mi3-GPU: MCMC-based inverse Ising inference on GPUs for protein covariation analysis,” Comput. Phys. Commun., 2020.
[69] S.Wang, J.Kim, X.Jiang, S. F.Brunner, andL.Ohno-Machado, “GAMUT: GPU accelerated microRNA analysis to uncover target genes through CUDA-miRanda,” BMC Med. Genomics, 2014.
[70] S. A.Manavski, andG.Valle, “CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment,” BMC Bioinformatics, 2008.
[71] A.Bakhoda, G. L.Yuan, W. W. L.Fung, H.Wong, andT. M.Aamodt, “Analyzing CUDA workloads using a detailed GPU simulator,” 2009.
[72] J.Besser, H. A.Carleton, P.Gerner-Smidt, R. L.Lindsey, andE.Trees, “Next-generation sequencing technologies and their application to the study and control of bacterial infections,” Clinical Microbiology and Infection. 2018.
[73] L. J. S.Williams, D. G.Tabbaa, N.Li, A. M.Berlin, T. P.Shea, I.MacCallum, M. S.Lawrence, Y.Drier, G.Getz, S. K.Young, D. B.Jaffe, C.Nusbaum, andA.Gnirke, “Paired-end sequencing of Fosmid libraries by Illumina,” Genome Res., 2012.
[74] G.Tan, L.Opitz, R.Schlapbach, andH.Rehrauer, “Long fragments achieve lower base quality in Illumina paired-end sequencing,” Sci. Rep., 2019.
[75] B.Langmead, C.Trapnell, M.Pop, andS. L.Salzberg, “Ultrafast and memory-efficient alignment of short DNA sequences to the human genome,” Genome Biol., 2009.
[76] H.Li, andR.Durbin, “Fast and accurate long-read alignment with Burrows-Wheeler transform,” Bioinformatics, 2010.
[77] H.Li, andR.Durbin, “Fast and accurate short read alignment with Burrows-Wheeler transform,” Bioinformatics, 2009.
[78] G.Cuccuru, M.Orsini, A.Pinna, A.Sbardellati, N.Soranzo, A.Travaglione, P.Uva, G.Zanetti, andG.Fotia, “Orione, a web-based framework for NGS analysis in microbiology,” Bioinformatics, 2014.
[79] V.Kisand, andT.Lettieri, “Genome sequencing of bacteria: Sequencing, de novo assembly and rapid analysis using open source tools,” BMC Genomics, 2013.
[80] N.Dierckxsens, P.Mardulyn, andG.Smits, “NOVOPlasty: De novo assembly of organelle genomes from whole genome data,” Nucleic Acids Res., 2017.
[81] J. A.Martin, andZ.Wang, “Next-generation transcriptome assembly,” Nature Reviews Genetics. 2011.
[82] M. J. P.Chaisson, R. K.Wilson, andE. E.Eichler, “Genetic variation and the de novo assembly of human genomes,” Nature Reviews Genetics. 2015.
[83] J. R.Miller, S.Koren, andG.Sutton, “Assembly algorithms for next-generation sequencing data,” Genomics. 2010.
[84] R.Li, W.Fan, G.Tian, H.Zhu, L.He, J.Cai, Q.Huang, Q.Cai, B.Li, Y.Bai, Z.Zhang, J.Wang, et al., “The sequence and de novo assembly of the giant panda genome,” Nature, 2010.
[85] G.Robertson, J.Schein, R.Chiu, R.Corbett, M.Field, S. D.Jackman, K.Mungall, S.Lee, H. M.Okada, J. Q.Qian, M.Griffith, A.Raymond, N.Thiessen, T.Cezard, Y. S.Butterfield, R.Newsome, S. K.Chan, R.She, R.Varhol, B.Kamoh, A. L.Prabhu, A.Tam, Y.Zhao, R. A.Moore, M.Hirst, M. A.Marra, S. J. M.Jones, P. A.Hoodless, andI.Birol, “De novo assembly and analysis of RNA-seq data,” Nat. Methods, 2010.
[86] R.Li, H.Zhu, J.Ruan, W.Qian, X.Fang, Z.Shi, Y.Li, S.Li, G.Shan, K.Kristiansen, S.Li, H.Yang, J.Wang, andJ.Wang, “De novo assembly of human genomes with massively parallel short read sequencing,” Genome Res., 2010.
[87] M. G.Grabherr, B. J.Haas, M.Yassour, J. Z.Levin, D. A.Thompson, I.Amit, X.Adiconis, L.Fan, R.Raychowdhury, Q.Zeng, Z.Chen, E.Mauceli, N.Hacohen, A.Gnirke, N.Rhind, F.DiPalma, B. W.Birren, C.Nusbaum, K.Lindblad-Toh, N.Friedman, andA.Regev, “Full-length transcriptome assembly from RNA-Seq data without a reference genome,” Nat. Biotechnol., 2011.
[88] O.Pös, J.Budis, Z.Kubiritova, M.Kucharik, F.Duris, J.Radvanszky, andT.Szemes, “Identification of structural variation from NGS-based non-invasive prenatal testing,” Int. J. Mol. Sci., 2019.
[89] B.Liu, J. M.Conroy, C. D.Morrison, A. O.Odunsi, M.Qin, L.Wei, D. L.Trump, C. S.Johnson, S.Liu, andJ.Wang, “Structural variation discovery in the cancer genome using next generation sequencing: Computational solutions and perspectives,” Oncotarget, 2015.
[90] W.Mu, B.Li, S.Wu, J.Chen, D.Sain, D.Xu, M. H.Black, R.Karam, K.Gillespie, K. D.Farwell Hagman, L.Guidugli, M.Pronold, A.Elliott, andH. M.Lu, “Detection of structural variation using target captured next-generation sequencing data for genetic diagnostic testing,” Genet. Med., 2019.
[91] H. J.Abel, andE. J.Duncavage, “Detection of structural DNA variation from next generation sequencing data: A review of informatic approaches,” Cancer Genetics. 2013.
[92] B. E.Shie, J. H.Cheng, K. T.Chuang, andV. S.Tseng, “A one-phase method for mining high utility mobile sequential patterns in mobile commerce environments,” 2012.
[93] H.Li, B.Handsaker, A.Wysoker, T.Fennell, J.Ruan, N.Homer, G.Marth, G.Abecasis, andR.Durbin, “The Sequence Alignment/Map format and SAMtools,” Bioinformatics, 2009.
[94] B.Langmead, andS. L.Salzberg, “Fast gapped-read alignment with Bowtie 2,” Nat. Methods, 2012.
[95] C. P.Cheng, K. L.Lan, W. C.Liu, T. T.Chang, andV. S.Tseng, “DeF-GPU: Efficient and effective deletions finding in hepatitis B viral genomic DNA using a GPU architecture,” Methods, 2016.
[96] E.Yildirim, E.Arslan, J.Kim, andT.Kosar, “Application-level optimization of big data transfers through pipelining, parallelism and concurrency,” IEEE Trans. Cloud Comput., 2016.
[97] Y.Ji, S.Chen, H.Yao, H.Fang, K.Li, S.Liu, Z.Xie, andK.Wang, “Multi-thread concurrent compression algorithm for genomic big data,” 2019.
[98] C. M.Liu, T.Wong, E.Wu, R.Luo, S. M.Yiu, Y.Li, B.Wang, C.Yu, X.Chu, K.Zhao, R.Li, andT. W.Lam, “SOAP3: Ultra-fast GPU-based parallel alignment tool for short reads,” Bioinformatics, 2012.
[99] P.Klus, S.Lam, D.Lyberg, M.Cheung, G.Pullan, I.McFarlane, G. S. H.Yeo, andB. Y. H.Lam, “BarraCUDA - A fast short read sequence aligner using graphics processing units,” BMC Res. Notes, 2012.
[100] N.Ahmed, J.Lévy, S.Ren, H.Mushtaq, K.Bertels, andZ.Al-Ars, “GASAL2: A GPU accelerated sequence alignment library for high-throughput NGS data,” BMC Bioinformatics, 2019.
[101] D.Li, R.Luo, C. M.Liu, C. M.Leung, H. F.Ting, K.Sadakane, H.Yamashita, andT. W.Lam, “MEGAHIT v1.0: A fast and scalable metagenome assembler driven by advanced methodologies and community practices,” Methods. 2016.
[102] S.Ren, K.Bertels, andZ.Al-Ars, “GPU-accelerated GATK haplotype caller with load-balanced multi-process optimization,” 2017.

校外：不公開電子論文及紙本論文均尚未授權公開

簡易檢索 / 詳目顯示

相關論文