MARC보기
LDR00000nam u2200205 4500
001000000433688
00520200225152637
008200131s2019 ||||||||||||||||| ||eng d
020 ▼a 9781085696005
035 ▼a (MiAaPQ)AAI22618715
040 ▼a MiAaPQ ▼c MiAaPQ ▼d 247004
0820 ▼a 004
1001 ▼a Hao, Yuning.
24510 ▼a Statistical and Computational Methods for Biological Data.
260 ▼a [S.l.]: ▼b Michigan State University., ▼c 2019.
260 1 ▼a Ann Arbor: ▼b ProQuest Dissertations & Theses, ▼c 2019.
300 ▼a 101 p.
500 ▼a Source: Dissertations Abstracts International, Volume: 81-03, Section: B.
500 ▼a Includes supplementary digital materials.
500 ▼a Advisor: Xie, Yuying.
5021 ▼a Thesis (Ph.D.)--Michigan State University, 2019.
506 ▼a This item must not be sold to any third party vendors.
506 ▼a This item must not be added to any third party search indexes.
520 ▼a The development of biological data focuses on machine learning and statistical methods. In immunotherapy, gene-expression deconvolution is used to quantify different types of cells in a mixed population. It provides a highly promising solution to rapidly characterize the tumor-infiltrating immune landscape and identify cold cancers. However, a major challenge is that gene-expression data are frequently contaminated by many outliers that decrease the estimation accuracy. Thus, it is imperative to develop a robust deconvolution method that automatically decontaminates data by reliably detecting and removing outliers. Our development of an algorithm called adaptive Least Trimmed Square (aLTS) identifies outliers in regression models, allows us to effectively detect and omit the outliers, and provides us robust estimations of the coefficients. For the guarantees of the convergence property and parameters recovery, we also included certain theoretical results.Another interesting topic is the investigation of the association of phenotype responses with the identified intricate patterns in transcription factor binding sites for DNA sequences. To address these concerns, we pushed forward with a deep learning-based framework. On one hand, to capture regulatory motifs, we utilized convolution and pooling layers. On the other hand, to understand the long-term dependencies among motifs, we used position embedding and multi-head self-attention layers. We pursued the improvement of our model's overall efficacy through the integration of transfer learning and multi-task learning. To ascertain confirmed and novel transcription factor binding motifs (TFBMs), along with their relationships internally, we provided interpretations of our DNA quantification model.
590 ▼a School code: 0128.
650 4 ▼a Statistics.
650 4 ▼a Computer science.
690 ▼a 0463
690 ▼a 0984
71020 ▼a Michigan State University. ▼b Statistics - Doctor of Philosophy.
7730 ▼t Dissertations Abstracts International ▼g 81-03B.
773 ▼t Dissertation Abstract International
790 ▼a 0128
791 ▼a Ph.D.
792 ▼a 2019
793 ▼a English
85640 ▼u http://www.riss.kr/pdu/ddodLink.do?id=T15493561 ▼n KERIS ▼z 이 자료의 원문은 한국교육학술정보원에서 제공합니다.
980 ▼a 202002 ▼f 2020
990 ▼a ***1008102
991 ▼a E-BOOK