자료유형 | 학위논문 |
---|---|
서명/저자사항 | Gaussian Regression Model Selection in High Dimension. |
개인저자 | Yang, Zikun. |
단체저자명 | Indiana University. Statistics. |
발행사항 | [S.l.]: Indiana University., 2019. |
발행사항 | Ann Arbor: ProQuest Dissertations & Theses, 2019. |
형태사항 | 139 p. |
기본자료 저록 | Dissertations Abstracts International 81-04B. Dissertation Abstract International |
ISBN | 9781687905277 |
학위논문주기 | Thesis (Ph.D.)--Indiana University, 2019. |
일반주기 |
Source: Dissertations Abstracts International, Volume: 81-04, Section: B.
Advisor: Womack, Andrew. |
이용제한사항 | This item must not be sold to any third party vendors. |
요약 | Statistics is a discipline focused on experimental design, data collection, inference, and presentation. Numerous statistical methodologies have become standard tools for offering insight into investigations from applied disciplines. Arguably the most fundamental and widely applied method is Gaussian linear regression model, which is a probabilistic method for modeling the relationship between a randomly distributed response and one or more explanatory covariates. One of the main challenges of building a regression model is to select a suitable model based on different criteria, and researchers oftentimes tend to choose an economic model with as few covariates possible while still maintaining good estimation or prediction properties. As the ability of collecting massive amounts of data has dramatically increased in recent years (e.g., in genomic studies, online data collection, and smart devices) a new difficulty has arisen in performing model selection in the high-dimensional regime. Here, the number of explanatory covariates is large (or even increasing) with respect to the sample size. In this thesis, I address the model section from two directions, discrete variable selection and continuous variable shrinkage. The discrete variable selection is to seek the model with the highest posterior probability conditioning on the observed data. To overcome the dilemma imposed by the growing dimensions of the model space, a truncated Poisson prior and the mixture of g[Special character(s) omitted]prior have been deployed to attain model selection consistency. For continuous variable shrinkage, the model includes all of the covariates and achieves model selection behavior by shrinking the coefficients of covariates with no or negligible effect on the outcome variable. An innovative log-Cauchy prior distribution on the local shrinkage parameter has been discovered, which yields superior theoretical advantages over competing methods while not creating a more costly computational framework. These methods should contribute to the solution of new challenges received from applied disciplines. |
일반주제명 | Statistics. Mathematics. |
언어 | 영어 |
바로가기 |
: 이 자료의 원문은 한국교육학술정보원에서 제공합니다. |