LDR | | 00000nam u2200205 4500 |
001 | | 000000435208 |
005 | | 20200227163238 |
008 | | 200131s2019 ||||||||||||||||| ||eng d |
020 | |
▼a 9781687927699 |
035 | |
▼a (MiAaPQ)AAI27536114 |
035 | |
▼a (MiAaPQ)umichrackham002162 |
040 | |
▼a MiAaPQ
▼c MiAaPQ
▼d 247004 |
082 | 0 |
▼a 621.3 |
100 | 1 |
▼a Deshmukh, Aniket Anand. |
245 | 10 |
▼a Kernel Methods for Learning with Limited Labeled Data. |
260 | |
▼a [S.l.]:
▼b University of Michigan.,
▼c 2019. |
260 | 1 |
▼a Ann Arbor:
▼b ProQuest Dissertations & Theses,
▼c 2019. |
300 | |
▼a 169 p. |
500 | |
▼a Source: Dissertations Abstracts International, Volume: 81-04, Section: B. |
500 | |
▼a Advisor: Scott, Clayton D. |
502 | 1 |
▼a Thesis (Ph.D.)--University of Michigan, 2019. |
506 | |
▼a This item must not be sold to any third party vendors. |
506 | |
▼a This item must not be added to any third party search indexes. |
520 | |
▼a Machine learning is a rapidly developing technology that enables a system to automatically learn and improve from experience. Modern machine learning algorithms have achieved state-of-the-art performances on a variety of tasks such as speech recognition, image classification, machine translation, playing games like Go, Dota 2, etc. However, one of the biggest challenges in applying these machine learning algorithms in the real world is that they require huge amount of labeled data for the training. In the real world, the amount of labeled training data is often limited.In this thesis, we address three challenges in learning with limited labeled data using kernel methods. In our first contribution, we provide an efficient way to solve an existing domain generalization algorithm and extend the theoretical analysis to multiclass classification. As a second contribution, we propose a multi-task learning framework for contextual bandit problems. We propose an upper confidence bound-based multi-task learning algorithm for contextual bandits, establish a corresponding regret bound, and interpret this bound to quantify the advantages of learning in the presence of high task (arm) similarity. Our third contribution is to provide a simple regret guarantee (best policy identification) in a contextual bandits setup. Our experiments examine a novel application to adaptive sensor selection for magnetic field estimation in interplanetary spacecraft and demonstrate considerable improvements of our algorithm over algorithms designed to minimize the cumulative regret. |
590 | |
▼a School code: 0127. |
650 | 4 |
▼a Engineering. |
650 | 4 |
▼a Computer science. |
650 | 4 |
▼a Electrical engineering. |
690 | |
▼a 0537 |
690 | |
▼a 0544 |
690 | |
▼a 0984 |
710 | 20 |
▼a University of Michigan.
▼b Electrical and Computer Engineering. |
773 | 0 |
▼t Dissertations Abstracts International
▼g 81-04B. |
773 | |
▼t Dissertation Abstract International |
790 | |
▼a 0127 |
791 | |
▼a Ph.D. |
792 | |
▼a 2019 |
793 | |
▼a English |
856 | 40 |
▼u http://www.riss.kr/pdu/ddodLink.do?id=T15494193
▼n KERIS
▼z 이 자료의 원문은 한국교육학술정보원에서 제공합니다. |
980 | |
▼a 202002
▼f 2020 |
990 | |
▼a ***1008102 |
991 | |
▼a E-BOOK |