대구한의대학교 향산도서관

상세정보

부가기능

Lifelong Reinforcement Learning on Mobile Robots

상세 프로파일

상세정보
자료유형학위논문
서명/저자사항Lifelong Reinforcement Learning on Mobile Robots.
개인저자Isele, David.
단체저자명University of Pennsylvania. Computer and Information Science.
발행사항[S.l.]: University of Pennsylvania., 2018.
발행사항Ann Arbor: ProQuest Dissertations & Theses, 2018.
형태사항196 p.
기본자료 저록Dissertations Abstracts International 81-02B.
Dissertation Abstract International
ISBN9781085588065
학위논문주기Thesis (Ph.D.)--University of Pennsylvania, 2018.
일반주기 Source: Dissertations Abstracts International, Volume: 81-02, Section: B.
Advisor: Eaton, Eric
이용제한사항This item must not be sold to any third party vendors.
요약Machine learning has shown tremendous growth in the past decades, unlocking new capabilities in a variety of fields including computer vision, natural language processing, and robotic control. While the sophistication of individual problems a learning system can handle has greatly advanced, the ability of a system to extend beyond an individual problem to adapt and solve new problems has progressed more slowly. This thesis explores the problem of progressive learning. The goal is to develop methodologies that accumulate, transfer, and adapt knowledge in applied settings where the system is faced with the ambiguity and resource limitations of operating in the physical world.There are undoubtedly many challenges to designing such a system, my thesis looks at the component of this problem related to how knowledge from previous tasks can be a benefit in the domain of reinforcement learning where the agent receives rewards for positive actions. Reinforcement learning is particularly difficult when training on physical systems, like mobile robots, where repeated trials can damage the system and unrestricted exploration is often associated with safety risks. I investigate how knowledge can be efficiently accumulated and applied to future reinforcement learning problems on mobile robots in order to reduce sample complexity and enable systems to adapt to novel settings. Doing this involves mathematical models which can combine knowledge from multiple tasks, methods for restructuring optimizations and data collection to handle sequential updates, and data selection strategies that can be used to address resource limitations.
일반주제명Artificial intelligence.
Robotics.
언어영어
바로가기URL : 이 자료의 원문은 한국교육학술정보원에서 제공합니다.

서평(리뷰)

  • 서평(리뷰)

태그

  • 태그

나의 태그

나의 태그 (0)

모든 이용자 태그

모든 이용자 태그 (0) 태그 목록형 보기 태그 구름형 보기
 
로그인폼