MARC보기
LDR00000nam u2200205 4500
001000000435167
00520200227162055
008200131s2019 ||||||||||||||||| ||eng d
020 ▼a 9781687987136
035 ▼a (MiAaPQ)AAI27606798
035 ▼a (MiAaPQ)0098vireo4384Mahendran
040 ▼a MiAaPQ ▼c MiAaPQ ▼d 247004
0820 ▼a 629.8
1001 ▼a Mahendran, Siddharth.
24510 ▼a Geometric Deep Learning for Monocular Object Orientation Estimation.
260 ▼a [S.l.]: ▼b The Johns Hopkins University., ▼c 2019.
260 1 ▼a Ann Arbor: ▼b ProQuest Dissertations & Theses, ▼c 2019.
300 ▼a 238 p.
500 ▼a Source: Dissertations Abstracts International, Volume: 81-06, Section: B.
500 ▼a Advisor: Vidal, Rene.
5021 ▼a Thesis (Ph.D.)--The Johns Hopkins University, 2019.
506 ▼a This item must not be sold to any third party vendors.
520 ▼a Monocular object orientation estimation or estimating the 3D orientation of an object given a single 2D image of the object, is an important component of traditional computer vision problems like scene understanding and 3D reconstruction as well as modern vision challenges like autonomous driving, augmented reality and robot manipulation. A main challenge of the object orientation estimation problem is that the task of estimating 3D orientation from a single 2D image is ill-posed. It requires a 3D object model in the loop and a key disadvantage of prior work using geometric models is that they are hand-crafted. Recent work proposes the use of powerful deep learning features via Convolutional Neural Networks (CNNs) that learn appropriate features and models from the data. Prior to the work described in this thesis, deep learning models for object orientation estimation formulated the problem as one of pose classification on the Euler angles. However, this ignores the geometry of the problem and the inherent structure in the orientation space, the set of all rotation matrices, SO(3).This thesis uses Geometric Deep Learning models for the orientation estimation task, which incorporate geometry of the orientation space into the deep learning pipeline by carefully choosing and designing representations, loss functions and network architectures well suited for this application. We first consider the problem of estimating the orientation of an object in an image assuming known object category and a bounding box containing the object in the image. We show that modeling the orientation space correctly by designing Riemannian CNNs i.e. regression and classification CNNs that use axis-angle or quaternion representations of rotation matrices and geodesic loss functions, leads to good performance on a challenging benchmarking dataset. We also propose a family of Bin & Delta models that combine pose classification CNNs (bin model) to get a coarse estimate of the object orientation and pose regression CNNs (delta model) that refine the coarse orientation estimate. Such models achieve state-of-the-art performance in benchmark datasets. Additionally, we have extended these models to the scenarios of unknown categorization and unknown localization by designing novel Integrated Networks to solve these multi-task problems.
590 ▼a School code: 0098.
650 4 ▼a Computer science.
650 4 ▼a Artificial intelligence.
650 4 ▼a Robotics.
690 ▼a 0984
690 ▼a 0800
690 ▼a 0771
71020 ▼a The Johns Hopkins University. ▼b Electrical and Computer Engineering.
7730 ▼t Dissertations Abstracts International ▼g 81-06B.
773 ▼t Dissertation Abstract International
790 ▼a 0098
791 ▼a Ph.D.
792 ▼a 2019
793 ▼a English
85640 ▼u http://www.riss.kr/pdu/ddodLink.do?id=T15494595 ▼n KERIS ▼z 이 자료의 원문은 한국교육학술정보원에서 제공합니다.
980 ▼a 202002 ▼f 2020
990 ▼a ***1008102
991 ▼a E-BOOK