대구한의대학교 향산도서관

상세정보

부가기능

Transparent Checkpointing over RDMA-based Networks

상세 프로파일

상세정보
자료유형학위논문
서명/저자사항Transparent Checkpointing over RDMA-based Networks.
개인저자Cao, Jiajun.
단체저자명Northeastern University. Computer Science.
발행사항[S.l.]: Northeastern University., 2017.
발행사항Ann Arbor: ProQuest Dissertations & Theses, 2017.
형태사항147 p.
기본자료 저록Dissertation Abstracts International 79-12B(E).
Dissertation Abstract International
ISBN9780438195042
학위논문주기Thesis (Ph.D.)--Northeastern University, 2017.
일반주기 Source: Dissertation Abstracts International, Volume: 79-12(E), Section: B.
Adviser: Gene Cooperman.
요약Fault tolerance for large-scale applications has long been an area of active research, as the size of the computation keeps growing. One of the components of a fault-tolerance strategy is checkpointing. However, no explicit checkpoint-restart so
요약In this dissertation, we present the first transparent, system-initiated checkpoint-restart solution that directly supports RDMA networks. This new approach does not depend on a specific parallel programming model, and does not require any modif
요약Conceptually, this dissertation can be divided into three parts. First, we introduce a new, generic model for RDMA networks, by extracting the key components for checkpointing an RDMA network. These components are the essential states that need
요약Second, we demonstrate the performance of the proposed approach. Moving from a medium-sized academic computer cluster to a petascale supercomputer, we show what issues are exposed as the application scales up, and how these issues are addressed.
요약Third, we show how to retrofit transparent checkpointing into the Cloud, as RDMA networks are also becoming more popular in the Cloud. A Checkpointing as a Service approach is presented, which employs checkpointing to provide fault tolerance as
일반주제명Computer science.
언어영어
바로가기URL : 이 자료의 원문은 한국교육학술정보원에서 제공합니다.

서평(리뷰)

  • 서평(리뷰)

태그

  • 태그

나의 태그

나의 태그 (0)

모든 이용자 태그

모든 이용자 태그 (0) 태그 목록형 보기 태그 구름형 보기
 
로그인폼