대구한의대학교 향산도서관

링크메뉴

주메뉴

- 소장자료+타기관자료검색
- 단행본+목차
- 연속간행물
- 학위논문
- 멀티미디어자료
- 기사색인
- 고문헌
- 전자책
- 신착/인기자료
- 오디오북
- 전자책
- 전자잡지(교양)
- 국내학술 DB
- 국외학술 DB
- E-learning
- Hot Article
- 전자저널 A-Z
- SFX(구독자원연계) 이용안내
- 전국대학 학위논문검색
- 논문 표절검사 솔루션
- 소개
- 개관시간
- 대출/반납
- 실별안내
- 편의시설
- 전자정보 교외접속안내
- 도서기증
- 이용교육자료
- 연구지원서비스
- 특별회원제도 안내
- LIFE 독서 마라톤
- DHU독서샘물
- 공지사항
- 자주묻는질문
- 묻고답하기
- 자료실
- 베스트 셀러
- 추천도서
- 대출현황/연장/예약
- 캠퍼스간대출신청현황
- 자료배달신청현황
- 희망도서신청
- 나의서평
- 타기관자료이용신청
- 보관함신청
- 그룹스터디룸
- 영상시설이용
- 관심도서리스트
- 소재불명도서 신고처리현황
- 교육 및 참가신청
- 개인정보관리

상세정보

상세정보

검색결과 돌아가기

검색화면

Export to Refworks

부가기능

MARC보기

Heterogeneous Monolithic 3D and FinFET Architectures for Energy-Efficient Computing

상세 프로파일

상세정보
자료유형	학위논문
서명/저자사항	Heterogeneous Monolithic 3D and FinFET Architectures for Energy-Efficient Computing.
개인저자	Yu, Ye.
단체저자명	Princeton University. Electrical Engineering.
발행사항	[S.l.]: Princeton University., 2019.
발행사항	Ann Arbor: ProQuest Dissertations & Theses, 2019.
형태사항	210 p.
기본자료 저록	Dissertations Abstracts International 81-04B. Dissertation Abstract International
ISBN	9781085774390
학위논문주기	Thesis (Ph.D.)--Princeton University, 2019.
일반주기	Source: Dissertations Abstracts International, Volume: 81-04, Section: B. Advisor: Jha, Niraj K.
이용제한사항	This item must not be sold to any third party vendors.
요약	More transistors are integrated within the same footprint area as the technology node shrinks to deliver higher performance. However, this is accompanied by higher power density that usually exceeds the coping capability of inexpensive cooling techniques. This Power Wall prevents the chip from running at full speed with all the devices powered-on. Another major bottleneck in chip design is the imbalance between the processor clock rate and memory access speed. This Memory Wall keeps the processor from fully utilizing its compute power. To address both the Power and Memory Walls, we propose several approaches and architectures.To tackle the Memory Wall, we develop an efficient memory interface for monolithic 3D-stacked non-volatile RAMs (NVRAMs). It takes advantage of the tremendous bandwidth made available by monolithic inter-tier vias (MIVs) to implement an on-chip memory bus in order to hide the latency of large data transfers. To tackle the Power Wall, we add a fine-grain dynamically reconfigurable (FDR) field- programmable gate array (FPGA) in our monolithic 3D architecture. It uses the concept of temporal logic folding to localize on-chip communication. We show that the architecture reduces both power and energy significantly at a better performance for both memory- and compute-intensive applications.The second problem targeted in this work is to develop energy-efficient architectures for convolutional neural networks (CNNs). CNNs have been shown to outperform conventional machine-learning algorithms across a wide range of applications, e.g., object detection, image classification, image segmentation, etc. However, the high computational complexity of CNNs often necessitates extremely fast and efficient hardware. The problem is getting worse as the size of neural networks grows exponentially. As a result, customized hardware accelerators have been developed to accelerate CNN processing without sacrificing model accuracy. However, previous accelerator design studies have not fully considered the characteristics of the target applications, which may lead to sub-optimal architecture designs. On the other hand, new CNN models have been developed for better accuracy, but their compatibility with the underlying hardware accelerator is overlooked most of the time. We propose an application-driven framework for architectural design space exploration of CNN accelerators. This framework is based on a hardware analytical model for individual CNN operations. It models the accelerator design task as a multi-dimensional optimization problem. We demonstrate that it can be efficaciously used in application-driven accelerator architecture design. In addition, it is capable of improving neural network models to best fit the underlying hardware resources.Most existing CNN accelerators focus on exploring various dataflow styles and computational parallelism designs. However, potential performance improvement from the sparsity (in activations and weights) is still underdeveloped. The amount of computation and memory footprint of CNNs can be significantly reduced if sparsity is exploited in network evaluations. With the design space exploration method discussed above, we develop SPRING, a sparsity-aware reduced-precision CNN accelerator architecture for both training and inference. We use a binary mask scheme to encode sparsity of activations and weights, and adopt the stochastic rounding algorithm to train CNNs with reduced precision without accuracy loss. We use the efficient monolithic 3D nonvolatile memory interface to alleviate the memory bottleneck of CNN evaluation, especially in training.The last research direction of this thesis focuses on analyzing timing, leakage power, and dynamic power of FinFET architectures under process, supply voltage, and temperature (PVT) variations. We propose a statistical optimization framework using dual device-type assignment at the architecture level under PVT variations that takes spatial correlations into account and leverages circuit-level statistical analysis techniques.
일반주제명	Computer engineering. Electrical engineering.
언어	영어
바로가기	: 이 자료의 원문은 한국교육학술정보원에서 제공합니다.

정보 더보기(Naver)

서평(리뷰)

서평(리뷰)

링크메뉴

주메뉴

전체메뉴

상세정보

부가기능

상세 프로파일

서평(리뷰)

태그

나의 태그

모든 이용자 태그

MY MENU

도서관정보

서평(리뷰)
별점:
별점
제목:

내용: