자료유형 | 학위논문 |
---|---|
서명/저자사항 | Go with the Flow: Graphs, Streaming and Relational Computations over Distributed Dataflow. |
개인저자 | Xin, Reynold Shi. |
단체저자명 | University of California, Berkeley. Electrical Engineering & Computer Sciences. |
발행사항 | [S.l.]: University of California, Berkeley., 2018. |
발행사항 | Ann Arbor: ProQuest Dissertations & Theses, 2018. |
형태사항 | 126 p. |
기본자료 저록 | Dissertation Abstracts International 80-01B(E). Dissertation Abstract International |
ISBN | 9780438324428 |
학위논문주기 | Thesis (Ph.D.)--University of California, Berkeley, 2018. |
일반주기 |
Source: Dissertation Abstracts International, Volume: 80-01(E), Section: B.
Advisers: Michael Franklin |
요약 | Modern data analysis is undergoing a "Big Data" transformation: organizations are generating and gathering more data than ever before, in a variety of formats covering both structured and unstructured data, and employing increasingly sophisticat |
요약 | This dissertation builds on Apache Spark, a distributed dataflow engine, and creates three related systems: Spark SQL, Structured Streaming, and GraphX. Spark SQL combines relational and procedural processing through a new API called DataFrame. |
요약 | The three systems have enjoyed wide adoption in industry and academia, and together they laid the foundation for Spark's 2.0 release. They demonstrate the feasibility and advantages of unifying disparate, specialized data systems on top of distr |
일반주제명 | Computer science. |
언어 | 영어 |
바로가기 |
: 이 자료의 원문은 한국교육학술정보원에서 제공합니다. |