대구한의대학교 향산도서관

상세정보

부가기능

A Graph-Based Approach to Change-Point Detection for Multivariate and Non-Euclidean Data

상세 프로파일

상세정보
자료유형학위논문
서명/저자사항A Graph-Based Approach to Change-Point Detection for Multivariate and Non-Euclidean Data.
개인저자Chu, Lynna.
단체저자명University of California, Davis. Biostatistics.
발행사항[S.l.]: University of California, Davis., 2019.
발행사항Ann Arbor: ProQuest Dissertations & Theses, 2019.
형태사항156 p.
기본자료 저록Dissertations Abstracts International 81-04B.
Dissertation Abstract International
ISBN9781085796194
학위논문주기Thesis (Ph.D.)--University of California, Davis, 2019.
일반주기 Source: Dissertations Abstracts International, Volume: 81-04, Section: B.
Advisor: Chen, Hao.
이용제한사항This item must not be sold to any third party vendors.This item must not be added to any third party search indexes.
요약We consider the testing and estimation of change-points, locations where the distribution abruptly changes, in a sequence of multivariate or non-Euclidean observations. While the change-point problem has been extensively studied for low-dimensional data, advances in data collection technology have produced data sequences of increasing volume and complexity. Motivated by the challenges of modern data, we study a non-parametric framework that can be effectively applied to various data types as long as an informative similarity measure on the sample space can be defined. We first consider the change-point problem in the offline setting, where the sequence of observations has been completely observed. The existing approach along this line has low power and/or biased estimates for change-points under some common scenarios. To address these problems, we present new tests based on similarity information that exhibit substantial improvements in detecting and estimating change-points. In addition, under some mild conditions, the new test statistics are asymptotically distribution free under the null hypothesis of no change. Analytic p-value approximation formulas to the significance of the new test statistics are derived, making the new approaches easy off-the-shelf tools for large datasets.Moreover, in many applications it is of scientific significance to detect anomaly events as data are being collected. We extend the new test statistics to the related, but distinct, online setting where change-points are detected sequentially as data is being generated. The approach utilizes nearest neighbor information and can be applied to ongoing sequences of multivariate data or non-Euclidean data. Analytic formulas for approximating the average run lengths of the new approaches are derived to make them fast applicable for large datasets. The effectiveness of the new approaches are illustrated in an analysis of New York taxi data.
일반주제명Biostatistics.
Statistics.
언어영어
바로가기URL : 이 자료의 원문은 한국교육학술정보원에서 제공합니다.

서평(리뷰)

  • 서평(리뷰)

태그

  • 태그

나의 태그

나의 태그 (0)

모든 이용자 태그

모든 이용자 태그 (0) 태그 목록형 보기 태그 구름형 보기
 
로그인폼