MANIFOLD VALUED DATA ANALYSIS OF SAMPLES OF NETWORKS, WITH APPLICATIONS IN CORPUS LINGUISTICS

成果类型:
Article
署名作者:
Severn, Katie E.; Dryden, Ian L.; Preston, Simon P.
署名单位:
University of Nottingham; State University System of Florida; Florida International University
刊物名称:
ANNALS OF APPLIED STATISTICS
ISSN/ISSBN:
1932-6157
DOI:
10.1214/21-AOAS1480
发表日期:
2022
页码:
368-390
关键词:
STATISTICS
摘要:
Networks arise in many applications, such as in the analysis of text documents, social interactions and brain activity. We develop a general framework for extrinsic statistical analysis of samples of networks, motivated by networks representing text documents in corpus linguistics. We identify networks with their graph Laplacian matrices for which we define metrics, embeddings, tangent spaces and a projection from Euclidean space to the space of graph Laplacians. This framework provides a way of computing means, performing principal component analysis, regression, and carrying out hypothesis tests, such as for testing for equality of means between two samples of networks. We apply the methodology to the set of novels by Jane Austen and Charles Dickens.
来源URL: