-
作者:Duan, Chenyang; Jiang, Yuan
作者单位:AbbVie; Oregon State University
摘要:To classify biological roles of different species in an ecological system, modern studies collect longitudinal and compositional counts of DNA sequences of taxonomically diagnostic genetic markers to measure the abundance of species over time. The major challenges of conducting this analysis are twofold: how to accommodate the complex dependence in this data type and how to model the longitudinal trajectories of the species' abundances. In this paper we propose a novel method named COMPARING t...
-
作者:Jiang, Bei; Raftery, Adrian E.; Steele, Russell J.; Wang, Naisyin
作者单位:University of Alberta; University of Washington; University of Washington Seattle; McGill University; University of Michigan System; University of Michigan
摘要:Responsible data sharing anchors research reproducibility and promotes the integrity of scientific research. Motivated by Canadian Scleroderma Research Group (CSRG) patient registry data, we present a risk-based method to produce privacy-preserved and high-utility synthetic datasets, which also simultaneously imputes missing data of mixed continuous and categorical types in the original dataset. This method divides all individuals into different subgroups, based on their reidentification risks...
-
作者:Shi-Jun, Samantha; Shand, Lyndsay; Li, Bo
作者单位:University of Illinois System; University of Illinois Urbana-Champaign; United States Department of Energy (DOE); Sandia National Laboratories
摘要:Significant events, such as volcanic eruptions, can have global and longlasting impacts on climate. These global impacts, however, are not uniform across space and time. Understanding how the Mt. Pinatubo eruption affects global and regional climate is of great interest for predicting the impact on climate due to similar events as well as understanding the possible effect of the stratospheric aerosol injections proposed to combat climate change. While many studies illustrated the impact of the...
-
作者:Boxer, Kate S.; Hong, Boyeong; Kontokosta, Constantine E.; Neill, Daniel B.
作者单位:New York University; New York University
摘要:Systems such as 311 enable residents of a community to report on their environments and to request nonemergency municipal services. While such systems provide an important link between community and government, resident-generated data suffer from reporting bias, with some subpopulations reporting at lower rates than others. Our research focuses on defining the underreporting of heating and hot water problems to New York City's 311 system and developing methods to estimate under-reporting. Firs...
-
作者:Li, Dayi; Stringer, Alex; Brown, Patrick e.; Eadie, Gwendolyn m.; Abraham, Roberto g.
作者单位:University of Toronto; University of Waterloo; University of Toronto
摘要:We propose a novel set of Poisson cluster process (PCP) models to detect ultra-diffuse galaxies (UDGs), a class of extremely faint, enigmatic galaxies of substantial interest in modern astrophysics. We model the unobserved UDG locations as parent points in a PCP and infer their positions based on the observed spatial point patterns of their old star cluster systems. Many UDGs have somewhere from a few to hundreds of these old star clusters, which we treat as offspring points in our models. We ...
-
作者:Zou, Haotian; Xiao, Luo; Zeng, Donglin; Luo, Sheng
作者单位:Duke University; North Carolina State University; University of Michigan System; University of Michigan
摘要:Alzheimer's Disease (AD) is a common neurodegenerative disorder impairing multiple domains. Recent AD studies, for example, the Alzheimer's to better understand AD severity and progression. To facilitate precision medicine for high-risk individuals, it is essential to develop an AD predictive model that leverages multimodal data and provides accurate personalized predictions of dementia occurrences. In this article we propose a multivariate functional mixed model with longitudinal magnetic res...
-
作者:Cabello, Esteban; Morales, Domingo; Perez, Agustin
作者单位:Universidad Miguel Hernandez de Elche; Universidad Miguel Hernandez de Elche
摘要:Exposure indices measure the degree of contact between two groups and are used to quantify occupational discrepancies between genders in a set of occupational sectors. This paper presents a novel methodology for predicting area-level proportions of employed men and women across various occupation sectors, along with estimating exposure indexes. The challenge arises from the compositional nature of the direct estimators of proportions, which tend to be imprecise when sample sizes are small. To ...
-
作者:Ma, Yingying; Lan, Wei; Leng, Chenlei; Li, Ting; Wang, Hansheng
作者单位:Beihang University; Southwestern University of Finance & Economics - China; University of Warwick; Hong Kong Polytechnic University; Peking University
摘要:The social characteristics of players in a social network are closely associated with their network positions and relational importance. Identifying those influential players in a network is of great importance, as it helps to understand how ties are formed, how information is propagated, and, in turn, can guide the dissemination of new information. Motivated by a Sina Weibo social network analysis of the 2021 Henan Floods, where response variables for each Sina Weibo user are available, we pr...
-
作者:MacBride, Cara; Davies, Vinny; Lee, Duncan
作者单位:University of Glasgow
摘要:In spatial areal unit data with missing or suppressed values, it is desirable to create models that are able to predict observations that are not available. Typically, statistical spatial smoothing models fitted in a Bayesian hierarchical framework are used for this purpose, which capture any unexplained residual spatial autocorrelation in the data through conditional autoregressive (CAR) or spatial autoregressive (SAR) priors applied to a set of random effects. In contrast, typical machine le...
-
作者:Wang, Jiping; Li, Rong; Chang, Wei-Shan; Hsiao, Kai-Yuan; Shia, Ben-Chang; Ma, Shuangge
作者单位:Yale University; Fu Jen Catholic University; Fu Jen Catholic University
摘要:The analysis of clinical treatment measures has been extensively conducted and can facilitate more effective resource management and assist in better understanding diseases. Most of the existing analyses have been focused on a single disease or many diseases combined. Partly motivated by the successes of gene-centric and phenotypic human disease network (HDN) research, there has been growing interest in network analysis of clinical treatment measures. However, existing studies have been limite...