-
作者:Mclaughlin, Katherine R.; Johnston, Lisa G.; Jakupi, Xhevat; Gexha-bunjaku, Dafina; Deva, Edona; Handcock, Mark S.
作者单位:Oregon State University; University of California System; University of California Los Angeles
摘要:Respondent -driven sampling (RDS) is used throughout the world to estimate prevalence and population size for hidden populations. Although RDS is an effective method for enrolling people from key populations in studies, it relies on a partially unknown sampling mechanism, and thus each individual's inclusion probability is unknown. Current estimators for population prevalence, population size, and other outcomes rely on a participant's network size (degree) to approximate their inclusion proba...
-
作者:Manole, Tudor; Bryant, Patrick; Alison, John; Kuusela, Mikael; Wasserman, Larry
作者单位:Massachusetts Institute of Technology (MIT); Carnegie Mellon University; Carnegie Mellon University; Carnegie Mellon University
摘要:We study the problem of data-driven background estimation, arising in the search of physics signals predicted by the Standard Model at the Large Hadron Collider. Our work is motivated by the search for the production of pairs of Higgs bosons decaying into four bottom quarks. A number of other physical processes, known as background, also share the same final state. The data arising in this problem is, therefore, a mixture of unlabeled background and signal events, and the primary aim of the an...
-
作者:Wrobel, Julia; Sauerbrei, Britton; Kirk, Eric A.; Guo, Jian-Zhon; Hantman, Adam; Goldsmith, Jeff
作者单位:Emory University; University System of Ohio; Case Western Reserve University; University of North Carolina; University of North Carolina Chapel Hill; University of North Carolina School of Medicine; Columbia University
摘要:We are motivated by a study that seeks to better understand the dynamic relationship between muscle activation and paw position during locomotion. For each gait cycle in this experiment, activation in the biceps and triceps is measured continuously and in parallel with paw position as a mouse trotted on a treadmill. We propose an innovative general regression method that draws from both ordinary differential equations and functional data analysis to model the relationship between these functio...
-
作者:Hadj-Amar, Beniamino; Jewson, Jack; Vannucci, Marina
作者单位:Rice University; Pompeu Fabra University
摘要:We propose a sparse vector autoregressive ( VAR ) hidden semi-Markov model ( HSMM ) for modeling temporal and contemporaneous (e.g., spatial) dependencies in multivariate nonstationary time series. The HSMM's 's generic state distribution is embedded in a special transition matrix structure, facilitating efficient likelihood evaluations and arbitrary approximation accuracy. To promote sparsity of the VAR coefficients, we deploy an l1 1-ball projection prior, which combines differentiability wi...
-
作者:Jeon, Minjeong; Schweinberger, Michael
作者单位:University of California System; University of California Los Angeles; Pennsylvania Commonwealth System of Higher Education (PCSHE); Pennsylvania State University; Pennsylvania State University - University Park
摘要:The recent shift to remote learning and work has aggravated longstanding problems, such as the problem of monitoring the mental health of individuals and the progress of students toward learning targets. We introduce a novel latent process model with a view to monitoring the progress of individuals toward a hard-to-measure target of interest and measured by a set of variables. The latent process model is based on the idea of embedding both individuals and variables measuring progress toward th...
-
作者:Han, Larry; Li, Yige; Niknam, Bijan; Zubizarreta, Jost
作者单位:Northeastern University; Harvard University; Harvard T.H. Chan School of Public Health; Harvard University; Harvard Medical School
摘要:Accurate hospital performance measurement is important to both patients and providers but is challenging due to case -mix heterogeneity, differences in treatment guidelines, and data privacy regulations that preclude the sharing of individual patient data. Motivated to overcome these issues in the setting of hospital quality measurement, we develop a federated causal inference framework. We devise a doubly robust estimator of the mean potential outcome in a target population and show that it i...
-
作者:Liu, Zihuan; Lee, Cheuk Yin; Zhang, Heping
作者单位:Yale University; Chinese University of Hong Kong
摘要:Neuroimaging studies often involve predicting a scalar outcome from an array of images collectively called tensor. The use of magnetic resonance imaging (MRI) provides a unique opportunity to investigate the structures of the brain. To learn the association between MRI images and human intelligence, we formulate a scalar-on-image quantile regression framework. However, the high dimensionality of the tensor makes estimating the coefficients for all elements computationally challenging. To addre...
-
作者:Patel, Ashish; Ditraglia, Francis J.; Zuber, Verena; Burgess, Stephen
作者单位:MRC Biostatistics Unit; University of Cambridge; University of Oxford; Imperial College London
摘要:Mendelian randomization (MR) is a widely-used method to estimate the causal relationship between a risk factor and disease. A fundamental part of any MR analysis is to choose appropriate genetic variants as instrumental variables. Genome-wide association studies often reveal that hundreds of genetic variants may be robustly associated with a risk factor, but in some situations investigators may have greater confidence in the instrument validity of only a smaller subset of variants. Nevertheles...
-
作者:Peterson, Emily N.; Nethery, Rachel C.; Padellini, Tullia; Chen, Jarvis T.; Coull, Brent A.; Piel, Frederic B.; Wakefield, Jon; Blangiardo, Marta; Waller, Lance A.
作者单位:Emory University; Rollins School Public Health; Harvard University; Harvard T.H. Chan School of Public Health; Imperial College London; University of Washington; University of Washington Seattle
摘要:Small area population counts are necessary for many epidemiological studies, yet their quality and accuracy are often not assessed. In the United States, small area population counts are published by the United States Census Bureau (USCB) in the form of the decennial census counts, intercensal population projections (PEP), and American Community Survey (ACS) estimates. Although there are significant relationships between these three data sources, there are important contrasts in data collectio...
-
作者:Wang, Zhong; Paters, Andrew D.; Sun, Lei
作者单位:National University of Singapore; University of Toronto; Hospital for Sick Children (SickKids); University of Toronto
摘要:Sex difference in allele frequency is an emerging topic that is crucial to our understanding of data quality and features, particularly when it comes to the largely overlooked X chromosome. To detect sex differences in allele frequency for both X chromosomal and autosomal variants, the existing method is conservative when applied to samples from multiple ancestral populations. Additionally, it remains unexplored whether the sex difference in allele frequency varies between populations, which i...