-
作者:Shen, Ronglai; Wang, Sijian; Mo, Qianxing
作者单位:Memorial Sloan Kettering Cancer Center; University of Wisconsin System; University of Wisconsin Madison; University of Wisconsin System; University of Wisconsin Madison; Baylor College of Medicine
摘要:High resolution microarrays and second-generation sequencing platforms are powerful tools to investigate genome-wide alterations in DNA copy number, methylation and gene expression associated with a disease. An integrated genomic profiling approach measures multiple omics data types simultaneously in the same set of biological samples. Such approach renders an integrated data resolution that would not be available with any single data type. In this study, we use penalized latent variable regre...
-
作者:Airoldi, Edoardo M.; Wang, Xiaopei; Lin, Xiaodong
作者单位:Harvard University; University System of Ohio; University of Cincinnati; Rutgers University System; Rutgers University New Brunswick
摘要:We consider the problem of quantifying temporal coordination between multiple high-dimensional responses. We introduce a family of multi-way stochastic blockmodels suited for this problem, which avoids preprocessing steps such as binning and thresholding commonly adopted for this type of data, in biology. We develop two inference procedures based on collapsed Gibbs sampling and variational methods. We provide a thorough evaluation of the proposed methods on simulated data, in terms of membersh...
-
作者:Crossett, Andrew; Lee, Ann B.; Klei, Lambertus; Devlin, Bernie; Roeder, Kathryn
作者单位:Pennsylvania State System of Higher Education (PASSHE); West Chester University of Pennsylvania; Carnegie Mellon University; Pennsylvania Commonwealth System of Higher Education (PCSHE); University of Pittsburgh
摘要:Recent technological advances coupled with large sample sets have uncovered many factors underlying the genetic basis of traits and the predis-position to complex disease, but much is left to discover. A common thread to most genetic investigations is familial relationships. Close relatives can be identified from family records, and more distant relatives can be inferred from large panels of genetic markers. Unfortunately these empirical estimates can be noisy, especially regarding distant rel...
-
作者:Handorf, Elizabeth A.; Bekelman, Justin E.; Heitjan, Daniel F.; Mitra, Nandita
作者单位:Pennsylvania Commonwealth System of Higher Education (PCSHE); Temple University; Fox Chase Cancer Center; University of Pennsylvania; University of Pennsylvania
摘要:Estimates of the effects of treatment on cost from observational studies are subject to bias if there are unmeasured confounders. It is therefore advisable in practice to assess the potential magnitude of such biases. We derive a general adjustment formula for loglinear models of mean cost and explore special cases under plausible assumptions about the distribution of the unmeasured confounder. We assess the performance of the adjustment by simulation, in particular, examining robustness to a ...
-
作者:Pannekoek, Jeroen; Shlomo, Natalie; De Waal, Ton
作者单位:University of Manchester
摘要:A common problem faced by statistical institutes is that data may be missing from collected data sets. The typical way to overcome this problem is to impute the missing data. The problem of imputing missing data is complicated by the fact that statistical data often have to satisfy certain edit rules and that values of variables across units sometimes have to sum up to known totals. For numerical data, edit rules are most often formulated as linear restrictions on the variables. For example, f...
-
作者:Konomi, Bledar A.; Dhavala, Soma S.; Huang, Jianhua Z.; Kundu, Subrata; Huitink, David; Liang, Hong; Ding, Yu; Mallick, Bani K.
作者单位:Texas A&M University System; Texas A&M University College Station; Texas A&M University System; Texas A&M University College Station; Texas A&M University System; Texas A&M University College Station
摘要:The properties of materials synthesized with nanoparticles (NPs) are highly correlated to the sizes and shapes of the nanoparticles. The transmission electron microscopy (TEM) imaging technique can be used to measure the morphological characteristics of NPs, which can be simple circles or more complex irregular polygons with varying degrees of scales and sizes. A major difficulty in analyzing the TEM images is the overlapping of objects, having different morphological properties with no specif...
-
作者:Quick, Harrison; Banerjee, Sudipto; Carlin, Bradley P.
作者单位:University of Minnesota System; University of Minnesota Twin Cities
摘要:Advances in Geographical Information Systems (GIS) have led to the enormous recent burgeoning of spatial-temporal databases and associated statistical modeling. Here we depart from the rather rich literature in space-time modeling by considering the setting where space is discrete (e. g., aggregated data over regions), but time is continuous. Our major objective in this application is to carry out inference on gradients of a temporal process in our data set of monthly county level asthma hospi...
-
作者:Ye, Zhi-Sheng; Hong, Yili; Xie, Yimeng
作者单位:Hong Kong Polytechnic University; Virginia Polytechnic Institute & State University
摘要:The main objective of accelerated life tests (ALTs) is to predict fraction failings of products in the field. However, there are often discrepancies between the predicted fraction failing from the lab testing data and that from the field failure data, due to the yet unobserved heterogeneities in usage and operating conditions. Most previous research on ALT planning and data analysis ignores the discrepancies, resulting in inferior test plans and biased predictions. In this paper we model the h...
-
作者:Lin, Winston
作者单位:University of California System; University of California Berkeley
摘要:Freedman [Adv. in Appl. Math. 40 (2008) 180-193; Ann. Appl. Stat. 2 (2008) 176-196] critiqued ordinary least squares regression adjustment of estimated treatment effects in randomized experiments, using Neyman's model for randomization inference. Contrary to conventional wisdom, he argued that adjustment can lead to worsened asymptotic precision, invalid measures of precision, and small-sample bias. This paper shows that in sufficiently large samples, those problems are either minor or easily ...
-
作者:Rusch, Thomas; Hofmarcher, Paul; Hatzinger, Reinhold; Hornik, Kurt
作者单位:Vienna University of Economics & Business; Johannes Kepler University Linz; Vienna University of Economics & Business
摘要:The WikiLeaks Afghanistan war logs contain nearly 77,000 reports of incidents in the US-led Afghanistan war, covering the period from January 2004 to December 2009. The recent growth of data on complex social systems and the potential to derive stories from them has shifted the focus of journalistic and scientific attention increasingly toward data-driven journalism and computational social science. In this paper we advocate the usage of modern statistical methods for problems of data journali...