Scan statistics with weighted observations

成果类型:
Article
署名作者:
Chan, Hock Peng; Zhang, Nancy Ruonan
署名单位:
National University of Singapore; Stanford University
刊物名称:
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
ISSN/ISSBN:
0162-1459
DOI:
10.1198/016214506000001392
发表日期:
2007
页码:
595-602
关键词:
P-VALUES sequence alignments approximations dna prediction drosophila clusters maximum protein bounds
摘要:
We examine scan statistics for one-dimensional marked Poisson processes. Such statistics tabulate the maximum weighted count of event occurrences within a window of predetermined width over all windows within an observed interval. We derive analytical formulas and also give an importance sampling method for approximating the tail probabilities of scan statistics. Because high-throughput genomic sequencing has led to the availability of massive amounts of biomolecular sequence data, it is often of interest to search long DNA or protein sequences for local regions that are enriched for a certain characteristic. Thus scan statistics have become it useful tool in modern computational biology. We illustrate the application of our p value approximations with such examples.