r-Scan statistics of a marker array in multiple sequences derived from a common progenitor
成果类型:
Article
署名作者:
Karlin, S; Chen, CF
署名单位:
Stanford University
刊物名称:
ANNALS OF APPLIED PROBABILITY
ISSN/ISSBN:
1050-5164
发表日期:
2000
页码:
709-725
关键词:
poisson
approximations
protein
摘要:
This study is motivated by problems of molecular sequence comparisons for biological traits conserved or lost over evolution time. A marker of interest is distributed in the genome of the ancestor and inherited among I offspring species which descend from this common ancestor. Each marker will be retained or lost during the evolution of the descendent species. The objective of the analysis here is to ascertain probabilities of clustering or overdispersion of the marker array among the sequences of the descendent species. Limiting distributions for the extremal r-scan statistics (defined in text) of the trait distributed among the I dependent offspring processes are derived by adapting the Chen-Stein Poisson approximation method. Results that accommodate new occurrences of the trait (gene) arising from duplications and transposition occurrences are also described. The r-scan statistical analysis is further applied to a multi sequence combined Poisson model where {B-1,...,B-l} are generated from m independent Poisson processes {A(1),...,A(m)} such that B-k = U(i is an element of Zk)A(i), where {Z(k)}(1 less than or equal tok less than or equal tol) are subsets of {1,2,...,m}.