Genetics. Published Articles Ahead of Print: August 20, 2008, Copyright © 2008
doi:10.1534/genetics.108.090860


A more recent version of this article appeared on September 1, 2008.


REGULAR RESEARCH PAPERS

The complementary neighborhood patterns and methylation-to-mutation likelihood structures of 15,110 single nucleotide polymorphisms in the bovine genome

1 Washington State University
2 University of Wisconsin-Madison

* To whom correspondence should be addressed. E-mail: jiangz{at}wsu.edu.

Submitted on April 30, 2008
Revised on June 9, 2008
Accepted on 19 June 2008


Abstract

The Bayesian analysis was performed to examine the single nucleotide polymorphisms (SNPs) neighborhood patterns in cattle using 15,110 SNPs, each with a flanking sequence of 500 bp. Our analysis confirmed three well-known features reported in plants and/or other animals: 1) the transition is the most abundant type of SNPs, accounting for 69.8% in cattle; 2) the transversion occurs most frequently (38.56%) in cattle when the A+T content equals two at their immediate adjacent sites and 3) C{Leftrightarrow}T and A{Leftrightarrow}G transitions have reverse complementary neighborhood patterns and so do A{Leftrightarrow}C and G{Leftrightarrow}T transversions. Our present study also revealed several novel SNP neighborhood patterns that have not been reported previously. First, cattle and humans share an overall SNP pattern, indicating a common mutation system in mammals. Second, unlike C{Leftrightarrow}T/A{Leftrightarrow}G and A{Leftrightarrow}C/G{Leftrightarrow}T, the true neighborhood patterns for A{Leftrightarrow}T and C{Leftrightarrow}G might remain mysterious because the sense and antisense sequences flanking these mutations are not actually recognizable. Third, among the reclassified four types of SNPs, the neighborhood ratio between A+T and G+C was quite different. The ratio was lowest for C{Leftrightarrow}G, but increased for C{Leftrightarrow}T/A{Leftrightarrow}G, further for A{Leftrightarrow}C/G{Leftrightarrow}T and at most for A{Leftrightarrow}T. Fourth, when two immediate adjacent sites provide structures for CpG, it significantly increased transitions compared to the structures without the CpG. Last, unequal occurrence between A{Leftrightarrow}G and C{Leftrightarrow}T in five paired neighboring structures indicates that the methylation induced deamination reactions were responsible for ~20% of total transitions. In addition, the conversion can occur at both CpG sites and non-CpG sites. Our present study provides new insights into understanding molecular mechanisms of mutations and genome evolution.

Key Words: SNPs, methylation induced transitions, naturally occurring transitions, neighborhood patterns, reverse complement