SNP, SV, CNV...does the substrate matter?

Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome:

Structural variation of the genome involves kilobase- to megabase-sized deletions, duplications, insertions, inversions, and complex combinations of rearrangements. We introduce high-throughput and massive paired-end mapping (PEM), a large-scale genome sequencing method to identify structural variants (SVs) ~3 kb or larger that combines the rescue and capture of paired-ends of 3 kb fragments, massive 454 Sequencing, and a computational approach to map DNA reads onto a reference genome. PEM was used to map SVs in an African and putatively European individual and identified shared and divergent SVs relative to the reference genome. Overall, we fine-mapped more than 1000 SVs and documented that the number of SVs among humans is much larger than initially hypothesized; many of the SVs potentially affect gene function. The breakpoint junction sequences of more than 200 SVs were determined with a novel pooling strategy and computational analysis. Our analysis provided insights into the mechanisms of SV formation in humans.

ScienceDaily has a much more intelligible lede, Individual Differences Caused By Shuffled Chunks Of DNA In The Human Genome:

"The focus for identifying genetic differences has traditionally been on point mutations or SNPs -- changes in single bases in individual genes," said Michael Snyder, the Cullman Professor of Molecular, Cellular & Developmental Biology and senior author of the study, which was published in Science Express. "Our study shows that a considerably greater amount of variation between individuals is due to rearrangement of big chunks of DNA."

SNP, that's single nucleotide polymoprhism, variation of base pairs on the DNA sequence which are the discrete units of the genome. These have been the traditional "gold standard" in terms of identifying how genetic variation may (or may not!) result in functional variation or modeling the dynamics of molecular evolution. But now there are plenty of papers trying to trumpet copy number variation, and then there are the internicine conflicts over evo-devo & cis regulatory elements.

To all this I say: a pox on all your houses! There is so much more in nature's toolkit than are imagined in your philosophies! There is a specific issue which I think needs to be addressed: perhaps the rise of evo-devo and its cohorts are a necessary correction to the hegemony of the infinite small effect models in population genetics and the focus on SNPs from the molecular angle. But at the end of the day evolution is substrate neutral. Variation is variation, and as long as it is heritable then that's all you need. I tend to find the sequence level variation vs. regulation type of arguments to be epiphenomenal; the sort of scientific debates necessary for the process of science to work itself out, but not consequential in the long run in relation to their genuine impact on evolutionary biology. In short, they're like the arguments about the term "gene," they tell you more about what the specific scientists are interested in and the focus of their research project than they do about what a gene might or might not be (I'm pretty instrumentalist about what a "gene" is).

Tags

More like this

A few weeks back, we published a review about the development and role of the human reference genome. A key point of the reference genome is that it is not a single sequence. Instead it is an assembly of consensus sequences that are designed to deal with variation in the human population and…
Where the variation comes from. Evolution proceeds by the action of many different evolutionary forces on heritable variation. Natural selection leads to the increase in frequency of variation that allows individuals to produce more offspring who, themselves, produce offspring. Genetic drift…
For the past few months, the shake-up that began with Next Generation DNA Sequencing has been forcing me to adjust to a whole new view of things going on inside of a cell. We've been learning things these past two years that are completely changing our understanding of the genome and how it works…
There's an old paradigm in human population genetics that we each differ from each other by less than one percent at the DNA sequence level. While that may be true for our DNA sequences, recent work indicates that there's also quite a bit of variation amongst individuals in the actual content of…