About Gazelles, Elephants, Blue Whales and Dodos: A Microbial Perspective

At least if we're talking about microbes.

Nick Loman has a good piece summarizing the state of genome sequencing technologies. It's pretty accurate, although I'm always skeptical about capabilities until I've seen it function. But from my point of view, which is focused on microbial genomics, the actual sequencing--determining the nucleotides on a piece of DNA--is already incredibly cheap. In that context, there are two models that seem relevant:

1) Really rapid sequencing. Here, you could have a bacterial genome in a matter of hours, even if it's not that efficient in terms of cost per nucleotide.

2) High throughput. Here, it might take days to generate your sequence, but you generate hundreds of bacterial genomes at once (e.g., Illumina).

With both models, the cost of sequencing is already very small. DNA extraction (getting the DNA from a bacterial cell) and creating a 'DNA library' (preparing the DNA for sequencing) are already as, if not more, expensive (we'll ignore for now the cost of acquiring strains with appropriate metadata which is also expensive). If you add in other factors, such as having the infrastructure to track thousands of samples, uploading data to various databases, assembling and annotating genomes, maintaining the IT support for the computational side of sequencing (on an annual basis, we're talking files that, in total, are many, many petabytes).

What seems interesting to me is that we've reached the point where advances in sequencing capacity are yielding decreasing marginal returns.

Now we just have to figure out all of the other stuff...

Caveat: For charismatic (and even non-charismatic) megafauna, we're not quite there yet, but we probably will be in a year, even with second-generation technologies.

More like this

A couple of weeks ago I attended the Human Microbiome Research Conference. At that meeting, one talk by Bruce Birren (and covered by Jonathan Eisen) mentioned something that was completely overlooked by the attendees. Now, I don't blame them, since what Birren mentioned was about bacterial…
A couple of weeks ago, I came across this discussion thread "Will you stop using 454?" It's a pretty good thread--not much to disagree with there, although, from my perspective, it missed a key point (I'll get to that). But my answer is simple: I already have. My work focuses primarily on…
The main theme of this year's Advances in Genome Biology and Technology meeting should come as no surprise to regular readers: sequencing. Generating as many bases of DNA sequence as quickly, cheaply and accurately as possible is the goal of the moment, and the number of companies jostling to…
The big news from the JP Morgan investment conference today is the announcement of a brand new shiny sequencing machine from Illumina, the HiSeq 2000. The new machine boasts an impressive set of statistics, and looks likely to gradually replace Illumina's GAIIx as the workhorse of most modern…