Insulin, sugar, and evolution

By sporte on December 22, 2014.

In my last post, I wrote about insulin and interesting features of the insulin structure. Some of the things I learned were really surprising. For example, I was surprised to learn how similar pig and human insulin are. I hadn't considered this before, but this made me wonder about the human insulin we used to give to one of our cats. How do cat and human insulin compare?

It turns out, that all vertebrates produce insulin, even frogs and zebra fish. Human preproinsulin is only 110 amino acids long and even human and fish insulin are pretty similar. Of course, this observation only leads to more questions. Like why? Why would fish insulin and human insulin be similar at all?

One clue comes from insulin's function. Many cells require insulin for growth. Another clue comes from the insulin structure. A key feature of the insulin protein is a pair of disulfide bonds that hold the two chains (A and B) together.

Disulfide bonds between chains A and B in human insulin, PDB ID 1TRZ

When insulin is made, it's made as one long protein (preproinsulin). Afterwards, a small part gets cut off at the amino end when it gets transported through the membrane. Later, another chunk gets cut out of the middle (C peptide) leaving the two disulfide bonds between cysteine residues holding everything (Chains A and B) together.

Those four cysteines look like they must play a pretty important role since they're charged with the task of holding it all together. This made me wonder: Do all the creatures that make insulin have cysteine bonds in the same positions?

To test this idea, I needed a way to identify those four cysteines within the insulin protein.

I opened 1TRZ (the human insulin monomer) in Molecule World* and applied molecule coloring to identify chains A and B. Then, I hid all the protein chains, and one by one, touched the C's in the two sequences to highlight the cysteines.

Once I found all the cysteines, I touched some of the C's again to deselect the ones that formed disulfide bonds within a chain and used the "Hide unselected" button to hide them. Now, we only see the cysteines that hold the A (pink) and B (blue) chains together.

The protein sequences are a little dim, but I can see the one letter abbreviations for the amino acids around each cysteine. These sequences help me spot where these cysteines were located in each chain of the protein.

Assembling the data set

The next step was to put together a set of sequences. I picked protein sequences since I wanted to include some distant relatives (worm & fly). To find the protein sequence for the human insulin gene (INS), I searched the gene database at the NCBI. The INS gene record contained a link to Homologene, a database that I used to get similar insulin protein sequences from other organisms. Curiously, I found that that mice and rats have two insulin genes! That was a surprise! Do rodents really consume that much sugar? I decided to include both rat insulin 1 and rat insulin 2 genes, since I didn't know which one was most important. As it turned out they're pretty similar to each other.

I also used the NCBI Gene database to get sequences from C. elegans (a nematode, a type of small worm) and Drosophila ananassae (a type of fly, related to Drosophila melanogaster).

Time to BLAST!

After compiling my list of accession numbers, it was time to run blast. I chose blastp from the BLAST home page at the NCBI and checked the "Align two or more sequences" box to compare my human insulin sequence to a set of other sequences.

Then, I pasted the accession numbers for my data set in the subject field and clicked BLAST.

BLAST results

All the sequences matched and had significant E values, even those from the fly and worm proteins.

But what about the cysteines?

Curiously, NCBI protein blast has this new feature and a new algorithm (to me anyway) for multiple alignments, called Cobalt.

To create a multiple alignment, you just click the "Multiple Alignment" link on the blastp results page.

Voila! You get a multiple alignment from Cobalt!

I think this alignment could be improved by a bit of editing, but the general idea is pretty clear. Even flies and worms keep those cysteines in the right place.

NCBI's Cobalt results will even let you make a phylogenetic tree. Those, this was a little bit flaky. Sometimes, I would click the link and see an error message saying the page wasn't there.

Nevertheless, sometimes, I could click the Phylogenetic Tree link, and sometimes, get a tree. And, it even makes sense.

I can see that the cysteines that participate in our key disulfide bonds are conserved through evolution from fish to humans. This was true for the cysteines in both the A and B chains.Students might like to investigate how far this goes. How many organisms have genes for insulin? Are they found in insects or starfish, or worms? Are insulin proteins from all organisms held together by disulfide bonds?

Check out While visions of sugar plums danced through their heads to see other interesting features of insulin molecular models.In the meantime, have a sweet holiday! And give thanks for insulin. This holiday, when many of us are consuming too much sucrose, it's nice to remember that insulin is busy handling the consequences.

Images & Bioinformatics software: All the images in this article were made from the new version of the Molecule World™ iPad app (Digital World Biology). Many of things we do with Molecule World can also be done with Cn3D, it's just a bit more complicated.

The 1TRZ structure was obtained from the NCBI's Molecular Modeling Database. The protein sequences, from the NCBI, and the blastp algorithm and Cobalt were used at the NCBI.

More like this

While visions of sugarplums danced through their heads

Sucrose Molecules of sucrose tore apart in their bellies letting glucose course free in their veins. Luckily for us, a system evolved long ago to capture that glucose and minimize it's potential for damage. Removing sugar from the blood and sequestering it in liver, fat, and muscle cells,…

And the plant goes "moo" ? - a bioinformatics case study with insulin

Sometimes when you go digging through the databases, you find unexpected things. When I was researching the previous posts on insulin structure and insulin evolution, I found something curious indeed. Human insulin, colored by rainbow. Image from the Molecule World iPad app by Digital World…

Using protein blast and searching for Elvis, part I.

In which we search for Elvis, using blastp, and find out how old we would have to be to see Elvis in a Las Vegas club. Introduction Once you're acquainted with proteins, amino acids, and the kinds of bonds that hold proteins together, we can talk about using this information to evaluate the…

Using protein blast and identifying unknown proteins, part II.

In which we identify unknown human proteins. Yesterday, I wrote about using the BLOSUM 62 matrix to calculate a score for matches between two proteins. Those scores give us a good start on understanding how blastp determines whether two sequences are matching by chance or because they're more…

Advertisment

Donate

ScienceBlogs is where scientists communicate directly with the public. We are part of Science 2.0, a science education nonprofit operating under Section 501(c)(3) of the Internal Revenue Code. Please make a tax-deductible donation if you value independent science communication, collaboration, participation, and open access.

You can also shop using Amazon Smile and though you pay nothing more we get a tiny something.

Science 2.0

Science Codex

More by this author

New home for Discovering Biology in a Digital World

October 30, 2017

Sometime in the next day or two, Scienceblogs will shut down. We've enjoyed the opportunity to blog here for the past 10+ years. Not to worry, @digitalbio and @finchtalk will continue blogging, but more so from their own site at Digital World Biology. The Scienceblogs posts have been reposted at…

Synbiobeta: The Future is Now

October 12, 2017

@synbiobeta concluded it’s #sbbsf17 annual meeting on synthetic biology Oct 5, 2017. The progress companies are making in harnessing biology as a platform for manufacturing and problem solving is world changing. Locations of Synbio Companies What is Synthetic Biology? Synthetic biology is a term…

Understanding the CRISPR Cas9 system

September 18, 2016

On Sept. 30th, I'm going to be co-presenting a Bio-Link webinar on Genome Engineering with CRISPR-Cas9 with Dr. Thomas Tubon from Madison College. If you're interested, Register here. Since my part will be to help our audience understand the basics of this system, I prepared a short tutorial with…

Zika virus, drug discovery, and student projects

March 8, 2016

It's well understood in science education that students are more engaged when they work on problems that matter. Right now, Zika virus matters. Zika is a very scary problem that matters a great deal to anyone who might want to start a family and greatly concerns my students. I teach a…

DNA: it's in your blood

February 28, 2016

Did you know small fragments of DNA are circulating in your blood stream? These short pieces of DNA are left behind after cells self-destruct. This self-destruction, or apoptosis, is a normal process. In the case of fetal development, certain cells in our hands die, leaving behind individual…