25.08.2015 Views

In the Beginning was Information

6KezkB

6KezkB

SHOW MORE
SHOW LESS
  • No tags were found...

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

6.2 The Genetic CodeWe now discuss <strong>the</strong> question of devising a suitable coding system.For instance, how many different letters are required and how longshould <strong>the</strong> words be for optimal performance? If a certain codingsystem has been adopted, it should be strictly adhered to (Theorem8, par 4.2), since it must be in tune with extremely complextranslation and implementation processes. The table in Figure 19comprises only <strong>the</strong> most interesting 25 fields, but it can be extendedindefinitely downwards and to <strong>the</strong> right. Each field represents aspecific method of encoding, for example, if n = 3 and L = 4,we have a ternary code with 3 different letters. <strong>In</strong> that case aword for identifying an amino acid, would have a length of L = 4,meaning that quartets of 4 letters represent one word. If we nowwant to select <strong>the</strong> best code, <strong>the</strong> following requirements shouldbe met:– The storage space in a cell must be a minimum, so that <strong>the</strong> codeshould economise on <strong>the</strong> required material. The more lettersrequired for each amino acid, <strong>the</strong> more material is required, aswell as more storage space.– The copying mechanism described above, requires n to be aneven number. The replication of each of <strong>the</strong> two strands of DNAinto complementary strands thus needs an alphabet having aneven number of letters. For <strong>the</strong> purpose of limiting copyingerrors during <strong>the</strong> very many replication events, some redundancemust be provided for (see Appendix A 1.4).– The longer <strong>the</strong> employed alphabet, <strong>the</strong> more complex <strong>the</strong> implementingmechanisms have to be. It would also require morematerial for storage, and <strong>the</strong> incidence of copying errors wouldincrease.<strong>In</strong> each field of Figure 19 <strong>the</strong> number of possible combinations for<strong>the</strong> different words appears in <strong>the</strong> top left corner. The 20 aminoacids require at least 20 different possibilities, and according toShannon’s <strong>the</strong>ory <strong>the</strong> required information content of each aminoacid could be calculated as follows: For 20 amino acids <strong>the</strong> averageinformation content would be i A ≡ i W ≡ ld 20 = log 20/log 2 =4.32 bits per amino acid (ld is <strong>the</strong> logarithm with base 2).94

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!