27.12.2012 Views

l - People

l - People

l - People

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

!<br />

!<br />

Method for analyzing relative frequency of occurrence<br />

Throughout this study, we will be comparing how often a particular entity (be it a certain<br />

amino acid, a certain pair of amino acids, a certain class of amino acids, a certain<br />

secondary structural element etc.) occurs in hinges versus everywhere in the Hinge Atlas<br />

or another of the datasets described above. The statistical analysis will be the same<br />

regardless of the particulars, so we will here present the general approach and later only<br />

mention adjustments particular to the specific question addressed.<br />

First we defined the following variables:<br />

D = total number of residues in the dataset<br />

H = total number of residues in hinges in the dataset<br />

C = classification scheme used to create groups of residue positions. For example, C<br />

could be secondary structure, degree of conservation, etc.<br />

c = a particular grouping of residues, where<br />

!<br />

c " C . For instance, if C = secondary<br />

structure, then c = helix is the class of all residues in helices, c = strand is the class of all<br />

residues in strands, etc. Another example might be C = evolutionary conservation, with c<br />

= cons1 = top 20% most conserved residues, c = cons2 = second 20% most conserved,<br />

etc.<br />

a c= set of all residues of class c in the dataset.<br />

d c = number of times residues of class c occurred anywhere in the dataset.<br />

60

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!