27.12.2012 Views

l - People

l - People

l - People

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

We also sought to determine whether there existed a bias towards particular protein<br />

classes, in either the Hinge Atlas or the nonredundant set of MolMovDB morphs from<br />

which it was compiled. To do this, we first counted the number of times each top-level<br />

Gene Ontology (GO) term under the “molecular function” ontology was associated with a<br />

protein in the Hinge Atlas. Where the annotation was given for deeper levels, we traced<br />

up the hierarchical tree to retrieve the corresponding top level term in the ontology. Thus<br />

we found, for example, that 14 proteins in the Hinge Atlas were associated with the term<br />

“nucleic acid binding.” We repeated this procedure for the PDB as a whole as well as for<br />

the non-redundant set of 1508 morphs in MolMovDB from which the Hinge Atlas was<br />

compiled. The results for the 10 most frequently encountered GO terms are shown in<br />

Table 2.9.<br />

To compare the Hinge Atlas counts to the PDB counts in an overall fashion, we used the<br />

chi-square distribution with 162 degrees of freedom (from 163 GO terms and 2 datasets)<br />

and obtained a chi-square value of 121.1. This corresponds to a p-value of 0.9931, so<br />

there is no statistically significant difference in the distribution of these terms in the<br />

Hinge Atlas vs. the entire Protein Data Bank.<br />

Statistical comparison of datasets<br />

The Hinge Atlas and computer annotated sets were compiled differently, therefore one<br />

might suspect that the hinges from one set might comprise a statistically different<br />

population from the hinges of the other set. If this were the case, then one of the two sets<br />

78

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!