27.12.2012 Views

l - People

l - People

l - People

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

the predictor by means of ROC (Receiver Operating Characteristic) curves. We need to<br />

define a few terms in order to use these:<br />

Test positives: Residues with<br />

Test negatives: Residues with<br />

HS(i)greater than or equal to a certain threshold.<br />

HS(i)less than a certain threshold.<br />

Gold standard positives: ! Residues annotated as hinges in the Hinge Atlas.<br />

Gold standard negatives: ! Residues which are not in hinges according to the Hinge Atlas<br />

annotation.<br />

True positives (TP): Those residues that are both test positives and gold standard<br />

positives.<br />

True negatives (TN): Residues that are both test negatives and gold standard negatives.<br />

False positives (FP): Residues that are test positives and gold standard negatives.<br />

False negatives (FN): Residues that are test negatives and gold standard positives.<br />

sensitivity<br />

specificity<br />

1 !<br />

specificity<br />

TP<br />

=<br />

TP + FN<br />

TN<br />

=<br />

FP + TN<br />

FP<br />

=<br />

FP + TN<br />

The ROC curve is simply a plot of the true positive rate (same as sensitivity) vs. false<br />

positive rate (1-specificity), for each value of the threshold, as the threshold is varied<br />

from +1 to -1, a range which included all possible values of<br />

76<br />

HS(i)<br />

. For a good predictor,<br />

the true positive rate will increase faster than the false positive rate as the threshold is

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!