02.08.2013 Views

Sample A: Cover Page of Thesis, Project, or Dissertation Proposal

Sample A: Cover Page of Thesis, Project, or Dissertation Proposal

Sample A: Cover Page of Thesis, Project, or Dissertation Proposal

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

A Pri<strong>or</strong>i Prediction<br />

We demonstrated that a pri<strong>or</strong>i probe prediction is a feasible approach based upon the cleansing<br />

results provided by the BaFL pipeline. The probe prediction implemented in this study included<br />

the inc<strong>or</strong>p<strong>or</strong>ation <strong>of</strong> the linear range filter. The linear range filter is the only filter which affects<br />

samples differently based upon the biological and lab<strong>or</strong>at<strong>or</strong>y variation. Theref<strong>or</strong>e it is paramount<br />

that the cross experiment probe extraction is perf<strong>or</strong>med upon similar disease studies and similar<br />

tissues. This is apparent with the squamous cell cancer data models which generated the lowest<br />

probe true positive rates (probes predicted to be thereafter cleansing and actually were). While<br />

the false positive rates f<strong>or</strong> ProbeSet prediction f<strong>or</strong> the adenocarcinoma models seem high, we can<br />

partially explain some <strong>of</strong> the false positives by considering that ~5600 probes were removed due<br />

to the batch 10 localized bare spot effect, presented in Figure 2.1. These probes would likely not<br />

have been removed via the BaFL pipeline f<strong>or</strong> this third dataset. Their presence would theref<strong>or</strong>e<br />

be accounted f<strong>or</strong> in the false positive rate (10%, data not shown) observed at the probe level.<br />

Removal <strong>of</strong> these probes may have additionally eliminated what would otherwise have been<br />

reliable ProbeSets since the affected ProbeSets may not have met the requirement <strong>of</strong> 4 constituent<br />

probes f<strong>or</strong> the Bhattacharjee dataset. This effect again would be observed via the 25.2% ProbeSet<br />

presence false positive rate as indicated in Table 2.7.<br />

Modified CDFs f<strong>or</strong> Computational Efficiency<br />

Since the probe characteristics are universal to an array design, one can easily construct a<br />

modified CDF, which decreases the total number <strong>of</strong> probes that must be considered in an analysis<br />

to those that are usable, and thus improves the computational requirements f<strong>or</strong> an analysis. We<br />

expect that different investigat<strong>or</strong>s will have preferred CDFs: f<strong>or</strong> example, the cross-hybridization<br />

63

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!