Sample A: Cover Page of Thesis, Project, or Dissertation Proposal
Sample A: Cover Page of Thesis, Project, or Dissertation Proposal
Sample A: Cover Page of Thesis, Project, or Dissertation Proposal
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
A Pri<strong>or</strong>i Prediction<br />
We demonstrated that a pri<strong>or</strong>i probe prediction is a feasible approach based upon the cleansing<br />
results provided by the BaFL pipeline. The probe prediction implemented in this study included<br />
the inc<strong>or</strong>p<strong>or</strong>ation <strong>of</strong> the linear range filter. The linear range filter is the only filter which affects<br />
samples differently based upon the biological and lab<strong>or</strong>at<strong>or</strong>y variation. Theref<strong>or</strong>e it is paramount<br />
that the cross experiment probe extraction is perf<strong>or</strong>med upon similar disease studies and similar<br />
tissues. This is apparent with the squamous cell cancer data models which generated the lowest<br />
probe true positive rates (probes predicted to be thereafter cleansing and actually were). While<br />
the false positive rates f<strong>or</strong> ProbeSet prediction f<strong>or</strong> the adenocarcinoma models seem high, we can<br />
partially explain some <strong>of</strong> the false positives by considering that ~5600 probes were removed due<br />
to the batch 10 localized bare spot effect, presented in Figure 2.1. These probes would likely not<br />
have been removed via the BaFL pipeline f<strong>or</strong> this third dataset. Their presence would theref<strong>or</strong>e<br />
be accounted f<strong>or</strong> in the false positive rate (10%, data not shown) observed at the probe level.<br />
Removal <strong>of</strong> these probes may have additionally eliminated what would otherwise have been<br />
reliable ProbeSets since the affected ProbeSets may not have met the requirement <strong>of</strong> 4 constituent<br />
probes f<strong>or</strong> the Bhattacharjee dataset. This effect again would be observed via the 25.2% ProbeSet<br />
presence false positive rate as indicated in Table 2.7.<br />
Modified CDFs f<strong>or</strong> Computational Efficiency<br />
Since the probe characteristics are universal to an array design, one can easily construct a<br />
modified CDF, which decreases the total number <strong>of</strong> probes that must be considered in an analysis<br />
to those that are usable, and thus improves the computational requirements f<strong>or</strong> an analysis. We<br />
expect that different investigat<strong>or</strong>s will have preferred CDFs: f<strong>or</strong> example, the cross-hybridization<br />
63