Sample A: Cover Page of Thesis, Project, or Dissertation Proposal
Sample A: Cover Page of Thesis, Project, or Dissertation Proposal
Sample A: Cover Page of Thesis, Project, or Dissertation Proposal
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
f<strong>or</strong> the 24 samples in the Lu, et al. raw data files [33] (filtered as described above), and the probe<br />
and ProbeSet presence were then predicted.<br />
Results<br />
The results rep<strong>or</strong>ted here are divided into three sections: (1) providing evidence f<strong>or</strong> the BaFl<br />
cleansing process, including details on the number <strong>of</strong> probes removed f<strong>or</strong> each probe sequence<br />
filter, evidence <strong>of</strong> sample batch processing variability, and exemplar ProbeSets affected by such<br />
confounding fact<strong>or</strong>s. (2) the stages and effects <strong>of</strong> the BaFL probe cleansing pipeline, including<br />
graphic representation <strong>of</strong> each filtering step and the final pr<strong>of</strong>ile consistency across the two<br />
datasets. (3) the research enrichment the BaFL pipeline facilitates, including elucidating potential<br />
transcript regions <strong>of</strong> interest and a pri<strong>or</strong>i predictions <strong>of</strong> independent datasets. F<strong>or</strong> the established<br />
alg<strong>or</strong>ithms we have used the validated samples from the datasets as input, and validated<br />
ProbeSets from the auth<strong>or</strong>’s lists to compare their classification perf<strong>or</strong>mance to each other and to<br />
the results <strong>of</strong> our method. The classification perf<strong>or</strong>mance <strong>of</strong> the auth<strong>or</strong>’s <strong>or</strong>iginal gene lists is<br />
used to see whether the sample cleansing protocol alone has a significant effect. We show that<br />
while previous eff<strong>or</strong>ts have identified inf<strong>or</strong>mative genes, the methods are over-tuned to technical<br />
properties (lab specific) <strong>or</strong> non-primary fact<strong>or</strong> biological properties (such as SNPs), rather than<br />
the desired biological response to the principle fact<strong>or</strong> (here, disease state specific).<br />
41