02.08.2013 Views

Sample A: Cover Page of Thesis, Project, or Dissertation Proposal

Sample A: Cover Page of Thesis, Project, or Dissertation Proposal

Sample A: Cover Page of Thesis, Project, or Dissertation Proposal

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Discussion<br />

A striking result from this series <strong>of</strong> experiments is the significant improvement in the power <strong>of</strong><br />

the t-test f<strong>or</strong> the Bhattacharjee dataset, when the ProbeSets to be considered and the value <strong>of</strong><br />

those ProbeSets are produced using the BaFL pipeline [39, 40]. This increase in power was not<br />

observed f<strong>or</strong> any <strong>of</strong> the cleansing methodologies with the smaller Stearman dataset, which, is<br />

much smaller, although it is completely replicated. From Figure 3.1 it can be seen that there is a<br />

significant improvement in the unif<strong>or</strong>mity <strong>of</strong> the p value kernel distributions <strong>of</strong> the BaFL-<br />

generated ProbeSet values tested f<strong>or</strong> significant differential expression f<strong>or</strong> the Bhattacharjee<br />

dataset, compared to those <strong>of</strong> RMA <strong>or</strong> dCHIP, particularly f<strong>or</strong> the null p values [39]. Since both<br />

datasets show similar variance after BaFL processing (see Chapter 2), and the Bhattacharjee<br />

dataset is both m<strong>or</strong>e heterogeneous (disease stage) and less precise (less replication) than the<br />

Stearman dataset, the lack <strong>of</strong> power comes down to the difference in sample size <strong>of</strong> the<br />

experiments [5].<br />

Models and Class Predictions<br />

The impact <strong>of</strong> the lack <strong>of</strong> power becomes apparent in the perf<strong>or</strong>mance <strong>of</strong> the down-selected RMA<br />

and dCHIP-based classification models, where the resulting datasets have little improvement <strong>or</strong><br />

even a loss <strong>of</strong> perf<strong>or</strong>mance when the t-test down-selection is used (compare the ALL set to the<br />

other three Sets in the top two rows <strong>of</strong> graphs in Figures 3.2 and 3.3). A meaningful outcome<br />

would show a gain in inf<strong>or</strong>mation when going from 12,000 to ~6,000 genes, where the method<br />

has allowed the genes with an impact on the phenotype to be retrieved [6, 41]. This did not occur<br />

when starting with the RMA and dCHIP cleansing methods, in spite <strong>of</strong> trying three types <strong>of</strong><br />

classification models in the search. In fact, there is a consistent increase in the variation across the<br />

83

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!