02.08.2013 Views

Sample A: Cover Page of Thesis, Project, or Dissertation Proposal

Sample A: Cover Page of Thesis, Project, or Dissertation Proposal

Sample A: Cover Page of Thesis, Project, or Dissertation Proposal

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

are significant assume such a data structure [7]. This still leaves several hundred values to<br />

compare per sample. F<strong>or</strong> classification purposes a much smaller diagnostic set is desired, so<br />

analysts typically employ a down-selection method to their dataset [12, 19-29]. The rationale f<strong>or</strong><br />

such an approach is two fold: it should improve the statistical power <strong>of</strong> subsequent analysis, by<br />

minimizing the N>>P issue and it will enrich the impact <strong>of</strong> the final candidates, by eliminating<br />

weak classifiers within the candidate list [11, 12, 20, 21, 27]. Weak classifiers are features within<br />

a solution subset which are not critical to the subset’s classification perf<strong>or</strong>mance [12, 30, 31].<br />

Common approaches to down selection in Microarray analysis pipelines have included:<br />

significance analysis <strong>of</strong> Microarray (SAM), t-tests, fold changes, expression differences, signal to<br />

noise ratios, etc. [12, 21, 23, 26, 27, 32]. F<strong>or</strong> the following analyses we employed Welch’s t-test<br />

on the ProbeSet values, rather than the individual probes across the ProbeSet since this is the only<br />

comparison possible with the RMA and dCHIP methods (see Chapter 2 f<strong>or</strong> details). Welch’s t-<br />

test first requires perf<strong>or</strong>ming the F test f<strong>or</strong> equality <strong>of</strong> variance, as given by:<br />

Welch’s t-test is f<strong>or</strong>mulated as:<br />

1<br />

s<br />

F " , (3)<br />

s<br />

2<br />

2<br />

1<br />

2<br />

2<br />

t " X ) X / s / n ( s / n , (4)<br />

where s1 = s2 given equal variance, which must be tested f<strong>or</strong>. The variable X represents the<br />

sample population mean, and n is the sample size [5]. The t-test is usually considered to be a<br />

2<br />

1<br />

weak diagnostic and eff<strong>or</strong>ts to control the family wise err<strong>or</strong> rate through Bonferroni c<strong>or</strong>rections <strong>or</strong><br />

71<br />

1<br />

2<br />

2<br />

2

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!