On the Analysis of Optical Mapping Data - University of Wisconsin ...
On the Analysis of Optical Mapping Data - University of Wisconsin ...
On the Analysis of Optical Mapping Data - University of Wisconsin ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
33<br />
by rejecting maps that do not align, as well as assuming that <strong>the</strong> significant alignments are<br />
completely correct, is uncertain. Often it is instructive instead to assess a model by some<br />
diagnostic plots, as described next.<br />
2.3 Diagnostics<br />
Due to <strong>the</strong> complexity <strong>of</strong> <strong>the</strong> model and <strong>the</strong> interplay between its various aspects, it is<br />
next to impossible to estimate all <strong>the</strong> parameters separately. However, given a particular<br />
set <strong>of</strong> parameter values, maps simulated from that model can be used to indirectly test<br />
goodness <strong>of</strong> fit. Specifically, simulated maps should have characteristics that are similar to<br />
observed maps, be <strong>the</strong>y numerical summaries or graphical diagnostics. In Figures 2.6, 2.7<br />
and 2.8, we present three diagnostic plots based on <strong>the</strong> marginal distributions <strong>of</strong> observed<br />
restriction fragment lengths, and <strong>the</strong> number <strong>of</strong> fragments in a map. The data being modeled<br />
is <strong>the</strong> set <strong>of</strong> GM07535 optical maps; maps are simulated from <strong>the</strong> in silico reference map<br />
with a combination <strong>of</strong> values for p (0.70, 0.75 and 0.80) and ζ (0.001, 0.003 and 0.005),<br />
keeping all o<strong>the</strong>r components fixed. The rate <strong>of</strong> desorption is determined by <strong>the</strong> function<br />
π α (t) = 1 −e −αt . The first plot suggests that <strong>the</strong> effect <strong>of</strong> desorption has been well modeled.<br />
Considered toge<strong>the</strong>r, <strong>the</strong> three plots suggest that p = 0.7 and ζ = 0.005 come closest to<br />
modeling <strong>the</strong> observed data. It is important to note that <strong>the</strong>se are only a few examples,<br />
and o<strong>the</strong>r similar diagnostic plots could be useful for similar purposes. None <strong>of</strong> <strong>the</strong>se plots<br />
require alignments, but plots analogous to Figure 2.4 that do depend on alignment may also<br />
be useful.