On the Analysis of Optical Mapping Data - University of Wisconsin ...
On the Analysis of Optical Mapping Data - University of Wisconsin ...
On the Analysis of Optical Mapping Data - University of Wisconsin ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
11<br />
direct glimpse at <strong>the</strong> underlying structure <strong>of</strong> <strong>the</strong> genome. Unfortunately, this also means<br />
that raw optical maps can be fairly noisy. In particular,<br />
• not all true restriction sites are observed, i.e. some cuts are missing, due to imperfect<br />
digestion by <strong>the</strong> restriction enzyme<br />
• breakage <strong>of</strong> DNA may cause spurious cuts to appear in a map<br />
• measurement <strong>of</strong> fluorescent intensities and conversion to base pairs is inaccurate, causing<br />
sizing errors in fragment lengths<br />
• relatively small fragments (say 5 Kb or less) may lose adhesion to <strong>the</strong> surface and desorb,<br />
in which case <strong>the</strong>y are not included in <strong>the</strong> final map. Some <strong>of</strong> <strong>the</strong>se fragments may<br />
re-attach <strong>the</strong>mselves near o<strong>the</strong>r fragments, potentially causing length overestimation<br />
in <strong>the</strong> latter.<br />
All <strong>the</strong>se noises are confounded with image processing errors. Mistakes in image processing<br />
may also cause optical chimeras, where unrelated maps are marked up as one because <strong>the</strong>y<br />
overlap on <strong>the</strong> image. O<strong>the</strong>r less systematic errors are also present. These errors, along<br />
with <strong>the</strong> choice <strong>of</strong> restriction enzyme and genome, affect <strong>the</strong> typical size <strong>of</strong> an optical map<br />
fragment. The average fragment size, <strong>of</strong>ten used to summarize an optical map data set, is<br />
usually between 5 and 40 Kb.<br />
1.3.3 Goals and challenges<br />
Goals: A typical optical mapping experiment begins with <strong>the</strong> collection <strong>of</strong> data followed by<br />
image processing to identify individual optical maps. The goal <strong>of</strong> subsequent analysis depends<br />
partly on <strong>the</strong> genome being mapped. Although <strong>the</strong> goal <strong>of</strong> optical mapping is always to make<br />
inferences about <strong>the</strong> underlying restriction map, it is important to distinguish between cases<br />
where a draft reference sequence <strong>of</strong> <strong>the</strong> organism is available and ones where it is not. In <strong>the</strong><br />
latter case, <strong>the</strong> goal <strong>of</strong> optical mapping is de novo assembly, i.e. to reconstruct <strong>the</strong> underlying<br />
restriction map, <strong>of</strong>ten to assist in sequencing efforts. In <strong>the</strong> former case, a possible candidate