29.07.2014 Views

On the Analysis of Optical Mapping Data - University of Wisconsin ...

On the Analysis of Optical Mapping Data - University of Wisconsin ...

On the Analysis of Optical Mapping Data - University of Wisconsin ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

11<br />

direct glimpse at <strong>the</strong> underlying structure <strong>of</strong> <strong>the</strong> genome. Unfortunately, this also means<br />

that raw optical maps can be fairly noisy. In particular,<br />

• not all true restriction sites are observed, i.e. some cuts are missing, due to imperfect<br />

digestion by <strong>the</strong> restriction enzyme<br />

• breakage <strong>of</strong> DNA may cause spurious cuts to appear in a map<br />

• measurement <strong>of</strong> fluorescent intensities and conversion to base pairs is inaccurate, causing<br />

sizing errors in fragment lengths<br />

• relatively small fragments (say 5 Kb or less) may lose adhesion to <strong>the</strong> surface and desorb,<br />

in which case <strong>the</strong>y are not included in <strong>the</strong> final map. Some <strong>of</strong> <strong>the</strong>se fragments may<br />

re-attach <strong>the</strong>mselves near o<strong>the</strong>r fragments, potentially causing length overestimation<br />

in <strong>the</strong> latter.<br />

All <strong>the</strong>se noises are confounded with image processing errors. Mistakes in image processing<br />

may also cause optical chimeras, where unrelated maps are marked up as one because <strong>the</strong>y<br />

overlap on <strong>the</strong> image. O<strong>the</strong>r less systematic errors are also present. These errors, along<br />

with <strong>the</strong> choice <strong>of</strong> restriction enzyme and genome, affect <strong>the</strong> typical size <strong>of</strong> an optical map<br />

fragment. The average fragment size, <strong>of</strong>ten used to summarize an optical map data set, is<br />

usually between 5 and 40 Kb.<br />

1.3.3 Goals and challenges<br />

Goals: A typical optical mapping experiment begins with <strong>the</strong> collection <strong>of</strong> data followed by<br />

image processing to identify individual optical maps. The goal <strong>of</strong> subsequent analysis depends<br />

partly on <strong>the</strong> genome being mapped. Although <strong>the</strong> goal <strong>of</strong> optical mapping is always to make<br />

inferences about <strong>the</strong> underlying restriction map, it is important to distinguish between cases<br />

where a draft reference sequence <strong>of</strong> <strong>the</strong> organism is available and ones where it is not. In <strong>the</strong><br />

latter case, <strong>the</strong> goal <strong>of</strong> optical mapping is de novo assembly, i.e. to reconstruct <strong>the</strong> underlying<br />

restriction map, <strong>of</strong>ten to assist in sequencing efforts. In <strong>the</strong> former case, a possible candidate

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!