On the Analysis of Optical Mapping Data - University of Wisconsin ...
On the Analysis of Optical Mapping Data - University of Wisconsin ...
On the Analysis of Optical Mapping Data - University of Wisconsin ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
20<br />
1.4 Outline<br />
<strong>Optical</strong> mapping is a fast, low-cost, single-molecule system for producing whole genome<br />
restriction maps. Its potential applications for studies <strong>of</strong> normal and disease biology are<br />
manifold, but computational and statistical challenges created by large genomes must be<br />
met in order for optical mapping to achieve this potential. Existing algorithms have been<br />
effective on optical map data from small genomes. These algorithms do not easily extend<br />
to <strong>the</strong> much larger data sets that are now being collected from larger genomes, and we are<br />
as yet unable to completely mine <strong>the</strong> wealth <strong>of</strong> information contained in <strong>the</strong>m. In part, this<br />
is due to unavoidable computational bottlenecks. However, new avenues <strong>of</strong> analysis have<br />
opened up with <strong>the</strong> availability <strong>of</strong> more and more sequence information. In <strong>the</strong> following<br />
chapters, we present some new ideas on how to deal with optical map data. These ideas share<br />
a common <strong>the</strong>me in that <strong>the</strong>y all take advantage <strong>of</strong> <strong>the</strong> availability <strong>of</strong> in silico reference maps<br />
derived from sequence. They do not, by any means, resolve all outstanding questions, but<br />
hopefully <strong>the</strong>y contribute to <strong>the</strong> understanding <strong>of</strong> optical map data and provide a reference<br />
for future work in this area. In Chapter 2, we discuss stochastic models for optical map<br />
errors and present some new approaches to parameter estimation in that setting. In Chapter<br />
3, we propose a new method to determine significance <strong>of</strong> alignments <strong>of</strong> optical maps to a<br />
reference, which is an important prerequisite in many analyses. In Chapter 4, we use <strong>the</strong>se<br />
alignments as <strong>the</strong> basis for an assembly-free method to detect copy number polymorphisms.<br />
Especially in cancer biology, <strong>the</strong> ability to detect gains and losses <strong>of</strong> DNA is critical, as<br />
frequently deleted sites may harbor tumor suppressor genes, and frequently amplified regions<br />
may harbor oncogenes.