29.07.2014 Views

On the Analysis of Optical Mapping Data - University of Wisconsin ...

On the Analysis of Optical Mapping Data - University of Wisconsin ...

On the Analysis of Optical Mapping Data - University of Wisconsin ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

20<br />

1.4 Outline<br />

<strong>Optical</strong> mapping is a fast, low-cost, single-molecule system for producing whole genome<br />

restriction maps. Its potential applications for studies <strong>of</strong> normal and disease biology are<br />

manifold, but computational and statistical challenges created by large genomes must be<br />

met in order for optical mapping to achieve this potential. Existing algorithms have been<br />

effective on optical map data from small genomes. These algorithms do not easily extend<br />

to <strong>the</strong> much larger data sets that are now being collected from larger genomes, and we are<br />

as yet unable to completely mine <strong>the</strong> wealth <strong>of</strong> information contained in <strong>the</strong>m. In part, this<br />

is due to unavoidable computational bottlenecks. However, new avenues <strong>of</strong> analysis have<br />

opened up with <strong>the</strong> availability <strong>of</strong> more and more sequence information. In <strong>the</strong> following<br />

chapters, we present some new ideas on how to deal with optical map data. These ideas share<br />

a common <strong>the</strong>me in that <strong>the</strong>y all take advantage <strong>of</strong> <strong>the</strong> availability <strong>of</strong> in silico reference maps<br />

derived from sequence. They do not, by any means, resolve all outstanding questions, but<br />

hopefully <strong>the</strong>y contribute to <strong>the</strong> understanding <strong>of</strong> optical map data and provide a reference<br />

for future work in this area. In Chapter 2, we discuss stochastic models for optical map<br />

errors and present some new approaches to parameter estimation in that setting. In Chapter<br />

3, we propose a new method to determine significance <strong>of</strong> alignments <strong>of</strong> optical maps to a<br />

reference, which is an important prerequisite in many analyses. In Chapter 4, we use <strong>the</strong>se<br />

alignments as <strong>the</strong> basis for an assembly-free method to detect copy number polymorphisms.<br />

Especially in cancer biology, <strong>the</strong> ability to detect gains and losses <strong>of</strong> DNA is critical, as<br />

frequently deleted sites may harbor tumor suppressor genes, and frequently amplified regions<br />

may harbor oncogenes.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!