29.07.2014 Views

On the Analysis of Optical Mapping Data - University of Wisconsin ...

On the Analysis of Optical Mapping Data - University of Wisconsin ...

On the Analysis of Optical Mapping Data - University of Wisconsin ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

13<br />

which is not surprising since well designed graphs can usually convey complex information<br />

more effectively than numerical summaries.<br />

1.3.4 Algorithms<br />

Problems in optical mapping are <strong>of</strong>ten approached indirectly by trying to answer simpler,<br />

more specific ones. This is not uncommon in computational biology, where <strong>the</strong> complexity<br />

<strong>of</strong> a problem may make a holistic solution difficult. Two algorithmic questions that play a<br />

recurrent role in many <strong>of</strong> <strong>the</strong>se approaches are alignment and assembly. Each tries to answer<br />

a particular problem; however, it is <strong>of</strong>ten more useful to think <strong>of</strong> <strong>the</strong>se as tools ra<strong>the</strong>r than<br />

solutions. Here, we give an overview <strong>of</strong> <strong>the</strong>se two fundamental computational tasks.<br />

Alignment<br />

The problem <strong>of</strong> alignment is to detect association or overlap between two or more restriction<br />

maps. Such association is measured by a score function which assigns a numerical<br />

measure <strong>of</strong> goodness to any potential alignment. Of course different score functions may<br />

be used and much rests on choosing a suitable score function. Waterman et al. (1984) presented<br />

a score function for restriction map comparison, which was subsequently extended by<br />

Huang and Waterman (1992). Valouev et al. (2006) have developed scores functions for <strong>the</strong><br />

comparison problem specifically in <strong>the</strong> context <strong>of</strong> optical mapping. These score functions<br />

have been derived as model-based likelihood ratio test statistics, although this is not strictly<br />

necessary (Appendix A).<br />

Given a suitable score function, dynamic programming is used to efficiently search for<br />

optimal alignments. In <strong>the</strong> context <strong>of</strong> alignment against a reference, for example, every individual<br />

optical map must be scored across <strong>the</strong> genome. Alignment algorithms for nucleotide<br />

sequence data, such as <strong>the</strong> Needleman-Wunsch and Smith-Waterman algorithms, can be<br />

adapted to work with restriction maps. Certain modifications are required to enable such<br />

use; <strong>the</strong>se are described by Valouev et al. (2006).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!