On the Analysis of Optical Mapping Data - University of Wisconsin ...
On the Analysis of Optical Mapping Data - University of Wisconsin ...
On the Analysis of Optical Mapping Data - University of Wisconsin ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
70<br />
States d.f. AIC BIC<br />
1 2 9775.9 9786.7<br />
2 5 9731.3 9758.1<br />
3 10 9729.9 9783.5<br />
Table 4.1 In comparing <strong>the</strong> GM07535 and CHM genomes, AIC and BIC values for HMM<br />
fits with 1, 2 and 3 states serve as a guide to choose <strong>the</strong> number <strong>of</strong> states. In this case, we<br />
choose <strong>the</strong> model with 2 states.<br />
optical maps represent a population that has 6 known sites each <strong>of</strong> decreased (1:2) and<br />
elevated (3:2) copy number.<br />
Figure 4.2 shows <strong>the</strong> results <strong>of</strong> fitting a 3-state negative binomial HMM to one such<br />
data set. Figure 4.3 summarizes <strong>the</strong> results from multiple instances <strong>of</strong> this simulation. For<br />
each simulation, models were fitted using <strong>the</strong> full data sets as well as smaller subsets. Copy<br />
number changes larger that 1 Mb were detected more or less consistently, while smaller<br />
regions were not. Surprisingly, <strong>the</strong> results obtained from <strong>the</strong> smaller data sets were not<br />
much different, except in a couple <strong>of</strong> runs which had more false positives. Presumably, <strong>the</strong>se<br />
details will vary depending on <strong>the</strong> size <strong>of</strong> <strong>the</strong> genome and <strong>the</strong> number and magnitude <strong>of</strong> <strong>the</strong><br />
copy number alterations.<br />
Example: GM07535 and CHM: We next considered optical map data from two human<br />
genomes, GM07535 and CHM (Section 1.2), restricting our attention to alignments to<br />
<strong>the</strong> first 6 chromosomes. 23424 GM07535 and 43502 CHM optical maps aligned to <strong>the</strong>se<br />
chromosomes. Alignments <strong>of</strong> CHM maps were used as <strong>the</strong> normalizing genome to detect relative<br />
copy number changes in GM07535. Both genomes are believed to be “normal”, so not<br />
much variation is expected. AIC and BIC values given in Table 4.1 provide some guidance<br />
in selecting an appropriate number <strong>of</strong> states, and a model with 2 states seems appropriate.<br />
Some regions <strong>of</strong> mild copy number differences are indicated by <strong>the</strong> graph <strong>of</strong> posterior probabilities<br />
in Figure 4.4. The reason for <strong>the</strong> differences is not clear, but could in part reflect<br />
heterozygous differences in GM07535.