Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
chapter 7 | Analysis Workflow 207<br />
We have chosen a confidence score value of 0.33 as the default because<br />
it gives a good compromise between accuracy and Call Rate, especially<br />
when the 85% per sample filter is applied. In cases where you see a<br />
lower Call Rate, you may wish to consider reanalyzing the data at the<br />
more stringent confidence score (e.g., 0.26), which will decrease the<br />
Call Rate for the sample, but may increase the accuracy of the<br />
genotypes.<br />
Data Clean Up<br />
For many genotyping applications, loss of accuracy can result in a<br />
significant decrease in genetic power. Genotyping errors can be caused<br />
by <strong>SNP</strong>s that give systematic errors, or by sporadic errors that occur<br />
due to stochastic, sample or experimental factors. <strong>SNP</strong>s that<br />
systematically give errors (e.g., out of Hardy Weinberg equilibrium)<br />
have been removed through a <strong>SNP</strong> selection process 1 . Overall the<br />
accuracy has been shown to be approximately 99.5% on the <strong>Mapping</strong><br />
<strong>500K</strong> Array Set based upon internal data.<br />
Prior to downstream analysis, the user should consider taking steps to<br />
identify and eliminate sporadic genotype errors. Steps may include<br />
eliminating <strong>SNP</strong>s out of Hardy Weinberg equilibrium in control<br />
samples from the population, or eliminating genotypes showing<br />
Mendel inconsistency or unlikely genotypes. GTYPE has<br />
functionality to identify <strong>SNP</strong>s showing Mendelian or Hardy<br />
Weinberg errors in user defined samples. These genotyping errors can<br />
be filtered out upon data export. Please consult the GeneChip ®<br />
Genotyping Analysis Software User’s Guide for more information.<br />
1 Matsuzaki H., Dong S., Loi H., Di X., Liu G., Hubbell E., Law J., Berntsen T., Chadha M., Hui H., Yang G., Kennedy<br />
G., Webster T., Cawley S., Walsh P., Jones K., Fodor S., Mei R. Genotyping over 100,000 <strong>SNP</strong>s on a pair of oligonucleotide<br />
arrays. Nat Methods 1:109-111 (2004).<br />
2 Di X., Matsuzaki H., Webster T.A., Hubbell E., Liu G., Dong S., Bartell D., Huang J., Chiles R., Yang G., Shen M.M.,<br />
Kulp D., Kennedy G.C., Mei R., Jones K.W., Cawley S. Dynamic model based algorithms for screening and genotyping<br />
over 100K <strong>SNP</strong>s on oligonucleotide microarrays. Bioinformatics 21:1958-63 (2005).<br />
1 See <strong>SNP</strong> Selection Criteria for the GeneChip Human <strong>Mapping</strong> 10K Array Xba 131 Technical Note. A similar process<br />
was used for the <strong>Mapping</strong> <strong>500K</strong> <strong>SNP</strong> selection.