11.07.2015 Views

Bioinformatics for DNA Sequence Analysis.pdf - Index of

Bioinformatics for DNA Sequence Analysis.pdf - Index of

Bioinformatics for DNA Sequence Analysis.pdf - Index of

SHOW MORE
SHOW LESS
  • No tags were found...

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Chapter 1Similarity Searching Using BLASTKit J. Menlove, Mark Clement, and Keith A. CrandallAbstractSimilarity searches are an essential component <strong>of</strong> most bioin<strong>for</strong>matic applications. They <strong>for</strong>m the bases <strong>of</strong>structural motif identification, gene identification, and insights into functional associations. With the rapidincrease in the available genetic data through a wide variety <strong>of</strong> databases, similarity searches are an essentialtool <strong>for</strong> accessing these data in an in<strong>for</strong>mative and productive way. In this chapter, we provide an overview<strong>of</strong> similarity searching approaches, related databases, and parameter options to achieve the best results <strong>for</strong> avariety <strong>of</strong> applications. We then provide a worked example and some notes <strong>for</strong> consideration.Key words: BLAST, sequence alignment, similarity search.1. Introduction1.1. An Introduction toNucleotide DatabasesPerhaps the central goal <strong>of</strong> genetics is to articulate the associations<strong>of</strong> phenotypes <strong>of</strong> interest with their underlying genetic componentsand then to understand the relationship between geneticvariation and variation in the phenotype. This goal has beenbuoyed by the tremendous increase in our ability to obtain moleculargenetic data, across both populations and species. As methods<strong>of</strong> gathering in<strong>for</strong>mation about various aspects <strong>of</strong> biologicalmacromolecules arose, biological in<strong>for</strong>mation became abundantand the need to consolidate and make this in<strong>for</strong>mation accessiblebecame increasingly apparent. In the early 1960s, Margaret Dayh<strong>of</strong>fand colleagues at the National Biomedical Research Foundation(NBRF) began collecting in<strong>for</strong>mation on protein sequencesand structure into a volume entitled Atlas <strong>of</strong> Protein <strong>Sequence</strong> andStructure (1). Since that beginning, databases have been an importantand essential part <strong>of</strong> biological and biochemical research.David Posada (ed.), <strong>Bioin<strong>for</strong>matics</strong> <strong>for</strong> <strong>DNA</strong> <strong>Sequence</strong> <strong>Analysis</strong>, Methods in Molecular Biology 537ª Humana Press, a part <strong>of</strong> Springer ScienceþBusiness Media, LLC 2009DOI 10.1007/978-1-59745-251-9_11

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!