21.01.2013 Views

note - FIZ Karlsruhe

note - FIZ Karlsruhe

note - FIZ Karlsruhe

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Introduction to similarity searching<br />

Similarity Searching and FSORT<br />

DGENE answer sets may be grouped by underlying source publications using Family SORT<br />

(FSORT). FSORT gathers multiple sequence hits together via publication, application, and/or<br />

priority numbers. FSORT organizes answers into two subsets: multiple sequence hit (multi-record)<br />

families and single sequence hit (individual-record) families. When FSORT is used on an answer<br />

set previously sorted by SCORE, the two FSORT subsets each separately retain their SCORE sort<br />

order. FSORT makes it possible to review, e.g. just the most similar sequence answer for each<br />

family group, or all the sequence hits from a single family group.<br />

Recall the Search Question (page 20)…<br />

Page 40 | GENESEQ on STN (DGENE) Workshop Manual<br />

Find patent sequences with similarity to the Banana Bunchy Top Virus<br />

(BBTV) Replication Initiation Protein (NCBI: AAG44003):<br />

MSSFKWCFTLNYSSAAEREDFLALLKEEELNYAVVGDEVAPSSGQKHLQG<br />

YLSLKKSIKLGGLKKKYSSRAHWERARGSDEDNAKYCSKETLILELGFPA<br />

SQGSNRRKLSEMVSRSPERMRIEQPEIYHRYTSVKKLKKFKEEFVHPCLD<br />

RPWQIQLTEAIDEEPDDRSIIWVYGPNGNEGKSTYAKSLMKKDWFYTRGG<br />

KKENILFSYVDEGSEKHIVFDIPRCNQDYLNYDVIEALKDRVIESTKYKP<br />

IKLVELINIHVIVMANFMPEFCKISEDRIKIIYC<br />

Search Strategy<br />

To conduct a BLAST similarity search, and display answers sorted<br />

by patent family…<br />

Step 1 Upload the sequence and conduct a BLAST search<br />

Step 2 Sort the answers by descending similarity score<br />

Step 3 Sort by patent family with FSORT<br />

Step 4 Display selected answers using DISPLAY PFAM

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!