note - FIZ Karlsruhe
note - FIZ Karlsruhe
note - FIZ Karlsruhe
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Sequence Code Match Searching<br />
Exact Sequence Search of Proteins (/SQEP) retrieves protein sequences that exactly match the<br />
search query. The search query must be completely defined. Variability symbols are not allowed.<br />
Subsequence Search of Proteins (/SQSP) retrieves exact answers plus sequences in which the<br />
query sequence is embedded. Variability symbols are allowed.<br />
Exact Sequence Search of Nucleic Acids (/SQEN) retrieves nucleic acid sequences that exactly<br />
match the search query. The search query must be completely defined. Variability symbols are not<br />
allowed. Ambiguity codes are allowed.<br />
Subsequence Search of Nucleic Acids (/SQSN) retrieves exact answers plus sequences in which<br />
the query sequence is embedded. Variability symbols are allowed. Ambiguity codes are allowed.<br />
Exact Family Sequence Search of Proteins (/SQEFP) retrieves protein sequences that exactly<br />
match the query and answers in which family-equivalent substitution of the query amino acids<br />
occurs (see table below). Variability symbols are not allowed.<br />
Subsequence Family Search of Proteins (/SQSFP) retrieves exact sequences, subsequences, and<br />
answers in which family-equivalent substitution of the query amino acids occurs (see table below).<br />
Variability symbols are allowed.<br />
Amino acid families for SQEFP and SQSFP<br />
The SQEFP and SQSFP family code match options for GETSEQ searching, allow narrow<br />
variability searching based upon highly conservative substitutions of chemically similar amino<br />
acids. The amino acid families are shown in the table below.<br />
Group Amino Acids<br />
Neutral - weakly hydrophobic P, A, G, S, T<br />
Acid amine - hydrophilic Q, N, E, D, B, Z<br />
Basic - hydrophilic H, K, R<br />
Hydrophobic I, M, L, V<br />
Aromatic F, W, Y<br />
Cross-linking C<br />
Neutral - weakly hydrophobic P, A, G, S, T<br />
Acid amine - hydrophilic Q, N, E, D, B, Z<br />
Basic - hydrophilic H, K, R<br />
Hydrophobic I, M, L, V<br />
Aromatic F, W, Y<br />
GENESEQ on STN (DGENE) Workshop Manual | Page 103