note - FIZ Karlsruhe
note - FIZ Karlsruhe
note - FIZ Karlsruhe
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
VII: Variability Symbols for GETSEQ<br />
Appendices<br />
There are many symbols which can be used in DGENE RUN GETSEQ subsequence<br />
searches /SQSP,/SQSFP and /SQSN (page 102), to allow for variability in residues. These<br />
symbols facilitate the searching of motifs, domains or other conserved regions, in either<br />
peptide or nucleotide sequences. The full table of options is given below.<br />
Symbol(s) Function Search Example<br />
[ ] Specify alternate residues<br />
[-]<br />
{ }<br />
with a<br />
number or<br />
range<br />
?<br />
*<br />
+<br />
Exclude a specific residue or<br />
alternate residues<br />
Repeat the preceding symbol,<br />
sequence, or an L-number, Enumber<br />
or saved name for a<br />
sequence query<br />
Repeat the preceding symbol,<br />
sequence, or sequence query<br />
zero or one time<br />
Repeat the preceding symbol,<br />
sequence, or sequence query<br />
zero or more times<br />
Repeat the preceding symbol,<br />
sequence, or sequence query<br />
one or more times<br />
LGP[VL]/SQSP:<br />
LGP followed by either V or L<br />
ATTGC[-A]GAAG/SQSN:<br />
ATTGC followed by any<br />
nucleotide except A followed<br />
by GAAG<br />
GG(FL){1-3}/SQSP<br />
(or GG(FL){1,3}/SQSP):<br />
GGFL, or GGFLFL, or<br />
GGFLFLFL.<br />
CAT(CTG){1,}TATT/SQSN:<br />
CAT followed by one or more<br />
repetitions of CTG followed by<br />
TATT, e.g. CATCTGTATT,<br />
CATCTGCTGTATT,<br />
CATCTGCTGCTGTATT etc.<br />
FLRRI(RP)?K/SQSP is<br />
equivalent to<br />
FLRRI(RP){0,1}K/SQSP:<br />
FLRRIK or FLRRIRPK<br />
CAT(CTG)*TATT/SQSN is<br />
the same as<br />
CAT(CTG){0,}TATT/SQSN:<br />
CATTATT or CAT followed<br />
by any number of repetitions of<br />
CTG followed by TATT, e.g.<br />
CATCTGTATT,<br />
CATCTGCTGTATT etc.<br />
CAT(CTG)+TATT/SQSN is<br />
equivalent to<br />
CAT(CTG){1,}TATT/SQSN:<br />
CAT followed by one or more<br />
repetitions of CTG followed by<br />
TATT, e.g. CATCTGTATT,<br />
CATCTGCTGTATT,<br />
CATCTGCTGCTGTATT etc.<br />
GENESEQ on STN (DGENE) Workshop Manual | Page 135