21.01.2013 Views

note - FIZ Karlsruhe

note - FIZ Karlsruhe

note - FIZ Karlsruhe

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Additional search indexes<br />

Text and Bibliographic searching<br />

Feature Table (FEAT)<br />

The Feature Table (FEAT) comprises indexed annotations which describe specific features of the<br />

sequence, e.g. coding regions, domains, introns/exons, etc. The features are organised in the table<br />

by the Thomson Reuters analyst, using a controlled list of feature keys and qualifiers (listed on<br />

pages 128 and 129). They can be a useful way to find sequences with reported characteristics.<br />

Unlike the Basic Index, the FEAT field supports left and simultaneous left and right truncation.<br />

Molecule Type (MTY)<br />

Molecule type is a controlled term field for different broad classes of sequences. This can often be<br />

a useful field for narrowing a sequence search into more precise subsets, e.g. to distinguish<br />

between mRNA and cDNA results. The full list of DGENE molecule types is given on page 132.<br />

=> S L1 AND CDNA/MTY<br />

Patent Sequence Location (PSL)<br />

The Patent Sequence Location (PSL) field indicates where a given sequence is located in the<br />

original document. The field shows if the sequence was referred to in a CLAIM, an EXAMPLE,<br />

or in other parts of the patent DISCLOSURE, and typically also includes a table, page, drawing,<br />

and/or sequence identity number 1 , if available. PSL can therefore be a useful way to narrow an<br />

answer set to sequences that may have been, e.g. claimed by the patent applicant.<br />

=> S L1 AND CLAIM/PSL<br />

Sequence Length (SQL)<br />

DGENE has a fully numerically range searchable Sequence Length (SQL) field. The SQL field<br />

can be searched with standard STN numeric operators or ranges. The field provides the ability, for<br />

example, to limit search results to desired sequence length or range of sequence lengths.<br />

=> S L1 AND 100-200/SQL<br />

Amino acid (AA) and Nucleic acid (NA)<br />

Individual amino acid residues and nucleotides are numerically range searchable in the amino<br />

acid (AA) and nucleic acid fields (NA) respectively. The AA and NA fields can be searched with<br />

standard STN numeric operators or ranges 2 , in combination with the single letter code of interest.<br />

=> S L1 AND 100-200 A/NA<br />

1 Sequence Identity Numbers (SEQ ID NOs) are indexed in the DGENE PSL field from 2004 onwards.<br />

2 For greater than (>) or less than ( S NA>10 (S) A/NA.<br />

GENESEQ on STN (DGENE) Workshop Manual | Page 89

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!