07.01.2015 Views

75 Integrating Membrane Transport with Male Gametophyte ... - TAIR

75 Integrating Membrane Transport with Male Gametophyte ... - TAIR

75 Integrating Membrane Transport with Male Gametophyte ... - TAIR

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

103 Proteomic services at the European Bioinformatics Institute<br />

D Thorneycroft, J Khadake, S Orchard, L Montecchi-Palazzi, S Kerrin, C Leroy, B Aranda, M Kleen, P Jones, R Cote,<br />

L Martens, A Quinn, W Derache, C Taylor, A Fleischmann, M Darsow, K Degtyarenko, W Fleischmann, S Boyce, K<br />

Axelsen, J Risse, M Ennis, N Vinod, P de Matos, S Klie, H Hemjakob, R Apweiler<br />

European Bioinformatics Institute<br />

Proteome analysis is becoming a powerful tool in the functional characterization of plants and has been applied to processes such<br />

as photomorphogenesis, flower development, protein degradation systems, signal transduction and plant-microbe protein interactions.<br />

These studies have expanded and refined understanding of the plant proteome and interactome in both model and crop species.<br />

The European Bioinformatics Institute (EBI) is the European node for globally coordinated efforts to collect and disseminate<br />

biological data. The Proteomics Services Team (EBI) provides databases and tools for the deposition, distribution and analysis of<br />

proteomics and proteomics-related data. PRIDE, the 'PRoteomics IDEntifications database' (http://www.ebi.ac.uk/pride) is a database<br />

of protein and peptide identifications that have been described in the scientific literature. These identifications may be annotated<br />

<strong>with</strong> supporting mass spectra. PRIDE is a web application, so submission, searching and data retrieval can all be performed using an<br />

internet browser. The Intact database (http://www.ebi.ac.uk/intact/index.jsp) provides a freely available, open source database system<br />

and tools for the storage and analysis of protein interaction data. The IntAct toolkit provides textural and graphical representation of<br />

the data and allows the user to explore protein interaction networks using a web-based interface. The database contains an increasing<br />

amount of protein interaction data derived from model and crop plants.<br />

The Proteomics Services team coordinates HUPO’s Proteomics Standards Initiative (PSI). , which is developing data standards<br />

for protein-protein interactions, mass spectrometry and general proteomics. The proteomics team also provides biochemical reference<br />

databases such as IntEnz, the Integrated relational Enzyme database which is the most up-to-date version of the Enzyme Nomenclature<br />

and ChEBI, the Chemical Entities of Biological Interest database, a freely available dictionary of molecular entities focused on ‘small’<br />

chemical compounds. The Proteomics Services Team seeks to constantly evolve to meet the needs of its users and play its part in the<br />

development and adoption of proteomic standards by the scientific community. For more information visit http://www.ebi.ac.uk.<br />

PRIDE: a public repository of protein and peptide identifications for the proteomics community. P Jones et al. Nucleic Acids Research Vol 1<br />

Issue 34 (Database issue) D659-D663. IntAct - an open source molecular interaction database. H. Hermjakob et al. Nucl. Acids. Res. 2004 32:<br />

D452-D455. IntEnz, the integrated relational enzyme database. Fleischmann A et al. Nucleic Acids Res. 2004 Jan 1;32(Database issue)D434-<br />

7. Autumn 2005 Workshop of the Human Proteome Organisation Proteomics Standards Initiative (HUPO-PSI) Geneva, September, 4-6,<br />

2005. S. Orchard et al. Proteomics. 2006 Feb;6(3):738-41. The European Bioinformatics Institute's data resources: towards systems biology.<br />

Brooksbank C et al. Nucleic Acids Res. 2005 Jan 1;33(Database issue):D46-53.<br />

104 New Gene Discovery in Unannotated Regions of the Arabidopsis Genome<br />

Yongli Xiao, William Moskal, Hank Wu, Beverly Underwood, Wei Wang, Erin Monaghan, Christopher Town<br />

The Institute for Genomic Research, Rockville MD, USA.<br />

The whole genome sequence of Arabidopsis <strong>with</strong> its annotation was completed at the end of 2000. Since then,<br />

four versions of genome re-annotation were released sequentially based on new experimental evidence and improved<br />

computational analysis. Evidence based mainly on sequence conservation indicated the existence of yet more genes<br />

in un-annotated regions of the genome. TwinScan and EuGene are two relatively new gene prediction programs that<br />

incorporate comparative genomic information. Compared <strong>with</strong> the TIGRv5 annotation, EuGene predicts 1,559 and<br />

TwinScan predicts 1,440 genes in intergenic regions of the Arabidopsis genome <strong>with</strong> 365 predictions in common. In order<br />

to verify the novel genes predicted by EuGene and TwinScan, a high throughput method of rapid amplification of cDNA<br />

ends (RACE) using cDNA from 11 diverse RNA populations was applied to 918 predictions in intergenic regions. We<br />

recovered transcripts from 429 predictions that yielded 323 novel full-length cDNAs. In addition, a comparative study<br />

of Arabidopsis and Brassica yielded a number of Conserved Arabidopsis Genome Sequences (CAGS), 28% of which<br />

aligned to un-annotated regions suggesting the presence of un-annotated novel genes. We similarly targeted 192 intergenic<br />

CAGS by the RACE pipeline and found an additional 25 un-annotated genes. Thus, a total 454 of novel genes in the<br />

intergenic regions of Arabidopsis has been recovered. We have implemented a streamlined Gateway® cloning protocol<br />

that permits simultaneous generation of ORF clones <strong>with</strong> and <strong>with</strong>out stop codons <strong>with</strong> little additional cost. To date, we<br />

have generated ORF clones in Gateway® entry vectors for over 2,000 Arabidopsis genes, including the novel intergenic<br />

structures described above, that are expressed at very low levels and thus absent from current EST/cDNA collections.<br />

Research supported by the NSF 2010 Project.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!