75 Integrating Membrane Transport with Male Gametophyte ... - TAIR
75 Integrating Membrane Transport with Male Gametophyte ... - TAIR
75 Integrating Membrane Transport with Male Gametophyte ... - TAIR
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
103 Proteomic services at the European Bioinformatics Institute<br />
D Thorneycroft, J Khadake, S Orchard, L Montecchi-Palazzi, S Kerrin, C Leroy, B Aranda, M Kleen, P Jones, R Cote,<br />
L Martens, A Quinn, W Derache, C Taylor, A Fleischmann, M Darsow, K Degtyarenko, W Fleischmann, S Boyce, K<br />
Axelsen, J Risse, M Ennis, N Vinod, P de Matos, S Klie, H Hemjakob, R Apweiler<br />
European Bioinformatics Institute<br />
Proteome analysis is becoming a powerful tool in the functional characterization of plants and has been applied to processes such<br />
as photomorphogenesis, flower development, protein degradation systems, signal transduction and plant-microbe protein interactions.<br />
These studies have expanded and refined understanding of the plant proteome and interactome in both model and crop species.<br />
The European Bioinformatics Institute (EBI) is the European node for globally coordinated efforts to collect and disseminate<br />
biological data. The Proteomics Services Team (EBI) provides databases and tools for the deposition, distribution and analysis of<br />
proteomics and proteomics-related data. PRIDE, the 'PRoteomics IDEntifications database' (http://www.ebi.ac.uk/pride) is a database<br />
of protein and peptide identifications that have been described in the scientific literature. These identifications may be annotated<br />
<strong>with</strong> supporting mass spectra. PRIDE is a web application, so submission, searching and data retrieval can all be performed using an<br />
internet browser. The Intact database (http://www.ebi.ac.uk/intact/index.jsp) provides a freely available, open source database system<br />
and tools for the storage and analysis of protein interaction data. The IntAct toolkit provides textural and graphical representation of<br />
the data and allows the user to explore protein interaction networks using a web-based interface. The database contains an increasing<br />
amount of protein interaction data derived from model and crop plants.<br />
The Proteomics Services team coordinates HUPO’s Proteomics Standards Initiative (PSI). , which is developing data standards<br />
for protein-protein interactions, mass spectrometry and general proteomics. The proteomics team also provides biochemical reference<br />
databases such as IntEnz, the Integrated relational Enzyme database which is the most up-to-date version of the Enzyme Nomenclature<br />
and ChEBI, the Chemical Entities of Biological Interest database, a freely available dictionary of molecular entities focused on ‘small’<br />
chemical compounds. The Proteomics Services Team seeks to constantly evolve to meet the needs of its users and play its part in the<br />
development and adoption of proteomic standards by the scientific community. For more information visit http://www.ebi.ac.uk.<br />
PRIDE: a public repository of protein and peptide identifications for the proteomics community. P Jones et al. Nucleic Acids Research Vol 1<br />
Issue 34 (Database issue) D659-D663. IntAct - an open source molecular interaction database. H. Hermjakob et al. Nucl. Acids. Res. 2004 32:<br />
D452-D455. IntEnz, the integrated relational enzyme database. Fleischmann A et al. Nucleic Acids Res. 2004 Jan 1;32(Database issue)D434-<br />
7. Autumn 2005 Workshop of the Human Proteome Organisation Proteomics Standards Initiative (HUPO-PSI) Geneva, September, 4-6,<br />
2005. S. Orchard et al. Proteomics. 2006 Feb;6(3):738-41. The European Bioinformatics Institute's data resources: towards systems biology.<br />
Brooksbank C et al. Nucleic Acids Res. 2005 Jan 1;33(Database issue):D46-53.<br />
104 New Gene Discovery in Unannotated Regions of the Arabidopsis Genome<br />
Yongli Xiao, William Moskal, Hank Wu, Beverly Underwood, Wei Wang, Erin Monaghan, Christopher Town<br />
The Institute for Genomic Research, Rockville MD, USA.<br />
The whole genome sequence of Arabidopsis <strong>with</strong> its annotation was completed at the end of 2000. Since then,<br />
four versions of genome re-annotation were released sequentially based on new experimental evidence and improved<br />
computational analysis. Evidence based mainly on sequence conservation indicated the existence of yet more genes<br />
in un-annotated regions of the genome. TwinScan and EuGene are two relatively new gene prediction programs that<br />
incorporate comparative genomic information. Compared <strong>with</strong> the TIGRv5 annotation, EuGene predicts 1,559 and<br />
TwinScan predicts 1,440 genes in intergenic regions of the Arabidopsis genome <strong>with</strong> 365 predictions in common. In order<br />
to verify the novel genes predicted by EuGene and TwinScan, a high throughput method of rapid amplification of cDNA<br />
ends (RACE) using cDNA from 11 diverse RNA populations was applied to 918 predictions in intergenic regions. We<br />
recovered transcripts from 429 predictions that yielded 323 novel full-length cDNAs. In addition, a comparative study<br />
of Arabidopsis and Brassica yielded a number of Conserved Arabidopsis Genome Sequences (CAGS), 28% of which<br />
aligned to un-annotated regions suggesting the presence of un-annotated novel genes. We similarly targeted 192 intergenic<br />
CAGS by the RACE pipeline and found an additional 25 un-annotated genes. Thus, a total 454 of novel genes in the<br />
intergenic regions of Arabidopsis has been recovered. We have implemented a streamlined Gateway® cloning protocol<br />
that permits simultaneous generation of ORF clones <strong>with</strong> and <strong>with</strong>out stop codons <strong>with</strong> little additional cost. To date, we<br />
have generated ORF clones in Gateway® entry vectors for over 2,000 Arabidopsis genes, including the novel intergenic<br />
structures described above, that are expressed at very low levels and thus absent from current EST/cDNA collections.<br />
Research supported by the NSF 2010 Project.