POSTERS - BLAST X - University of Utah
POSTERS - BLAST X - University of Utah
POSTERS - BLAST X - University of Utah
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>BLAST</strong> X Mon. Morning Session<br />
MESSAGE PASSING: PROTEIN STRUCTURE ASSEMBLY FROM SEQUENCE DATA FOR<br />
TWO-COMPONENT SIGNALING PROTEINS<br />
Hendrik Szurmant 1 , Martin Weigt 2,3 , Robert White 1,3 , Terry Hwa 3 and James A. Hoch 1<br />
1<br />
Division <strong>of</strong> Cellular Biology, Department <strong>of</strong> Molecular and Experimental Medicine, The Scripps<br />
Research Institute, La Jolla, CA 92037<br />
2<br />
Institute for Scientific Interchange, I-10133 Torino, Italy<br />
3<br />
Center for Theoretical Biological Physics and Department <strong>of</strong> Physics, <strong>University</strong> <strong>of</strong> California at<br />
San Diego, La Jolla, CA 92093<br />
The crystal structure <strong>of</strong> the Bacillus subtilis response regulator Spo0F in complex with<br />
the histidine kinase structural homologue Spo0B defined the active site <strong>of</strong> phosphotransfer and<br />
the spatial interactions <strong>of</strong> two-component systems <strong>of</strong> microbes and plants. The limited<br />
bioinformatic data available at the time was sufficient to deduce and understand the molecular<br />
basis for recognition specificity between histidine kinases and response regulators (J. A. Hoch<br />
and K. I. Varughese. 2001. J. Bacteriol. 183:4941-4949). Today, the availability <strong>of</strong> large protein<br />
databases generated from sequences <strong>of</strong> hundreds <strong>of</strong> bacterial genomes enables more<br />
sophisticated statistical approaches to extract interacting and specificity determining positions<br />
between proteins from protein databases. The goal <strong>of</strong> such studies is to identify protein<br />
interaction surfaces from sequencing data alone, without previous structural knowledge, i.e. cocrystal<br />
data. A number <strong>of</strong> co-variance based approaches producing nearly identical results have<br />
been applied to the highly amplified two-component systems as a means to verify mathematical<br />
data with structural knowledge <strong>of</strong> sensor kinase/response regulator interaction; including one<br />
presented by us at the <strong>BLAST</strong> IX in 2007 (R. A. White et al. 2007. Methods Enzymol. 422:75-<br />
101). While producing potential specificity determining position information, these methods have<br />
an important shortcoming in predicting spatial proximity. They cannot distinguish between<br />
directly correlated (interacting) and indirectly correlated (non-interacting) residue positions. To<br />
address this issue we developed a novel method that combines co-variance analysis with global<br />
inference analysis, adopted from use in statistical physics.<br />
When applied to a set <strong>of</strong> over 2500 representatives <strong>of</strong> the bacterial two-component<br />
signal transduction system, the combination <strong>of</strong> covariance with global inference methods<br />
successfully and robustly identified residue pairs that are proximal in space without resorting to<br />
ad hoc tuning parameters, both for hetero-interactions between sensor kinase (SK) and<br />
response regulator (RR) proteins and for homo-interactions between RR proteins. The<br />
spectacular success <strong>of</strong> this approach illustrates the effectiveness <strong>of</strong> this combination approach<br />
in identifying direct interaction positions based on sequence information alone. We expect this<br />
method to be applicable for predicting interaction surfaces between proteins present in only one<br />
copy per genome as the number <strong>of</strong> sequenced genomes continues to expand, and for<br />
assembling multi-protein structures.<br />
5