12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

5 <strong>Structure</strong> and <strong>Function</strong> of Intrinsically Disordered <strong>Protein</strong>s 119Fig. 5.2 Foldindex plot of the disorder of p53. Disorder of the tumor suppressor p53 has beenpredicted by the FoldIndex algorithm (Prilusky et al. 2005). The plot is colour coded, <strong>with</strong> reddenoting predicted disorder and green denoting order, in agreement <strong>with</strong> biophysical data thatsuggest disorder <strong>with</strong>in the N-terminal trans-activa<strong>to</strong>r domain and the C-terminal tetramerizationand regula<strong>to</strong>ry domains (Bell et al. 2002; Dawson et al. 2003) (Reprinted from Prilusky et al.2005 by permission of Oxford University Press)of predicted regular secondary structure are structurally disordered (Liu and Rost2003). Whereas the predic<strong>to</strong>r, NORSp, performs comparably <strong>to</strong> other disorder predic<strong>to</strong>rs,it should be noted that there are well-ordered proteins composed entirely ofnon-repetitive local structural elements (termed loopy proteins (Liu et al. 2002) ),and IDPs, which contain transient local structural elements (Fuxreiter et al. 2004).The tendency of these latter <strong>to</strong> form structure is well-predictable, which sets a conceptuallimit <strong>to</strong> predictions based on the above principle.5.3.5 Machine Learning AlgorithmsArguably the most advanced approaches <strong>to</strong> disorder prediction are machine learning(ML) algorithms, i.e. predic<strong>to</strong>rs trained <strong>to</strong> distinguished sequences that encodeordered or disordered structures. Compared <strong>to</strong> the previous simpler approaches,these incorporate non-trivial amino acid features and hidden sequence properties,which probably explains their superior performance. At the same time, their correctprediction often does not rely on known principles, and thus they do not add <strong>to</strong> ourunderstanding of what defines disorder.The classical ML algorithm is PONDR (predic<strong>to</strong>r of natural disorderedregions), a neural network (NN) algorithm, which is based on local amino acidcomposition, flexibility and other sequence features (Romero et al. 1998). It has

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!