12.07.2015 Views

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

From Protein Structure to Function with Bioinformatics.pdf

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

18 J. Lee et al.potentials, each residue is represented either by a single a<strong>to</strong>m or by a few a<strong>to</strong>ms,e.g., C α-based potentials (Melo et al. 2002), C β-based potentials (Hendlich et al.1990), side chain centre-based potentials (Bryant and Lawrence 1993; Kocheret al. 1994; Thomas and Dill 1996; Skolnick et al. 1997; Zhang and Kim 2000;Zhang et al. 2004), side chain and C α-based potentials (Berrera et al. 2003). One ofthe most widely-used knowledge-based potentials is a residue-specific, all-a<strong>to</strong>m,distance-dependent potential, which was first formulated by Samudrala and Moult(RAPDF) (Samudrala and Moult 1998); it counts the distances between 167 aminoacid specific pseudo-a<strong>to</strong>ms. Following this, several a<strong>to</strong>mic potentials <strong>with</strong> variousreference states have been proposed, including those by Lu and Skolnick (KBP)(Lu and Skolnick 2001), Zhou and Zhou (DFIRE) (Zhou and Zhou 2002), Wanget al. (self-RAPDF) (Wang et al. 2004), Tost<strong>to</strong> (vic<strong>to</strong>r/FRST) (Tosat<strong>to</strong> 2005), andShen and Sali (DOPE) (Shen and Sali 2006). All these potentials claimed thatnative structures can be distinguished from decoy structures in their tests. However,the task of selecting the near native models out of many decoys remains as a challengefor these potentials (Skolnick 2006); this is actually more important thannative structure recognition because in reality there are no native structures availablefrom computer simulations. Based on the CAFASP4-MQAP experiment in2004 (Fischer 2006), the best-performing energy functions are Vic<strong>to</strong>r/FRST(Tosat<strong>to</strong> 2005) which incorporates an all-a<strong>to</strong>m pair wise interaction potential, solvationpotential and hydrogen bond potential, and MODCHECK (Pettitt et al.2005) which includes C βa<strong>to</strong>m interaction potential and solvation potential. <strong>From</strong>CASP7-MQAP in 2006, Pcons developed by Elofsson group based on structureconsensus performed best (Wallner and Elofsson 2007).1.4.3 Sequence-<strong>Structure</strong> Compatibility <strong>Function</strong>In the third type of MQAPs, best models are selected not purely based on energyfunctions. They are selected based on the compatibility of target sequences <strong>to</strong> modelstructures. The earliest and still successful example is that by Luthy et al. (1992),who used threading scores <strong>to</strong> evaluate structures. Colovos and Yeates (1993) laterused a quadratic error function <strong>to</strong> describe the non-covalently bonded interactionsamong CC, CN, CO, NN, NO and OO, where near-native structures have fewererrors than other decoys. Verify3D (Eisenberg et al. 1997) improves the method ofLuthy et al. (1992) by considering local threading scores in a 21-residue window.Jones developed GenThreader (Jones 1999) and used neural networks <strong>to</strong> classifynative and non-native structures. The inputs of GenThreader include pairwise contactenergy, solvation energy, alignment score, alignment length, and sequence andstructure lengths. Similarly, based on neural networks, Wallner and Ellofsson builtProQ (Wallner and Elofsson 2003) for quality prediction of decoy structures. Theinputs of ProQ include contacts, solvent accessible area, protein shape, secondarystructure, structural alignment score between decoys and templates, and the fractionof protein regions <strong>to</strong> be modeled from templates. Recently, McGuffin developed a

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!