-

 

 

 




Optimization Online





 

A new formulation of protein evolutionary models that account for structural constraints

Andrew J. Bordner (bordner.andrew***at***mayo.edu)
Hans D. Mittelmann (mittelmann***at***asu.edu)

Abstract: Despite the importance of a thermodynamically stable structure with a conserved fold for protein function, almost all evolutionary models neglect site-site correlations that arise from physical interactions between neighboring amino acid sites. This is mainly due to the difficulty in formulating a computationally tractable model since rate matrices can no longer be used. Here we introduce a general framework, based on factor graphs, for constructing probabilistic models of protein evolution with site interdependence. Conveniently, efficient approximate inference algorithms, like Belief Propagation, can be used to calculate likelihoods for these models. We fit an amino acid substitution model of this type that accounts for both solvent accessibility and site-site correlations. Comparisons of the new model with rate matrix models and a model accounting only for solvent accessibility demonstrate that it better fits the sequence data. We also examine evolution within a family of homohexameric enzymes and find that site-site correlations between most contacting subunits contribute to a higher likelihood. In addition, we show that the new substitution model has a similar mathematical form to the one introduced in (Rodrigue et al. 2005), although with different parameter interpretations and values. We also perform a statistical analysis of the effects of amino acids at neighboring sites on substitution probabilities and find a significant perturbation of most probabilities, further supporting the significant role of site-site interactions in protein evolution and motivating the development of new evolutionary models like the one described here. Finally, we discuss possible extensions and applications of the new substitution models.

Keywords: protein evolution, phylogenetic models, pseudolikelihood, nonconvex nonlinear optimization, factor graphs

Category 1: Applications -- Science and Engineering (Biomedical Applications )

Category 2: Nonlinear Optimization (Constrained Nonlinear Optimization )

Citation:

Download: [PDF]

Entry Submitted: 08/29/2013
Entry Accepted: 08/29/2013
Entry Last Modified: 12/05/2013

Modify/Update this entry


  Visitors Authors More about us Links
  Subscribe, Unsubscribe
Digest Archive
Search, Browse the Repository

 

Submit
Update
Policies
Coordinator's Board
Classification Scheme
Credits
Give us feedback
Optimization Journals, Sites, Societies
Mathematical Optimization Society