New Collaborative Research Article in Molecular Biology And Evolution (IF 10.7)

Excited to share our recent collaboration study with the group of Prof. Miguel Arenas at Univ Vigo, Spain published in Molecular Biology and Evolution (IF 10.7). In this study we present a substitution model of protein evolution that includes structural and enzymatic activity.

The substitution model used the binding affinity of the enzyme–substrate complex as well as structural constraints that include the flexibility of structural flaps, hydrogen bonds, amino acids backbone radius of gyration, and solvent-accessible surface area that are quantified through molecular dynamics simulations. HIV-1 Protease was used as a test study. We found that accounting for selection on the protein activity improves the fitting of the modeled functional regions with the real observations, especially in data with high molecular identity, which recommends considering constraints on the protein activity in the development of substitution models of evolution.

Substitution Models of Protein Evolution with Selection on
Enzymatic Activity

David Ferreiro, Ruqaiya Khalil, Sérgio F. Sousa, Miguel Arenas

Molecular Biology and Evolution (2024) | DOI: 10.1093/molbev/msae026

Abstract:

Substitution models of evolution are necessary for diverse evolutionary analyses including phylogenetic tree and ancestral sequence reconstructions. At the protein level, empirical substitution models are traditionally used due to their simplicity, but they ignore the variability of substitution patterns among protein sites. Next, in order to improve the realism of the modeling of protein evolution, a series of structurally constrained substitution models were presented, but still they usually ignore constraints on the protein activity. Here, we present a substitution model of protein evolution with selection on both protein structure and enzymatic activity, and that can be applied to phylogenetics. In particular, the model considers the binding affinity of the enzyme–substrate complex as well as structural constraints that include the flexibility of structural flaps, hydrogen bonds, amino acids backbone radius of gyration, and solvent-accessible surface area that are quantified through molecular dynamics simulations. We applied the model to the HIV-1 protease and evaluated it by phylogenetic likelihood in comparison with the best-fitting empirical substitution model and a structurally constrained substitution model that ignores the enzymatic activity. We found that accounting for selection on the protein activity improves the fitting of the modeled functional regions with the real observations, especially in data with high molecular identity, which recommends considering constraints on the protein activity in the development of substitution models of evolution.