Predicting amino acid substitution probabilities using single nucleotide polymorphisms

Francesca Rizzato, Alex Rodriguez, Xevi Biarnés, Alessandro Laio

Producció científica: Article en revista indexadaArticleAvaluat per experts

2 Cites (Scopus)


Fast genome sequencing offers invaluable opportunities for building updated and improved models of protein sequence evolution. We here show that Single Nucleotide Polymorphisms (SNPs) can be used to build a model capable of predicting the probability of substitution between amino acids in variants of the same protein in different species. The model is based on a substitution matrix inferred from the frequency of codon interchanges observed in a suitably selected subset of human SNPs, and predicts the substitution probabilities observed in alignments between Homo sapiens and related species at 85–100% of sequence identity better than any other approach we are aware of. The model gradually loses its predictive power at lower sequence identity. Our results suggest that SNPs can be employed, together with multiple sequence alignment data, to model protein sequence evolution. The SNP-based substitution matrix developed in this work can be exploited to better align protein sequences of related organisms, to refine the estimate of the evolutionary distance between protein variants from related species in phylogenetic trees and, in perspective, might become a useful tool for population analysis.

Idioma originalAnglès
Pàgines (de-a)643-652
Nombre de pàgines10
Estat de la publicacióPublicada - d’oct. 2017


Navegar pels temes de recerca de 'Predicting amino acid substitution probabilities using single nucleotide polymorphisms'. Junts formen un fingerprint únic.

Com citar-ho