Ontology highlight
ABSTRACT:
SUBMITTER: McGee F
PROVIDER: S-EPMC8563988 | biostudies-literature | 2021 Nov
REPOSITORIES: biostudies-literature
McGee Francisco F Hauri Sandro S Novinger Quentin Q Vucetic Slobodan S Levy Ronald M RM Carnevale Vincenzo V Haldane Allan A
Nature communications 20211102 1
Potts models and variational autoencoders (VAEs) have recently gained popularity as generative protein sequence models (GPSMs) to explore fitness landscapes and predict mutation effects. Despite encouraging results, current model evaluation metrics leave unclear whether GPSMs faithfully reproduce the complex multi-residue mutational patterns observed in natural sequences due to epistasis. Here, we develop a set of sequence statistics to assess the "generative capacity" of three current GPSMs: th ...[more]