cover of episode Zero-Shot Transfer of Protein Sequence Likelihood Models to Thermostability Prediction

Zero-Shot Transfer of Protein Sequence Likelihood Models to Thermostability Prediction

2023/7/19
logo of podcast PaperPlayer biorxiv bioinformatics

PaperPlayer biorxiv bioinformatics

Frequently requested episodes will be transcribed first

Shownotes Transcript

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2023.07.17.549396v1?rss=1

Authors: Reeves, S., Kalyaanamoorthy, S.

Abstract: Protein sequence likelihood models (PSLMs) are an emerging class of self-supervised deep learning algorithms which learn distributions over amino acid identities in structural and evolutionary contexts. Recently, PSLMs have demonstrated impressive performance in predicting the relative fitness of variant sequences without any task-specific training. In this work, we comprehensively analyze the capacity of six PSLMs to predict experimental measurements of thermostability for variants of hundreds of heterogeneous proteins. We assess performance of PSLMs relative to state-of-the-art supervised models, highlight relative strengths and weaknesses, and examine the complementarity between these models. We focus our analyses on stability engineering applications, assessing which methods and combinations of methods can most consistently identify and prioritize mutations for experimental validation. Our results indicate that structure-based PSLMs have competitive performance with the best existing supervised methods and can augment the predictions of supervised methods by integrating insights from their disparate training objectives.

Copy rights belong to original authors. Visit the link for more info

Podcast created by Paper Player, LLC