cover of episode PortPred: exploiting deep learning embeddings of amino acid sequences for the identification of transporter proteins and their substrates

PortPred: exploiting deep learning embeddings of amino acid sequences for the identification of transporter proteins and their substrates

2023/1/27
logo of podcast PaperPlayer biorxiv bioinformatics

PaperPlayer biorxiv bioinformatics

Shownotes Transcript

Link to bioRxiv paper: http://biorxiv.org/cgi/content/short/2023.01.26.525714v1?rss=1

Authors: Anteghini, M., Martins dos Santos, V. A. P., Saccenti, E.

Abstract: The physiology of every living cell is regulated at some level by transporter proteins which constitute a relevant portion of membrane-bound proteins and are involved in the movement of ions, small and macromolecules across bio-membranes. The importance of transporter proteins is unquestionable. The prediction and study of previously unknown transporters can lead to the discovery of new biological pathways, drugs and treatments. Here we present PortPred, a tool to accurately identify transporter proteins and their substrate starting from the protein amino acid sequence. PortPred successfully combines pre-trained deep learning-based protein embeddings and machine learning classification approaches and outperforms other state-of-the-art methods. In addition, we present a comparison of the most promising protein sequence embeddings (Unirep, SeqVec, ProteinBERT, ESM-1b) and their performances for this specific task.

Copy rights belong to original authors. Visit the link for more info

Podcast created by Paper Player, LLC