3260 papers • 126 benchmarks • 313 datasets
For GO terms prediction, given the specific function prediction instruction and a protein sequence, models characterize the protein functions using the GO terms presented in three different domains (cellular component, biological process, and molecular function).
(Image credit: Papersgraph)
These leaderboards are used to track progress in protein-function-prediction-5
Use these libraries to find protein-function-prediction-5 models and implementations
A new strategy and self-supervised methods for pre-training Graph Neural Networks (GNNs) that avoids negative transfer and improves generalization significantly across downstream tasks, leading up to 9.4% absolute improvements in ROC-AUC over non-pre-trained models and achieving state-of-the-art performance for molecular property prediction and protein function prediction.
This study undertakes a comprehensive exploration of joint protein representation learning by integrating a state-of-the-art PLM (ESM-2) with distinct structure encoders (GVP, GearNet, CDConv) and achieves significant improvements over existing sequence- and structure-based methods.
It is found that GP regression achieves highly competitive accuracy for these tasks while providing with well-calibrated uncertainty quantitation and improved interpretability and xGPR may be used as part of an active learning strategy to engineer a protein with a desired property in an automated way without human intervention.
AFDP is devised, an integrated approach for protein function prediction which benefits from the combination of two available tools, AHRD and eggNOG, to predict the functionality of novel proteins and produce more precise human readable descriptions by applying the stCFExt algorithm.
This paper proposes C-HMCNN(h), a novel approach for HMC problems, which, given a network h for the underlying multi-label classification problem, exploits the hierarchy information in order to produce predictions coherent with the constraint and improve performance.
This work explores the potential of metric-based meta-learning for solving few-shot graph classification and proposes a novel framework which explicitly considers global structure and local structure of the input graph.
This model successfully learns the dynamic fingerprints of proteins and provides molecular insights into protein functions, with vast untapped potential for broad biotechnology and pharmaceutical applications.
This work proposes OntoProtein, the first general framework that makes use of structure in GO (Gene Ontology) into protein pre-training models, and constructs a novel large-scale knowledge graph that consists of GO and its related proteins, and gene annotation texts or protein sequences describe all nodes in the graph.
This work proposes a framework that aims to learn the exact discrepancy between the original and the perturbed graphs, coined as Discrepancy-based Self-supervised LeArning (D-SLA), and validate it on various graph-related downstream tasks, on which the authors' largely outperforms relevant baselines.
Adding a benchmark result helps the community track progress.