Research Connect
Research PapersAboutContact

Geometric deep learning of RNA structure

Published in Science (2021-08-27)
aionlincourseaionlincourseaionlincourseaionlincourse
Generate GraphDownload

On This Page

  • TL;DR
  • Abstract
  • Authors
  • Datasets
  • References
TL

TL;DR

A machine learning approach is introduced that enables identification of accurate structural models without assumptions about their defining characteristics, despite being trained with only 18 known RNA structures, and the resulting scoring function, the Atomic Rotationally Equivariant Scorer (ARES), substantially outperforms previous methods and consistently produces the best results in community-wide blind RNA structure prediction challenges.

Abstract

Machine learning solves RNA puzzles RNA molecules fold into complex three-dimensional shapes that are difficult to determine experimentally or predict computationally. Understanding these structures may aid in the discovery of drugs for currently untreatable diseases. Townshend et al. introduced a machine-learning method that significantly improves prediction of RNA structures (see the Perspective by Weeks). Most other recent advances in deep learning have required a tremendous amount of data for training. The fact that this method succeeds given very little training data suggests that related methods could address unsolved problems in many fields where data are scarce. Science, abe5650, this issue p. 1047; see also abk1971, p. 964 A machine learning method significantly improves scoring of RNA structural models, despite being trained on very few structures. RNA molecules adopt three-dimensional structures that are critical to their function and of interest in drug discovery. Few RNA structures are known, however, and predicting them computationally has proven challenging. We introduce a machine learning approach that enables identification of accurate structural models without assumptions about their defining characteristics, despite being trained with only 18 known RNA structures. The resulting scoring function, the Atomic Rotationally Equivariant Scorer (ARES), substantially outperforms previous methods and consistently produces the best results in community-wide blind RNA structure prediction challenges. By learning effectively even from a small amount of data, our approach overcomes a major limitation of standard deep neural networks. Because it uses only atomic coordinates as inputs and incorporates no RNA-specific information, this approach is applicable to diverse problems in structural biology, chemistry, materials science, and beyond.

Authors

Raphael J. L. Townshend

1 Paper

Stephan Eismann

1 Paper

A. Watkins

1 Paper

References58 items

1

Deep Learning

2

Improved protein structure prediction using potentials from deep learning

MedicineComputer Science
3

Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning

4

Electronic Reprint Biological Crystallography the Protein Data Bank Biological Crystallography the Protein Data Bank

5

Hierarchical, rotation‐equivariant neural networks to select structural models of protein complexes

Research Impact

234

Citations

58

References

0

Datasets

7

R. Rangan

1 Paper

Maria Karelina

1 Paper

Rhiju Das

1 Paper

R. Dror

1 Paper

6

Protein-Ligand Scoring with Convolutional Neural Networks

7

ARES-specific adaptation of E3NN

8

Training code for ARES neural network

9

Rfam 14: expanded coverage of metagenomic, viral and microRNA families

10

Accelerated cryo-EM-guided determination of three-dimensional RNA-only structures

11

RNA-Puzzles Round IV: 3D structure predictions of four ribozymes and two aptamers

12

High-affinity recognition of specific tRNAs by an mRNA anticodon-binding groove

13

Evaluating DCA-based method performances for RNA contact prediction by a well-curated data set

14

Structural basis of amino acid surveillance by higher-order tRNA-mRNA interactions

15

FARFAR2: Improved de novo Rosetta prediction of complex global RNA folds

16

Crystal structure of an adenovirus virus-associated RNA

17

A^2-Net: Molecular Structure Estimation from Cryo-EM Density Volumes

18

RNA3DCNN: Local and global quality assessments of RNA 3D structures using 3D deep convolutional neural networks

19

Principles for targeting RNA with drug-like small molecules

20

End-to-end differentiable learning of protein structure

21

Biological Sequence Modeling with Convolutional Kernel Networks

22

RNA Structure: Advances and Assessment of 3D Structure Prediction.

23

Design of RNAs: comparing programs for inverse RNA folding

24

Transcending the prediction paradigm: novel applications of SHAPE to RNA function and evolution

25

SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction

26

3dRNAscore: a distance and torsion angle dependent evaluation function of 3D RNA structures

27

Structural analysis of adenovirus VAI RNA defines the mechanism of inhibition of PKR.

28

T box riboswitches in Actinobacteria: Translational regulation via novel tRNA interactions

29

The purine riboswitch as a model system for exploring RNA biology and chemistry.

30

Direct evaluation of tRNA aminoacylation status by the T-box riboswitch using tRNA-mRNA stacking and steric readout.

31

The Noncoding RNA Revolution—Trashing Old Rules to Forge New Ones

32

Assessing the utility of coevolution-based residue–residue contact predictions in a sequence- and structure-rich era

33

An Integrated Encyclopedia of DNA Elements in the Human Genome

34

Protein 3D Structure Computed from Evolutionary Sequence Variation

35

All-atom knowledge-based potential for RNA structure prediction and assessment

36

PyRosetta: a script-based interface for implementing molecular modeling algorithms using Rosetta

37

Atomic accuracy in predicting and designing non-canonical RNA structure

38

New metrics for comparing and assessing discrepancies between RNA 3D structures and models.

39

Automated de novo prediction of native-like RNA tertiary structures

40

CONTRAfold: RNA secondary structure prediction without physics-based models

41

3DNA: a software package for the analysis, rebuilding and visualization of three-dimensional nucleic acid structures.

42

Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit

43

Auxiliary code related to the publication “Geometric Deep Learning of RNA Structure,

44

Structural data used to test a new geometric deep learning RNA scoring function emulating fully de novo modeling conditions

45

Structural data used to train, test, and characterize a new geometric deep learning RNA scoring function

46

arXiv:1802.08219 [cs.LG]

47

Computational Methods for RNA Structure Validation and Improvement.

48

Nonredundant 3D Structure Datasets for RNA Knowledge Extraction and Benchmarking

49

ENCODE Project Consortium

50

Spherical Tensor Calculus for Local Adaptive Filtering

51

On robust estimation of the location parameter

52

Materials and methods are available as supplementary materials

53

Thomas for discussions and advice. Funding: Funding was provided by National Science Foundation Graduate Research Fellowships

54

MDAR Reproducibility Checklist View/request a protocol for this paper from Bio-protocol

55

J. Behler, M. Parrinello, Phys. Rev. Lett. 98 , 146401 (2007).

56

); the Army Research Office Multidisciplinary University Research Initiative program (R.D.); the U.S. Department of Energy

57

View/request a protocol for this paper from Bio-protocol 31 August 2020; accepted 14 July 2021

58

); and National Institutes of Health grants R21CA219847 (R.D.) and R35GM122579 (R.D.). Author contributions: R

Authors

Field of Study

Medicine

Journal Information

Name

Science

Volume

373

Venue Information

Name

Science

Type

journal

URL

https://www.jstor.org/journal/science