1
Embeddings from protein language models predict conservation and variant effects
2
Light attention predicts protein location from the language of life
3
PredictProtein - Predicting Protein Structure and Function for 29 Years
5
Transformer protein language models are unsupervised structure learners
6
Embeddings from deep learning transfer GO annotations beyond homology
7
Big Bird: Transformers for Longer Sequences
8
Transforming the Language of Life: Transformer Neural Networks for Protein Prediction Tasks
9
Language Models are Few-Shot Learners
10
Language modelling for biological sequences – curated datasets and baselines
11
ProGen: Language Modeling for Protein Generation
12
Reformer: The Efficient Transformer
13
PyTorch: An Imperative Style, High-Performance Deep Learning Library
14
Modeling aspects of the language of life through transfer-learning protein sequences
15
Pre-Training of Deep Bidirectional Protein Sequence Representations With Structural Information
16
Improved protein structure prediction using predicted interresidue orientations
17
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
18
Unified rational protein engineering with sequence-based deep representation learning
19
ZeRO: Memory Optimization Towards Training A Trillion Parameter Models
20
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
21
Attention Interpretability Across NLP Tasks
22
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism
23
On the Variance of the Adaptive Learning Rate and Beyond
24
XLNet: Generalized Autoregressive Pretraining for Language Understanding
25
Evaluating Protein Transfer Learning with TAPE
26
A Multiscale Visualization of Attention in the Transformer Model
27
Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences
28
Generating Long Sequences with Sparse Transformers
29
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
30
Generative Models for Graph-Based Protein Design
31
Learning protein sequence embeddings using information from structure
32
ElemCor: accurate data analysis and enrichment calculation for high-resolution LC-MS stable isotope labeling experiments
33
ProteinNet: a standardized data set for machine learning of protein structure
34
Transformer-XL: Attentive Language Models beyond a Fixed-Length Context
35
SCOPe: classification of large macromolecular structures in the structural classification of proteins—extended database
36
UniProt: a worldwide hub of protein knowledge
37
Dark Proteins Important for Cellular Function
38
Evolutionary couplings and sequence variation effect predict protein binding sites
39
A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
40
Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold
41
TFLMS: Large Model Support in TensorFlow by Graph Rewriting
42
Probabilistic variable-length segmentation of protein sequences for discriminative motif discovery (DiMotif) and sequence embedding (ProtVecX)
43
NetSurfP-2.0: improved prediction of protein structural features by integrated deep learning
44
Promotion of virus assembly and organization by the measles virus matrix protein
45
Assessment of hard target modeling in CASP12 reveals an emerging role of alignment‐based contact prediction methods
46
Horovod: fast and easy distributed deep learning in TensorFlow
47
Deep Contextualized Word Representations
48
End-to-end differentiable learning of protein structure
49
Evolutionary adaptations to new environments generally reverse plastic phenotypic changes
50
Universal Language Model Fine-tuning for Text Classification
51
DeepLoc: prediction of protein subcellular localization using deep learning
52
DeepLoc: prediction of protein subcellular localization using deep learning
53
Quantitative firing pattern phenotyping of hippocampal neuron types
54
MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets
55
Capturing non‐local interactions by long short‐term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility
57
Attention is All you Need
58
In-datacenter performance analysis of a tensor processing unit
59
Clustering huge protein sequence sets in linear time
60
Sixty-five years of the long march in protein secondary structure prediction: the final stretch?
61
FastText.zip: Compressing text classification models
62
TMSEG: Novel prediction of transmembrane helices
63
Extreme Scale-out SuperMUC Phase 2 - lessons learned
64
Announcing Supercomputer Summit
65
RaptorX-Property: a web server for protein structure property prediction
66
TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems
67
Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields
68
Unexpected features of the dark proteome
69
Accurate contact predictions using covariation techniques and machine learning
70
JPred4: a protein secondary structure prediction server
71
UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches
72
GloVe: Global Vectors for Word Representation
73
Neural Machine Translation by Jointly Learning to Align and Translate
74
One billion word benchmark for measuring progress in statistical language modeling
75
Distributed Representations of Words and Phrases and their Compositionality
76
LocTree2 predicts localization for all domains of life
77
Approximate Nearest Neighbor: Towards Removing the Curse of Dimensionality
78
Three-Dimensional Structures of Membrane Proteins from Genomic Sequencing
79
Hydrophobic forces and the length limit of foldable protein domains
80
Protein 3D Structure Computed from Evolutionary Sequence Variation
81
What is the best multi-stage architecture for object recognition?
82
PSI-2: structural genomics to cover protein domain family space.
83
NVIDIA cuda software and gpu parallel computing architecture
84
Improving fold recognition without folds.
85
Protein flexibility and intrinsic disorder
86
PISCES: a protein sequence culling server
87
Enhanced protein domain discovery by using language modeling techniques from speech recognition
88
Evaluation and improvement of multiple sequence methods for protein secondary structure prediction
89
High-resolution structures of variant Zif268-DNA complexes: implications for understanding zinc finger-DNA recognition.
90
Pitfalls of protein sequence analysis.
91
Epitope Mapping by Label-Free Biomolecular Interaction Analysis
92
Combining evolutionary information and neural networks to predict protein secondary structure
93
Improved prediction of protein secondary structure by use of sequence profiles and neural networks.
94
Prediction of protein secondary structure at better than 70% accuracy.
95
Amino acid substitution matrices from protein blocks.
96
The SWISS-PROT protein sequence data bank.
98
Studies on the reduction and re-formation of protein disulfide bonds.
99
Supplementary Material of Optimal Gradient Checkpoint Search for Arbitrary Computation Graphs
102
Electra: Pre-training text encoders as discriminators rather than generators
103
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
104
Anaconda software distribution
105
Visualizing Data using t-SNE
106
Approximate Nearest Neighbors
107
The ENZYME database in 2000
109
Bridging the protein sequence-structure gap by structure predictions.
110
PHD: predicting one-dimensional protein structure by profile-based neural networks.
111
Characteristics of Sentence Length in Running Text
112
Enzyme nomenclature 1992. Recommendations of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology on the Nomenclature and Classification of Enzymes.
113
Press release announcing Supercomputer Fugaku. Technical report, RIKEN