The Elements of Statistical Learning: Data Mining, Inference, and Prediction
61
The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures.
62
ESOL: Estimating Aqueous Solubility Directly from Molecular Structure
63
Announcing the worldwide Protein Data Bank
64
Greedy function approximation: A gradient boosting machine.
65
Special Invited Paper-Additive logistic regression: A statistical view of boosting
66
Artificial neural networks for computer-based molecular design.
67
A Neural Device for Searching Direct Correlations between Structures and Properties of Chemical Compounds
68
Neural Networks in QSAR and Drug Design. Edited by J. Devillers. Volume 2 in the Series: Principles of QSAR and Drug Design. Academic Press: San Diego, 1996, 284 pp. ISBN 0-12-213815-5
69
Neural Networks in QSAR and Drug Design
70
The properties of known drugs. 1. Molecular frameworks.
71
WordNet: A Lexical Database for English
72
Support-Vector Networks
73
Neural networks in chemistry
74
SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules
75
Progress in Biophysics and Molecular Biology
76
Blum and J.-L. Reymond
77
Aggregate Analysis of ClincalTrials.gov (AACT) Database
78
Deep Learning
79
N. S.; Elemento, O. Cell Chemical Biology
80
Deep Learning as an Opportunity in Virtual Screening
81
Nature
82
ChEMBL Deposited Data Set - AZ dataset
83
Bioinformatics
84
Learning and Representation Learning Workshop (NIPS
85
DeepChem: Deep-learning models for Drug Discovery and Quantum Chemistry
Journal of chemical information and computer sciences.
103
ChemNet: A Novel Neural Network Based Method for Graph/Property Mapping
104
13) Miller, G. A. Communications of the ACM
105
Chemical research in toxicology.
106
Manuscript in preparation (58) Wallach
107
RDKit: Open-Source Cheminformatics So ware
108
Experimental in vitro DMPK and physicochemical data on a set of publicly disclosed compounds
109
Note that performances of our models might be different from values in the benchmark tables due to no limitation imposed on running time(more epochs), different random splitting patterns
110
• Original result: 0.846 ∼ 0.867 for different model structure settings
111
All model validation scripts and trained models can be found in DeepChem
112
Directed Acyclic Graph models
113
International Journal of Computer Vision (IJCV)
114
Graph Convolutional models
115
MAE in kcal/mol: • Original result: 0.93 ± 0.02 with 2 DTNN layers and 100,000 training samples
116
36) AIDS Antiviral Screen Data. https://wiki.nci.nih.gov/display
117
RMSE in logS(log solubility in mol per litre): • Original
118
AIDS Antiviral Screen Data
119
Medical Dictionary for Regulatory Activities
120
Quantum Machine
121
• Reimplementation: 0.68 for valid subset, 0.58 for test subset Weave models
We evaluate the model on ESOL dataset, note that we provide performances based on a 80/10/10 random train, valid, test splitting, while the original paper reported performance under cross validation