Predicting error bars for QSAR models

Timon Schroeter, Anton Schwaighofer, Sebastian Mika, Antonius Ter Laak, Detlev Suelzle, Ursula Ganzer, Nikolaus Heinrich, Klaus Robert Müller

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Unfavorable physicochemical properties often cause drug failures. It is therefore important to take lipophilicity and water solubility into account early on in lead discovery. This study presents log D7 models built using Gaussian Process regression, Support Vector Machines, decision trees and ridge regression algorithms based on 14556 drug discovery compounds of Bayer Schering Pharma. A blind test was conducted using 7013 new measurements from the last months. We also present independent evaluations using public data. Apart from accuracy, we discuss the quality of error bars that can be computed by Gaussian Process models, and ensemble and distance based techniques for the other modelling approaches.

Original languageEnglish
Title of host publicationCompLife 2007 - 3rd International Symposium on Computational Life Science
Pages158-167
Number of pages10
DOIs
Publication statusPublished - 2007
Externally publishedYes
Event3rd International Symposium on Computational Life Science, CompLife 2007 - Utrecht, Netherlands
Duration: 2007 Oct 42007 Oct 5

Publication series

NameAIP Conference Proceedings
Volume940
ISSN (Print)0094-243X
ISSN (Electronic)1551-7616

Other

Other3rd International Symposium on Computational Life Science, CompLife 2007
Country/TerritoryNetherlands
CityUtrecht
Period07/10/407/10/5

ASJC Scopus subject areas

  • Physics and Astronomy(all)

Fingerprint

Dive into the research topics of 'Predicting error bars for QSAR models'. Together they form a unique fingerprint.

Cite this