Community

How do StarDrop models compare to other models?

E-mail Print PDF

In independent tests, the StarDrop predictive ADME models have matched or exceeded the performance of other commercially available or published models. The results for the StarDrop models are shown in the tables below. If comparing the StarDrop models with other in silico models, it is important to note that the reported validation results here are for independent test sets (see “How do you validate your models?” above). It is also important, when comparing StarDrop predictions with experimental data, to ensure that this is a like-for-like comparison. For example, HIA predictions cannot be compared directly with oral bioavailability data on which both absorption and first-pass clearance will have an impact. Similarly, the StarDrop predictions for aqueous solubility are unlikely to correlate well with data obtained in a high-throughput kinetic assay based on dilution of DMSO compound stocks and will not predict the solubilities of different salt forms.

Summary of Statistical Results for Continuous QSAR models

Model Definition N R2 RMSE
logP Predicts the logarithm of the octanol/water partition coefficient for neutral compounds 2950 0.92 0.44
logD@pH7.4 Predicts the logarithm of the octanol/buffer at pH 7.4 distribution coefficient 257 0.88 0.67
logS Predicts the logarithm of the intrinsic aqueous solubility, S in uM, for neutral compounds 663 0.82 0.70
logS@pH7.4 Predicts the logarithm of the solubility, S in uM, in phosphate buffered saline at pH7.4 96 0.74 0.61
log([brain]:[blood]) Predicts the logarithm of the Brain/Blood ratio 87 0.74 0.32
hERG pIC50 Predicts the pIC50 values for inhibition of hERG K+ channels expressed in mammalian cells 33 0.72 0.64
2C9 pKi Predicts the pKi values for CYP2C9 affinity 25 0.62 0.63

a) N = number of compounds in independent test set.
b) R2 gives the correlation between calculated and experimental values for the compounds in the independent test set.
c) The root mean squared error (RMSE) statistic gives the error for the corresponding correlation coefficient. When possible, the RMSE values are calculated for compounds within (IN) or outside (OUT) the chemical space of the model. “unknown” is reported if there are not enough training and test compounds outside the chemical space to calculate an RMSE value.

Summary of Statistical Results for Classification QSAR models

Model Definition N Accuracy Specificity
HIA category Returns a binary prediction for human intestinal absorption, based on a threshold of 30% absorbed 245 ‘-‘ 66% ‘-‘ 91%
‘+’ 99% ‘+’ 95%
BBB category Returns a binary prediction for Blood/Brain barrier penetration 52 ‘-‘ 93% ‘-‘ 91%
‘+’ 83% ‘+’ 83%
P-gp category Returns a binary prediction for P-gp transport 51 ‘yes‘ 86% ‘yes‘ 78%
‘no’ 68% ‘no’ 79%
PPB category Returns a binary prediction for human plasma protein binding, based on a threshold of 80% absorbed 159 ‘-‘ 78% ‘-‘ 78%
‘+’ 77% ‘+’ 77%
2D6 affinity category Returns a 4-class prediction for 2D6 affinity 45 Root mean square error = 0.87 classes

>a) N = number of compounds in independent test set
b) The accuracy for each class is reported as the percentage of compounds correctly classified.
c) The specificity refers to the percentage of correct classifications within the overall set of compounds predicted to be in that class.

Comments (0)
Only registered users can write comments!
Last Updated on Thursday, 20 May 2010 08:10  

Latest Forums

Read more >

Popular Downloads

Read more >