| Version: | 1.2 |
| Name: | (Q)SAR Model Reporting Format |
| Author: | European Chemicals Bureau |
| Date: | July 2007 |
| Contact: | Joint Research Centre, European Commission |
| e-mail: | qsardb@jrc.it |
| www: | http://ecb.jrc.ec.europa.eu/qsar/ |
QSAR for algae toxicity of benzene derivatives
QSARModel 4.0.3 Molcode Ltd., Turu 2, Tartu, 51014, Estonia
http://www.molcode.com
07.12.2009
Indrek Tulp
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Tarmo Tamm
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Gunnar Karelson
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Dimitar Dobchev
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Dana Martin
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Kaido Tämm
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Deniss Savchenko
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Jaak Jänes
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Eneli Härk
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Andres Kreegipuu
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Mati Karelson
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
Molcode model development team
Molcode Ltd.
Turu 2, Tartu, 51014, Estonia
models@molcode.com
http://www.molcode.com
09.01.2010
Karelson M, Dobchev D, Tamm T, Tulp I, Jänes J, Tämm K, Lomaka A, Savchenko D & Karelson G (2008). Correlation of blood-brain penetration and human serum albumin binding with theoretical descriptors. ARKIVOC 16, 38-60.
Karelson M, Karelson G, Tamm T, Tulp I, Jänes J, Tämm K, Lomaka A, Savchenko D & Dobchev D (2009). QSAR study of pharmacological permeabilities. ARKIVOC 2, 218–238.
Model is proprietary, but the training and test sets are available.
None to date.
Chlorella vulgaris
3.Ecotoxic effects. . 3.2.Short-term toxicity to algae (inhibition of the exponential growth rate)
EU testing method C.3. The EC50 is the concentration (mM) that induces toxicity response halfway between the baseline and maximum after 15 min.
mM
log(1/EC50)
Toxicity data [log(1/EC50)] were determined in a biochemical assay utilizing the unicellular alga C. vulgaris. Algae in the logarithmic phase of their growth cycle were used. All toxicological analyses were performed in a buffer solution with a pH of 6.9 and a temperature between 25 and 30 °C. Assays were conducted following the protocol described by Worgan et al. (2003) with a 15 min static design. The disappearance of FDA was accounted for by spectrofluorimetric measurement of fluorescein (the product of hydrolysis) (Leszczynska & Oleszkiewic 1996) at an excitation wavelength of 465 nm and an emission wavelength of 515 nm. Range-finding experiments were performed in order to determine the highest and lowest concentrations required to produce a dose-response relationship ranging from 100% inhibition of enzyme activity to no observed toxicological effect. Blank buffer solution was utilized as a control, and the relative responses to it were used to generate the dose-response curve. The 50% effective concentration was estimated by Probit analysis using the SPSS ver. 10.0 (SPSS Inc., Chicago, IL) software. The average EC50 was taken from a minimum of three analyses.
The toxicity data are taken from one publication (Cronin et al, 2004) to ensure consistency. The data were generated in on one lab and in one experimental series.
Statistics: max value: 3.1 min value: -4.06 standard deviation: 1.465 skewness: -0.422
QSAR
Multilinear regression QSAR
Multilinear regression QSAR derived with BMLR (Best Multiple Linear Regression) method
log(1/EC50) = -3.532 +0.371*Kier&Hall index (order 0) +1.233*Number of benzene rings -23.698*Difference (Pos - Neg) in Charged Part of Partial Charged Surface Area (Zefirov) -22.954*HA dependent HDCA-2/SQRT(TMSA) (Zefirov) (all)
Kier&Hall index (order 0), unitlesszero order Kier and Hall valence connectivity index
Number of benzene rings, unitlessCount of benzene rings in the molecule
Difference (Pos - Neg) in Charged Part of Partial Charged Surface Area (Zefirov), Å2total difference between the charged positive and negative charged surface areas
HA dependent HDCA-2/SQRT(TMSA) (Zefirov) (all), auArea-weighted surface charge of hydrogen bonding donor atoms
Initial pool of ~1000 descriptors. Stepwise descriptor selection based on a set of statistical selection rules:
one-parameter equations: Fisher criterion and R2 over threshold, variance and t-test value over threshold, intercorrelation with another descriptor not over threshold;
two-parameter equations: intercorrelation coefficient below threshold, significant correlation with endpoint, in terms of correlation coefficient and t-test.
Stepwise trial of additional descriptors not significantly correlated to any already in the model.
1D, 2D, and 3D theoretical calculations. Quantum chemical descriptors derived from AM1 calculation. Model developed by using multilinear regression.
QSARModel 4.0.3
Molcode Ltd, Turu 2, Tartu, 51014, Estonia
http://www.molcode.com
18.25 (73 chemicals /4 descriptors)
Applicability domain based on training set:
a) by chemical identity: benzene derivatives with one aromatic core
b) by descriptor value range: The model is suitable for compounds that have the descriptors in the following minimal-maximal ranges:
Kier&Hall index (order 0): 1.45 - 13.9
Number of benzene rings: 0 - 2
Difference (Pos - Neg) in Charged Part of Partial Charged Surface Area (Zefirov): -0.0593 - 0.00616
HA dependent HDCA-2/SQRT(TMSA) (Zefirov) (all): 0 - 0.0655
Range of descriptor values in training set with ±30% confidence. Descriptor values must fall between maximal and minimal descriptor values of training set ±30%.
QSARModel 4.0.3
Molcode Ltd, Turu 2, Tartu, 51014, Estonia
http://www.molcode.com
See 5.1
Yes
Chemname:Yes
SMILES:No
CAS RN:Yes
InChI:No
MOL file:Yes
Formula:Yes
All
All
73 data points: 34 negative values; 39 positive values
R2 = 0.921 (Correlation coefficient)
s2 = 0.427 (Standard error of the estimate)
F = 197.8 (Fisher function)
R2CV = 0.904
R2CVMO = 0.903
ABC analysis (2:1 training : prediction) on sorted (in increased order of endpoint value) data divided into 3 subsets (A;B;C). Training set formed with 2/3 of the compounds (set A+B, A+C, B+C) and validation set consisted of 1/3 of the compounds (C, B, A).
average R2 (fitting) = 0.923
average R2 (prediction) = 0.900
Yes
Chemname:Yes
SMILES:No
CAS RN:Yes
InChI:No
MOL file:Yes
Formula:Yes
All
All
18 data points: 9 negative values; 9 positive values
From sorted data each 5th was subjected to the test set starting from 3rd in order to assure the equality in distribution tails.
R2 = 0.887 (Coefficient of determination)
Descriptor value range (all in range of applicability domain):
Kier&Hall index (order 0): 3.57 - 12.9
Number of benzene rings: 0 - 2
Difference (Pos - Neg) in Charged Part of Partial Charged Surface Area (Zefirov): -0.0236 - 0.00995
HA dependent HDCA-2/SQRT(TMSA) (Zefirov) (all): 0.00548 - 0.0543
The validation coefficient of determination (R2) is close to the coefficients of internal validation (R2CV and R2CVMO).
Descriptors "Kier&Hall index (order 0)" and "Number of benzene rings" define a non-polar narcosis. They cover the toxicity baseline that is usually modelled with logP. The descriptors "Difference (Pos - Neg) in Charged Part of Partial Charged Surface Area (Zefirov)" and "HA dependent HDCA-2/SQRT(TMSA) (Zefirov) (all)" are related to the reactivity of the compounds and they represent the polar narcosis part of the toxicity.
A posteriori mechanistic interpretation, consistent with published scientific interpretations of experiments.
Provided in Cronin et al (2004)
Data taken from Cronin et al.(2004)
Cronin MTD, Netzeva TI, Dearden JC, Edwards R & Worgan ADP (2004). Assessment and Modeling of the Toxicity of Organic Chemicals to Chlorella vulgaris: Development of a Novel Database. Chemical Research in Toxicology 17, 545–554.
Worgan ADP, Dearden JC, Edwards R, Netzeva TI & Cronin MTD (2003). Evaluation of a novel short-term algal toxicity assay by the development of QSARs and inter-species relationships for narcotic chemicals. QSAR & Combinatorial Science 22, 204-209.
Leszczynska M and Oleszkiewic JA (1996). Application of fluorescein diacetate hydrolysis as an acute toxicity test. Environmental Technology 17, 79-85
Training data set Acute_toxicity_algae_73_trainingsetValidation data set Acute_toxicity_algae_18_testsetOther documents
Molcode, algae toxicity of benzene derivatives, Chlorella vulgaris