J Mol Model (2014) 20:2508 DOI 10.1007/s00894-014-2508-x

ORIGINAL PAPER

Quantile regression model for a diverse set of chemicals: application to acute toxicity for green algae Jonathan Villain & Sylvain Lozano & Marie-Pierre Halm-Lemeille & Gilles Durrieu & Ronan Bureau

Received: 25 July 2014 / Accepted: 20 October 2014 / Published online: 29 November 2014 # Springer-Verlag Berlin Heidelberg 2014

Abstract The potential of quantile regression (QR) and quantile support vector machine regression (QSVMR) was analyzed for the definitions of quantitative structure-activity relationship (QSAR) models associated with a diverse set of chemicals toward a particular endpoint. This study focused on a specific sensitive endpoint (acute toxicity to algae) for which even a narcosis QSAR model is not actually clear. An initial dataset including more than 401 ecotoxicological data for one species of algae (Selenastrum capricornutum) was defined. This set corresponds to a large sample of chemicals ranging from classical organic chemicals to pesticides. From this original data set, the selection of the different subsets was made in terms of the notion of toxic ratio (TR), a parameter based on the ratio between predicted and experimental values. The robustness of QR and QSVMR to outliers was clearly observed, thus demonstrating that this approach represents a major interest for QSAR associated with a diverse set of chemicals. We focused particularly on descriptors related to molecular surface properties.

Keywords Algae species . Ecotoxicology . Molecular surface . Outliers . Quantile regression . Support vector machine J. Villain : S. Lozano : M.

Quantile regression model for a diverse set of chemicals: application to acute toxicity for green algae.

The potential of quantile regression (QR) and quantile support vector machine regression (QSVMR) was analyzed for the definitions of quantitative stru...
955KB Sizes 0 Downloads 6 Views