Page de résumé pour/Title page for BelnUcetd-01032008-172816


Type de document Thèse/Dissertation
Auteur Le Bailly de Tilleghem, Céline
Adresse e-mail de l'auteur celine.lebailly@uclouvain.be
URN BelnUcetd-01032008-172816
Langue Anglais/English
Titre Statistical contribution to the virtual multicriteria optimisation of combinatorial molecules libraries and to the validation and application of QSAR models
Intitulé du diplôme STAT 3 - Doctorat en sciences (statistique)
Département/Domaine EUEN/STAT - Institut de statistique
Adresse e-mail du département
Jury
Nom Titre
Beck, Benoît Membre du jury/Committee Member
Lambert, Philippe Membre du jury/Committee Member
Weihs, Claus Membre du jury/Committee Member
von Sachs, Rainer Président du jury/Committee Chair
Govaerts, Bernadette Promoteur/Director
Simar, Léopold Promoteur/Director
Mots-clés
  • Desirability
  • Quantitative structure-activity relationship
  • Screening algorithm
  • Uncertainty propagation
  • Multicriteria optimisation
  • Lead optimisation
  • Delta method
  • Combinatorial library
Date de défense 2008-01-07
Type d'accès unrestricted
Résumé
This thesis develops an integrated methodology based on the desirability index and QSAR models to virtually optimise molecules. Statistical and algorithmic tools are proposed to search in huge collections of compounds obtained by combinatorial chemistry the most promising ones.

First, once the drugability properties of interest have been precisely defined, QSAR models are developed to mimic the relationship between those optimised properties and chemical descriptors of molecules. The literature on QSAR models is reviewed and the statistical tools to validate the models, analyse their fit and their predictive power are detailed.

Even if a QSAR model has been validated and sounds highly predictive, we emphasise the importance of measuring extrapolation by the definition of its applicability domain and quantifying the prediction error for a given molecule. Indeed, QSAR models are often massively applied to predict drugability properties for libraries of new compounds without taking care of the reliability of each individual prediction.

Then, a desirability index measures the compromise between the multiple estimated drugability properties and allows to rank the molecules in the combinatorial library in preference order. The propagation of the models prediction error on the desirability index is quantified by a confidence interval that can be constructed under general conditions for linear regression, PLS regression or regression tree models. This fulfills an important lack of the desirability index literature that considers it as exact.

Finally, a new efficient algorithm (WEALD) is proposed to virtually screen the combinatorial library and retain the molecule with the highest desirability indexes.

For each explored molecule, it is checked if it belongs to the applicability domain of each QSAR models.

In addition, the uncertainty of the desirability index of each explored molecule is taken into account by gathering molecules that can not be distinguished from the optimal one due to the propagation of QSAR models prediction error. Those molecules do not have a significantly smaller desirability than the optimal molecule found by WEALD.

This constitutes another important improvement in the use of desirability index as a tool to compare solutions in a multicriteria optimisation problem.

This integrated methodology has been developed in the context of lead optimisation and is illustrated on a real combinatorial library provided by Eli Lilly and Company. This is the main application of the thesis. Nevertheless, as the results on desirability index uncertainty are applicable under general conditions, they can be applied to any multicriteria optimisation problem, like it often occurs in industry.

Autre version
Fichiers
  Nom du fichier       Taille       Temps de chargement évalué (Heures:Minutes:Secondes) 
 
 28.8 Modem   56K Modem   Acces haute-vitesse 
  LeBaillyElectonicThesis.pdf 13.78 Mb 01:03:49 00:32:49 00:01:13

Parcourir toutes les thèses répertoriées par ( Auteur | Département )

Pour d'autres questions ou tout problème technique contacter theses-sceb@listes.uclouvain.be.