header image
Home arrow Tutorial arrow Statistical Molecular Design, part 3
Statistical Molecular Design, part 3 PDF Print E-mail
Written by Lennart Eriksson, Per M Andersson, Erik Johansson and Torbjörn Lundstedt   
The design of a training set intended for QSAR according to the principles of SMD, makes it possible to explore - in a systematic way - the structure-activity relationships within the data-set in question. By interrogating the resulting QSAR model it is then possible to extract clues of how to modify the chemical properties of the compounds in order to possibly enhance their biological activity profile. Particularly, it is of interest to compute predictions of biological properties of compounds which have not yet been fabricated. This process is sometimes known as virtual screening. An important benefit of QSAR-based virtual screening, is that the QSAR model itself constitutes a navigation tool. One need not make predictions for all possible combinatorial options in a molecular structure. Rather, the QSAR-model can be used to direct the virtual screening towards inducing changes only in the substituents and moieties it finds important. In order to illustrate the concept of QSAR-directed virtual screening, we will review a data set of hexapeptides for which the training set was designed according to SMD. Prior to relating this story, though, we will need to review the basic principles underpinning peptide QSAR, and particularly the concept of the "z-scales".
The two preceding tutorials here at Chemometrics.se (see previous tutorials) have concerned the use of statistical molecular design (SMD) in the design of sets of representative, informative and diverse molecules. SMD is an efficient tool to accomplish a lead-centered design in drug discovery and design. In so doing, the SMD-protocol is actually used to develop a new series of molecules. This is in sharp contrast to the situation often prevailing within e.g. environmental chemistry and toxicology, where QSAR-techniques are utilized to select sub-sets of representative compounds.
 


< Prev   Next >
Search website
Editorial flash
Editorials at Chemometrics.se
Welcome to the new Editorial section at Chemometrics.se. In the past, the Tutorial and Editorial sections were merged. Now, these have been separated.
Read more...
News flash
Metabolomics 2010 meeting

www.metabolomics2010.com

June 27- July 1, 2010

Amsterdam, The Netherlands

Last chance for Abstract submission for ORAL presentations !!

The deadline for submitting abstracts for oral presentations is approaching rapidly! You have until Friday 23rd April.
We encourtage you all to submit your proposed contributions by then.
After this date you can however still send in abstracts for posters.
As we have a limited space for just 400 posters we encourage everyone to submit their abstracts as soon as possible in order not to miss out.

To register and send in your abstracts for talks and posters click here

The Local Organisers

Thomas Hankemeier and Robert Hall

 

 

Tutorial flash
Selection of subsets of variables in linear regression
Selection of variables in linear regression is one of the most studied subjects in theoretical statistics. In applied sciences there is also considerable interest in the topic. The popular program packages like SAS, SPSS, BMDP and others have advanced programs to select variables that should be used in the modeling work.
Read more...