Introduction to 3D-QSAR

With the advancement of computational resources, there is a gradual uplifting of the used dimensions of quantitative structure–activity relationship (QSAR) descriptors. The two-dimensional (2D) and lower-dimensional models suffer from various drawbacks that led to the introduction of 3D-QSAR. This approach has been enhanced with significant advancements in order to study multiple three-dimensional (3D) features of chemicals, establishing a correlation between structure and biological activity. The 3D-QSAR techniques are broadly divided into alignment-based methods [comparative molecular field analysis (CoMFA), self-organizing molecular field analysis (SOMFA), comparative molecular similarity indices analysis (CoMSIA), receptor surface analysis (RSA), and molecular shape analysis (MSA)] and alignment-independent methods [comparative molecular moment analysis (CoMMA), weighted holistic invariant molecular (WHIM) descriptor analysis, VolSurf, Compass, comparative spectral analysis (CoSA), grid-independent descriptors (GRIND)]. The fundamental concept, methodology, and limitations of some of the major approaches are discussed in this chapter to give an overview of this topic.

Keywords

3D-QSAR; comparative molecular field analysis (CoMFA); comparative molecular moment analysis (CoMMA); comparative molecular similarity indices analysis (CoMSIA); comparative spectral analysis (CoSA); molecular shape analysis (MSA); receptor surface analysis (RSA); self-organizing molecular field analysis (SOMFA); weighted holistic invariant molecular (WHIM)

Contents

8.1 Introduction 292

8.2 Comparative Molecular Field Analysis 293

8.2.1 Concept of CoMFA 293

8.2.2 Methodology of CoMFA 294

8.2.3 Factors responsible for the performance of CoMFA 295

8.2.3.1 Biological data 295

8.2.3.2 Optimization of 3D structure of the compounds 296

8.2.3.3 Conformational analysis of compounds 297

8.2.3.4 Determination of bioactive conformations 298

8.2.3.5 Alignment of molecules 299

8.2.3.6 Calculation of molecular interaction energy fields 300

8.2.3.7 Model generation 300

8.2.4 Display and interpretation of results 301

8.2.5 Advantages and drawbacks of CoMFA 301

8.3 Comparative Molecular Similarity Indices Analysis 302

8.3.1 Concept of comparative molecular similarity indices analysis 302

8.3.2 Methodology of CoMSIA 302

8.3.3 Advantages of CoMSIA 303

8.4 Molecular Shape Analysis 304

8.4.1 Concept of molecular shape analysis 304

8.4.2 Methodology of the MSA 305

8.4.3 MSA descriptors 306

8.5 Receptor Surface Analysis 307

8.5.1 Concept of receptor surface analysis 307

8.5.2 Methodology of the RSA 307

8.5.3 RSA descriptors 307

8.6 Other Approaches 308

8.6.1 Alignment-based 3D-QSAR model 308

8.6.1.1 Self-organizing molecular field analysis 308

8.6.1.2 Voronoi field analysis 309

8.6.1.3 Molecular quantum similarity measures 310

8.6.1.4 Adaptation of the fields for molecular comparison 310

8.6.1.5 Genetically evolved receptor modeling 311

8.6.1.6 Hint interaction field analysis 312

8.6.2 Alignment-independent QSAR model 312

8.6.2.1 Comparative molecular moment analysis 312

8.6.2.2 Weighted holistic invariant molecular descriptor analysis 313

8.6.2.3 VolSurf 313

8.6.2.4 Compass 313

8.6.2.5 GRID 314

8.6.2.6 Comparative spectral analysis 315

8.6.2.7 Quantum chemical parameters in QSAR analysis 315

8.7 Overview and Conclusions 315

References 316

8.1 Introduction

The basic principle of a quantitative structure–activity relationship (QSAR) study is that the deviations in biological response among a series of compounds are accountable for the differences in the structural properties. In the classical QSAR studies, biological responses have been correlated with atomic, group, or molecular properties such as lipophilicity, polarizability, electronic, and steric properties (Hansch analysis) or with certain structural features (Free–Wilson analysis). However, in these techniques, one cannot ignore their limited utility for designing diverse functional new molecules due to the lack of consideration of the three-dimensional (3D) structures of the molecules. As a consequence, 3D-QSAR has emerged as a natural extension to the classical Hansch and Free–Wilson approaches that exploits the 3D properties of the ligands to predict their biological response by employing robust chemometric tools. The 3D-QSAR is a broad term encompassing all those QSAR methods that correlate macroscopic target properties with computed atom-based descriptors derived from the spatial representation of the molecular structures. These approaches have served as a valuable predictive tool in the design of pharmaceuticals and agrochemicals [1–3].

The prime goal of any 3D-QSAR method is to establish the relationship between biological activity and spatial properties of chemicals like steric, electrostatic, and lipophilic ones. The 3D-QSAR methodology is computationally more exhaustive and complex than 2D-QSAR approaches. Normally, it consists of several steps to acquire numerical descriptors from the compound structures:

1. The optimum (near bioactive) conformation of the compound has to be determined, either from experimental data (X-ray crystal structure or NMR) or a theoretical tool like molecular mechanics, and then optimization of the energy has to be performed.

2. An alignment of the conformers in the data set has to be generated in 3D-space.

3. The space with an immersed conformer is probed computationally for generating various descriptors.

4. Finally, the computed descriptors should be correlated with the experimental biological response of the studied compounds.

It is interesting to point out that some methods, independent of the alignment strategy, have also been developed with the progress of 3D-QSAR approaches [4].

One has to understand that the QSAR model is not a substitute for the experimental assays, although experimental techniques are also not free of inaccuracies. However, QSAR researchers are trying to develop a model that is as close as possible to the real one, and for this purpose, the 3D-QSAR techniques have to rely on some basic assumptions, which are illustrated here:

• Binding of a drug molecule or ligand with the receptor is considered directly related to the biological response. Effects on second messengers or other signaling effects between receptor binding and experimentally observed response are not normally considered.

• Molecular properties (physical, chemical, and biological) are encoded with a set of numbers or descriptors.

• It is believed in general that compounds with common structures have comparable properties, and thus they have similar binding modes and accordingly equivalent biological activities and vice versa.

• Structural properties leading to a biological response are usually determined by nonbonding forces, mainly steric and electrostatic ones.

• Another important assumption is that the biological response is shown by the ligand itself, not by its metabolite product.

• The lowest-energy conformation of the ligand is its bioactive conformation, which exerts binding effects.

• The geometry of the receptor binding site is considered rigid, though there are a few exceptions.

• The loss of translational and rotational degrees of freedom (entropy) upon binding is believed to follow a similar pattern for all these compounds.

• The protein binding site is assumed to be the same for all of the studied ligands.

• The major factors that contribute to the overall free energy of binding, like desolvation energy, temperature, diffusion, transport, pH, salt concentration, and plasma protein binding, are difficult to identify and thus are generally ignored.

The 3D-QSAR methods can be classified based on a variety of criteria, as given in Table 8.1. Most commonly and successfully employed 3D-QSAR methods are discussed in the following sections of this chapter.

Table 8.1

Categorization of 3D-QSAR techniques

Basis of classification	Type	Examples of techniques
Based on employed chemometric techniques	Linear	CoMFA, CoMSIA, AFMoC, GERM, CoMMA, SoMFA
Based on employed chemometric techniques	Nonlinear	Compass
Based on the alignment criterion	Alignment-dependent	CoMFA, CoMSIA, MSA, RSA, GERM, AFMoC, HIFA, VFA, MQSM
Based on the alignment criterion	Alignment-independent	Compass, CoMMA, HQSAR, WHIM, GRIND, VolSurf, CoSA
Based on intermolecular modeling or the information employed to develop QSAR	Ligand-based	CoMFA, CoMSIA, MSA, RSA, Compass, GERM, CoMMA, SoMFA
	Receptor-based	AFMoC, HIFA

8.2 Comparative Molecular Field Analysis

8.2.1 Concept of CoMFA

Comparative molecular field analysis (CoMFA) is a molecular field–based, alignment-dependent, ligand-based method developed by Cramer et al. [5], which helps in building the quantitative relationship of molecular structures and its response property. The method mostly focuses on ligand properties like steric and electrostatic ones, and the resulting favorable and unfavorable receptor–ligand interactions. As CoMFA is an alignment-dependent, descriptor-based method, all aligned ligands are placed in an energy grid, and by placing an appropriate probe at each lattice point, energy is calculated. The resultant energy calculated at each unit fraction corresponds to electrostatic (Coulombic) and steric (van der Waals) properties. These computed values serve as descriptors for model development. These descriptor values are then correlated with biological responses employing a robust linear regression method like partial least squares (PLS). The PLS results serve as an important signal to identify the favorable and unfavorable electrostatic and steric potential and also correlate it with biological responses.

8.2.2 Methodology of CoMFA

The formalism of the CoMFA methodology is described next:

a. Structures of all molecules are drawn using any structure-drawing software.

b. The bioactive conformation of each molecule is generated and energy minimization is carried out.

c. All the molecules are superimposed or aligned using either manual or automated methods employed in the working software, in a manner defined by the supposed mode of interaction with the receptor.

d. Thereafter, the overlaid compounds are positioned in the center of a lattice grid with a spacing of 2 Å.

e. In the 3D space, the steric and electrostatic fields are calculated around the molecules with different probe groups positioned at all intersections of the lattice. Computation of the steric field uses the Lennard–Jones equation as follows:

V=LJ4ε[(σr)12−(σr)6]=ε[(rmr)12−2(rmr)6] (8.1)

(8.1)

In Eq. (8.1), ε is the depth of the potential well, σ is the finite distance at which the interparticle potential is zero, r is the distance between the particles, and r_m is the distance at which the potential reaches its minimum. At r_m, the potential function has the value −ε. The distances are given as r_m=2^1/6σ.
Again, computation of electrostatic field follows the Coulombic interaction equation as follows:

E=[q1q24πεr] (8.2)

(8.2)

where q₁ and q₂ denote point charges, r is the distance between charges, and ε is the dielectric constant of the medium.

f. The interaction energy or field values forming a pool of the descriptor/variable matrix are correlated with the biological response data employing the PLS technique, which identifies and extracts the quantitative influence of specific features of molecules on their activity.

g. The results may be expressed as correlation equations with the number of latent variable terms, each of which is a linear combination of original independent lattice descriptors.

h. For visual interpretation, the PLS output is illustrated in the form of interactive graphics consisting of colored contour plots of coefficients of the corresponding field variables at each lattice intersection, and showing the imperative favorable and unfavorable regions in the 3D space, which are closely associated with the biological activity.

The CoMFA formalism is schematically illustrated in Figure 8.1.

Figure 8.1 Fundamental steps of the CoMFA methodology.

8.2.3 Factors responsible for the performance of CoMFA

There are diverse factors that can control the complete performance of the constructed CoMFA model. These are described in the next sections.

Stay updated, free articles. Join our Telegram channel

Tags: Understanding the Basics of QSAR for Applications in Pharmaceutic

Jul 18, 2016 | Posted by admin in PHARMACY | Comments Off

Basicmedical Key

Fastest Basicmedical Insight Engine

Introduction to 3D-QSAR

Introduction to 3D-QSAR

Keywords

8.1 Introduction

8.2 Comparative Molecular Field Analysis

8.2.1 Concept of CoMFA

8.2.2 Methodology of CoMFA

8.2.3 Factors responsible for the performance of CoMFA

8.2.3.1 Biological data

8.2.3.2 Optimization of 3D structure of the compounds

8.2.3.3 Conformational analysis of compounds

8.2.3.4 Determination of bioactive conformations

8.2.3.4.1 X-ray crystallography

8.2.3.4.2 NMR spectroscopy

Like this:

Related

Stay updated, free articles. Join our Telegram channel

Full access? Get Clinical Tree

Basicmedical Key

Fastest Basicmedical Insight Engine

Introduction to 3D-QSAR

Introduction to 3D-QSAR

Keywords

8.1 Introduction

8.2 Comparative Molecular Field Analysis

8.2.1 Concept of CoMFA

8.2.2 Methodology of CoMFA

8.2.3 Factors responsible for the performance of CoMFA

8.2.3.1 Biological data

8.2.3.2 Optimization of 3D structure of the compounds

8.2.3.3 Conformational analysis of compounds

8.2.3.4 Determination of bioactive conformations

8.2.3.4.1 X-ray crystallography

8.2.3.4.2 NMR spectroscopy

Share this:

Like this:

Related

Related posts:

Stay updated, free articles. Join our Telegram channel

Full access? Get Clinical Tree