Background

Five-membered heterocyclic ring systems constitute an important part of many biologically relevant molecules. They occur in carbohydrates (furanoses), nucleosides and nucleotides, the amino acid proline and their many derivatives. In addition, they often occur as a moiety in complex natural products. Chemical modifications of nucleic acids, often driven by the needs of antisense research, target in part the five-membered cycle or its analogues in order to tailor their conformation towards the desired needs [1, 2].

The advent of combinatorial chemistry has also revived the interest in heterocyclic rings and their conformation [3]. As a result, scaffolds containing five-membered heterocycles have received much attention for the rapid generation of potential lead compounds in pharmaceutical research [411].

Typically, the chemical and conformational space is explored by introducing a diversity of substituents at varying positions around the cycle. Depending on the position and nature of these substituents, the cycle either adopts a single conformation or may be in equilibrium between two conformations. These conformations will in turn impact on the conformational space that will be covered by the substituents, making the determination of the cycle's conformation an issue of considerable interest.

Over the years, NMR has become a well-established technique for this purpose. In particular, 3J HH scalar coupling constants are well-suited as they are mainly determined by the torsion angle over which they are measured. In the case of ring systems, the vicinal 3J HH scalar coupling constants are directly correlated to their corresponding exocyclic torsion angles (θ exo ). These are related to the corresponding endocyclic torsion angles (θ endo ) by a simple equation (1) where A and B are constants determined by the geometry of the atoms linked to the common central bond.

θ exo = A θ endo + B

As the set of all five endocyclic torsion angles in a five-membered ring fully determines its conformation, 3J HH scalar coupling constants provide a direct measure of the ring's conformation. The Haasnoot-Altona equation (3) [12] and the Diez-Donders equation (4) [13, 14], both based on the well known Karplus equation (2), describe the relation between a 3J HH coupling and the corresponding exocyclic torsion angle (θ exo ) to a high level of accuracy. In both equations this is mainly achieved by including a set of four parameters λ i (i = 1, ..., 4) that account for the influence of electronic effects contributed by the substituents [15, 16]. In some studies, the set of experimental 3J HH scalar coupling constants is further extended by 3J HF scalar coupling constants [1719] or interproton distances obtained by nOe NMR experiments. Here however, we assume that only 3J HH scalar couplings are available for conformational analysis.

3J HH = P1 cos2(θ) + P2 cos(θ) + P3

3 J H H = P 1 c o s 2 ( θ ) + P 2 c o s ( θ ) + P 3 + i = 1 4 λ i { P 4 + P 5 c o s 2 ( ( 1 ) i λ i θ + P 6 | λ i | ) } MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaaeaaaaaaaaa8qafaqabeGabaaabaWdaiabbccaGmaaCaaaleqabaWdbiabiodaZaaakiabdQeak9aadaWgaaWcbaWdbiabdIeaijabdIeaibWdaeqaaOWdbiabg2da9iabdcfaq9aadaWgaaWcbaWdbiabigdaXaWdaeqaaOWdbiaayoW7cqWGJbWycqWGVbWBcqWGZbWCpaWaaWbaaSqabeaapeGaeGOmaidaaOGaeiikaGIaeqiUdeNaeiykaKIaey4kaSIaemiuaa1damaaBaaaleaapeGaeGOmaidapaqabaGcpeGaaG5aVlabdogaJjabd+gaVjabdohaZjabcIcaOiabeI7aXjabcMcaPiabgUcaRiabdcfaq9aadaWgaaWcbaWdbiabiodaZaWdaeqaaOWdbiabgUcaRaqaamaaqahapaqaa8qacqaH7oaBpaWaaSbaaSqaa8qacqWGPbqAa8aabeaaaeaapeGaemyAaKMaeyypa0JaeGymaedapaqaa8qacqaI0aana0GaeyyeIuoakmaacmaapaqaa8qacqWGqbaupaWaaSbaaSqaa8qacqaI0aana8aabeaak8qacqGHRaWkcqWGqbaupaWaaSbaaSqaa8qacqaI1aqna8aabeaak8qacaaMd8Uaem4yamMaem4Ba8Maem4Cam3damaaCaaaleqabaWdbiabikdaYaaakiabcIcaOiabcIcaOiabgkHiTiabigdaXiabcMcaP8aadaahaaWcbeqaa8qacqWGPbqAaaGccqaH7oaBpaWaaSbaaSqaa8qacqWGPbqAa8aabeaak8qacqaH4oqCcqGHRaWkcqWGqbaupaWaaSbaaSqaa8qacqaI2aGna8aabeaak8qadaabdaWdaeaapeGaeq4UdW2damaaBaaaleaapeGaemyAaKgapaqabaaak8qacaGLhWUaayjcSdGaeiykaKcacaGL7bGaayzFaaaaaaaa@80F1@
(3)
3 J H H = i = 0 3 C i ( λ 1 4 ) c o s ( i θ ) + i = 1 3 S i ( λ 1 4 ) s i n ( i θ ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaeeiiaaYaaWbaaSqabeaaqaaaaaaaaaWdbiabiodaZaaakiabdQeak9aadaWgaaWcbaWdbiabdIeaijabdIeaibWdaeqaaOWdbiabg2da9maaqahapaqaa8qacqWGdbWqpaWaaSbaaSqaa8qacqWGPbqAa8aabeaaaeaapeGaemyAaKMaeyypa0JaeGimaadapaqaa8qacqaIZaWma0GaeyyeIuoakiabcIcaOiabeU7aS9aadaWgaaWcbaWdbiabigdaXiabgAci8kabisda0aWdaeqaaOWdbiabcMcaPiaayoW7cqWGJbWycqWGVbWBcqWGZbWCcqGGOaakcqWGPbqAcqaH4oqCcqGGPaqkcqGHRaWkdaaeWbWdaeaapeGaem4uam1damaaBaaaleaapeGaemyAaKgapaqabaaabaWdbiabdMgaPjabg2da9iabigdaXaWdaeaapeGaeG4mamdaniabggHiLdGccqGGOaakcqaH7oaBpaWaaSbaaSqaa8qacqaIXaqmcqGHMacVcqaI0aana8aabeaak8qacqGGPaqkcaaMd8Uaem4CamNaemyAaKMaemOBa4MaeiikaGIaemyAaKMaeqiUdeNaeiykaKcaaa@6ABC@
(4)

Altona and Sundaralingam showed that the description of a five-membered ring conformation can be reduced to a two-parameter pseudorotation model [20, 21] that fully describes its conformation. The first parameter, the pucker phase P, represents the phase of the conformation and indicates which ring atoms are positioned out of the ring plane. The second parameter, the pucker amplitude ν max , corresponds to the amplitude of the conformation and describes the extent to which the atoms determined by P are out of the plane. The relationship with the endocyclic torsion angles θendo, iis shown in (5).

θ e n d o , i = ν m a x c o s ( P + 4 π i 5 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaaeaaaaaaaaa8qacqaH4oqCpaWaaSbaaSqaa8qacqWGLbqzcqWGUbGBcqWGKbazcqWGVbWBcqGGSaalcqWGPbqAa8aabeaak8qacqGH9aqpcqaH9oGBpaWaaSbaaSqaa8qacqWGTbqBcqWGHbqycqWG4baEa8aabeaak8qacaaMd8Uaem4yamMaem4Ba8Maem4CamNaeiikaGIaemiuaaLaey4kaSscfa4aaSaaa8aabaWdbiabisda0iabec8aWjabdMgaPbWdaeaapeGaeGynaudaaOGaeiykaKcaaa@4CCB@
(5)

This well-known pseudorotation description, originally described for the furanose ring in nucleosides and nucleotides [20, 21], was further generalized to any five-membered heterocycle by Diez et al. [2224] who introduced two additional parameters α i and ε i for each endocyclic bond to cope with differences in bond lengths in various types of five-membered rings (Equation 6). As the phase of the conformation P is a periodic variable, polar plots called pseudorotation wheels are mostly used to depict ring conformations.

θ e n d o , i = α i ν m a x c o s ( P + ϵ i + 4 π i 5 ) MathType@MTEF@5@5@+=feaagaart1ev2aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaaeaaaaaaaaa8qacqaH4oqCpaWaaSbaaSqaa8qacqWGLbqzcqWGUbGBcqWGKbazcqWGVbWBcqGGSaalcqWGPbqAa8aabeaak8qacqGH9aqpcqaHXoqypaWaaSbaaSqaa8qacqWGPbqAa8aabeaak8qacqaH9oGBpaWaaSbaaSqaa8qacqWGTbqBcqWGHbqycqWG4baEa8aabeaak8qacaaMd8Uaem4yamMaem4Ba8Maem4CamNaeiikaGIaemiuaaLaey4kaSYefv3ySLgznfgDOfdaryqr1ngBPrginfgDObYtUvgaiqaacqWF1pG8paWaaSbaaSqaa8qacqWGPbqAa8aabeaak8qacqGHRaWkjuaGdaWcaaWdaeaapeGaeGinaqJaeqiWdaNaemyAaKgapaqaa8qacqaI1aqnaaGccqGGPaqkaaa@5EE2@
(6)

Using the above equations, 3J HH couplings can be used to derive the pseudorotation parameters of the five-membered cycle. As mentioned previously, the cycle may be in equilibrium between two conformations. Thus, most generally two sets of pseudorotation parameters (P and ν max ) and the relative population (%1, i.e. the percentage of the first conformation present with %2 = 1 - %1) need to be fitted to the experimental NMR data. In order to avoid an under-determined model, experimental data measured at different temperatures is generally used. In such cases, the model assumes that only the relative population of the two conformations varies when changing the temperature. Thus n + 4 (n being the number of temperatures used) variables will be optimized to fit the experimental data in such cases. To the best of our knowledge, the program PSEUROT [25], originally developed by Altona et al., is still the only generally available program to perform this type of analysis. Written in FORTRAN, its interface as well as its output is purely text-based. In order to facilitate the analysis of the PSEUROT results, a post-processing feature has been included in the independently developed MULDER package [26] to generate a graphical output of the PSEUROT results. In this communication, we propose an integrated, user-friendly Matlab program, including a self-explanatory graphical user interface (GUI), to facilitate the set-up, execution and subsequent analysis of pseudorotation calculations for five-membered ring systems decorated with a variety of substituents. The use of Matlab as high-level programming language enables to create, within a limited time frame, high-quality plots that provide a graphical impression of the conformational space accessible. Furthermore, due to the open-source GNU GPL license, users have the opportunity to adapt the program to their specific needs.

Implementation

The program consists of a computational core that is accessed and controlled through a GUI (Figure 1), both written in Matlab. Its goal is to search pseudorotation parameters {P, ν max } for at most two conformations as well as their relative population (%1, n) at n temperatures that fit a series of experimental NMR scalar coupling constants. The initial pseudorotation parameters are set by the user at the start of the computational procedure. The user is also given the choice to define a subset of pseudorotation parameters that have to be optimized. Using equations 1, 4 and 6, the scalar coupling constants relating to conformation defined by the initial pseudorotation values are calculated and a root-mean-square-deviation (RMSD) is determined with respect to the experimental coupling data. Next, the pseudorotation parameters are adapted so as to minimize this RMSD. The Matlab fmincon function from the Optimization Toolbox, which is based on a Sequential Quadratic Programming (SQP) algorithm [27], was used for this purpose. To restrict the optimization to physically sensible solutions, values of partition coefficients were restrained to the [0, 1] interval. Furthermore, puckering amplitudes (ν max ) were restricted to the physically relevant [0, π/3] interval.

Figure 1
figure 1

Layout of the Matlab GUI. In the main window of the GUI (upper panel), both the coupling data and additional parameters required by equations 1 and 6 can be interactively defined, using a simplified representation of the cycle. Both the atom types in the cycle and the substituents can be defined interactively. The 'electronegativity editor' (bottom panel) provides a convenient graphical aid to set up the group-electronegativities (λ i ) for many common substituents as required by equation 4.

A choice is provided between two operational modes. In the first mode, an optimization of the chosen parameters is performed as is, yielding the pseudorotation parameters that best fit the experimental data. The output generated in this operation mode is purely text-based and contains the optimized variables, their corresponding endocyclic torsion angles and a tabular comparison between experimental and fitted scalar coupling data (See accompanying manual). In the second mode of the program, the complete pseudorotation space of the cycle of interest is explored via 3600 combinations of pseudorotation parameters {P, ν max } as described hereafter. The pseudorotation space of the cycle is covered by sampling 120 values of P in the [0, 2π] interval and 30 values of ν max in the [0, π/3] interval. For each {P, ν max } thus sampled, the minimized RMSD between the experimental and calculated scalar couplings is determined. As for the output, a pseudoratation wheel is plotted and the RMSD is represented through 20 contour lines in the [Δ min , 2Δ min + 0.1] interval where Δ min is the lowest RMSD obtained over the 3600 fittings.

Whether the parameters of a second conformation and the relative populations at each temperature are optimized during each of the 3600 runs is again determined by the user's choice. This results in two possible outputs for this operating mode. For the case where the user chooses not to optimize any parameters of the second conformation, one can easily assess from the pseudoratational wheel if the experimental data can be fitted by a single five-membered ring conformation. For the case where the second conformation's parameters (and the relative population of both conformations) are also to be optimized, the pseudorotation wheel represents the conformational space accessible to the cycle when two five-membered ring conformations are fit to the experimental data. In this latter case, one additional pseudorotation wheel is created for each temperature depicting the relative population of the two fitted conformations for each set of {P, ν max } using contour levels.

This second mode of calculation has some interesting advantages over the first mode. First, a better appreciation of the conformational space of the cycle that is in agreement with the experimental data is obtained. Furthermore, the extent of the contour levels in the pseudorotation plot allow to establish whether the model is under-determined or not. Even when under-determined, such calculation can already indicate those conformations that can be excluded from further investigation.

Results

Program features

Only the main features of the program, fully described in the manual available together with the program, are discussed. The GUI is easy to use and handles the input as well as the various operational modes and data output (Figure 1). Two modes of calculation for fitting five-membered ring conformations to 3J HH scalar coupling data obtained at up to five temperatures can be selected. Both the coupling data and additional parameters required by equations 1 and 6 are defined interactively, using a simplified representation of the cycle (Figure 1, upper panel). Both the atom types in the cycle and the substituents can be defined interactively. The GUI includes an 'electronegativity editor' (Figure 1, bottom panel) which provides a convenient graphical aid to set up the group-electronegativities (λ i ) for many common substituents as required by equation 4.

Also, the program is tolerant for sparse input data. When only a sum of two or more 3J HH scalar couplings can be measured, e.g. due to overlap, this sum can be included in the model instead of the individual couplings. Contrary to the PSEUROT program, a coupling does not necessarily have to be measured at each temperature to be included in the calculation.

Four different parameterizations of equation 4 are implemented in the program. These include the 20-parameter and 12-parameter parameterization by Donders [14] and two slightly different 9-parameter parameterizations [28, 29] one of which is used in PSEUROT 6.2. The use of Diez-parameters into the calculations is fully implemented by using equation 6 for the calculation of exocyclic torsion angles out of the puckering parameters. The current version does not include the use of the Barfield-correction [30, 31] used in some studies.

Depending on the calculation mode selected, textual or graphical output is provided. As the program is written in Matlab, the latter plots can be easily post-processed to comply with the user's needs. In addition, several parameters involving the construction of the plots can be set manually. These parameters include the resolution of the search grid used for the pseudorotation scan (i.e. the increment in P and ν max ) and the minimal and maximal contour levels used in the plots. Furthermore, interpolation of these contour plots using cubic spline functions can be selected for obtaining smoothed graphs.

Standard data sets for β-D-ribose and β-D-deoxyribose as well as input data of 1 and 2 are available with the distribution of the software.

Comparison to PSEUROT

To test the performance of the computational core and the input/output handling using the GUI and compare the results with respect to PSEUROT (version 6.2), a set of two 4'-thio-2'-deoxynucleoside analogs [32] was used. For both molecules, depicted in Figure 2, literature [32] provides a full set of five coupling constants measured at five different temperatures as well as Diez-parameters for each endocyclic bond. For both compounds, two conformations were fit to the experimental data using our Matlab GUI and PSEUROT 6.2. The most recent parameterization of equation 4 was used. Details of these optimal fittings, as well as a third fit, taken from literature [32] and referred to as LHK, using the Haasnoot equation 3 [12] are presented in Table 1. This table includes the optimized pseudorotation parameters for the North and South conformations (P N , ν N , P S , ν S ) in degrees and the population of the North-conformer at each temperature for which couplings were provided. Finally, the maximal deviation and RMSD between the fitted couplings and the experimental ones is indicated.

Figure 2
figure 2

Covalent structure of compounds 1 and 2.

Table 1 Optimal puckering coordinates, partition coefficients and residual errors found for compounds 1 and 2 using this Matlab GUI (ML), PSEUROT (PS) and literature data (LHK) [32].

Quasi identical results are obtained when comparing the Matlab GUI (ML) to PSEUROT (PS). Differences between the two programs are most likely introduced by the difference in optimization method and convergence criteria. Looking at the residual deviations with respect to the experimental data, the Matlab GUI finds a slightly better fit. When comparing to the literature data (LHK), one can appreciate the influence of different Karplus equations on the fitting. Differences of 10 and 3 degrees are found for P N and ν N respectively. Also slight differences in populations are observed.

In order to get a better appreciation of the precision of these optimal values, the full pseudorotation wheel of 1 and 2 has been scanned by the Matlab program resulting in Figures 3 and 4. In Figure 3, contour plots indicate the root-mean-square-deviation between the fitted and the experimental scalar coupling couplings for 1 (left) and 2 (right). Looking at the extent of the contour lines, it is clear that the South-conformation is better defined than the North-conformation. Taking into account errors originating from measuring scalar couplings, deficiencies of the model and the Karplus equation, the optimal fit does not necessarily correspond to the real situation. Therefore, interpreting the regions that have a RMSD lower than a certain threshold (e.g. 0.2 Hz), as representative of the cycle's conformation seems more reasonable than taking the optimal fit for granted, especially when comparing different compounds. In addition to Figure 3, Figure 4 can be used to interpret the allowed population-range for each conformation. These graphs represent the population of both conformations of 1 at 285K (left) and 353K (right). The thick black contour is the highest contour level of Figure 3. Within this black contour, a variety of populations can be found, ranging from 27 to 40 percent of the North-conformation at 285K and from 35 to 49 percent at 353K. Again, this gives a more realistic view of the conformational behavior of compound 1.

Figure 3
figure 3

Pseudorotation wheel for compounds 1 and 2. Contour lines indicate the total RMSD with the experimental data (Hz). Figures are directly obtained from the GUI. No post-processing has been performed.

Figure 4
figure 4

Pseudorotation wheel of partition coefficients of compound 1 for the two extreme temperatures (285K and 353K). Contour lines indicate the percentage of conformation present in the best fit. The thicker black contour line is the outer contour line of Figure 3. Figures are directly obtained from the GUI. No post-processing has been performed.

Conclusion

A matlab program with easy to use graphical user interface for the calculation of five-membered ring conformations has been presented. This program has been made freely available online under a GNU GPL license. The performance of the program was tested on a set of two 4'-thio-2'-deoxynucleoside analogs. It has been shown that identical results can be obtained as in PSEUROT. Furthermore, high-quality graphical output can be generated, facilitating the interpretation of the calculations.

Availability and requirements

Project name: Matlab Pseudorotation GUI

Project home page: http://sourceforge.net/projects/matlabpseudorot

Operating system(s): Platform independent (tested in WinXP)

Programming language: Matlab (tested in release R2007a)

Other requirements: Matlab Optimization Toolbox

License: GNU GPL