Molecular modelling of crystal structures -- part 1

Calculations on structures from the CSD

In this excercise, you will retrieve structures from the CSD and make working Cerius2 models out of those.

1. Estrone
How many structures of the steroid estrone are there in the CSD, and what are the diferences between them?
To answer these questions, you will have to log in to the server at the cmbi, and run the CSD interface.

Using the CSD interface

* log in at cmbi1.cmbi.kun.nl with the username/password you got earlier from the CMBI system manager.
To log in, open a terminal window on your workstation (Desktop | Open Unix Shell), and type
ssh -X cheminf.cmbi.ru.nl
in the window.
If you username here is different from the one you got at the CMBI, you have to type
ssh -X username@cheminf.cmbi.ru.nl
where username is your username at the CMBI.

* Start the cheminformatics menu by typing xmenu & in the terminal window

* When the menu program has started, select ConQuest from the Cheminformatics menu.

After a few seconds, the following window should appear:


Find your own way through the interface, and get and save (in cif format) all estrone structures. Remember, we want just estrone, not any of the derivatives that are there as well in the database!


Transferring the structures

Now that you have the structures, get them to your workstation via the 'secure copy' command, scp:
* in the cmbi1 terminal window, type
scp estron-data-file vstofchem17.sci.kun.nl:
(replacing estron-data-file by whatever you called your data file), and provide your password as requested.

Note that there is a ":"-sign following vstofsgi.sci.kun.nl! The file will now be copied to your local (VSC) home directory.
NB. If your local username is different from the one at the CMBI, type
scp estron-data-file local-username@vstofsgi.sci.kun.nl:
where local-username is your local user name.


Analysing the structures in Cerius2

You now have the data on the workstation, and can load them into Cerius2, via File | load; set the file type to "CIF".

How many structures are there? Are they all different polymorphs?

If a chemist would sketch an estrone molecule, the sketch would look somewhat different from the molecules as they appear on the screen.

Name two important things which are missing in (some of) the structures.

It may not always be easy to see whether two structure are the same, almost the same, or different. The X-ray powder diffraction (XRPD) pattern may be helpful then: if structures are (almost) the same, their XRPD patterns are also. In Cerius2, you can calculate XRPD patterns, and compare patterns from different models by subtracting one from the other.

In the card deck, go to the Analytical 1 card, as shown below.


Calculate the XRPD pattern for the each structure by clicking Calculate Diffraction, which opens a new window, and clicking CALCULATE difraction pattern there. Note: the calculation is done for the currently active structure.
To compare patterns, you can look at their difference pattern. To do so, use the preferences for powder patterns (see the window below). Now, each time a pattern is calculated, the difference with the previous pattern is plotted as well.


Compare all structures by their XRPD pattern. What are your findings?

2. OD14
Get both structures of the steroid known as OD14 from the CSD. You should find two polymorphs.

Give the crystallographic highlights of the polymorphs; what is common, what not?

Clean up the structures by:
* 'unbuilding' the crystal (Card deck Builders1; Crystal Building | Unbuild Crystal)
* using the 3D-Sketcher from the Build menu (not card deck) to get the molecules right
* re-building the crystal
Save the structures to disk so you can use them later, using the .msi format.

Hydrogen bonds can be spotted automatically by Cerius2, via Build (in the main Vidualizer window) | Edit H-Bonds.... Have a look at the hydrogen-bonding scheme of the polymorphs. Do they make sense? How have they been influenced by your cleaning-up the structures?

The molecules have some interesting conformational differences. Let's look at the energy difference between the conformations. To do so, make a new model for each conformation. Load a force field (OFF SETUP | OPEN FORCE FIELD | Load; choose the Dreiding2.21 force field), and put atomic charges on the molecules. Save the structures for later use.

Wich are the main methods to assign atomic charges? What are the most important differences? Why are Gasteiger charges a practical choice in this case?

To get a realistic value for the energy, the energy should be calculated after optimization of the geometry, and not on the structure as it is stored in the data base. First make a copy of the structure by generating a new model space (1), making the structure you want to copy also visible (2), and set the arrangement of visible structures (3), as shown below. Then drag the visible model into the empty model space (4).


Make the original model not-visible again. Now unbuild the crystal structure, so you get just the asymmetric unit. It contains two molecules (which are not related by symmetry). Repeat the above procedure to copy a model, to obtain another model of the asymmetric unit. Then delete one molecule from each of the models, so you end up with a model of just one molecule, and another model of only the other molecule.

To do the energy calculation, load the Dreiding force field: select the OFF SETUP card, click Load, and pick DREIDING2.21. Then go to the OFF METHODS card, and click Run. This will bring up a window where you can calculate the structure's current energy, or do an energy minimization.


What are the absolute/relative energies of the two conformations? What happens if you do not optimize the geometry?

Finally, we want to calculate the energy difference between the two polymorphs. Be sure to handle the non-bonded interactions correctly, via the OFF SETUP | OPEN FORCE FIELD | Energy Terms | van der Waals/Coulomb windows in the card deck. Note that the settings that you enter here, may be undone when you (re-) load the force field, so always remember to do that first!
Also, note that the numbers you get from Cerius are in kcal per mol unit cells! So if there are N molecules per unit cell, you have to divide the number by N in order to arrive at kcal/mol.

What is the problem with non-bonded interactions in large systems, and how are these interactions handled best for 2/3D periodic systems?
What is the energy difference between the polymorphs?


You can compare the optimized structures with the original ones from the CSD by overlaying both in the models window. Is the optimized structure still similar to the original? You can also compare their XRPD patterns, like you did for estrone. How similar are the patterns?

Once you have done all this, get back to the main course page.