9.1 Target MD and steered MD simulations of Ribose Binding Protein (RBP) in water

GENESIS prepares several functions that restrain selected distance/angle/dihedral angles. In this section, these restraint functions are described.

Preparation

First of all, let’s download the tutorial file (tutorial22-9.1.zip). This tutorial consists of five steps: 1) simulation system, 2) energy minimization, 3) equilibration, 4) production run with a restraint function, 5) Targeted MD simulation, and 6) Steered MD simulation. Control files for GENESIS are already included in the download file.

# Download the tutorial file
$ cd /home/user/GENESIS/Tutorials
$ mv ~/Downloads/tutorial22-9-1.zip ./ 
$ unzip tutorial22-9.1.zip 
$ cd tutorial-9.1
$ ls
1_system  2_minimize  3_equilibrate  4_MD_restraint  5_targeted_MD  6_steered_MD

A direcotry in paramter files can be linked as shown in Tutorial 3.1.

1. Simulation System

In this tutorial, we use the all-atom model of the Ribose Binding Protein (RBP). RBP has two domains and adopts closed and open conformations related to ligand binding.  We are going to explain restraint energy functions using the protein. The 1_system directory contains two pdb and a psf files for the system.

2. Minimization

First, we execute the energy minimization of the system to remove steric clashes in the initial structure. For further details of the control file and choice of parameters, please refer to the basic tutorials section (see Tutorial 3.2). In the 2_minimize directory, a control file, “INP”, is given.

# Change directory to 2_minimize
$ cd 2_minimize
$ ls
INP
# Energy minimization
$ export OMP_NUM_THREADS=3
$ mpirun -np 8 /home/user/GENESIS/bin/spdyn INP > log

3. Equilibration

Equilibration runs are executed in the following steps: 1) equilibration in the NVT ensemble, 2) relaxation of the simulation box in the NPT ensemble with positional restraints, and 3) equilibration in the NVT ensemble with positional restraints. The procedure is similar to the basic tutorial section, except that the step numbers are doubled. (see Tutorial 3.2)

$ cd ../3_equilibrate
$ ls
1_md 2_comdist
$ cd 1_md
INP1 INP2 INP3
$ export OMP_NUM_THREADS=3
# equilibration in NVT ensemble (w/ positional restraints, VVER 2fs)
$ mpirun -np 8 /home/user/GENESIS/bin/spdyn INP1 > log1
# relaxation of the simulation box (w/ positional restraints, VVER 2fs)
$ mpirun -np 8 /home/user/GENESIS/bin/spdyn INP2 > log2
# equilibration in NVT ensemble (w/ positional restraints, VRES 2.5 fs)
$ mpirun -np 8 /home/user/GENESIS/bin/spdyn INP3 > log3

To see the structural changes of the protein, we sometimes need to restrain the distances between the centers of mass (COMs) of two domains (groups).

Before running simulations with the restraints, we need to calculate the distance between two groups from the trajectory. 

$ cd ../2_comdist
$ cat INP
$ /home/user/GENESIS/bin/trj_analysis INP > log

INP is a control file for trj_analysis. In the [OUTPUT] section, the distance between COMs of groups is calculated if comdisfile is defined. The atom masses are read from the molecule files such as psffile. Atoms in each group are selected via the [SELECTION] section.  See the GENESIS manual for instructions on how to specify the group. Please use com_distance1 instead of distance1 in the [OPTION] section.

[INPUT]
psffile = ../../1_setup/rbp_closed.psf
reffile = ../../1_setup/rbp_closed.pdb
[OUTPUT]
comdisfile = output.comdis           # COM distance file
[TRAJECTORY]
trjfile1 = ../1_md/eq3.dcd           # trajectory file
md_step1 = 40000                     # number of MD steps
mdout_period1 = 400                  # MD output period
ana_period1 = 10                     # analysis period
repeat1 = 1
trj_format = DCD                     # (PDB/DCD)
trj_type = COOR+BOX                  # (COOR/COOR+BOX)
trj_natom = 0                        # (0:uses reference PDB atom count)
[SELECTION]
group1 = segid:PROA & (resno:1-100 | resno:236-259) and an:CA
group2 = segid:PROA & (resno:108-231 | resno:269-271) and an:CA
[OPTION]
check_only = NO                        # only checking input files (YES/NO)
allow_backup = NO                      # backup existing output files (YES/NO)
com_distance1 = 1 2                    # Analyze the time courses of COM distance
$ /home/user/GENESIS/bin/trj_analysis INP > log
$ ls
INP  log  output.comdis
$ tail -n5 output.comdis
96 29.422
97 29.446
98 29.475
99 29.463
100 29.437

The COM distance at the last step is used in the next simulation.

4. Restraint function of group distances

A simulation with restraints on the COM distance can be used in conventional MD and REUS simulations. In this section, we show how to use restraints on the COM distance.

$ cd ../../4_MD_restraint
$ ls
1_md 2_comdist
$ cd 1_md
$ ls
INP
$ mpirun -np 8 /home/user/GENESIS/bin/spdyn INP > log

To restrain the COM distance, we set keywords in [SELECTION] and [RESTRAINTS] sections in the following way.

[SELECTION]
group1 = segid:PROA & (resno:1-100 | resno:236-259) and an:CA
group2 = segid:PROA & (resno:108-231 | resno:269-271) and an:CA
[RESTRAINTS]
nfunctions = 1
function1 = DISTMASS
reference1 = 29.4
constant1 = 1.0
select_index1 = 1 2

After the simulation, you can check the COM distance during the simulation using trj_analysis.

$ cd ../2_comdist
$ cat INP
$ /home/user/GENESIS/bin/trj_analysis INP > log

5. Targeted MD simulation

Targeted MD (TMD)[1,2] is a method that applies holonomic forces to drive the system towards a targeted RMSD of the system during the simulation.

$ cd ../../5_targeted_MD
$ ls
1_md 2_rmsd_analysis 3_comdist 4_crd_concert
$ cd 1_md
$ ls
INP
$ mpirun -np 8 /home/user/GENESIS/bin/spdyn INP > log

The keywords, target_md and final_rmsd, should be set in the [DYNAMICS] section. In TMD, the initial RMSD value (ininital_rmsd) is calculated automatically. (in the [RESTRAINT] section, reference1 is set, however, the reference is updated during the simulation).

Caution: the number of steps (nsteps) is small for the tutorial purpose. Please increase the number when you execute the simulation in your research.

[INPUT]

(skip)
psffile = ../../1_setup/rbp_closed.psf
pdbfile = ../../1_setup/rbp_closed.pdb
reffile = ../../1_setup/rbp_open.pdb
rstfile = ../../3_equilibrate/1_md/eq3.rst

(skip)


[DYNAMICS] integrator = VRES # [LEAP,VVER,VRES] nsteps = 40000 # number of MD steps (100 ps) timestep = 0.0025 # timestep (2.5fs) eneout_period = 400 # energy output period (1ps) crdout_period = 400 # coordinates output period (1ps) rstout_period = 40000 # restart output period nbupdate_period = 10 # nonbond update period elec_long_period = 2 # period of reciprocal space calculation thermostat_period = 10 # period of thermostat update barostat_period = 10 # period of barostat update target_md = YES # targeted MD final_rmsd = 0.5 # target RMSD value (skip) [RESTRAINT] nfunctions = 1 function1 = RMSDMASS reference1 = 4.3 constant1 = 1.0 select_index1 = 1

In the TMD and SMD, reffile in the [INPUT]  section should be a structure file of the target coordinate. The number of atoms and the order of atoms must be the same as in psffile (as well as in the pdbfile and rstfile).

After the simulation, you can check the structural change from open to closed forms via vmd.

In addition, GENESIS prepare analysis tools. In this tutorial, the control files for rmsd_analysistrj_analysis, and crd_convert are also prepared.

# mass-weighted RMSD
$ cd ../2_rmsd_analysis
$ /home/user/GENESIS/bin/rmsd_analysis INP > log

This figure shows that RMSD goes down linearly in this simulation.

 

# COM distance
$ cd ../3_trj_analysis
$ /home/user/GENESIS/bin/trj_analysis INP > log

 

The COM dist also changes during the simulation.

You can select coordinates from the trajectory for making a movie.

# Convert crd for movie 
$ cd ../4_crd_convert 
$ /home/user/GENESIS/bin/crd_convert INP > log

6. Steered MD simulation

Steered MD (SMD)[3] is a method that applies forces to change the conformations. Targeted MD applies a holonomic constraint to keep RMSD value, whereas Steered MD applies a harmonic restraint force that changes the reference value throughout the simulation. GENESIS has two types of simulations.

$ cd ../../6_Steered_MD
$ ls
1_md 2_rmsd_analysis 3_comdist 4_crd_concert
$ cd 1_md
$ ls
INP
$ mpirun -np 8 /home/user/GENESIS/bin/spdyn INP > log

The keywords, steered_md and final_rmsd, should be set in the [DYNAMICS] section. As well as in TMD, the initial RMSD value (ininital_rmsd) is calculated automatically, too. (in the  [RESTRAINT] section, despite reference1 being set, the reference is updated during the simulation). Since Steered MD applies a harmonic restraint force, it is necessary to set a reasonable force constant (constant1 in this example). The value will change the speed of structural change. Later we will show the difference between the force constants (FCs).

Caution: the number of steps (nsteps) is small, and FC is slightly large for the tutorial purpose. Please increase the number of steps when you execute the simulation in your research. The value of FC should be determined with care, taking into account the aspect of possible structural changes.

[DYNAMICS]
integrator = VRES      # [LEAP,VVER,VRES]
nsteps = 40000         # number of MD steps (100 ps)
timestep = 0.0025      # timestep (2.5fs)
eneout_period = 400    # energy output period (1ps)
crdout_period = 400    # coordinates output period (1ps)
rstout_period = 40000  # restart output period
nbupdate_period = 10   # nonbond update period
elec_long_period = 2   # period of reciprocal space calculation
thermostat_period = 10 # period of thermostat update
barostat_period = 10   # period of barostat update
steered_md = YES       # steered MD
final_rmsd = 0.5       # target RMSD value
(skip)
[RESTRAINT]
nfunctions = 1
function1 = RMSDMASS
reference1 = 4.3
constant1 = 10.0
select_index1 = 1

After the simulation finishes, you can check the structural change from open to closed forms using vmd.

In addition, GENESIS provides analysis tools. In this tutorial, the control files for rmsd_analysistrj_analysis, and crd_convert are also prepared.

# mass-weighted RMSD
$ cd ../2_rmsd_analysis
$ /home/user/GENESIS/bin/rmsd_analysis INP > log

We examine two FCs (10 and 100). Please notice that the behaviour of the RMSD curve changes when a different value of FC is used.

# COM distance
$ cd ../3_trj_analysis
$ /home/user/GENESIS/bin/trj_analysis INP > log

As shown in the RMSD plot, the amounts of change in the COM dist are also different.

You can select coordinates from the trajectory for making a movie.

# Convert crd for movie 
$ cd ../4_crd_convert 
$ /home/user/GENESIS/bin/crd_convert INP > log

7. References

  1. J. Schlitter, et. al.  Mol. Sim., 10, 291-308, (1993).
  2. J. Schlitter, et. al.  J. Mol. Graph., 12, 84–89, (1994).
  3. H Grubmüller et al. Science, 271, 997–999, (1996).

Written by Chigusa Kobayashi@RIKEN R-CCS. March, 4, 2022

Updated by Chigusa Kobayashi@RIKEN R-CCS. May, 9, 2022