Users' Guide to the Monte Carlo Program MMC
for Computer Simulation of Molecular Solutions.
Mihaly Mezei
Department of Pharmacological Sciences
Icahn School of Medicine at Mount Sinai
E-mail address:
Mihaly.Mezei@mssm.edu
April 04, 2022
TABLE OF CONTENTS
I. GENERAL DESCRIPTION
The present document describes the Force-Biased Metropolis
Monte Carlo program for computer simulation studies of molecular
liquids and solutions in the canonical, isobaric-isothermal
and grand-canonical
ensembles and the analysis of such calculations based on the
Proximity Criterion.
This section provides an overview of the
capabilities of the program.
I.1. Systems modeled
The present program was originally written for simulations on
infinitely dilute solutions (one solute in (numsol)
solvent molecules).
As a result of this origin, the program has developed two
different treatments of molecules.
Solvent molecules have to be rigid and be of the same type
(this allows the use of special, rapid energy calculation code).
The collection of all other molecules (i.e., the one(s) not
declared to be solvents) are called by the program the
solute.
Thus the solute can consist of a single rigid
molecule, a collection of freely translating and rotating rigid
molecules or a collection of freely translating and rotating
molecules with certain (user specified) torsion angles freely
rotating.
The program establishes the connectivity of the solute
molecule(s)
based on interatomic distances and built-in threshold values.
By default H-H bonds and bonds between an H and a heavy atom on different
groups (residues) are ignored, but this can be changed
(see the keys BDHH and
BDRH).
Also, the bond thresholds can be changed with the
MODA key.
This sometimes leads to missing or extra bonds, especally if the
input structure (given by the SLTA key)
is obtained from a molecular dynamics simulation.
Such defects can be identified by using the
FCGA and
BNDL keys and corrected by the keys
MAKB or
BRKB or forced to be ignored (ok for molecules
without active torsions) with the MOLD key.
Free energy simulations can be performed on selected
molecules
of the solute (see Sec. I.5.).
The torsion angles to be changed on the solute are specified
by the two atoms defining the bond of the torsion
(with the key TORD).
The torsions can be grouped and
it is the group of torsions that is selected for change. The main
reason for this is that if a torsion affects atoms that are on an
other rotatable bond then it may be efficient to move the
affected bond as well. These moves, however, are not correlated
(i.e., each is generated by a different random number).
There is also an option of changing the torsion angles of a
loop segment (with the key LOOP).
The torsion potential parameters can be obtained from a CHARMM or
AMBER parameter file.
It is possible to impose periodicity on the solute,
either along the X axis (with the key SLTA
POLY)
i.e., with period equal to the edge of the cell in the X direction
- an infinite polymer,
or in all three directions
(with the key SLTA XTAL) with its
unit cell (or a multiple of a unit cell) coinciding with the
periodic cell of the simulation (i.e., a crystal).
The periodic extension of the solute outside the
periodic cell is done automatically by the program.
The intermolecular interactions are assumed to be of the usual 12-6-1 type,
i.e., Lennard-Jones terms for the repulsion and dispersion plus
electrostatics based on partial charges on atoms (or pseudo atoms).
The parameters can be obtained from libraries stores by the program
or defined via input.
The interactions can be truncated
(see keys SVVC,
SUVC, SUUC)
either by a spherical cutoff
(each atom interacts with all others within a certain distance)
or by the minimum image convention
(each atom interacts with one periodic image of all the other atoms).
For the purpose of cutoff or image calculation, distances usually refer
to molecule-molecule or residue-residue distances.
When spherical truncation is used, the long-range electrostatic energy
can be estimated by the reaction field formula
(provided all residues are neutral) - see key
RFCR.
I.2. Properties
calculated during a simulation
The major outputs of a Monte Carlo simulation (besides the
configurations saved for further analysis by the
TRAJ key) are as follows.
- Macroscopic thermodynamic properties of the system:
internal energy, pressure, heat capacity (constant-volume
or pressure, depending on the ensemble used), the density far from
the solute (see key MOND). Simulations
in the grand-canonical ensemble (see key GCEN) also provide the density, excess
chemical potential, isothermal compressibility, and
expansivity. Simulations in the isothermal isobaric ensemble
(see key IBEN) also provide the density
and the thermal expansion coefficient.
- Several methods to calculate free energy have been
implemented (see key FREE) by relating
it to either some computed energy values or to some distributions.
- Bulk solvent-solvent and solute-solvent distribution functions
(see key DSTC):
radial distribution function based on center of mass (COM),
quasi-component distribution functions for coordination number,
binding energy and pair energy, and dipole correlation functions.
- The derivative of the solute and total energy with respect
to the solute potential parameters can also be calculated as an
ensemble average (see key SENS). This
may be used for the development of
potential parameters and for the sensitivity of the calculated
energy to the potential parameters.
- Calculation of the mean forces exerted by the solvent on the solute atoms
(see key SENS) and the electric field
gradients at the solute and solvent sites (see key
FLDG) can also be requested.
- For systems with variable torsion angles the torsion
angle distribution can be calculated
(see key TAND)
- Proximity analysis, i.e., distributions based on the
original form of the Proximity Criterion
or its enhanced version can also be
performed. This analysis partitions the solvent according to the
proximal solute atoms and calculates distributions for each
partition.
The solute atoms are partitioned into functional
groups either defined by their chemical identity
(see key FCGA) or into alternative
functional groups specified by the user
(see key FCGD).
The proximity analysis results are presented for individual solute
atoms, functional groups, and solute residues (see key
SLTA) and also are
averaged over functional groups of the same kind.
I.3. Potential
functions
Several potential functions are built into the program. For
the solute-solvent nonbonded interactions
(specified with the SUPT key).
Clementi et al.'s
library, EPEN/QPEN, Kollman et al.'s libraries (AMBER-94 and
AMBER-2002), CHARMM22 library, Berendsen-Van Gunsteren library
(GROMOS87 and GROMOS96);
and Jorgensen's OPLS library.
Note that for the GROMOS force field only the combination
rule can be used - the non-bonded parameters for pairs
are not implemented.
Except for the EPEN form, parameters are stored internally, but
they can be modified from input and allowance is made for
inputting new atom types besides the stored ones. As these
parameter libraries are continually updated and extended, it is
advisable to check if the value stored is the best one
(using the PLBP key).
For aqueous solutions, the solvent-solvent interaction
(specified with the SUPT key)
can be either
in the form of MCY-CI type or TIPS/TIP3P/SPC type or
TIPS2/TIP4PB-F type or TIP5P type potential. For the MCY-CI type the
parameters of both the MCY-CI-II and the Yoon-Morokuma-Davidson
parametrizations are built in. For TIPS2/TIP4P/B-F types, the A, B
and q parameters may also be provided by the user, but the TIPS,
SPC, TIP3P, Bernal-Fowler, TIPS2, TIP4P and TIP5P water model parameters
are built in. For solvent other than water, 12-6-1 type potential
can be used with parameters from the AMBER, GROMOS or OPLS
libraries.
There is a framework for adding field-dependent potentials to the
currently implemented atom-atom potentials, but this requires some
additional coding.
I.4. Sampling techniques
Various sampling techniques can be selected with the
MOVE key.
Translations and rotations can be sampled with the original
Metropolis Method or with the
Force-Biased algorithm of
Pangali, Rao and Berne (not implemented for
multimolecular solute moves, though).
Force biasing, of course, necessitates the
recalculation of energies and forces at every accepted step,
thereby significantly increasing the computer time required
to perform a move. The improvement in convergence, however, more
than outweighs the added cost. By default a lambda value of
1/2
is used but the scaled force-bias technique where lambda is
scaled down near the solute has also been
implemented.
Novel torsion-angle sampling techniques include the
extension-biased torsion moves and
the
local moves with the reverse proximity criterion.
Molecules to be moved can be selected randomly, in a cycle
or in a
shuffled cycle or using the preferential
sampling scheme of
J.C. Owicki, which leads to improved
statistics for the local solution environment of the solute (see
key MOVE PRF*).
The cavity-biased insertion technique for
grand-canonical ensemble simulations and the
virial-biased volume-change
for the isobaric ensemble have also been implemented.
I.5. Free energy methods
Free energy simulations can be performed using a coupling
parameter lambda (CP in the followings) in the energy expression.
There are two
fundamentally different approaches
to generate a path: (A) simultaneous
creation/annihilation and (B) continuous deformation. In the
first the Hamiltonians of the initial and final states are used
in unchanged and the coupling parameter is used to combine the
two; in the second case there is a single hybrid Hamiltonian that
is based on a parametrization that is a mixture of the two
systems. For multimolecular solute, the free energy simulation
can be applied to just one or two solute molecules, depending on
the type of coupling.
I.5.A. Creation/annihilation path
I.5.A.1. Creation/annihilation path to be used in
thermodynamic integration (FREE TICA).
Here
E(CP)=CPk*E1+(1-CP)k*Eo
where Eo and E1 represent the
Hamiltonians initial and final
state, respectively. k=1 represents linear coupling, k>1 has been
referred to as or 'almost linear' coupling (e.g., k=4 for
1/r12
repulsion). It is also possible to use different k values for
the different types of energy terms
(1/r12, 1/r6, and 1/r) with the
so-called polynomial TI.
Eo and E1 can differ in conformation,
partial charges, and chemical identity (the present version
requires equal number of atoms in the two states - this can
always be achieved with he use of dummy atoms. It is also
possible to provide two more coupling parameter values
CPo and CP1 such that
CPo < CP < CP1.
The program will compute the free energy
difference between states represented by E(CPo) and
E(CP1) with the
perturbation method as the sum of two free energy differences:
A1-Ao =
-kT ln<exp((E(CP1)-E(CP))/kT)>CP
+ kT ln<exp((E(CPo)-E(CP))/kT)>CP.
<E(CPo)>CPo and
<E(CP1)>CP1 are also computed for
consistency check.
The two molecules have to contain the same number of residues
(groups), but can have different number of atoms.
I.5.A.2. Perturbation method based on a sequence of
intermediate states (FREE TILI). This
option performs a
perturbation method calculation between two states generated
using the deformation-type path described below in I.5.B.
The reference state will be
(E(CP1)-E(CPo))/2. In this case, the
inital and final states have to have the same number of atoms also.
I.5.B. Deformation path
I.5.B.1. The program also provides option for a
fundamentally nonlinear coupling where the coupling parameter
changes both the solute coordinates and the potential parameters
to generate intermediate states (continuous deformation).
Besides creating intermediate Eo and E1
states for the linear or
'almost linear' couplings described above in I.5.A.2, this
coupling can be used with the probability ratio method
(FREE PMF1) or with the overlap ratio
method (FREE PMNL), as
described
below in I.5.B.2 and I.5.B.3, respectively. The initial and final
states may differ in position, partial charge or identity of
atoms (there are limitations for EPEN).
In each of the following cases, the path involves one or two
molecules of the solute. The following paths are supported:
I.5.B.1.1. Linear combination of two conformations of a
single molecule (FREE PMF1 GENL). In
this case all atoms of the molecule may
change positions independently. The configurations at the
beginning and at the end of the path are given on input and the
intermediate configurations are convex linear combinations (i.e.,
R(CP)=CP*R1+(1-CP)*Ro) of these two.
I.5.B.1.2. Moving two molecules of the solute in a
correlated fashion (FREE PMF1 WRMM). In
this case, the
intramolecular distances within the two parts should remain
constant. Note, that if the orientation of either part is
different in the initial and final states then at the
intermediate states the molecule will be distorted.
I.5.B.1.3. Varyig the distance between two solute molecules
of the solute but keeping the one of them fixed
(FREE PMF1 WRFM).
In this case the configurations at the two ends of the path
differ only in the conformation of the moving part of the solute.
This option may be more economical when one part is large and the
other is small.
I.5.B.1.4. Moving and rotating one molecule while keeping
the other one fixed (FREE PMF1 WRFR).
In this case the center of
mass of the first part is moved between the center of masses at
the two endpoints and the orientation of the moving part is also
changed to transform it from the initial to the final
orientation. This option differs from I.5.B.1.3 in that at each
point of the path the moving molecule has the
same intramolecular conformation.
I.5.B.1.5. Correlated rotation of a set of torsion angles
(FREE PMF1 TORS). In this case a set of
torsion angles are
changed in a user-specified interval using a common CP parameter:
CP=0 sets all the selected angles to the beginning of their
interval and CP=1 sets all of them to the end of their interval.
I.5.B.2. The solvent contribution to the free energy change
between two solute conformations can also be determined by
computing the ratio of the Boltzmann probabilities as sampled
along a line in the conformation space of the solute (described
by any of the paths described in Sec. I.5.B.1,
FREE PMF1 **** *USS).
This approach is called the probability ratio method.
Calculations of this type usually require umbrella sampling
techniques. The program allows either the use of a harmonic
potential to be given from input or the self-consistent
determination of a tabulated weighing function using the
Adaptive Umbrella Sampling Method.
I.5.B.2. Any of the paths descibed in Sec. I.5.B.1 can be
used to estimate free energy changes simultaneosly using the
overlap ratio method of
Jacucci and Quirke] and the
perturbation method (FREE PMNL). Here
the calculations
are run at a given fixed E(CP) and two more coupling parameter
values, CPo and CP1 are specified,
CPo < CP < CP1. For the overlap
ratio method, the program will calculate the CP averaged
distribution of <E(CP)-E(CPo)>CP
and <E(CP1)-E(CP)>CP. Separate
calculations at CPo and CP1 will give the
needed CPo and CP1 averaged
distributions to be used with the overlap ratio method.
Simultaneously, the program also calculates the perturbation
method differences between the states CPo and CP as
well as CP and CP1. Notice that this kind of
perturbation method calculation is
different from the calculation described with the
creation/annihilation path above: there the middle state E(CP) was
a linear combination of the initial and final states,
E(CPo) and E(CP1).
For consistency check, (similarly to the perturbation
method calculations described above) the CPo average
of E(CPo) and
the CP1 average of E(CP1) are also
calculated.
I.5.C. Chemical potential.
Free energy can be also obtained from the chemical potential.
The excess chemical potential can be obtained either from
grand-canonical
ensemble simulations (key GCEN),
from calculating the excess free energy
of changing a molecule to ideal gas (with one of the methods
described
above, e.g., with key FREE TICA)
or by the Widom insertion method
(key FREE WIDO).
The program implements a version that is based on
insertions into cavities.
I.6. Solute move selection
The sampling of the moves is specified as follows. The
overall frequency of solute moves is specified by input provided
with the STEP key. As discussed above,
the whole solute can be a) rotated and displaced;
b) one constituent molecule of the solute
can be rotated and displaced; c) two constituent molecules can be
swapped; d) a group of torsion angles can be changed; and e) the
coupling parameter used in a potential of mean force calculation
can be changed. If moves of type b, c, d, and e are required,
then the program will request a relative frequency (weight) of making that
move. Regular solute moves and coupling parameter changes are
both given a weight of 1.0 . Thus giving a weight of 2.0 for type
3 moves, out of 4 solute move attempts, 1 will change the solute
position, 1 will change the coupling parameter, and 2 will change
one of the randomly selected torsion angle groups (on the
average, of course). Clearly, the more types of moves on is using,
the more frequently has the solute to be moved.
For the free energy options, the treatment of moves of type 2
or 3 depends of the type of path. For the creation/annihilation
path using TI (FREE TICA)
the torsions are independently sampled on
both copies, since these can be, and generally are, different
molecules. For the continuous path
(FREE PMF1 or
FREE PMNL) or
the cration/annihilation PM (FREE PMLI),
the torsions are treated
identically in each copy since here it is assumed that the
molecule is changing smoothly. This means, that for a
creation/annihilation TI these moves have to be sampled twice as
frequently.
I.7. Initial
configuration
The starting configuration for the Monte Carlo
(see key CNFG) is either read from a
previously prepared file, generated from the solute's description
(see key SLTA) and randomly placed solvents
or from a configuration of another system through various
transformations.
Grand-canonical ensemble simulation
(see key GCEN) is also an efficient way
of obtaining a random initial configuration.
I.8.
Information needed for a simulation
Several steps in the preparation of the input to and the analysis of
the results form MMC can be helped by the program
Simulaid.
The list of its features are described in a
J. Comp. Chem. paper.
I.9.1. T, temperature at which the simulations are to be
run (specified with the TEMP key).
I.9.2. The coordinates of the nuclei, electrons and
pseudoatoms (if any) in a local coordinate framework, and the
charges associated with these centers for the solute and
(optionally) for the solvent
(specified with the SLTA key).
The program
Simulaid
can help convert structures in different file formats
(e.g., PDB, CHARMM .CRD) to MMC format and can
optimize the orientation of a solute within a periodic box
(to maximize the smallest inter-image distance and thereby minimize
the artifactual effect of the solute images), within a boundig box,
or optimally center a solute in a sphere (for droplet-type
calculations).
I.9.3. Potential specifications.
For each solute atom the potential type has to be specified from
input (except for EPEN where for each atom and pseudoatom
the A, B, and C parameters have to be specified individually).
The program
Simulaid
can help to convert from PDB, CHARMM, AMBER or Macromodel
input files to MMC format.
Several water potentials are implemented with specific codes,
but solvent can be defined similarly to the solute as well.
I.9.4. numsol, the number of solvent molecules used in the
simulation,
specified with the NSLV key.
I.9.5. Size and shape of the periodic cell
(specified with the PBCN key).
I.9.6. When generating a new configuration the program can
determine nmolec and the simulation cell size parameters
from the partial molar volumes of the solute and solvent and
the required number of solvent shells around the solute.
I.9.7. For partial solute displacements the groups of solute
atoms (independent molecules) to be moved separately
(specified with the PARD key).
I.9.8. For solute torsions
(specified with the TORD key).
the atoms defining the torsion
angle and the torsion potential parameters.
I.9.
Self testing
There is an extensive
self testing routine in the program that
tests the integrity/consistency of the data.
It protects against both corruption of data and program errors.
It is invoked at several ways:
- Automatically at startup
- Periodically, as specified by the user with the
SLFT key.
- Optionally at the end of the run, as specified by the user with
the STOP key.
- Automatically at every 2,500,000th step (but not at the last
one).
I.10. General analysis
Except for the the bulk distributions and most thermodynamic
variables,
all the the properties discussed above in
Sec. I.2., can be calculated from
the history of a simulation as well. Note, that such analyses can
be done
on simulation trajectories generated by other programs, e.g., by
CHARMM or AMBER. In addition to the analyses described above,
several other types of analyses are implemented as summarized
below.
- Generic solvent sites can be calculated
(see key GENS)
- Two configurations can be compared and the changes in the
solute molecules can be calculated
(see key CMPC)
- Cavities in the solute can be visualized by a grid technique
(see key PRTG) and their volume be
estimated (see key STVG)
- The solute-solvent and solute solute energies can be separated
into terms involving each solute atom
(see key ENGL)
- The distribution of the solvents around the solute can be
visulaized by creating a configuration with the solvent coordinates
from a set of selected configurations
(see key DENF)
- A simulation history can also be filtered according to various
criteria, e.g., energetic, geometric or simply eliminating segments
of the system (see key FILT)
II. FILE ORGANIZATION.
Beside then standard input (Fortran unit 5) - for the information
needed
for the run - and the standard output file (unit 6) - printed
output, diagnostics, described in Section
V. - the program uses a number of different
files. In the followings,
writing to the standard output will be refererred to as printing.
File names are limited to 80 characters and
are structured the following way:
<jobname>[.<numrun>][_<version>].<file
type>.
where
- <jobname> is any legitimate filename, common to
all files generated by the run (specified by the
FILE key);
- <numrun> is a number
greater than 1 and less than 99 (i.e., the omit
the run number from the file name for <numrun>=1
and limit the number of runs to 98);
After each RUNS command,
<numrun> gets incremented by one.
This allows the execution of several different MC runs
(equilibrations, free energy runs with different coupling
parameters, etc.) within a single job.
Also, <numrun> can be modified
by the FILE command.
- <version> is an integer between 2 and 9999
(i.e., the omit
the version number from the file name for <version>=1.
It is used whenever a given runnumber deals with several files of the
same type during a run
(e.g., FILT,
TRAJ
CHKP or
the SCKP keys)
- <file type>, as explained below, is one of
{ckp, crd, hst, dst, idl,
pxc, pxp, pxi, slt, wmp,
dat, grd, pxv, wmp,
pdb, fgd, CRD}
- File type=ckp: Checkpoint file storing the
complete program status (checkpoint). It is created automatically
when the RUNS,
WCKP or
SCKP keys are used.
It allows the resumption of the
calculation with the RCKP command by
restoring all common
blocks to their value prior to the last write.
It starts with the
coordinates of the current configuration, in the same (binary)
format as on file .crd
(except for the coupling parameter that is stored differently).
Checpoint file is written before a selftest
(see key SLFT),
at every nplt-th step
(before printing the distributions
- see key RUNS)
and at every nrecd step.
By default, nrecd is set
to the smaller of nplt or
1000000, 750000, 500000, 200000, 100000 for systems with
number of atoms <3000, <5000, <7500, <10000, >10000,
respectively but it can be set to any value with the key
CHKP.
- File type=crd: The coordinates of the initial
configuration are stored here.
Additional records may store the coupling parameter
(see key FREE),
the number of molecules
(see key GCEN)
and/or the simulation cell sizes
(see key IBEN)
It is read by the CNFG command,
and can be written by the WCNF command
- File type=hst: Trajectory file where the simulation
history is (to be) stored. It's use is activated by the
TRAJ command.
- File type=dst: The bulk solvent distribution functions
calculated during the simulation may
be saved after every nplt steps (the g(R), various QCDF's
(Quasi-component distribution
functions) on this file. It can later
be used with plot programs for graphical displays. Examples for
QCDF's are the distribution of coordination numbers or binding
energies.
- File type=idl: The log of insertions and deletions are
stored here, as described for the command
GCEN.
- File type=pxc: The checkpoint file for proximity
analys is generated when keys
PXAN or
SCAN are used.
- File type=pxp: The file containing the proximity
analysis distributions for further plotting, written when the
PXPL key is present.
- File type=pxi: The file containing the proximity
information (generated by PXWR)
for each configuration analyzed.
- File type=slt: The file containing the solute
description (read after the key SLTA).
- File type=wmp: The file of matched PMF segments prepared
for plotting by the WMAT command.
- File type=dat: Configuration file written for InsightII
input (generated by WCNF SVAN).
- File type=grd: Grid point file written for cavity
analysis, generated by PRTG.
- File type=pxv: Proximity region visualization file,
generated by GENV WRIT.
- File type=pdb: Configuration written in
PDB format.
- File type=fgd: Field-gradient file, generated by the
FLDG key.
- File type=CRD: Coordinate file in CHARMM CRD format
(with or without the CHARMM headers).
III. INPUT DESCRIPTION
The program input is based on keyword driven commands.
The structure of a command is as follows:
- Leading keyword
- Auxiliary keyword(s)
- Unformatted data
- Formatted data
Of these only the leading keyword appears all the time.
Each keyword is a four-character, upper case string.
Keywords and unformatted data have to be on the same line.
Commands requiring
more than one line require the continuation character "~" at the
end of the line that is not the end of the command.
Several
commands can be put on the same line if they are separated by the
separator character "|". Text after the comment character "!" is
neglected (i.e., considered just comment).
The type of items required are defined by the leading keyword.
Auxiliary keywords and data may have default values.
The order of keywords and data is fixed, so if there is a needed
auxiliary keyword or data, all other items have to be specified
that precede the needed one.
Formatted data (when required) always start in a line after the
keywords and it can not be omitted, but for items with
default value, can be secified as zero.
While there is no rigid order of the leading keywords, there
are certain rules. Certain leading keywords have
prerequisite
keywords (i.e., the prerequisite keywords all have to be inputted
earlier) and potential precursor keywords that, if
used, must be read before the current one or after the next
RUNS) or
SCAN).
If KEY1 is a potential precursor of KEY2 then Key2 is a
potential successor of KEY1.
If the same leading keyword is used
more than once before a RUNS command, a
warning is given and the last occurrence will be used (except for
TITL).
During input and initializations the data read is cheked
for consistency and for current program limits.
This may result in messages prefixed by
>>>>> OVERRIDE,
----- WARNING, ===== STRONG WARNING or
***** ERROR.
At the conclusion of the run, the number of each type of messages
is printed.
ERROR messages prevent a simulation or an analysis run
but don't abort immediately the input.
When the input has more than one ERROR messages,
some of the additional ERROR messages may
only be the consequence of the first one.
WARNING message indicates
that under some circumstances the input may be incorrect
and STRONG WARNING message indicates
that the input is likely but not necessarily incorrect or that the
error is not fatal.
OVERRIDE messages are generally harmless, but they can
indicate that the input was not what the user intended.
This section describes the input for each leading keyword
and specifies their prerequisites and potential precursors.
The description of the commands is given by leading keywords.
For example
CNFG Key1 [Key2 Key3 Data] Fdata
means that the
leading keyword CNFG has to be followed
by at least one other auxiliary
keyword and optionally two more ("Key 2 Key 3"),
followed by free-format numerical data ("Data")
(all on the line containing the leading keyword), followed by
formatted data, started in a new line.
Note also that the type of (or the need for) auxiliary keys,
free-format and formatted
data may depend on the actual choice of auxiliary keys.
The possible keywords to be used for
Key1, Key2 and Key3 and their effects are explained subsequently,
as well as the nature and (for Fdata) the format of the data.
Keywords are always given with upper case BOLD
characters while variable (data) names are given in lower case
bold characters. Input item that is optional is given in
[ brackets ] and the default values are given in { braces }.
The default value for an auxiliary keyword
is the first one on the list.
A record type number (RECTYPE) is assigned
to every formatted data record. It is used in error messages
on the program's output to
identifying the type of data read when the error was encountered.
III.1. File handling keywords
- FILE: Job name
- WCNF: Configuration
saving
- WCKP: Creation of a
checkpoint file
- SCKP: Periodic archiving of
checkpoint files
- RMCK: Checkpoint file
removal
- TRAJ: History file
specification
- MINE: Minimum energy extraction into
history file
- IDAC: Proximity
insertion/deletion acceptance rate
- IDLG: Insertion/deletion
log file specification
- PXPL: Proximity analysis
distribution plot file writing
- PXWR: Proximity information
file specification
- WTRA: Trajectory
conversions
- STIR: Contact elimination
- LPAT: Path of local disk directory
- BUFF: History file buffer
save
III.01.01. Job name
FILE Data
Data: jobname[, numrun]
III.01.02. Configuration saving
WCNF
Prerequisite: CNFG,
SLTA
Related key:
PDBT.
Key1 (coordinate file format):
- BNRY - Configuration is saved in binary
- ASCI - Configuration is saved in ASCII, (3f15.5)
- ASAN - Configuration is saved in ASCII, and in , each
atom's coordinates are preceded by its atomic symbol and followed
by the group number and charge (a4,1x,3f15.5i5,f10.5)
- PDB - Configuration is exported to
PDB format;
last two colums contain the atomic radius and charge
- PDBO - Like Key1=PDB , but the
last two colums of solvent atoms contain
the fractional occupancies and RMS's determined by
GENS or read by
CNFG.
PDB format;
- PDBQ - Save the configuration in GRASP PDB format #1
(last two colums contain the atomic radius and charge)
- CHRM - Configuration is exported to CHARMM CRD format
Key2 (possible manipulations) - for any of the keys above:
- UNCH - Save the configuration unchanged
- FULL - When the FLXR key
is present, this option will save the reassembled full structure
since otherwise only the moving atoms are saved.
- CENT - The solute is centered in the simulation cell
before saving.
- FIXC - Change all solute molecules to conform to the
geometry defined by the SLTA key.
- SW23 - Switch between 2-copy and 3-copy free energy
solutes before saving the configuration:
- MLCT - Extract from the configuration the centers of the
solute molecules and discard the rest (including the solvent)
before saving
- REPL - Extend the system with its periodic replicas
as specified by the PBCN key.
If extensions required only in a few directions then the
required replica cell centers should be read with the
PBCN READ (cell center coordinates
can be printed with the PRPB key).
All solvent molecules are printed after all the solute images and
the solute atoms affected by the key
PARD
are also printed contiguously.
- REP0 - Extend the system with its periodic replicas
but don't replicate the solute. This option is useful for enlarging an
existing system.
- NOSV - Don't write the solvents
- PRET - Apply the pre-transformation that has been specified
by the solute data header read with the
SLTA key or defined by the
CNFG RANC key
Data: numrunr
- numrunr - runnumber to use on the new file
- TRRT - Translate and rotate the system before saving
- RTTR - Rotate and translate the system before saving
Data (for TRRT or RTTR): numrunr,
dx, dy, dz,
R11,R21,R31,
R12,R22,R32,
R13,R23,R33
- numrunr - runnumber to use
- dx, dy, dz - the components of the
displacement vector
- R - the rotation matrix, specified column-wise
- PXSR - Rearrange the solvents so that the ones non-proximal
to selected solute atoms will be put first.
Data (for PXSR): numrunr,
nasltrange, iasltrange1, iasltrange2, ...
- numrunr - runnumber to use
- nasltrange, iasltrange1, iasltrange2, ...
The selected solute atoms fall into nasltrange segments and
the limits of each segment are
iasltrangei and iasltrangei.
- SPSR - Rearrange the solvents so that the ones outside a
user-defined spehere will be put first.
Data (for SPSR): numrunr,
x, y, z, rsph.
- numrunr - runnumber to use
- x, y, z - center of the sphere
- rsph - radius of the sphere
- PDBD - Save the configuration in
PDB format
(last two colums contain general data in free format
- see Key2 and Key3 below and key KMNP)
- PDBG - Save the configuration in
Grasp PDB format #2
(last two colums contain general data in free format
- see Key2 and Key3 below and key KMNP)
- PDTD - Same as PDBD except the data in the two
data colums will come from the difference between two datasets,
generated by the PXDT key.
- PDTG - Same as PDBG except the data in the two
data colums will come from the difference between two datasets,
generated by the PXDT key.
- CHRD - Save the configuration in CHARMM CRD format. The
content of the data column after the coordinates will be determined
by Key2 as described below.
Key2 (for Key1=CHRD only);
Key2, Key3 (for Key1=PD*D or Key1=PD*G
only):
Any (two) of the following keywords, specifying the
two quantities to be printed in the last (two) data column(s)
of the atom records
- VFSH Volume of the first solvation shell
- V2SH Volume of the first two solvation shells
- NFSH Number of solvents in the first solvation shell
- N2SH Number of solvents in the first and second
solvation shells
- DFSH Solvent density in the first solvation shell
- DSSH Solvent density in the first and second solvation
shell
EBFS Solute-solvent binding energy in the first
solvation shell
- EPFS Solute-solvent binding energy in the first
solvation shell
- TOBE Total binding energy
- TOTN Total number of solvents
- NFVV Solvent-solvent coordination number
- EPVV Solvent-solvent pair energy
- EBVV Solvent-solvent binding energy
- MRDF First minimum of the primary RDF
- KRDF Coordination number using the actual
first minimum of the primary RDF
- VRDF First shell volume using the actual
first minimum of the primary RDF
- DRDF First shell density using the actual
first minimum of the primary RDF
- CRCV Circular variance, based on all solute atoms
- CRCH Circular variance, based on the solute heavy atoms only
- INAC insertion acceptance rate (saved with the
IDAC key)
- DLAC deletion acceptance rate
Data: numrunr, rcrcv
- numrunr {numrun} - The configuration is saved
to the file <job name>.<numrunr>.dat when
Key1 is SVAN,
to the file <job name>.<numrunr>.pdb when
Key1 is PDB*,
otherwise to file <job name>.<numrunr>.crd
When Key2 or Key3 = CRC*:
- rcrcv {6.0} - The
circular variance will be calculated
using solute atoms within a radius of rcrcv.
The circular variance for a point R is defined as
CV = 1.0 - [ SUM (<R>-ri)
/ SUM (|<R>-ri| ]
where ri are the coordinates of solute atoms
within
rcrcv. When R is inside the solute, CV is near one,
and
when R is outside the solute, CV is near zero.
III.01.03. Creation of a checkpoint file
WCKP [Data]
Prerequisite: FILE
Data: numrunw
- numrunw {numrun} - Checkpoint file
<jobname>.numrun.ckp is to be written.
III.01.04. Periodic archiving of checkpoint files
SCKP [Data]
Data: nsavckpf
- nsvckpf - Frequency of writing a separate checkpoint
file.
Each checkpoint file will have the runnumber of the current run
and consecutive version numbers, starting at 2.
This is option allows the later calculation of proximity analysis
averages over different segments of the run (see key
CMPC),
provides a backup procedure in case the program
crashes while writing the checkpoint file, and is
useful for debugging.
III.01.05. Checkpoint file removal
RMCK [Data]
Data: numrunp
- numrunp {<current runnumber> -1} - if positive,
the checkpoint file(s) for
run number.
numrunp will be removed
(by default, the checkpoint file created by the last
RUNS or SCAN
will be removed).
If numrunp is negative then the checkpoint file(s) for
the <current runnumber> - |numrunp| will be removed.
This option is useful for preventing the disk to become too full
when the calculation is broken up into several
RUNS or SCAN
steps.
III.01.06. History file specification
TRAJ Key1, [Key2,Key3] [Data]
Can not be followed by: MINE
Potential successors:
RCKP
Prerequisite of:
DENF, WTRA
Variable to set to 'T' in the preprocessor: ME
Related keys:
MINE,
WTRA,
BUFF.
For MC runs this key sets up the saving or reading
of the run history on
file <jobname>.<runnum>_<runvers>.hst
(For runnum=1 the run number part is omitted and for
runvers=1 the version part is omitted).
See also the key BUFF.
For analysis (SCAN) or
filtering (FILT **** TRAJ)
this command
specifies the format of the history file. An input
history can consist of several files with the same runnumber and
with consecutive versionnumbers (i.e., the
second, third, etc. piece should have names
<jobname>.<runnum>_<runvers+1>.hst ,
<jobname>.<runnum>_<runvers+2>.hst , etc.).
Note, that this naming convention also applies to
non-MMC formats that have customary extensions
(e.g., .pdb, .DCD) requiring either renaming
these files or setting up aliases for them.
Key1 (trajectory file format - if any):
- NONE - No saving of run history.
- ALLE -
The following information is stored at every accepted step in
blocks of 250 when no variable coupling parameter is applied:
- estac - Total energy of the configuration
- cst - Coordinates of the first three atoms of the
moved molecule istc.
- istc - Number (label) of the solvent molecule moved
(0 when the whole solute was moved).
- nmcst - Step number in the MC walk.
- bngst - Binding energy of the molecule moved. When
variable coupling parameter is used (i.e.,
FREE PMF1
run), the coupling parameter value is saved instead of the binding
energy.
This option allows the full reconstruction of the
run's history but works only when neither the key
PARD nor the key
PART is in use and
only in the canonical ensemble.
- ALLV - Same as ALLE, but the total viral sum is
saved instead of the binding energy.
Same limitations apply as for ALLE.
- ALLC - Trajectory file contains full configurations
in formatted (ASCII) records as follows:
- 1st line: number of atoms written (natomp), number of
solvent molecules+1, total number of atoms, first solute atom written
out, last solute atom written out, MC stepnumber, coupling
parameter in format(i6,9x,i6,8x,i6,7x,2i6,5x,i9,4x,f8.0)
- Next natomp lines: x,y,z coordinates (in &%197;)
(3f15.5) format
- Last -1 line: ins/del stepnumber, total energy, number of
accepted insertions and deletions, umbrella sampling weight
factor for the configuration
- Last line: solute-solvent energy and its contributions for
each interaction term. Important: for water solvents only
the first three atoms of each solvent molecule are written out.
For (T,P,N) ensemble runs the three edge parameters (see key
PBCN) are also saved
in an additional record in (3f10.5) format.
- ALLA - Trajectory file contains full configurations
file in formatted (ASCII) records as follows:
- 1st line: number of atoms written (natomp), number of
solvent molecules+1, total number of atoms, first solute atom written
out,i last solute atom written out, MC stepnumber, coupling
parameter in format(i6,9x,i6,8x,i6,7x,2i6,5x,i9,4x,f8.0)
- Next natomp lines: atomic symbol, x,y,z coordinates
(in Å), group (residue) number, partial charge with
(a4,1x,3f15.5,i5,f10.5) format
- Last -1 line: ins/del stepnumber, total energy, number of
accepted insertions and deletions, umbrella sampling weight
factor for the configuration
- Last line: solute-solvent energy and its contributions for
each interaction term. Important: for water solvents only
the first three atoms of each solvent molecule are written out.
- For (T,P,N) ensemble runs the three edge parameters (see key
PBCN) are also saved in an additional
record.
- ALLB - Trajectory file contains full configurations
in binary format:
- First record: number of solvent
molecules, number of atoms, umbrella sampling weight factor for the
configuration, MC stepnumber, insertion/deletion stepnumber, number
of accepted insertions, number of accepted deletions, first atom
written out, total energy, solute-solvent energy and its
contributions for each interaction term
- Second record:
the whole configuration
- For free energy runs the coupling parameter is written as an
additional record. Again, for water solvents only the
first three solvent atoms are written out.
- For (T,P,N) ensemble runs the three edge parameters (see key
PBCN) are also saved in an additional
record.
- ALLP - Trajectory file contains full configurations
in PDB format.
The first record is a REMARK giving the same information as the
first line of the ALLA format in
format(9x,i6,3x,i6,5x,i6,7x,2i6,5x,i9,4x,f8.0) and the next REMARK
may give the total energy and solute solvent energy in the format
(9x,e16.8,17x,e13.6); additional REMARKs may follow.
- ALLH - Trajectory file contains full configurations
in CHARMM CRD format.
The first title line gives the same information as the
first line of the ALLA format in
format(9x,i6,3x,i6,5x,i6,7x,2i6,5x,i9,4x,f8.0) and the next title line
may give the total energy and solute solvent energy in the format
(9x,e16.8,17x,e13.6); additional title lines may follow.
- CHRM - Trajectory file contains full configurations
in CHARMM binary trajectory format.
For runnum=1 the .DCD extension is also allowed.
- AMBR - Trajectory file contains full configurations
in AMBER format. The program will
check if the system is centered around (0,0,0) or
(Ex/2,Ey/2,Ez/2).
Key2 (level of solute changes):
- RGFX - The solute is rigid and is not moved during the
simulation
- RGMO - The solute is rigid and but it is allowed to
move during the simulation
- FLEX - The solute is flexible
Key3 after Key1=ALL*
(level of solute atom saving):
- ALST - All solute atoms are saved
- MVST - Only the solute atoms affected by partial
solute moves (PARD key) are saved
- NOST - Solute atoms are not saved
- NOSV - Solvent atoms are not saved
Key3 after Key1=AMBR (PBC information):
- NOBX - No box-size information after configurations
- BOX - After each configuration the simulation cell
edges are written in a new line
- BOXX - Read the box description after each
configuration but keep using the values inputted with the
NSLV command
Data: natskip, runnum, runvers,
tfilename, incrun, alttrajext
- natskip {0} - When natskip > 0 then
the last natskip atoms of each frame will be skipped.
When natskip < 0 then
the first natskip atoms of each frame will be skipped.
Ignored with MMC trajectories.
- runnum {current run number} - The
run number
of the trajectory file.
- runvers {1} - The version number of the (initial)
trajectory file.
- tfilename {jobname} - The trajectory file
is named tfilename.runnum_runvers.hst
- incrun {0} - If zero, additional trajectory segments will be searched
with increasing the version number; if one, with increasing the run number.
- alttrajext - the trajectory extension (instead of hst).
It has to be exactly three characters.
When running a MC simulation, full configurations
(Key1=ALL*)
will be written on the history file
at every nmcrec-th MC step
(read with the RUNS key).
III.01.07. Minimum energy extraction
MINE Key1
Related key:
TRAJ
Key1 (trajectory or overall):
- NONE - Don't extract the minimum energy
- TRAJ - extract the minimum energy
over every nmcrep MC step segment and
save it on the trajectory (.hst) file.
Requires the TRAJ key.
Also, save the overall lowest energy and write it on a PDB file
at the end of the run defined by the
RUNS key.
- ALLC - Only extract the minimum energy and write it
on a PDB file at the end of the run defined by the
RUNS key.
- WANN - Besides the function of ALLC
the configuration at each temperature change is written to a file
jobname_annh.runnum.pdb
- WANN - Besides the function of ALLC
the reassembled full configuration at each temperature change is written to a file
jobname_annfh.runnum.pdb (valid only when the
TRAJ key is present).
III.01.08. Proximity insertion/deletion acceptance rate
IDAC [Data]
Data: rpxidlim, nrange, irangestart, irangestop
(nrange times)
- rpxidlim {4.0} - Insertion and deletion acceptance
rates within rpxidlim of the nearest solute heavy atom will be
calculated for each solute heavy atom.
- nrange {0} - Number of solute atom index ranges
- irangestart, irangestop (nrange times) - proximity
index calculation will be restricted to heavy atoms in the rage(s)
irangestart - irangestop
III.01.09. Insertion/deletion log file specification
IDLG Key1 [Data]
Prerequisite: GCEN.
Prerequisite of:
IDAG
Related key: IDAG.
Key1 (ins/del log file creation):
- OFF - No log is written.
- ASCI - For each accepted insertion and deletion the
program writes +1 or -1 (for insertion or deletion), the solvent
molecule index, the number of solvent molecules after the
insertion/deletion, the MC
stepnumber and the coordinates of the center of mass of the
molecule inserted/deleted on the .idl file.
- ASPX - Same as ASCI, but the atom number
of the nearest solute atom to the insertion/deletion site as well as
the distance between them are also printed.
Data: rpxidlim
- rpxidlim {4.0} - Solute atoms with insertion and deletion sites
within rpxidlim will be listed with the respective counts.
III.01.10. Proximity analysis distribution plot file
writing
PXPL
If this keyword is present, the program will write an ASCII
file containing all the calculated proximity distributions to the
.pxp file.
The distributions written on the .pxp file can be plotted
with
the companion (Fortran-77) program pxps.f.
pxps.f produces Postscipt files of plots, 8 plots per page.
It can put two distributions on the same plot, but both have to use
the
same dependent variable.
The right-side vertical axis will belong to the second plot.
A typical run of pxps.f is as follows (user input is
displayed in bold face):
% pxps
Proximity plot generator Version:/07/22/98
Data file name root=wt1
File wt1.pxp opened <- the file wt1.pxp
was generated by
the PXPL key
Use @ for: @f=Greek fnt, @n=Normal fnt, @+=superscript
@-=subscript, @m=newline, @#n#=font size n, @@=@symbol
Title of graph page:
GnRH Wild Type, Extended Conformation
Do you want grid lines drawn (y/n)? n <- If yes, only one plot
can be plotted
Plot types:
1 Solute-solvent radial distribution functions <- from key PXGR
2 Solute-solvent pair energy distribution functions <- from key PXBE
3 Mean solvent orientation as a function of R <- from key PXDP
4 Distribution of first shell solvent orientations <- from key PXDP
5 Solvent-solvent radial distribution functions <- from key PXWW
Plot type number=1
Data colums: 1 R 2 gpx(R) 3 Kpx(R) 4 gt(R) 5 Kt(R)
Data column number=2
Do you want a second function also plotted (y/n)? y
Plot types:
1 Solute-solvent radial distribution functions
2 Solute-solvent pair energy distribution functions
3 Mean solvent orientation as a function of R
4 Distribution of first shell solvent orientations
5 Solvent-solvent radial distribution functions
Plot type number=1
Data colums: 1 R 2 gpx(R) 3 Kpx(R) 4 gt(R) 5 Kt(R)
Data column number=3
Number of graphs to print (0: will ask for range)=0
First and last distribution function=1,18
Data set not found (skipped): gkr is= 1 <- empty proximity
regions
Data set not found (skipped): gkr is= 10
File wt1_2_9_gpx_Kpx.ps opened
File wt1_11_18_gpx_Kpx.ps opened
More plots (y/n)? n <- allows to plot other distributions
%
The format of the .pxp file is as follows:
- Header:
- nsltpx, nsltpxgr (2i5)
- ianslt (20i4)
- resnam,atnam (20a4)
- nsltpx - Number of solute atoms analysed
- nsltpxgr - Number of rdf's generated
- ianslt - atomic numbers of all solute atoms analysed
- resnam,atnam - 8-character label of all solute atoms
analysed
- Radial distribution functions (PXGR
key was present). For each distribution function the program writes
- is, mxgrd (' is=',i4,' ngrids=',i4,' gkr')
- (grd(i), gr(i), rcoord(i),
grt(i), rcoort(i), i=1,mxgrd) (5f8.3)
- is - rdf number
- mxgrd - Number of gridpoins
- grd(i) - Radius of the i-th gridpoint
- gr(i) - Primary rdf
- rcoord(i) - Primary running coordination number
- grt(i) - Total rdf
- rcoort(i) - Total running coordination number
- Primary solute pair energy distributions
(PXBE key was present).
For each distribution function the program writes
- is, mxgrd (' is=',i4,' ngrids=',i4,' xpe')
- (egr(i),sltep(i),rcoore(i),i=1,mxgrd)
(10f8.4)
- is - rdf number
- mxgrd - Number of gridpoins
- grd(i) - Energy of the i-th gridpoint
- sltep(i) - Primary pair energy distribution value
- rcoord(i) - Primary running coordination number
- Primary solvent-solvent rdfs
(PXWW key was present).
For each distribution function the program writes
- is, mxgrd, nijgvvp
(' is=',i4,' ngrids=',i4,' ngvv=',i2,' gvv')
- mxgrd lines: grd(i),(groo(i,iv),v=1,nijgvvp)
(10f8.4)
- is - rdf number
- mxgrd - Number of gridpoins
- nijgvvp - Number of rdf types (between different
solvent centers)
- grd(i) - Radius of the i-th gridpoint
- groo(i,iv) - Primary solvent-solvent rdf
- Solvent mean orientation
(PXDP key was present).
For each distribution function the program writes
- is, mxgrd (' is=',i4,' ngrids=',i4,' tav')
- mxgrd lines: (grd(i),tav(i),i=1,mxgrd) (10f8.4)
- is - rdf number
- mxgrd - Number of gridpoins centers)
- grd(i) - Radius of the i-th gridpoint
- tav(i) - Primary mean solvent-solute angle
- Solvent orientation rdf (first shell)
(PXDP key was present).
For each distribution function the program writes
- is, mxgrd (' is=',i4,' ngrids=',i4,' xtd')
- mxgrd lines: (grd(i),xtd(i),i=1,mxgrd) (2f9.4)
- is - rdf number
- mxgrd - Number of gridpoins
centers)
- grd(i) - Angle THETA of the i-th gridpoint
- xtd(i) - Primary solvent-solute angle distribution
III.01.11. Proximity information file specification
PXWR Key1 [Key2 Data]
Prerequisite: PXCR,
Key1 (file format - if any):
- OFF - Proximity information writing is turned off.
- ASCI - The proximity information file is an annotated
ASCI file containing the following data for each configuration
analyzed:
(nsv,nmc,ncnfpx,
(iproxi(i),resname(i),resno(i),atnam(i),rmin(i),esvst(i),i=1,nsv)
where
- nsv is the number of solvent molecules
- nmc is the MC stepnumber
- ncnfpx is the number of configurations analyzed so far
- iproxi(i) is the solute atom nearest to solvent i
- rmins(i) is the corresponding minimum distance (in Å)
- esvst(i) is the interaction energy of solvent i with the solute
(in kcal/mol).
- BNRY - The proximity information file is a
binary file containing one record for each configuration analyzed:
(nsv,nmc,ncnfpx,(iproxi(i),rmin(i)2,esvst(i),i=1,nsv)
Important: iproxi is stored as INTEGER*2 to save space.
Note, that the size of a file created with the BNRY option
is about the tenth of the size of a file
created with the ASCI option.
- BNRR - The proximity information file is a
binary file containing one record for each configuration analyzed:
(nsv,nmc,ncnfpx,(iproxi(i),rmin(i)2,i=1,nsv) .
This file is about 2/3rd of the size of the file written with
the BNRY option.
Key2 (read or write):
- WRIT - Write the proximity information
- READ - Read the proximity information from an existing
.pxi file
Data:
- numrunpxi - The
run number
of the .pxi file to read from
(only read with Key2=READ)
The proximity information file has extension .pxi.
III.01.12. Converting/shifting a trajectory
WTRA Key1 Key2 [Data]
Prerequisite: TRAJ
Key1 (transformation type):
- UNCH - Don't change coordinates
- STCT - Shift the system to put the solute's center of
mass at the cell center
- SHCT - Shift the system to move the origin from the
lower left corner of the box to the center (update PBC images
accordingly)
Key2 (trajectory format):
- One of ALLC, ALLA, ALLB, ALLP -
see key TRAJ.
Note, that ALLB is the most compact.
Data: nmcmax, nmcfreq, nmcskip,
incvers, nmcmaxseg
- nmcmax {100000000} - gather configurations until the
nmcmax-th step.
- nmcfreq {1} - use configuration saved at every
nmcfreq-th simulation step
- nmcskip {0} - skip the first nmcskip simulation steps
- incvers {10} - The converted trajectory will
have a version number that is incremented from the initial
by incvers
- nmcmaxseg {0} - When > 0, after nmcmaxseg steps,
open a new output file with version number incremented by one.
This key allows the conversion of the input trajectory to a new
format
and gives the option of centering it in two different ways.
III.01.13. Contact elimination
STIR [Data]
Prerequisite: CNFG
Data: eijmin, disp, dispa
- eijmin {1000.0}, disp {3.0}
- solute molecule pairs with energy above
eijmin kcal/mol will be pulled apart by disp Å
- dispa {90.0} - solute torsion group members with intramolecular
energy exceeding eijmin will be incremented by dispa
degrees.
This option is useful for 'cleanng up' a starting configuration
generated with the RANC option.
III.01.14. Local disk directory
LPAT [Data]
Data: scratchpath
- scratchpath - The full path to the directory where temporary files
and checkpoint files are written during the run.
The checkpoint files wil be copied to the originating directory only
at the end of the run.
This option allows the reduction of network traffic.
III.02. System descriptor keywords
- NSLV: Number of
solvent molecules
- TEMP: Simulation
temperature
- TITL: Description
- PBCN: Periodic boundary
conditions
- GCEN: Grand-canonical (T,V,mu)
ensemble
- STPX: Stop GCE run at
nx
- BTUN: Tune GCE B
parameter
- LIMG: Limit the cavity
grid
- IBEN: Isothermal isobaric
(T,P,N) ensemble
- LCMP: Virial biasing factor
- CNFG: Initial
configuration
- SCRM: Scramble torsion angles
- SCAL: Simulation cell
rescaling
- OVST: Overlap of the second
solute copy with the first
- SPRD: Spread out solute
molecules
- PBGR: Solute reset with
grids
- LIGA: Multi-copy ligand
- STUN: Stepsize tuning
- TAC0: Stepsize accumulator reset
- SANN: Simulated annealing
- SACP: Simulated annealing of the chemical potential
III.02.01. System size
NSLV Data
Can not be followed by:
HRDW
Potential successors:
CNFG, RCKP
Data: numsol, nsolfix
- numsol - Number of solvent molecules.
For grand-canonical ensemble runs (GCEN key)
using CNFG RAN* will generate
numsol randomly placed solvents to start with.
- nsolfix {0} - The first nsolfix solvent molecules
will be kept fixed during the simulations.
nsolfix > 0 requires that the solute does not undergo
translation/rotation (cedslt and rtxslt should both be
0.0 - see the STEP key).
If the input has no NSLV key then the number of solvent molecules
will be established from the .crd or .hst files.
III.02.02. Simulation temperature
TEMP Data
Prerequisite of:
RUNS, SANN
Data: T
- T - the simulation temperature in Kelvin
III.02.03. Title
TITL Data
Potential successor:
RCKP
Data: description of the calculation.
The program is prepared to read at most two title lines.
III.02.04. Periodic boundary conditions and cell
dimension.
PBCN Key1 [Key2 Key3] Data [Fdata]
Prerequisites:
SUVC, SVVC
Potential successor:
RCKP
Prerequisite of:
SLTA, GCEN,
MOND, LIMG,
VORO, FREE,
PRPB, GENS
Key1 (cell shape):
- RECT - Rectangular (cubic) boundary conditions (SC).
Data: edgex [edgey, edgez] (in Å).
- edgex, edgey {edgex}, edgez
{edgex} - The X,Y,Z edges of the rectangle.
- Vol=edgex*edgey*edgez Å3,
Rinsc=min(edgex,edgey,edgez)/2
- FCC - Face-centered cubic boundary conditions
(FCC).
Data: edg
- edg- The edge of the rombohedral dodecahedron is
edg*sqrt(3)/2
- Vol=2*edg3Å3,
Rinsc=edg/sqrt(2)
- TOCT - Truncated octahedron boundary conditions.
Data: edg
- edg - distance of the squrre face from the center
- Vol=4*edg3Å3,
Rinsc=edg(sqrt(3)/2)
The coordinate axes go through the center of the square faces.
- HCP - Hexagonal close-packed boundary conditions.
Data: edg
- edg - diameter of the inscribed sphere
- Vol=edg3/sqrt(2)Å3,
- Rinsc=edg/2
- HEXG - Hexagonal prism boundary conditions (HP).
Data: plen, pedge
- plen - The length of the prism
- pedge - The edge of the hexagon.
- Vol=plen*pedge2*3*sqrt(3)
Å3,
Rinsc=min(plen, pedge*sqrt(3)/2.
- The axis of the prism is the X axis and the vertex of the
hexagon goes through the Z axis.
For analyzing trajectories with different axis conventions, use the
PXYZ key.
- SPHR - Sphere boundary conditions (SPH)
Data: rsph
- rsph-radius of the sphere.
- PHS - Primary hydration shell
(Beglov & Roux)
Variable to set to 'T' in the preprocessor: PH
Key2 (solute atom radius source):
- RVDW - Standard Van der Waals radii will be used for
the solute atom radius.
- RSIG - Lennard-Jones SIG/2 values will be used for the
solute atom radius.
Key3 (target energy definition):
- UTTO - uphsref read is the
total restraining energy
- UTPM - uphsref read is the
per molecule restraining energy
- UPML - uphsref read is the
per molecule restraining energy, but solvents farther than
a threshold are ignored for the reference shell energy calculation
Data: uphsref, phsk, rphs,
rphsmin, rphsmax, denphs, nmcphs,
ranboxedge,
rphswid
(only for Key3=UPML!),
naphsrange, iaphsrange1, iaphsrange2, ...
- uphsref - reference hydration shell energy
- phsk - force constant of the restraining potential
- rsph, rphsmin, rphsmax - initial,
minimum and maximum radius of the shell, resp.
- denphs - solvent bulk density (used to calculate g(r)'s)
- nmcphs - frequency (in MC steps) of shell radius updates
- ranboxedge {1000.0} - edge of cube within which random
points will be generated for proximity RDFs
- rsphwid {rphsmax} - width of the
restraining shell contributing to
the per molecule restraining energy (used only with Key3=UPML).
- naphsrange {0} - the number of solute atom ranges the
primary hydration shell is restricted to (when naphsrange=0, all
solute heavy atoms will be used)
- iaphsrange1, iaphsrange2 (naphsrange times)
- the first and last solute atoms of each range to use
- READ - Input boundary conditions.
Fdata:
- RECTYPE 29: rinscr, vol, rmax (3F10.0)
- rinscr - Radius of the largest inscribed sphere into
the cell.
- vol - Volume of the unit cell (in Å3)
- rmax - For minimum image convention the cutoff
radius used is rmax.
- RECTYPE 39: ncell (I5)
- ncell - Number of cell-center coordinates to be read.
- RECTYPE 38's: (cic(k,i),k=1,3) (3F15.0)
- ncell times repeated.
- cic - The negatives of the
coordinates of center i (i.e., -X, -Y, -Z).
Note: Input boundary conditions work only if the image cell
can be found by comparing the distances of a point from the
various cell centers, the smallest distance will belong tot he
wanted cell number. It works when the periodic system
can be described by an orthogonal crystal coordinate
system. For other cases, more complicated calculations are
required - to start with, a transformation to
non-orthogonal coordinate system - but it is not
implemented yet.
For pure liquids, FCC is the best since this maximizes the
distance of a molecule from its image. On the other hand,
RECT is conceptually the simplest. Furthermore, for
RECT the dimensions of the cell can be different along the
three axes, allowing for various types of crystals. Solutes that
are far from spherical can also be wrapped around by waters more
efficiently.
(In this case, however, the solute rotation should be prevented
- see key STEP.) HEXG have been
developed for long
solutes or solutes representing an infinite polymer chain, to
maximize the distance of the chain from its image. SPHR
simply encloses the system into a sphere of inputted radius. Note
that the FCC calculations are only partially vectorized.
For the rectangular, face-centered cubic, hexagonal prism
and sphere boundary conditions there are special algorithms in
the program, one for each. When boundary conditions are inputted,
the cell number of the periodic image cell where a molecule is
located is determined simply by comparing the distance of the
molecule from the various cell centers read in.
The unit cell parameters are determined from the bulk
density or molar volume of the solution, Nmol, and the
choice of boundary conditions. If the density of the system is d
g/ml , then
d (g/ml) = weight in g of Nmol / volume of
Nmol in ml =
(Nmol/L)*Mol.wt / (Vol/1024)
where L is Avogadro's number. The cell parameter(s) can be
obtained from the resulting Vol value.
III.02.05. Grand-canonical ensemble selection
GCEN[ Key1 Key2 Key3 Key4 Key5 Data ]
Fdata
Prequisite: PBCN
Can not be followed by: LIMG
Prerequisite of:
PRTG, PRFI,
BTUN, STVG,
STPX, SACP
Related keys:
IDLG,
IDAG
Variable to set to 'T' in the preprocessor: CB
Grand-canonical ensemble simulation using the cavity-biased
method with grid-insertion or the original,
random insertion method.
Key1 (GCE insertion strategy):
- OFF - Return to canonical ensemble.
- CAVB - Cavity biased insertions at gridpoints
- CVBF - Cavity biased insertions at a rectangle around
the gridpoints
- UNBI - Unbiased (random insertions)
Key2 (solute atom radius source):
- RVDW - Standard Van der Waals radii will be used for
the solute atom radius.
- RSIG - Lennard-Jones SIG/2 values will be used for the
solute atom radius.
- RSGH - Like RSIG, but hydrogens with SIGMA=0 will
be assigned SIGMA=1.
Key3 (insertion/deletion selection strategy):
- ALTI - Alternating insertion and deletions attempts
- RANI - Randomly decided insertion and deletions
attempts
Key4 (alternative Pcav definitions):
- ALLG - Pcav is defined as the number of free grids /
total number of grids
- SLVG - Pcav is defined as the number of free grids /
number of grids not covered by the solute
Key5 (strategy to use electrostatic term
for insertion/deletion steps):
- LJQI - All potential terms are used for the I/D
decision.
- LJNC - The electrostatic contributions are
neglected for the insertion/deletion decision. Ensemble averages
for the thermodynamical quantities are not corrected
for the applied approximation.
- LJCN - As in LJNC, but the ensemble
averages for the thermodynamic quantities are corrected for by
utilizing the applied umbrella sampling.
- LJCU - As in LJCN, but the umbrella
sampling is applied to both the insertion/deletion and the regular
moves, and thermodynamic quantities are
corrected for the biasing.
Data: ucspma, usmixp.
- ucspma - Average solute-solvent electrostatic
energy(in kcal/mole). If the supplied average electrostatic
solute-solvent energy is less than -1000 kcal/mole,
then the program uses the starting configuration's
value for the average solute-solvent electrostatic
energy. Umbrella sampling bias is Ew =
usmixp*(us1-n*ucspma) where
us1=solute-solvent electrostatic
energy, and n=number of solvent molecules at a given
configuration.
Fdata:
- RECTYPE 13: ba,
diamslv,
pmvslt, ngridx, ngridy, ngridz,
nslvxp,
nmolfx, idfreq, idrepf, nmcransh
(3F10.0,6I5,2I10)
- ba - The B parameter of
Adams. The larger it is, the larger will the
chemical potential be.
- diamslv - The solvent diameter in Å, to be used as the cavity radius.
- pmvslt - The molar volume of the solute (to
be used in the density calculation).
- ngridx, ngridy, ngridz - The
number of gridpoints in the X, Y and Z directions (not used for
GCEN UNBI runs).
By default, the whole simulation cell is covered by the grid,
but a smaller portion can also be specified with the
LIMG key.
- nslvxp - The targeted number of solvent molecules
referred to the key STPX
- nmolfx {0}- The number of solvent molecules that are
not allowed to be deleted.
- idfreq - The frequency of insertion/deletion
attempts (in MC steps).
- idrepf - The frequency of summary line (in
Ins/Del steps)
- nmcransh - The inital grid will be
regenerated after every nmcransh-th MC step with a random
shift in its position.
Other aspects of a GCE simulation are controlled by the keys
PRFI, IDLG,
IDAG, STPX,
LIMG, and BTUN.
III.02.06. Stop GCE run at nx
STPX Data
Prerequisite: GCEN,
Data nslvxp
- nslvxp {nslvxp read with the
GCEN key}
- target number of solvents to stop at.
When this key is present the run will stop if the number of
solvent molecules reached nslvxp.
III.02.07. Tune the GCE B parameter
BTUN Key1 [Data]
Prerequisite: GCEN,
MOND or BLKW
Key1 (tuning algortithm)
- NONE - Turn off tuning, continue with the last B value
(not recommended)
- AVRG - Turn off tuning, continue with the average B value
over the last run
- ADPT - Adaptive algorithm
- FLUC - Fluctuation-based algorithm
- PICL - Process control algorithm
Data for Key1=ADPT and Key1=FLUC:
targetden, nmctunave, nmctunskip,
chabmax, dentol, inorout
- targetden {0.997} - The targeted density in the bulk
region as defined by the MOND key.
- nmctunave {100000} - The number of MC steps to use for
collecting data before each B parameter change.
- nmctunskip {nmctunave} - The number of MC steps
to skip (as equilibration) after each B parameter change.
- chabmax {1.0} - The maximum absolute change allowed for
the B parameter.
- dentol {0.01} - The density range around
targetden within which the tuning is considered converged.
- inorout {2} - when =1, the region inside the cube defined
by the MOND key will be assumed bulk;
when =2, the region outside the cube is assumed bulk.
- targetn {-50.0} - the targeted number of solvent molecules
in the region selected by inorout. If it is positive, it will
override the targetden value read above.
Data for Key1=PICL:
targetden, integraltime, gain, inorout
- targetden {0.997} - The targeted density in the bulk
region as defined by the MOND key.
- integraltime {13000.0} - integraltime
- gain {-113.0} - gain
- inorout {2} - see above.
- targetn {-50.0} - see above.
The program will periodically change the GCE B
parameter
to achieve the targeted density in the 'bulk' region - defined as
the outside region by the MOND key.
Note that the keys NONE and AVRG are only meaningful
after having restored a checkpoint file
(key RCKP).
III.02.08. Limit the cavity grid
LIMG [Key1,Data]
Prerequisite:
PBCN
Potential successor:
GCEN
Key1 (used only for GCE simulations to specify treatment of
boundary crossings):
- NOWA - Free movement of solvents in and out of the grid
region
- WARC - Free movement of solvents in and out of the grid
region but counts of border crossings are maintained
- WALL - Movement of solvents in and out of the grid
region
is prevented, counts of crossing attempts are maintained
Data:
glimminx, glimmaxx, glimminy, glimmaxy,
glimminz, glimmaxz
- The limited grid will extend from
glimminx to glimmaxx,
glimminy to glimmaxy,
glimminz to glimmaxz in the x, y, and z directions,
respectively.
For any glim*value that reaches or exceeds the limits of the
simulation
cell (centered at (0,0,0), the periodic boundary conditions will
also be invoked.
III.02.09. Isothermal isobaric ensemble simulation (TPN)
IBEN Key1, Key2, Data
Can not be followed by:
VCHA
Potential successors:
CNFG, VCHA
Prerequisite of:
LCMP
Related key:
LCMP.
Variable to set to 'T' in the preprocessor: IB
Key1 (volume sampling technique):
- NONE - Turn off (TPN) ensemble sampling
- UNIF - (TPN) ensemble sampling with uniformly
distributed steps
- VB3D - (TPN) ensemble sampling with
virial-biased sampling of Mezei.
- VB1D - (TPN) ensemble sampling with refinement of the
virial-biased sampling of Mezei,
using the components of the virial sum to generate cell size
changes
in the X, Y, and Z directions, respectively.
Key2 (volume change (un)isotropy):
- ISOT - Volume change involves all three axes
- IXYZ - Volume change alternates between changes in the
X, Y or Z directions.
- ISYZ - Volume change alternates between changes in the
X dimensions and in the Y-Z dimensions.
directions
Data:
- press {1.0} - The external pressure specified (in
atm)
- vrange {100.0} - The volume change range (in
Å3)
- nvchfreq {nmolec} -
The frequency of volume change attmpts (in MC steps)
- nvchrep {nmolec} - The frequency of reports on
volume change
attmpts (in number of volume change attempts)
- vlam {0.5} - The virial biasing lambda factor.
For Key2=IXYZ:
- ixyzfreeze(1), ixyzfreeze(1), ixyzfreeze(1)
{0,0,0} - If ixyzfreeze(k) is > 0 then no change will be done along
the k-th axis.
Note, that putting the key IBEN after CNFG
will result in ignoring the cell size on the .crd file
and the vaulues inputted with the PBCN key will be used
instead.
III.02.10. Virial-biasig factor
LCMP Data
Prerequisite:
IBEN
Data:
- complfac {0.5} - The scaling factor to use for virial
biasing (corresponding to the lambda factor for force biasing).
III.02.11. Initial configuration
CNFG Key1 [ Key2 Key3 Data ] [Fdata]
Prerequisites:
SLTA, FILE
Can not be followed by:
FREE, TORD,
CMPC, NSLV,
IBEN, VCHA,
PXYZ
Potential successors:
RCKP, SPST
Prerequisite of:
RUNS, SCAL,
PRTG, REGE,
SPRD, VORO,
STIR, SCRM,
TORT, LPST,
GENT
Related keys:
GENT,
SPRD,
SCRM,
STIR.
Key1 (data source):
- READ : Initial configuration is read from user supplied
.crd file. For Key2=PDB or Key2=CHRM,
the extensions .pdb and .CRD, resp., will be
assumed first.
- RDEX : When part of the system is only used for grid
generation (see key FLEX) the full
configuration is read from a PDB or Charmm CRD file and the
moving atoms will be extracted and saved in ASCI format on
a .crdfile.
Key2 (file format):
- ASCI - Coordinate file will be read/written in
ASCII.
- BNRY - Coordinate file will be read/written in
binary.
- ASAN - Coordinate file is an annotated file written
with WCNF SVAN.
- PDB - Coordinate file is imported from
PDB format
If the program is dimensioned for generic sites
(nmolec < #MH) then the solvent fractional occupancies and
RMS's are also read from the last two data columns.
- CHRM - Coordinate file is imported from CHARMM CRD format
Key3 (configuration check procedure):
- NOFX - Abort if solute on the .crd file
differs form the input definition
- FXCR - If solute on the .crd file differs
form the input definition by more than 0.01 Å,
use the input definition for both
- FXCC - If solute on the .crd file differs
form the input definition, by more than 0.001,
use the input definition for both
- FXIN - If solute on the .crd file differs
form the input definition, use one on the .crd file for
both
- IGND - Ignore if solute on the .crd file
differs form the input definitition
Data: numrunr, irunvers, jobnamer
- numrunr {numrun},
irunvers {1},
jobnamer {jobname} - The program will look for the file
<jobnamer>.<numrunr>.crd
(<jobnamer>.crd when numrunr=1) to read the
coordinates from. If the filename is different from the
default coordinate file name, the program will open a new
coordinate file name using numrun as well, and copy the
coordinates there.
- RANC - The starting configuration is obtained by
- Placing the solute
(as defined by the SLTA key)
at the cell center,
with its COM shifted to the cell center.
This shift is added to the information that was read by the
SLTA key.
- Inserting the number of solvents
requested by the NSLV key at random
location and orientation. Overlap with the solute van der Waals
envelop is prevented.
- When solute molecules are cloned, the cloned copies will be
placed at randomly selected points with randomly selected
orientations in the cell,
with their COM outside the rest of the solute.
No tests for overlap will be performed.
- Force bias and the calculation of
distribution functions will be turned off.
See also the key STIR
for the elimination of persistent contacts.
Key2 (file format): see CNFG READ
Key3 (determination of cell size and number of solvents):
- SIZE - The cell dimensions given
by the PBCN key
and the number of solvents given by the
NSLV
key will be used
- NTOS - Cell dimensions determined from externally
provided partial molar volumes and number of solvent molecules.
given by the
NSLV key.
For
PBCN RECT: all edges are the same;
for PBCN HEXG: provide the
prism length as plen with PBCN HEXG
- STOS - Cell dimensions are determined from
externally provided solute and solvent partial molar volumes and
number of required shells. Data inputted with the keys
NSLV and PBCN
will be disregarded.
- STON - Number of solvents is determined from the
externally provided partial molar volumes and cell dimensions
given by the PBCN key.
Data: pmvslt, pmvslv, rnshll,
cplpar
- pmvslt - Partial molar volume of the solute in
ml/mole.
- pmvslv {18.12004 (water at 25 deg C)} - Partial
molar volume of the solvent in ml/mole.
- rnshll {2} - The number of solvent shells required
around the solute in the simulation cell (only used for
Key3=STOS)
- cplpar {0.0} The coupling parameter value to use
for the creation of the third copy (for
FREE PMF1 or
FREE PMNLruns)
- RANI - Same as RANC, except that the solute will
be placed into the simulation cell unchanged (i.e., as defined by the
SLTA key).
- TRAJ : The initial configuration is gathered from the
trajectory file opened previously.
Key2(file format): see CNFG READ
- RPSU : A built-in procedure is used. The first molecule
of an old system is replaced by the solute. The COM of the old and
new solute will coincide. Solvents overlapping with the Van der
Waals envelope of the new solute will be discarded. The
resulting configuration will be written on the .crd file.
Also, DSTC NONE and
SAMP METC will be set.
Key2(file format): see CNFG READ
Data: numrunr, jobname, noslt
- jobname, numrunr - Old configuration is to be
read from file <jobname>.<numrunr>.crd
- noslt - Number of solute atoms in the old
configuration.
Fdata:
- RECTYPE 23: iclslt(1),...,iclslt(noslt)
(14I5)
- iclslt(i) - Atomic number (for EPEN potential) or
atom type (for Clementi potential) of the I-th atom in the old
solute.
- RPUV : The starting configuration is obtained from an
old system by replacing both the solute and solvent molecules in
such a way that the orientation matrices are preserved. Both
the number of solute and solvent atoms can change.
Key2: see CNFG READ
Data: numrunr, jobname, noslt,
noslv, iop37o
- jobname, numrunr - Old configuration is to be
read from file <jobname>.<numrunr>.crd
- noslt - Number of solute atoms in the old
configuration.
- noslv - Number of solvent atoms in the old
configuration.
- iop37o - Potential identifier of the old solvent
(0-1-2 <=> MCY-TIP3P-TIP4P).
Fdata:
- RECTYPE 25's: (not needed for
SLTA SLVT).
- noslt records - one for every atom of the old
solute:
- ioslt(i),(coslt(j,i),j=1,3) (I5,3F15.10)
- ioslt - Atomic number for atom i (6 for C, 8 for
O, etc.) for EPEN potential and atom type for all the other
potentials.
- coslt - Coordinates of the old solute.
- RECTYPE 26's: (read only when iop37o .ne. 0)
- noslv records - one for every atoms of the old
solvent:
- ioslv(i),(coslv(j,i),j=1,3) (I5,3F15.10)
- ioslv - atomic number for solvent atom i.
- coslv - X,Y,Z coordinates of atom i in the old
solvent.
- PMFN - Create an input file for conformational
transition from a file with a fixed solute conformation. It takes
the conformation given after SLTA
(RECTYPEs 14) as Ro reads in the R1
conformation
(together with new charges) and prepares the linear combination of
Ro and R1.
All three solute conformations
will be stored as a single solute on the .crd file, along
with the value of the parameter CP (read from input). Note, that it
assumes that the solute in the input configuration is in the
Ro conformation.
Key2: see CNFG READ
Data: numrunr, jobname, nstfa0new,
cplpar
- jobname, numrunr - The input file containing
a full configuration with the solute in its initial conformation is
<jobname>.<numrunr>.crd . The solute
conformation has to be the one described by the data after
SLTA.
- nstfa0new - the number of FE solute atoms in the
file <jobname>.<numrunr>.crd
.
- cplpar - The starting value of the coupling
parameter CP.
Fdata:
- RECTYPE 31: iclslt(i),(cslt(j,i),j=1,3),
qslt(i) (I5,4F15.5)
- iclslt, cslt, qslt - Description of
the second solute conformation. Identical meaning as for the data
after SLTA. The number of real solute
atoms is nslt
in the above description. This same number should appear also
with the keyword SLTA. However, the
program will
replace it with 3*nslt, corresponding to the fact that
it stores three copies of the free-energy solute, one each of the
it stores three copies of the free-energy solute, one each of the
two extremal conformations and the combined
conformation.
- RPUU - Same as RPSU, but no solvent molecule is
discarded.
III.02.12. Scramble torsion angles
SCRM
Prerequisite: CNFG
When this key is present, all torsion angles are given a random value and
the solute is regenerated in this random conformation.
III.02.12. Simulation cell rescaling
SCAL Data
Prerequisite:
CNFG
Data: e1, e2, e3
- e1, e2, e3 - Same meaning as edgex,
edgey, edgez for the PBCN
key, defining the new cell. The COM's of the solvents will be
scaled
to fill in the new box evenly.
III.02.13. Overlap of the second solute copy with the
first
OVST Data
Data: i1, i2, i3
- i1, i2, i3 - The second copy of the solute
will be shifted and rotated to best overlap with the first. The
overlap is based on the coordinates of atoms i1, i2,
i3. If any of them are zero, the first three atoms will be
chosen. This is useful for free-energy calculations to reduce the
change in the solute that the calculation has to model.
III.02.14. Spread out the solute molecules for display
SPRD [Data]
Prerequisite: CNFG
Data:
- numrunw {numrun} - the runnumber of the PDB file
to be written.
This key generates a PDB file containing the solute molecules
spread out
in the x-y plane so that they don't overlap.
III.02.15. Force solute reset with grids
PBGR
When this key is present, the grid calculations will reset
each solute atom to the periodic cell before calculating
the cover list. This is a safety option, since it is automatically
set when the inital structure contains solute atoms outside the
simulation cell.
III.02.16. Multi-copy ligand
LIGA Data
Data:
- ligands -
The number of solute molecules that are different copies
of the same ligand (they will not see each other).
III.02.17. Stepsize tuning
STUN Key1, Key2, Key3, Data
Prerequisite:
MVRT
Can not be followed by:
MVRT
Prerequisite of:
TAC0
Related key: TAC0.
Key1 (move type):
- TRAN - Tune solute molecule translation stepsize
- ROTA - Tune solute molecule rotation stepsize
- TORS - Tune solute molecule torsion stepsize
- LOOP - Tune solute molecule loop move stepsize
Key2 (tune type):
Data: targetacc, wsum, timeint, gain,
wstepsum, tunedstepmax
- targetacc {0.3} - target acceptance rate (< 1.0)
- wsum {0.999} - current acceptance rate Pacci
for tuning purposes is
Pacci - wsum*Pacci-1 + (1-wsum)*iacci
where iacci is 1 for accepted and 0 for rejected moves.
- timeint {2000.0} - integraltime of tuning
- gain {-50.0} - gain of tuning
- wstepsum {wsum} - weight factor used to calculate the
average tuned stepsizes, analogously to the calculation of the
acceptance rates.
- tunedstepmax - the maximum value the tuned stepsize(s) can
take. Default value is 2 Å for Key1=TRAN, 180o otherwise.
Note that the solute sampling should be set to alternating
translation and rotation with the
MVRT
key. For cloned solute molecules (see the key
CLON) the tuning will generate identical
stepsizes for each instance of a torsion angle.
III.02.19. Stepsize accumulator reset
TAC0 Key1, Data
Prerequisite: STUN
Key1 (move type): same as Key1 of
STUN
Data: nmc_zeroacc
- nmc_zeroacc - the stepsize accumulator for the step type
specified by Key1 will be reset at the nmc_zeroacc-th MC step.
nmc_zeroacc
has to be a multiple of the frequency of printing full results
(nplt read by the RUNS key).
This option is designed to work with
STUN **** OFFC.
III.02.19. Simulated annealing
SANN Key1, Data
Prerequisite: TEMP
Key1 (annealing schedule):
- NONE - turn off simulated annealing
- LINE - temperature is decreased by constant increments
- EXPO - temperature at the nstep-th step is set as
T=T0*exp(-nstep*expfac)
- LOG - temperature at the nstep-th step is set as
T=coef_log/ln(nstep+2)
- LIST - temperature values at each step are specified from input
Data for Key1=LINE: nmctempstep, tempinc
- nmctempstep - temperature is changed at every
nmctempstep MC step
- tempinc - temperature decrement at each step
- If tempinc=0.0 then tempfin, the final temperature;
MMC will calculate tempinc
Data for Key1=EXPO: nmctempstep, expfac
- nmctempstep - temperature is changed at every
nmctempstep MC step
- expfac - temperature decrement exponent at each step
- If expfac=0.0 then tempfin, the final temperature;
MMC will calculate expfac
Data for Key1=LOG: nmctempstep, coef_log,
- nmctempstep - temperature is changed at every
nmctempstep MC step
- coef_log - prefactor of the logarithmic formula
- If coef_log=0.0 then tempfin, the final temperature;
MMC will calculate coef_log
Data for Key1=LIST: nmctempstep, ntemplist,
templist(1),...,templist(ntemplist)
- nmctempstep - temperature is changed at every
nmctempstep MC step
- ntemplist - number of temperature values to read (< 101)
- templist(i) - the temperature at the i-th temperture step.
After 100 steps, the temperature stays at the last list value.
The run stops if a negative value is reached.
III.02.20. Simulated annealing of the chemical potential
SACP Key1, Data
Prerequisite: GCEN
Key1 (iteration length)
- FIXS - Iteration lengths are fixed, independent of the number of molecules
- SCLS - Iteration length is proportional to the number of molecules
Data: bincr, nmcsacpstep, fract_lim
- bincr - B parameter increment
- nmcsacpstep - for Key1=FIXS
the B parameter will decreased by bincr at
every nmcsacpstep MC step;
for Key1=SCLS the B parameter will decreased at every
nmolec*nmcsacpstep MC step.
- fract_lim - Cavity biasing will be turned off if the fraction of
the number of free grids exceeds fract_lim
When this option is used and the trajectory writing
(key TRAJ)
is not used, at each B parameter change the solvent coordinates
will be saved on the history file as PDB MODELs.
III.3. Free-energy calculation keywords
- FREE: Free energy
options
- RAUS: AUS parameter
change
- WMAT: PMF matching
- WPLT: AUS iteration plot
- TIQU: TI quadrature
- OVRA: Overlap ratio
- FETK: Temperature scaling for TI
evaluation
III.03.01. Free energy options
FREE Key1, [Key2-4 Data Fdata]
Prerequisite: PBCN
Potential successors:
CNFG, SLTA,
TORD, RCKP,
PARD
Related keys:
TIQU,
WMAT,
WPLT.
Key1 (free-energy method choice):
- WIDO - Excess chemical potential is calculated using the
Widom insertion method with cavity modification.
This option requires either a subsequent SCAN key
or the PXAN key before the
RUNS key
Data: rspha, ngridx, ngridy,
ngridz,
ngtry, nitry, ishftx, ishfty,
ishftz, nmolwid; nmolwid times
(ifdummy, ew0)
- rspha - Cavity radius (solute atom sizes are defined
by the vdw radii see key PRAT)
- ngridx, ngridy, ngridz -
Number of grid points in the x, y, z directions, respectively.
By default, the whole simulation cell is covered by the grid,
but a smaller portion can also be specified with the
LIMG key (preceding the
FREE key)
- ngtry {1} - Number of different grids (different random shifts)
to use for each configuration
- nitry {1} - Number of insertion trys per grid
- ishftx {0}, ishfty {0}, ishftz {0} -
When isftx=1, no grid shift is applied in the x direction;
same for y and z.
When isftx=2, succesive grids will be centered at i/ngtry.
nmolwid {1} - Number of different solutes to calculate free energy for.
For each solute:
- ifdummy {0} - When ifdummy=1, the first atom of the
inserted molecule is assumed to be non-interacting (used only to
ensure the best centering into the cavity).
- ew0 {0.0} - expected energy value (will be subtracted
from all calculated energies before forming exponentials, added
back at the end)
- ADDW - An excess chemical potential-like quantity
is calculated for the mutation of a solute atom into
a new group, using the Widom insertion method formalism.
In the figure below, atom X will be replaced
by different groups. These groups have to be defined as
separate molecules, counted as free-energy molecules and
they have to include at least the atoms that are
in 1-4 position to the atom X. In the figure
below they are labeled as A.
L A A
L-L | | |
| L L----A D A
| | / \ / \
L-L--L--L A--R A--R
| | \ / \ / \
| L L----A X A G(X)--G
L-L | | | |
| L A A G
This option requires either a subsequent SCAN key
or the PXAN key before the
RUNS key
Key2 (ligand flexibility):
- FIXL - the conformation of the ligand (atoms L, A, R, D and X)
is unchanged during the simulation
- FLEX -
the conformation of the ligand varies during the simulation
Data:
nwgtry, nwitry, nmolwid, nmcransh,
nghanchor;
nghanchor+1 times ighanchor;
nmolwid times ew0
- nwgtry {1} - Number of different orientations of the
replacement group to try (evenly spaced)
- nwitry {1} - Number of different orientations of the atom
to be replaced
- nmolwid {1} - Number of different groups to use as a replacement.
- nmcransh {0} - At each nmcransh-th simulation step
a new random increment is generated for the orientational sampling.
- nghanchor {1} - Number of anchor atoms
- ighanchor - the atomindex of the anchor atoms - negative index
means to drop that atom from the ligand (atom D above);
the (nghanchor)th is the atom R bonded to the atom
X to be replaced and
the (nghanchor+1)th is the atom X to be replaced.
Thus the nganchor-1 atoms of type A and D should be followed by
the atom R, the atom G(X) and the rest of the G atoms (if any)
For each of the nmolwid group:
- ew0 {0.0} - expected energy value (will be subtracted
from all calculated energies before forming exponentials, added
back at the end)
- CHIM - the excess Helmholtz free energy
of changing a solvent molecule into a new substance
is calculated using the
perturbation formula without a coupling parameter.
Each solvent molecule is changed to its new form and the energy
of the new substance is evaluated and fed into the perturbation
method formula. Clearly, it is a low-accuracy technique.
This option requires either a subsequent SCAN key
or the PXAN key before the
RUNS key
Key2 (Grid direction):
- XGRD - free energy profile aling the X axis will be calculated
- YGRD - free energy profile along the Y axis will be calculated
- ZGRD - free energy profile along the Z axis will be calculated
Data: ngrid, nwitry, nmolchim,
nmolchim times (ifdummy, ew0)
- ngrid -
Number of grid points in the direction chosen by Key2.
- nwitry - Number of insertion trys in the place of each deleted
molecule
nmolchim {10} - Number of different solutes to calculate free
energy for.
For each solute:
- ifdummy {0} - When ifdummy=1, the first atom of the
inserted molecule is assumed to be non-interacting (used only to
ensure the best centering into the cavity).
- ew0 {0.0} - expected energy value (will be subtracted
from all calculated energies before forming exponentials, added
back at the end)
- TICA - Energy coupling is used to change from one solute
to an other:
E(CP) = CPtiexp*E1 +
(1-CP)tiexp*Eo
The control function of the TI integrand will be plotted at the
end of the run and error estimates will be provided for it.
- PMLI - Same as TICA except that
the program will calculate the free energy difference between the
states precombined with the CIMX or CCMX key
using the perturbation method.
The control function of
<exp(-(E(uscp1)-E(cplpar))/kT)>cplpar
will be plotted at the end of the run and error estimates
will be provided for it.
- PMNL
- The calculation is performed at a fixed
coupling
parameter value CP (atomic coupling) to get free energy dfferences
between states CP and CPo and states CP and
CP1 with both the perturbation method and overlap ratio
method.
- Key2 for Key1=TICA, PMLI, or PMNL
(pre-combining):
- NOMX - No combining of states. The (excess) free
energy difference will be calculated between the two states
represented by the two free energy solutes defined with the
SLTA key.
- CIMX - Prepare the initial and final states as
the linear combinations of the inputted ones (after the
SLTA
key) with coupling parameter values uspar0, uspar1.
The potential coefficients will be similarly combined.
During the run, these states
will then be considered to correspond to coupling parameters 0 and
1, respectively. The coupling parameter cplpar will
transformed accordingly to maintain the ratio of the forward
and backward coupling parameter change.
- CCMX - The same linear combinations will be done as
for CIMX but both for the conformation pair given after
SLTA and for the conformation on the
.crd file. In
addition to that, the two solute conformations on the .crd
file are assumed to have been be obtained previusly as linear
combinations of the two inputted ones with coupling parameter
values uscpo0, uscpo1 (read with formatted input,
see below). This allows the preparation of a new
'window' from a previously used (and equilibrated) 'window'.
Data (for Key1=TICA only):
ngquadp, igquadp,
epstol, sigtol, qtol, xyz2tol
- ngquadp {0}, igquadp {0} -
if not zero, the coupling parameter
will be the igquadp-th point of the ngquadp-point
Gaussian quadrature
- epstol {0.0} - When the LJ sigma values of all atoms
in a group differ by less than epstol then that group will
be coupled linearly, i.e., tiexp = 1 will be used.
- sigtol {0.0}, qtol {0.0}, xyz2tol
{0.0} - similar tolerance values for the LJ epsilon, atomic charge
and distance square.
Data for key1=PMNL:
gmor0k, gmor1k, gdvork (X20,5F10.0)
- gmor0k, gmor1k - the energy difference
distributions will be accumulated
for the overlap ratio method calculation
in a grid starting at gmor0k, gmor1k
for the initial and final states, respectively.
Each distribution contains 100 grids with a grid width of
gdvork.
Fdata (for Key1=TICA, PMLI, or PMNL):
- RECTYPE 5: tiexp, cplpar, uspar0,
uspar1, uscpo0, uscpo1 (3F5.2,5X,5F10.0)
- tiexp - Coupling parameter exponent array. Its
elements give the coupling parameter exponents for the
1/r12, 1/r6 and 1/r potential energy terms
(Polynomial TI).
When tiexp(i)>0, in using TI
(FREE TICA), the CP and
(1-CP) factors for the corresponding potential energy
terms are raised to the tiexpi(i)-th power).
- cplpar - Coupling parameter value to be used in
the energy expression. Ignored when ngquadp > 0 !!
- uspar0, uspar1 -
when Key2 = CIMX or CCMX
the free energy difference will be determined
between states represented by these two
coupling parameter values (instead of between the
two states defined by the key
SLTA),
using the perturbation method (Key1 = PMLI or PMNL)
and the overlap ratio method (Key1 = PMNL).
- uscpo0 {0}, uscpo1 {1} - The coupling
parameter values that had been used to prepare the solute
previously on the .crd file. It is used when a starting
configuration is generated for an intermediate coupling parameter
value from a configuration that itself was already an
intermediate one (Key2 = CCMX).
- PMF1
- The conformational coupling parameter CP will
be sampled with umbrella sampling using either harmonic or adaptive
weighing.
Key2 for Key1=PMF1 (coupling type):
- GENL - Linear coupling in Cartesian space
- WRMM - Linear coupling in Cartesian space, but the
solute has two rigid molecules that move together as the coupling
parameter changes.
- WRFM - Like WRMM, but the second molecule
solute is kept fixed. The second molecule is not considered part of
the free energy solute and should be specified among the non-free
energy solutes.
- WRFR - Like WRFM but the first molecule is not
only displaced but also rotated.
- TORS - Coupling parameter is defined along torsional
coordinate(s). Requires also the TORD
key.
- WRTR - Linear coupling of the center of mass of two
positions of a single molecule, followed by translation and
rotation
in certain directions.
Key3 (solute change type):
- GEOC - Only the solute geometry changes
- GEQC - Only the solute geometry and partial charges
change
- GDQC - The solute geometry, partial charges and atom
types all change
Key4 (umbrella sampling type):
- HUSS - Umbrella sampling is based on a harmonic
potential.
- AUSL - Adaptive umbrella sampling is used with linear
interpolation for the AUS weighths
- AUSE - Adaptive umbrella sampling is used with exponential
interpolation for the AUS weighths
Data: (for FREE PMF1 WRF* ****
**** nmolcnt2
- nmolcnt2 - The second molecule of the PMF calculation
that is kept fixed. This information is used
to generate the common center of the two PMF molecules to make
sure that
all molecules interact with the same images of both PMF molecules
Data: (for FREE PMF1 WRTR ****
**** cdpmfx, crpmfx, cdpmfy,
crpmfy, cdpmfz, crpmfz
- cdpmfx, cdpmfy, cdpmfz - Center of mass
displacement step size limits
(see key STEP) in
the X, Y, and Z directions, respectively. Zero value in any
direction
will restrict the motion to the other, nonzero directions.
- crpmfx, crpmfy, crpmfz - Rotation
step size limits (see key STEP) around
the X, Y, and Z axes, respectively.
Fdata for
FREE PMF1 **** **** **SS):
- RECTYPE 6:
nsweep, cplmin, cplmax, delcpl,
c0cplh,
p0cplh, wcplcha (15x,i5,6F10.0)
- nsweep {0}. If > 0, an adaptive umbrella
sampling iteration will continue if the interval was traveled at
least nsweep times.
- cplmin, cplmax - minimum and maximum of
the coupling parameter CP to be sampled in this run.
- delcpl - Stepsize parameter for CP.
- c0cplh, p0cplh - With HUSS, the
umbrella sampling weights are obtained as
exp(c0cplh*(CP-p0cplh)2).
- wcplcha {1.0} - Relative weight of coupling
parameter moves (compared to other types of solute moves).
Fdata (additional) for
FREE PMF1 **** **** AUS*):
- RECTYPE 7: iopnrm, iopeql, iopenc,
nitssk, faclim, fcenc1,
- smplmx, ratmax, rldvmx, diffmx
(4I5,6F10.0)
- RECTYPE 8: ngovmn, nsubmn, ngrcor,
nwtst, fcenc2, encexp, tolera
(4I5,6F10.0)
- iopnrm - Option to select matching algorithm in
adaptive US.
- iopnrm < 0: linear n-step optimized:
- iopnrm = -1: matching is done on the
probabilities
(not recommended).
- iopnrm = -2: matching is done on the
potential of mean force.
- iopnrm = -3: matching is done on the potential of
mean force, with each increment matched separately (Dutch method).
- iopnrm = 0: nonlinear n-step optimized.
- iopnrm > 0: nonlinear n-step optimization,
with
regenerating initial guess from successive 1-step
optimization at every iopnrm-th iteration (after
resorting the iterations based on the range covered).
- iopeql - When >0, self test for equilibration in
the adaptive umbrella sampling is performed by screening previous
iterations at every iteration and dropping
iterations 'contradicted' by subsequent ones.
- iopenc - Controls the strategy to force the
adaptive umbrella sampling to extend the sampled range of the
coupling parameter by temporarily modifying the weights.
- iopenc < 0 : Instead of Boltzmann probabilities, the US
weights will be calculated from their sampling frequencies
- iopenc = 0 : No weight modification.
- iopenc = 1 : Weights are modified gradually as
the number of iterations during which a grid is unsampled
grows. ("global encouraging")
- iopenc = 2 : Suddenly abandoned regions
will get higher weight:
- iopenc = 3 : 1 and 2 combined.
- nitssk - Number of iterations to be skipped when
equilibration is found to be necessary during adaptive US.
- faclim {0.5} - K factor used to limit the
sampling to the requested interval.
- fcenc1 {2.0} - C factor used to encourage
sampling of new, undersampled region (iopenc = 1 or 3 or < 0),
- smplmx, ratmax {2.0} - The allowed ratio
between W(CP) over neighboring grid points is limited to the range
[1/ratmax, ratmax] when the frequency of sampling of
that grid point is under smplmx.
- rldvmx {0.0001} - Maximum relative deviation
allowed in nonlinear optimization.
- diffmx {0.9} - Threshold value for the iteration
difference indicator.
- ngovmn {4} - Minimum number of grids to overlap
for two iterations to be considered in the screening.
- nsubmn {5} - Iteration is rejected if one out of
every nsubmn subsequent overlapping iterations give
difference indicator > diffmx.
- ngrcor {5} - Number of gridpoints over which the
weighting is to be modified when sampling
encouragement is used.
- nwtst - Partial results of adaptive step is
printed for iterations until nwtst. If nwtst=#WI
(see Section VI.), stop the run after
saving the data on the iteration.
- fcenc2 {1.1} - For global encouraging, unsampled
grid weight will be multiplied by fcenc2 in each
iteration.
- encexp {0.75} - for global encouraging (iopenc = 1 or 3),
sampled grids modifying weights will be replaced by their encexp-th
power.
- encexp {1.00} - for sampling-frequency based US weights
(iopenc < 0), the sampling frequencies will be raised to their
encexp-th power.
- tolera {0.06} - The maximum distance of the
coupling parameter from the interval defined by cplmin and
cplmax.
III.03.02. Adaptive Umbrella Sampling parameter change
RAUS Key1, Fdata
Key1 (initialization option):
- NOIN - No accumulator/counter is changed
- CTIN - Initialize the sampling frequency accumulator
- ALIN - Initialize all AUS-related accumulators, counters
Fdata:
- RECTYPE's 6-8 as described with the
FREE PMF1 **** **** AUS*
keywords followed by
- RECTYPE 9: iterw0, newlim (2i5)
- iterw0 - When iterw0 > 0, use only iterations
from iterw0 for the matching.
When iterw0 < 0, restore dropped iterations.
- newlim - If iterw0 = 1, iterations are discarded
until the newly inputted region is reached and further
nitskp iterations are performed (as equilibration).
Note that cplmin and cplmax have to stay at their original value.
III.03.03. PMF matching
WMAT Key1, Key2, Key3, Key4, Data, Fdata
This function joins the adaptive umbrella sampling estimates
of the potential of mean force obtained from the checkpoint files
of calculations on overlapping coupling parameter ranges (windows).
Key1 (checkpoint list specification):
- REGL - checkpoint file runnumbers are evenly spaced
- LIST - checkpoint file runnumbers are irregularly spaced
Key2 (plot option)
- NONE - matched distributions will not be put on .wmp files
- PLPS - matched distributions and sampling frequency values
will be written on the file <jobname>.<numrun>.wmp
(for further plotting)
and a Postscript plot
will be written on the file <jobname>.<numrun>.wmp.ps.
- NOPS - only the .wmp file will be written
- NOPL - only the Postscript plot will be written
Key3 (plot cropping option)
- NOCR - distribution beyond match points will be printed/plotted
on the .wmp file(s)
- CROP - distribution beyond match points will be croppped
(to produce a contiguous plot)
Key4 (matching algorithm)
- MATW - Match at the best sampled gridpoint in the overlap range
- MATD - Find the matching factor by minimizing the
weighted deviation square sum in the overlap range
- MATC - Find the matching factor by first minimizing the
weighted deviation square sum in the overlap range and then select the gridpoint
where the matched curves are the closest and match at that point
Data for Key1=REGL:
nruns, numrunr, incrnumrun
- nruns - Number of runs to match
- numrunr {numrun} - Run number for the checkpoint
file containing the first segment (window)
- incrnumrun {1} - Run number increment, i.e., the i-th
window's checkpoint file was generated by run number
numrunr+incrnumrun*(i-1).
Data for Key1=LIST: nruns, numrun1,
numrun2
...
- nruns - Number of runs to match
- numrun* -
Run numbers
of the checkpoint files to match (nruns number)
Fdata:
- RECTYPE 48: delpl1,
delpl2, r0, cx, pmn, imatread,
cplzero, xlab
(5F10.0,I5,A)
- delpl1, delpl2 - To be added to the values of
w(CP) and p(CP) (to shift plots).
- r0, cx - The CP values on the .crd file
will be transformed into r0+cx*CP.
- pmn - Minimum sampling frequency for a grid point to
be written on the .wmp file (to keep noise out of the plots).
- imatread - Number of match points to read (if any)
- cplzero - when not zero, the match PMF will be shifted in
such a way that the PMF at that coupling parameter value will be zero.
- xlab - The X axis label when r0>0 or cx is not one.
- RECTYPE 49: (read if imatread > 0):
(match(i),i=1,nruns+1) (11I5)
- match - matching point specification. match(1)
and match(nruns+1) specify the beginning and the end
point of the complete coupling parameter interval. match(2)
is the gridpoint used to match runs 1 and 2. In general,
match(i+1) is the matching point between runs i and i+1.
Zero at any place results in default values. The default matching
points are determined based on the combined sampling frequencies
and the interval defaults to the [0.0,1.0] coupling parameter
interval. The whole record can be omitted.
III.03.04. AUS iteration plot
WPLT Key1, Data, [Data]
Key1 (scale for data points):
- LIN - circles with radius proportional to the extent of
sampling will be drawn at each data point
- SQRT - radius will be proportional to the square root of
the sampling extent
- LOG - radius will be proportional to the logarithm of
the sampling extent
Data: PSfile, nfstplt, maxit,
nfrplt, nrad
- PSfile - name of the file the plot will be written
in Postscript format
- nfstplt {1}, maxit {number of iterations},
nfrplt {1} - plot data for every nfrplt -th iteration
from iteration nfstplt to iteration maxit
- nrad {5} - the radius of the largest circle (in pixels)
This key will produce a family of curves showing the PMF calculated from
each iteration as lines, with circles representing the extent of sampling
at each coupling parameter grid. The matched PMF is shown with
large filled circles.
III.03.05. TI quadrature
TIQU Key1,[Key2] Data, [Fdata]
Key1 (data source specification):
- DATA - Just read the integrand values
- REGL - checkpoint file runnumbers are evenly spaced
- LIST - checkpoint file runnumbers are irregularly spaced
Key2 (contribution type, only for Key1=REGL or Key1=LIST)
- ALL - use both solute-solvent and solute solute (if any) contributions
- STSV - use only the solute-solvent contribution
- STST - use only the solute-solute contribution
Data for Key1=DATA: nquad, integrand values
- nquad - Number of quadrature points. Currently allowed
values: 3, 5 and 8.
- integrand values - nquad numbers
Data for Key1=REGL:
nquad, numruns, incrnumrun
- nquad - Number of quadrature points.
- numruns {numrun} -
Run number for the checkpoint
file containing the result of the first quadrature point
- incrnumrun {1} - Run number increment, i.e., the i-th
quadrature point checkpoint file was generated by run number
numrunr+incrnumrun*(i-1).
Data for Key1=LIST: nquad,
numrun1, numrun2 ...
- nquad - Number of quadrature points.
- numrun* - Run numbers
of the quadrature point checkpoint files (nquad number)
Fdata: (only for Key1=REGL or Key1=LIST)
- RECTYPE 41: ncread, nfac,
iskipread, itrapread, nendread (6I5)
- ncread - If > 0, the cumulative averages of the TI
integrands will be read from STDIN (unit 5) (RECTYPE's 43-45), ncread
numbers per line. Maximum value: 8.
- nfac - The TI integrands read from STDIN (unit 5) will be divided by
nfac if nfac > 0 (ignored if ncread=0).
- iskipread{0} - if > 0, allow for a-posterior equilibration
(read RECTYPE 42 below).
ncntin long segments) to skip from the beginning of the run
(a posteriori equilibration).
- itrapread {0} - If > 0, perform trapezoid rule integration
(read RECTYPE 46 below).
- nendread {0} - If > 0, calculate the TI integral over
different intervals (read RECTYPE 46 below).
- RECTYPE 42: (read if iskipread > 0)
(nskp(iq),iq=1,nquad) (8I5)
- nskp(iq) - Number of control function blocks (i.e.,
ncntin long) to skip from the beginning for quadrature point
iq.
- RECTYPE 43: (read if ncread > 0) ident
(a80,/,a80)
- ident - 2-line description of the data.
- If iread > 0 then nquad times (one data set
for each quadrature point):
- RECTYPE 44: nbl, ncntin, cplpr
(2I10,F10.0)
- nbl - The number of control function blocks to read
in.
- ncntin - The length of a control function block.
- cplpar - The coupling parameter value.
- RECTYPES 45: ticums(ib,iq),ib=1,nbl,
ncread numbers per line (6E12.6)
- ticums - The cumulative average of the integrand at
each control function block.
- RECTYPE 46: (read if itrapread > 0 ):
idef, ymin, ymax (I5,2F10.0)
- idef,-
If idef=0, the fitted polynomial will be used to calculate
the integrand at the endpoints.
- ymin, ymax -
The value of the TI integrand at
coupling parameters 0 and 1 (if available) for trapezoid rule
integration (to be used when idef > 0).
- RECTYPE 47: (nendread diffferent lines): xmin,
xmax (6F12.6)
- xmin, xmax - New coupling parameter limits. The
program will calculate the integral of the Gaussian approximating
polynomial between these limits. This line is only read if
RECTYPE 43 was present.
This function performs the numerical (Gaussian) quadrature on
thermodynamic integration runs. It prepares a plot of the
polynomial fitting the quadrature ponts and, optionally, performs
other numerical integrations.
III.03.06. Overlap ratio evaluation
OVRA Data
This function obtains the free energy from two overlap ratio method
runs.
Data: numrun1, numrun2
Run number
- numrun1, numrun2 - The
run numbers
that generated the two checkpoint files containing the
energy distributions the method requires.
III.03.07. Temperature scaling for TI
Prerequisite: TEMP
FETK Data
This option scales the temperature by dividing it with
cplparkexp to represent a run where the total system
is made to turn into ideal gas
Data: kexp, nquad, iquad
- kexp {4} - the coupling parameter exponent
- nquad - the number of Gaussian quadrature points (3, or 5 or 8)
- iquad - the quadrature index (≤ nquad)
III.4. Molecule descriptor keywords
- SLTA: Solute
description
- SLVA: Solvent
description
- CLON: Solute cloning
- MAKB: Make bonds
- BRKB: Break bonds
- BDHH: Forcing H-H bonds
- BDRH: Forcing bonds of
an H to a different group
- MOLD: Define solute
molecule limits
- RDBD: Bond list read
- TORD: Torsion angle
definitions
- CSEG: CHARMM segment id
handling
III.04.01. Solute description
SLTA Key1 Key2 Key3, Data, [Data, Fdata]
Prerequisites:
FILE, PBCN,
SVPT, SUPT
Can not be followed by:
SLVA, PMOD,
SLVA, PFRD,
FREE, PROT,
p
FLXR, CNST,
CHRG, CLON,
MAKB, BRKB,
RFSL, MIXR,
SETC, MODA,
MOLD, STEP,
RDBD
Potential successors:
RCKP, PRMF
Prerequisite of:
CNFG, RUNS,
TORD, WCNF,
FCGA, PARD,
FCGD, PMFM,
AROM, ENGL,
STVG, CMPC,
BNDL, HBMO,
SVDP, GENV,
PXCR, SCAN,
HBBR
Related keys:
CLON,
MAKB,
BRKB,
MOLD,
RDBD.
Key1 (possible solute periodicity):
- SMPL - Simple solute.
- POLY - Periodic solute along the X axis. Displacement
not made along the X axis.
- XTAL - Solute is periodic along all 3 axes (crystal).
Global solute moves are forbidden.
Key2 (input format):
- MMC - MMC format - specific to this program
(the program
Simulaid
can help convert structures in different file formats
(e.g., PDB, CHARMM .CRD) to this format).
- MMC4 - Older version of MMC format (4-character PF labels)
- PDB - PDB
format input - atomnames will
be used to deduce atomic numbers only
- CRD - CHARMM CRD format input -
atomnames will be used to deduce atomic numbers only
Key3 (data source):
- READ - Read the solute description data
(the formatted input below describing each solute atom)
from the input file, starting from the next line
- FILE - Read the solute description data
from a file with extension .slt .
For Key2=PDB, .pdb is also allowed,
For Key2=CRD, .CRD is also allowed.
Data: nslt, nsltfe, nsltfe0,
nsltpr, numrunr, jobnamer
The program is currently prepared to handle atoms up to atomic
number 86. The atomic number for lone-pair is 89 (in case
of EPEN, it is disregarded) and atomic number 90 is an
electron.
Atomic numbers 91-93 have been reserved for the united atoms
CH1-CH3, respectively. The various
atomtypes for the different
potential libraries implemented and the actual functional form of
the potentials are specified in a separate document.
Important: For EPEN, the different types of centers have to
be grouped together. Lone-pairs should follow nuclei and
electrons should follow nuclei and lone-pairs.
When the coupling parameter is used to determine
conformational transition (FREE PMF1),
three sets of solute
coordinates are used: the two conformations Ro and
R1 and their
combination CP*R1+(1-CP)*Ro. The solute
description thus has to
contain the coordinates, charges of the two conformations as well
as room for the mixture. In this third set, only the atomtypes
have to be given. When CNFG PMFN,
however, only the Ro conformation is required here since
the R1 conformation will be
given at a different part of the input. It is important that the
different descriptions of the conformation give the solute atoms
in identical order. Solute atom type change during the transition
can not involve EPEN.
III.04.02. Solvent description
SLVA [Data Fdata]
Prerequisite: SVPT
Potential successors:
SLTA, RCKP
Prerequisite of:
SVDP
Data: nslv, islvrep, label, islv4, filename
- nslv - Number of solvent atoms per solvent molecule.
- islvrep {1} - The central atom to be used for
representing the solvent molecule when only one center is needed,
e.g., for proximity analysis, solute-solvent distances will be
calculated from the islvrep-th solvent atom
- label {HOH } - The residue label to use with this
molecule.
- islv4 - {0} When islv4=1 the atom records use only
four characters for the potential label
- filename - when present, the solvent desription (see Fdata below)
will be read from a file filename.slv, in the same format as
the Fdata for the solute atoms (key SLTA)
Fdata:
- RECTYPE 15's: nslv records - one for every atoms of the
solvent:
itypslv(i), (cslv(j,i),j=1,3), qslv(i),
labslv(i)
- For nslv > 0:
For islv4=0:(1x,a6,4F10.0,2x,a4) or (1x,i6,4F10.0,2x,a4)
For islv4=1:(1x,a4,4F10.0,2x,a4) or (1x,i4,4F10.0,2x,a4)
- For nslv = 0: (1x,a6,4f10.0,11x,a4)
Also, a character G in colum4 53 can override islvrep read above.
- itypslv - for general solvent
(SVPT GENL) it
is the atom type (either a potential label, left-adjusted,
or number, right adjusted - see key
SLTA), otherwise the atomic number of atom i.
- cslv - X,Y,Z coordinates in Å of the atom i.
- qslv - charge on atom i to be used in the SLT-SLV
interactions.
- labslv - label of atom i
Pseudo-atoms are to be placed after the regular atoms.
Note: This key is ignored if the solvent is not
GENL type.
Note: the dipole of the solvent molecule should be parallel
to the X axis for the purpose of the near-neighbor solvent dipole
correlation function computation
(DSTC ALL).
III.04.03. Solute molecule cloning
CLON Data
Potential successor:
SLTA
Data: nclone, iclonef(1), iclonel(1),
ncopy(1), ...
iclonef(nclone), iclonel(nclone),
ncopy(nclone)
III.04.04. Making additional bonds
MAKB Data
Potential successor:
SLTA
Data: nmake, imake1(1), imake2(1),
...,
imake1(nmake), imake2(nmake)
- nmake - Number of bonds to make
- imake1(i), imake2(i) - The i-th new bond
connects atoms imake1(i) and imake2(i)
Note that this key is not implemented for the case when cloning
(CLON) is in effect.
III.04.05. Breaking bonds
BRKB Data
Potential successor:
SLTA
Data: nbreak, ibreak1(1), ibreak2(1),
...
ibreak1(nbreak), ibreak2(nbreak)
- nbreak - Number of bonds to break
- ibreak1(i), ibreak2(i) - The i-th bond to
be broken
connects atoms ibreak1(i) and ibreak2(i)
Note that this key is not implemented for the case when cloning
(CLON) is in effect.
III.04.06. Forcing H-H bonds
BDHH
By default, the topology search excludes bonds between hydrogens.
If this key is present, these bonds will be kept.
III.04.07. Forcing bonds of an H to a different group
BDRH
By default, the topology search excludes bonds of hydrogens to heavy
atoms of a different group/residue.
If this key is present, these bonds will be kept.
III.04.08. Defining solute molecule limits
MOLD Data
Potential successor:
SLTA
Data: molslt, ilastm(1), ilastm(2),
...
- molslt - Number of molecules in the solute
- ilastm(i) - Last atom of molecule i.
This key overrides the molecular topology for defining the
molecules within the solute (or just a part of it).
This is useful when the
molecule is distored and too many incorrect bonds are
found or too many bonds are missing.
When applied to cloned (CLON) solute
atoms, the definitions have to be
repeated for all clones.
The solute atoms not covered by this operation
have to be placed contiguously at the end of the
solute definition read with
the (SLTA) key.
III.04.09. Bond list input instead of calculating
RDBD Key1 [Data]
When this key is present, the solute bond list is read from a file.
Key1 (bond list type):
- MAKB - Bond list is read from a file whose syntax
is the same as the input for the
MAKB key (including the key itself).
Simulaid can prepare this
file along with the .slt file
- CPSF - Bond list is read from a file
that is a Charmm topology (.psf) file.
- ATOP - Bond list is read from a file
that is an Amber topology file.
Data file name
- file name - bond list is a file called file name.mkb,
file name.psf, and file name.top for Key1 =
MAKB, CPSF, and ATOP, resp.
Default file name: jobname.
III.04.10. Torsion angle definitions
TORD Key1 Key2 Data Fdata
Prerequisite:
SLTA
Can not be followed by:
PRMF, SETC,
LOOP, FREE,
PARD, SKWT
Potential successors:
CNFG, SPST
Prerequisite of:
REGE, TORT,
LPST, GENT,
TAND
Related keys:
LOOP, PART,
TORT, LPST.
Key1 (torsion definiton source):
- INPT
Data: ntorsinp, nsltcop, rngfac,
rngfacl, inrcat
- ntorsinp - Number of torsion angles on the solute.
- nsltcop {1} - number of solute copies to apply the
inputted torsions. Applicable for free energy simulations where two
or three solute copies are needed (see the key
FREE for details.
- rngfac {1.0}
(nsltcop numbers, one for each solute copy) -
The torstep values read in RECTYPE 24's will
be all multiplied with rngfac.
- rngfacl {1.0}
(nsltcop numbers, one for each solute copy) -
The steploop values read in RECTYPE 24's will
be all multiplied with rngfacl.
The different solute copies may have different factors.
- incrat {0} - the atom indices read in below will be
incremented
by incrat (to allow the insertion of new molecules into the
solute).
Fdata (when ntorsinp > 0):
- RECTYPE 24's (for each torsion) ik, il,
steploop, torstep, itorgrp,
isense, wtors, deldih
targetang, iskew (2i5,2f10.0,2i5,3f10.0,i5)
- ib, ik - The two (bonded) atoms defining the
bond of torsion. The program will assign automatically one
neighbour
each of ik and il to define the actual torsion angle.
For solute molecules that were sepcified with the help of the
CLON key, ik and il
refer to the atoms in the original (uncloned list). Torsions on
cloned molecules will also be cloned automatically.
For free-energy calculations using three solute copies
(FREE PMF1 or
FREE PMNL )
the atom numbers should relate to the third copy.
- steploop - The torsional displacement stepsize range
for loop moves.
It is analogous to wpslt* (given with the
PARD key).
Extension biasing is not applied to loop moves.
- torstep - The torsional displacement stepsize range.
It is analogous to wpslt* (given with the
PARD key) and its meaning also varies
depending on Key 1 of PART.
- itorgrp - The torsion group number. For
FREE PMF1 TORS the first group
is the group of torsions moved by the coupling parameter.
Torsions belonging to the same torsion group
will be changed simultaneously.
All torsions of a group must refer to the same solute molecule.
- isense - The direction of rotation for
deldih.
+1: clockwise, -1: counterclockwise.
- wtors {1.0} Weight for selecting this torsion.
- deldih - For torsions involving the coupling parameter
the program will generate the second conformation by a
rotation of deldih degrees (in the direction specified by
isense) whenever it is not zero.
This saves the user the need to actually generate
the coordinates of the transformed solute conformation.
For torsions not involved in the coupling parameter, the program will
keep this torsion within deldih degrees of its initial value
(as defined by the coordinates read by the
SLTA key)
whenever it is not zero.
deldih can not be given as negative.
targetang and iskew are read only when the
SKWT key is present:
- targetang - angle toward which the sampling may be skewed
- iskew - when it is not zero, the sampling of the
corresponding torsion will be skewed
- ALL
Key2 (torsion group definition):
- MOLG - Each solute molecule will be a torsion group
- SING - Each torsion will be a torsion group
- BRAG - Each branch from the root atom will be a
torsion group
Data: defrot, deflooprot, maxtorchain,
nfixbondtyp,
atypx1, atypy1, atypx2, atypy2, ...
- defrot {10.0 deg or 0.5 for EXB*} - The torstep value
to be used for all torsions
- deflooprot {10.0} - The steploop value to be used for all
torsions
- maxtorchain {nslt} - Simple torsions moving chains longer
than maxtorchain atoms will be disabled
- atypx1, atypy1 - list of nfixbondtyp pairs of
atomtypes. Torsion angles around bonds between such pairs of atoms will be
held fixed.
>LI>RDBB
Same input as Key1=INPT, but in addition the program will ensure that
protein backbones (if any) use backbone atoms for the four atom defining
each backbone torsion angle.
>LI>ALBB
Same input as Key1=ALL , with the same added function as for
Key1=RDBB
III.04.11 CHARMM segment id handling
CSEG
When this key is present, the last 4 characters of spgroup
(read with SLTA **** MMC) will be
used for the segment id when CHARMM CRD format is written
with the WCNF CHRM or
DENF CHRM keys.
III.5. Potential descriptor keywords
- SUPT: Solute potential
type
- SVPT: Solvent potential
type
- MIXR: LJ parameter mixing
rule
- PRMF: Torsion potential
specification
- PMOD: Potential library
modifications
- MODA: Element
modification
- PFRD: Potential library input
- SUVC: Solute-solvent
cutoff
- SUUC: Solute-solute
cutoff
- SVVC: Solvent-solvent
cutoff
- INCT: Solvent inner
cutoff
- PROT: Minimum repulsion for
general solvent
- RFCR: Reaction-field correction
- SETC: Set various constants
- CHRG: Solute charge
manipulations
- SPST: Field-dependent potentials
- SPPS: Solute
molecule-dependent potentials
- AROM: List of aromatic carbons
- SVIN: Solvation parameter input
- ENHB: Hydrogen-bond potential
- HBMO: Non-default hydrogen-bond
- FLXR: Residues to sample
- CNST: Constraint potential
- VVNE: No solvent-solvent elecrostatic in
Solvation parameter input
- IGTT: Solute-solute potential is set to zero
III.05.01. Solute-solvent potential
SUPT : Key1 [Key2]
Potential successor:
RCKP
Prerequisite of:
SLTA, PROT,
RFSL
Related keys:
PMOD,
MODA,
PFRD,
MIXR.
Key1 (solute PF library source):
III.05.02. Solvent-solvent potential
SVPT Key1 Key2 [Data]
Potential successor:
RCKP
Prerequisite of:
SLTA, SLVA
Key1 (solvent PF type):
- MCY - Solvent is MCY-CI water
Key2 (solvent PF parameters):
- MCY - Use the stored MCY-CI-II parameters.
- YMD - Use the stored Yoon-Morokuma-Davidson
parameters (YMD).
- TIP3 - Solvent is TIPS/SPC/TIP3P (type) water.
Key2 (solvent PF parameter source):
- TIPS - Use the stored parameters for Jorgensen TIPS
- SPC - Use the stored parameters for SPC.
- TIP3 - Use the stored parameters for Jorgensen
TIP3P.
- SPCE - Use the stored parameters for SPC/E.
- READ - Input water-water interaction parameters.
Data: c6, c12, qh (in kcal, Å and
electron)
- c6 - Coefficient of the -1/r6 term
in kcal Å6.
- c12 - Coefficient of the 1/r12 term
in kcal Å12.
- qh - Charge on the hydrogen
- TIP4 - Solvent is TIPS2/TIP4P/B-F (type) water.
Key2 (solvent PF parameter source):
- BRFW - Use the stored parameters for Bernal-Fowler
- TIP2 - Use the stored parameters for Jorgensen TIPS2
- TIP4 - Use the stored parameters for Jorgensen
TIP4P.
- READ - See above
- TIP5 - Solvent is TIP5P (type) water. Currently implemented only for
HRDW SCLR
Key2 (solvent PF parameter source):
- TIP5 - use the stored parameters for TIP5P water.
- READ - See above
- GENL - Solvent is general 1-6-12 molecule.
Key2: The parameter library used (see
SUPT for the list).
Key2=GENL also requires the key
SLVA READ
III.05.03. Lennard-Jones parameter mixing rules
MIXR Key1
Potential successors:
SLTA, RCKP
Key1:
- GEGE - Geometric mean for both the SIG's and the
EPS's
- ARGE - Arithmetic mean for the SIG's and geometric mean
for the EPS's.
ELJ(r)=4*EPS[(SIG/r)12-(SIG/r)6]
= c12/r12-c6/r6.
Relevant only for AMBER, CHARMM, GROMOS and OPLS potentials.
III.05.04. Torsion potential parameter input
PRMF Key1 Data
Can not be followed by: SLTA
Potential successor: TORD
Related key: PFRD
Key1: Same as Key1 of SUPT
- the name of the potential library.
Data: Name of the file containing the parameters.
The input syntax for the parameter files is defined by the
respective libraries.
At present, only AMBER, CHARMM, OPLS (in CHARMM format)
and GROMOS libraries can be read.
Furthermore, for GROMOS input the file name has to be of the
form <name>bon.itp and the file <name>.rpt also
has to be present.
III.05.05. Potential (re)definition
PMOD Key1 Data Fdata
Potential successor:
SLTA
Key1:
- Key of the potential function library to be modified (see
key SUPT). All parameters
will be echoed on the output listing.
Data: ntyps
- ntyps - Number of new types
Fdata
III.05.06. Atom (element) descriptor modifications
MODA Data Fdata
Potential successor:
SLTA
Data: nmod
- nmod - Number of element descriptors to modify
Fdata:
- RECTYPE 10 (nmod records):
ian, ianpr, ifgr, namr,
awr, vdwr, ramaxr, nv
(3i5,1x,a4,3f10.0,i5)
- ian - atomic number to modify
- ianpr - new atomic number for print
(to be used with united atom types)
- ifgr - new functional group index
(see key PMOD)
- namr - new name
- awr - new atomic weight
- vdwr - new Van der Waals radius
- ramaxr - new covalent radius
- nv - new valence
Using this key the built in constants for each chemical
element can be modified or new ones can be
defined.
Fields left blank or given as zero will retain their original value
III.05.07. Potential library input
PFRD Key1 Data
Potential successor: SLTA
Related key: PRMF
Key1 (library type):
- CHRM - Read the NB parameters from a Charmm parameter file
- OPLS - Read the NB parameters from a Charmm-formatted
parameter file for OPLS types
Data: paramfile iswitchcase
- paramfile - the name of the parameter file
- iswitchcase {0} - when =1 the potential labels read
will be converted from lower case to upper case
III.05.08. Solute-solvent cutoff
SUVC Key1 [Data]
Potential successor:
RCKP
Prerequisite of:
PBCN
Key1 (cutoff type):
- SPCC - Spherical cutoff based on the solute COM
- MICC - Minimum image convention based on the solute COM
- ISOE - Isoenergy cutoff
- SPGC - Spherical cutoff based on solute group COM's
- MIGC - Minimum image convention, based on solute group
COM's
Data: Cutoff radius {Rinsc - see
PBCN}
For the particular case of a single solute molecule in nmolec-1 solvent it
is advisable to use the minimum-image cutoff for solute-solvent
interaction to avoid the single discontinuity at the boundary of
the cutoff sphere.
Multimolecular solutes or for solutes with torsional degree of freedom
require group-based cutoffs.
In the minimum-image cutoff for each pair of molecules only
the interaction between the nearest images are taken into
account, irrespective of the intermolecular separation. This is
equivalent to the use of a cutoff cube, prism, etc., identical in
size, shape and orientation with the periodic cell.
The cutoff radius may be based on the centers of solute
groups (see SPGC or MIGC above) - useful for large
solutes. Also,
interionic PMF calculations can use the
iso-energy cutoff introduced by Mezei.
III.05.09. Solute-solute cutoff
SUUC Key1
Potential successor:
RCKP
Prerequisite of:
PARD, PART,
ENGL
Key1 (cutoff type):
- NONE - No cutoff, no periodic images considered
- MIMC - Minimum image convention based on solute molecule
centers
- SPMC - Spherical cutoff based on solute molecule centers
- MIGC - Minimum image convention based on solute molecule
group (residues) centers
- SPGC - Spherical cutoff based on solute molecule
group (residue) centers
Data: rc
- rc {Rinsc} - Cutoff radius
(see PBCN)
Note, that the **GC options will update the energies of all
atoms that are in a residue when a torsion move moves
the group center atom.
To reduce the extra calculations, a) you should not use too large
groups when this option is active and b) whenever possible, group
torsions
so that they don't leave too many group members unchanged.
III.05.10. Solvent-solvent cutoff
SVVC Key1 [Data]:
Potential successors:
RCKP, PART
Prerequisite of:
PBCN
Key1 (cutoff type):
- SPCC - Spherical cutoff
- MINI - Minimum image convention
Data: rc, exprc, maprc
- rc {Rinsc - see
PBCN} - Cutoff radius
- exprc {5.5 Å} - Special cutoff for the
exponential term in the MCY potential.
- maprc {rc+2Å} - Cutoff for inclusion to the
solvent-solvent near-neighbor bitmap
III.05.11. Inner cutoff on the solvent
INCT Key1 [Data]
Potential successor:
RCKP
Key1:
- NONE - None used
- EXPL - Exponential repulsion at short distances
Data: rwwina
- rwwina {2.0 Å} - For water-water distance rww <
rwwina, replace the energy by
10.0*exp(rwwina2-rww2) + 1000.0
kcal/mol.
III.05.12. Minimum repulsion.
PROT Data
Prerequisite:
SUPT
Potential successor:
SLTA
Data: c12prot
- c12prot - For general solvent only, solute-solvent
interactions where no LJ interaction is specified by the potential
library are modified ('protected') with a repulsion term of
c13prot/r1212 kcal/Å12 .
This can be used to avoid numerical instabilities during
creation-annihilation TI (FREE TICA)
that can result from unlike charges getting too close to each
other.
III.05.13. Reaction-field correction
RFCR [Data]
Variable to set to 'T' in the preprocessor: RF
Data: epsrf
- epsrf {80.0} - The dielectric constant in the
reaction field correction formula.
epsrf=1.0 represents vacuum (i.e., no correction);
epsrf=80.0 represents water.
The use of the reaction field correction requires the use of
identical cutoffs for the
solvent-solvent (SVVC),
solute-solvent (SVVC) and
solute-solute (SUUC) interactions.
Also, the cutoffs are to be group based and each group has to be
electrically neutral.
III.05.14. Set various constants
SETC Key1 Data
Potential successors:
SLTA, TORD
- Key1=DIEL:
Data: dielc
- dielc - An overall dielectric constant,
applied (as a divisor) to
all electrostatic interactions.
If a distance-dependent dielectric function code
is activated it multiplies whichever (Mehler-Solmayer or epr(r)=r)
distance-dependent dielectric function is calculated
(see the preprocessor).
- Key1=DDIE:
Constants for the Mehler-Solmayer dielectric function
eps=A +(eps0-A)/[1+ k * exp(-lambda*(eps0-A)*r)]
Data: eps0_ddd,
a_ddd, rlambda_ddd, rk_ddd
- eps0_ddd {78.4} - eps0
- a_ddd {6.0294} - A
- rlambda_ddd {0.18733345} - lambda
- rk_ddd {213.5782} - k
- eps1_dd_ddd {8.0} - when eps1_dd_ddd > 0 then the dielectri
constant will be set to eps1_dd_ddd for distances that are less
than the distance at which the Mehler-Solmayer function reaches
eps1_dd_ddd
NOTE: This dielectric constant is not applied to interactions involving any
of the explicit water models, i.e., only to interactions involving the
solute and a general solvent.
- Key1=FC14:
Data: esf14, vdwf14
- esf14 {0.833333} Factor to apply to the 1-4
electrostatic interactions
- vdwf14 {0.5} Factor to apply to the 1-4 Lennard-Jones
(i.e., Van der Waals) interactions.
If FC14 is not used then the program sets the default values
that are potential dependent: (1.0, 1.0) for CHARMM,
(0.5, 0.5) for OPLS, (0.833333, 0.5) for AMBER and all the others as well.
- Key1=FCIN: Data: fcintra
- fcintra Factor to apply to the solute intramolecular
energies when summing them into the total energy
- Key1=BLKS: Data: nblskip, nbllim
- nblskip {0} - Number of control function blocks to skip when
calculating error estimates (to drop equilibration part).
- nbllim {#MI} - Number of the last control function blocks to use
calculating eror estimates.
III.05.15. Solute charge manipulations
CHRG Key1
Potential successor:
SLTA
Key1:
- ZERO - Set the solute and solvent charges to zero.
This is advisable when a solvent configuration is generated
randomly since it will avoid solvents getting stuck to the solute due
to unphysically close distances involving atoms that have no
repulsive core assigned to them.
- INPT - Use inputted charges (after
SLTA) for OPLS
III.05.16. Field-dependent solute potential
SPST Key1 Data
Prerequisite: FLXR
Can not be followed by:
AROM, SVIN,
ENHB, PARD,
TORD, CNFG,
HBMO
Related keys: AROM,
ENHB
Variable to set to 'T' in the preprocessor: GM
Key1:
- NONE - No field-dependent potential
- EMAP - Additional solute potential depending on energy grid maps
are read from files (see below).
- CMAP - Energy grid maps are calculated.
- CMPW - Energy grid maps are calculated and written to the
map files.
Data: ningridmap, ewvdmax
- ningridmap {131} - Number of gridpoints in the energy maps - has
to be an odd number and has to be the same as the
dimension symbol #GM
- ewvdmax {1000.0} - VdW energies above ewvdmax will be
truncated to ewvdmax
- scalemap {1.0} - All maps read will be multiplied by scalemap
Additional data for Key1=CMAP or CMPW:
edgemap, gridcent_x, gridcent_y, gridcent_z,
gridbuffer
- edgemap - the edge of the (cubic) grid map.
- gridcent_x, gridcent_y, gridcent_z -
the coordinates of the center of the grid map.
- gridbuffer - atoms in a box extended by gridbuffer
will contribute to the energy maps.
For Key1=EMAP,
after establishing the grid constants the program reads the energy maps from
the files
prot.pi,
lp.map,
pa.map,
pc.map,
pd.map,
ph.map,
pn.map,
po.map,
pp.map,
ps.map.
for the electrostatic energy, van der Waals energy for
C, A, N, O, P, S and H atoms, respectively (A is aromatic C) and,
whenever needed, the file lp.map containing the contribution of the
(Autodock 4) solvation term and the file ov.map
containing the O-H VdW term.
These are binary files containig the 3-dimensional array
e(ningridmap,ningridmap,ningridmap) in a single record.
The electrostatic map (read first) also contains a second record containig
title, nx, ny, nz, ep, es, micro, stern, radd, ho, hi2, akp, aks
where
- title: 40-character description
- nx,ny,nz: number of grids in the x, y,and z directions
(unless nx=ny=nz=ningridmap the program aborts)
- ep,es: dielectric constant inside and outside the solute
- hi2: grid spacing
- akp,akps: Debey screening constant inside and outside the solute
map(hi2*[(ningridmap-1)/2+1],
hi2*[(ningridmap-1)/2+1], hi2*[(ningridmap-1)/2+1]) is
the energy at <0.0,0.0,0.0>;
map(1,1,1) is the energy at
<-hi2*(ningridmap-1)/2, -hi2*(ningridmap-1)/2, -hi2*(ningridmap-1)/2>
The solvation term Esolv(i,j) contribution from atoms i and j
is of the form
Esolv(i,j) = (ViSj +
VjSi)*exp{r(i,j)/gaussdist2}
Si=Ai + qk_par*abs(q(i))
where gaussdist2, and qk_par are constants
Vi, and Ai are atom type-dependent constants
and r(i,j) is the distance between atoms i and j and
q(i) is the partial charge on atom i.
All constants have built-in values that can be changed
with the SVIN key
(the built-in values serve as defaults).
Furthermore, if the map file hb.map is present
then the program will treat all fixed oxygens as hydrogen bond donors
and include a 12-10 hydrogen-bond potential.
Hydrogen-bond acceptors can not be included in such a map since the H-X bond's
orientation is conformation-dependent. It can be calculated explicitly
during the simulation
with the ENHB key
Note that on restart the energy maps have to be read again the same way
as for startup.
For Key1=CMAP or Key1=CMPW, the program needs the description of
the atoms contributing to the grid potentials.
These are assumed to be in
a file <jobname>_2.slt that is in the same format
as the records read
by the key (SLTA, but only the fields
tslt(i), (cslt(j,i),j=1,3), qslt(i),
labslt2(i), indxrdf(i) are read.
There should be a record
for each atom that is not explicitly represented.
III.05.17. Molecule-dependent solute potential
SPPS Key1 Data
Key1:
This key allowed the use of potential terms depending on the field or on the
molecule's center, orientation, etc. To use this option, the
user also have to provide the corresponding code to the body of the
subroutine esltmolec.
III.05.18. List of aromatic carbons
AROM Key1 Data
Prerequisite: SLTA
Potential successor:
SPST
Prerequisite of:
SVIN
Related key: SPST.
Key1:
Solute potential type see key SUPT.
Data: narom, pflab(1),...,pflab(narom)
- narom - number of labels to read
- pflab - potential label of aromatic carbon (narom labels)
This information is needed for deciding what type of energy maps
to use with these atoms - see, e.g., SPST EMAP.
III.05.19. Desolvation parameter input
SVIN Data
Prerequisite:
AROM
Potential successor:
SPST
Related keys: SPST,
AROM.
Key1:
- ALLA -
Adds a desolvation term (Autodock 4 parametrization)
acting between all solute atoms.
Aliphatic carbons will be treated as united atoms
(the charges of hydrogens on carbons are added to the carbon charge).
- CARB - same as ALLA but the desolvation term is only applied
between carbon atoms.
- CARA - same as CARB but the desolvation term is only applied
between non-polar carbon atoms.
A carbon is considered polar if its charge is > polarcarblim
- NOHP - same as ALLA but the desolvation term is not applied
to pairs involving a hydrogen atom.
- NOPP - same as ALLA but the desolvation term is not applied
between polar atoms: N,O,S, polar H and polar C.
Data: gaussdist2, gausslimfac, qp_par,
6*[A_solv(i), V_solv(i)],
polarcarblim
- gaussdist2 {24.50}
- gausslimfac {4.0} - desolvation terms will not be calculated for
distances that are larger than gausslimfac*gaussdist2.
- qk_par {0.01097}
- A_solv(1) {-0.00143} Carbon
- V_solv(1) {33.5103 } Carbon
- A_solv(2) {-0.00052} Aromatic carbon
- V_solv(2) {33.5103 } Aromatic carbon
- A_solv(3) {-0.00162} Nitrogen
- V_solv(3) {22.4493 } Nitrogen
- A_solv(4) {-0.00251} Oxygen
- V_solv(4) {17.1573 } Oxygen
- A_solv(5) {-0.00214} Phosphorus
- V_solv(5) {33.5103} Phosphorus
- A_solv(6) {-0.00214} Sulphur
- V_solv(6) {33.5103} Sulphur
- A_solv(7) {-0.00051} Hydrogen
- V_solv(7) { 0.0 } Hydrogen
- polarcarblim {0.2} - the carbon polarity threshold
III.05.20. Hydrogen bond potential
ENHB Key1 Key2 Data
Prerequisite: SLTA
Potential successor:
SPST
Prerequisite of:
HBMO
Related keys: SPST,
HBMO,
SSVIN.
Key1 (solute-solute HB potential type):
- NONE - Hydrogen-bond terms will be not be used for the
solute-solute interactions.
- SMPL - Hydrogen-bond putential is of the form
(c12/rHB12-c10/rHB10)*cos(A...H-X)
- 2ANG - Hydrogen-bond putential is of the form
(c12/rHB12-c10/rHB10) *
cosiexptheta(A...H-X) *
cosiexpomega(LP-A...H)
Key2 (solute-map HB potential type):
Same options as for Key1
Data: rmaxhb, rminhb, rminhbnc,
qmaxhbdon, qminhbacc, iexptheta, iexpomega,
colpangle
- rmaxhb {3.0} - upper threshold of the HB potential
- rminhb {1.65} - lower threshold of the HB potential
- qmaxhbdon {0.0} - when not zero, hydrogens with partial charge >
qmaxhbdon will not act as hydrogen-bond donors
- qminhbacc {0.0} - when not zero, oxygens with partial charge <
qminhbacc (i.e.,more negative) will not act as hydrogen-bond acceptors
- iexptheta {2} - For Key*=2ANG,
the exponent on the theta factor
- iexpomega {4} - for Key*=2ANG,
the exponent on the omega factor
colpangle {120} - The angle between the two lone-pair directions,
used for the calculation of the omega angle
When this key is used
a hydrogen-bond term will be used between polar hydrogens and oxygens
as specified by Key1 and Key2, subject to the filters specified by
rmaxhb, rminhb, qmaxhbdon, qminhbacc.
This term may be used between non-bonded solute atoms and
- if used - the grid-based energy term
(SPST EMAP).
When the grid energy is used, and there are hydrogen bonds between the
explicitly represented atoms and the atoms contributing to the grid
then the atoms (that are otherwise fixed) involved in hydrogen bonds
have to be specified.
The hydrogen-bond donors and
acceptors will be extracted from the non-felxible part of the solute
as specified by the FLXR key.
III.05.21. Non-default hydrogen bond
HBMO Key1 Data
Prerequisites:
ENHB, SLTA
Potential successor:
SPST
Related key: SPST,
ENHB.
Key1 (type of action):
- ADDT - Add new hydrogen bonding type(s). Default types are:
- "D>NH": H on a nitrogen
- "D-OH": H on an oxygen
- "DNH+": H on a protonated nitrogen
- "AC=O": Carbonyl oxygen
- "AH-O": Non-carbonyl oxygen
- "AC-O": Ester or ether oxygen
New types are defined by their potential label:
Data for Key1=ADDT:
pflib, ntype, pflab(1), ..., pflab(ntype)
- pflib - Potential library of the labels
(see key SUPT)
- ntype - Number of new types
- pflab(i) - Potential type label of the ith new type.
- ADDC - Add new hydrogen bonding coefficients:
Data for Key1=ADDC:
hbtyp1, hbtyp2, c12, c10, rminhbheavy
- hbtyp1, hbtyp2 - hydrogen bond type labels for the
donor and acceptor if this type of bydrogen bond (either the
default labels listed above or the potential labels read with the key
HBMO ADDT)
- c12, c10 the c12 and c10 coefficients of
this type of bydrogen bond.
If both are given as zero then this type of bydrogen bond will not be used.
- rminhbheavy {0.0} - For Key*=2ANG,
hydrrogen bond between types hbtyp1, hbtyp2,
oxygen acceptor with one bond only,
the lower threshold of distance between the donor hydrogen's
heavy-atom and the atom the oxygen is bonded.
- EXCL - Exclude atom types from hydrogen bonding
Data for Key1=EXCL:
pflib, nhbexclude, pflab(1),
...,pflab(nhbexclude)
- pflib - Potential library of the labels
(see key SUPT)
- nhbexclude - number of labels to read
- pflab - potential label to exclude (nhbexclude labels)
III.05.22. Residues to sample
Potential successor:
SLTA
FLXR Key1 Data
Related key: TORD
Prerequisite of: SPST
Key1 (input syntax):
- INDX - Ranges of flexible residues are listed as pairs of integers
- PDB - PDB-formatted ATOM records, one for each flexible residue
Data for Key1=INDX: n_residue_range,
nresidue_range times: irf, i2f
Data for Key1=PDB: n_residue,
nresidue times: PDB ATOM records specifying the residue numbers of the
flexible residues
For calculations using the energy map
this key defines the residues to be sampled. All other residues will
contribute to the energy maps.
III.05.23. Constraint potential
Potential successor:
SLTA
CNST Key1 Key2 Data
Key1 (new or modify)
- NONE - No constraints
- INIT - New constraint list
- UPDT - Modifying constraint list
Key2 (constraint bond creation)
- NOCT - No CONECT records will be written on PDB files
- WCNT - CONECT records will be written on PDB files representing the
constraints as bonds
Data for Key1=INIT: nconstr_pair,
nconstr_pair times: iatom1, iatom2, d_target,
force_target, tol_target
Data for Key1=UPDT: nconstr_pair,
nconstr_pair times: d_target, force_target, tol_target
- nconstr_pair: The number of solute atom pairs whose distance is
to be constrained
- iatom1, iatom2: The two atoms whose distance is
constrained with this constraint.
- d_target: The target distance for this constraint
- force_target: The constraint force constant. The constraint energy
will be
force_target*(d_target-actual distance)2-
force_target*tol_target2
- tol_target: Distance tolerance
- the constraint energy is set to zero if the actual distance is within
tol_target of d_target
III.05.24. No solvent-solvent electrostatic interaction
Potential successor:
SLTA
VVNE
When this key is present solvent-solvent interactions will mot include the
electrostatic terms.
III.05.25. Ignore solute-solute energy
IGTT
When this key is present all solute-solute interactions are set to zero.
III.6. Sampling-related keywords
- MOVE: Move selection
strategy
- SAMP: Sampling
technique
- NFBU: Solute force bias
- STEP: Stepsizes
- STPS: Stepsize scaling
- PARD: Solute molecule
displacement
- SWAP: Solute molecule
swap
- MV2S: Correlated 2-solute move
- PART: Solute torsion
- MVRT: Alternating
translation and rotation
- PRFI: Insertion/deletion
pref. sampl.
- SEED: Random number seed
input
- NONB: Non-Boltzmann
solute samplings
- STSC: Temporary stepsize scaling
- LOOP: Loop torsion moves
- IGJA: Ignore Jacobian
- SKWT: Skewed torsion sampling
- IGSV: Non-interactive solvent
- RNDG: Random number generator type
III.6.01. Molecule selection strategy specification
MOVE Key1 [Data] [Fdata]
Potential successor:
RCKP
Prerequisite of:
PRFI
Key1 (molecule selection strategy):
- RAND - Molecules are selected randomly (uniformly) for
move
- CYCL - Cyclic selection - increasing order
- CYCI - Cyclic selection - decreasing order
- SHCY - Shuffled cyclic procedure
of Mezei is used.
- PRSL - Preferential sampling with piecewise linear
weight function. Preferential sampling (PS) systematically selects
solvents near the solute more frequently to be moved to improve the
satatistics of the solute-related properties.
- PRSP - Preferential sampling with polynomial weigth
function
- PRSE - Preferential sampling with exponential weigth
function
Key2 (distance choice for PS):
- COM - PS is based on (solvent's COM's) distance from
solute COM
- NRST - PS is based on (solvent's first atom's)
distance from the nearest solute heavy atom. Not valid for EPEN
solute.
- CXYZ - PS is based on distance from inputted center
center
For Key2=CXYZ Data:
- CXYZ - The coordinates of the center from where the
PS distance is calculated.
Fdata for Key1=PRSL:
- RECTYPE 1: npf (I5)
- RECTYPE 2: (rpf(i),apf(i),i=1,-npf)
(10F7.0)
- rpf, apf - The weight function value w(r) at each
rpf(i) is apf(i). For r < rpf(1)
w(r)=apf(1) while, for r > rpf(npf)
w(r)=apf(npf). For intermediate r's (rpf(1)
< r < rpf(npf)) w(r) obtained by linear
interpolation betwen the two nearest rpf's.
Fdata for Key1=PRSP:
- RECTYPE 1: npf (I5)
- npf = 1+ the smallest exponent having non-zero
coefficient.
- RECTYPE 3: (apf(i),i=1,npf) (10F7.0)
- apf - The coefficients in the weighing polynomial.
Units are in Å. The weighing function is constant for r <
apf(1) and
If r > apf(1) then w(r)=apf(2)+apf(3)/r
+apf(4)/r2+ ... . w(apf(1))
gives the value of the constant.
Fdata for Key1=PRSE:
- RECTYPE 4: stiff, stvar(2F7.0)
- stiff,stvar - The weighting function is
exp(-(stiff*/stvar2)*r2) where
r is the solute-solvent distance as specified by Key2.
III.6.02. Sampling technique selection
SAMP Key1 [Data Fdata]
Variable to set to 'T' in the preprocessor
for Key1=FBPR or FBSC: FR
Key1 (possible displacement biasing strategy):
- FBPR - Force biased.
Data: fblam {0.5} .
- fblam - the lambda factor for force biasing (the
strength of the bias)
- METC - Standard Metropolis, moves in a cube.
- METS - Standard Metropolis, moves in a sphere.
- METX - Standard Metropolis, fixed length moves (half of
cedslt or cedslv as read by the key
STEP
- METR - Standard Metropolis, length of the moves
should fall within a range
- FBSC - Force biased, scaled depending on the slt-slv
distance.
For Key1=METR: Data: rngminfac
- rngminfac {0.8} - the step magnitude will be between
rngminfac*ceds** and ceds**
For Key1=FB**:
Data: fblam
- fblam {0.5} - the lambda factor for force biasing (the
strength of the bias)
For Key1=FBPR:
Fdata:
- RECTYPE 27: nfblam (I5)
- nfblam - Number of data points in the table specifying
the force-bias scaling.
- RECTYPE 28:
(rfblam(i),afblam(i),i=1,nfblam)
(10F7.0)
- rfblam(i), afblam(i) - The force-bias lambda
parameter will be multiplied by a scale factor that is interpolated
between the values afblam(i) at solute-solvent distance
rfblam(i).
III.6.03. Solute sampling technique selection
NFBU
If present, the Metropolis method will be used for
for moves translating/rotating the whole solute
(se key STEP),
even when the key SAMP specifies
force-biased sampling.
In this case leaving the C@TS lines
comments saves some computer time.
III.6.04. Solute and solvent stepsizes
STEP Data
Potential successor:
SLTA
Prerequisite of:
RUNS
Data: cedslt, rtxslt, cedslv,
rtxslv, nsltfreq
- cedslt - Shift interval in Å for the COM of the solute:
The displacement of the solute COM will be limited into the
interval [-cedslv/2,cedslv/2].
- rtxslt - Rotation interval in degrees in which the
solute is to be rotated.
- cedslv - Shift interval in Å for the COM of the
solvent.
- rtxslv - Rotation interval in degrees by which the
solvent is to be rotated.
- nsltfreq - The frequency of all solute moves. Solute
move may be any of the following type: displacement-rotation
(cedslt > 0.0 and nmolfx=0 ); coupling parameter
change (FREE PMF1); partial solute move
(PARD);
partial solute torsion (PART) or
solute group swap (SWAP).
The above variables are used to perturb a chosen molecule,
solute or any of the solvent during the Metropolis walk. As a
simple rule of thumb, these variables are chosen by trial and
error to obtain 30-40% acceptance rate.
For water the recommended values are:
0.30 Å for the COM of the solvent and 30 degrees for the rotation
angle for standard Metropolis and 0.55 Å and 40 degrees for Force
Bias. The stepsize of the solute should be adjusted to
provide similar acceptance rate as the solvent. In general, the
larger the solute the smaller stepsizes are to be used.
III.6.05. Stepsize scaling
STPS Data
Data: notmovcyc
- notmovcyc {1} - Solvent stepsizes are scaled down by 10%
when a solvent is found that was not moved during the last
10*notmovcyc cycles. One cycle is nmolec MC steps.
III.6.06. Partial displacement of the solute
PARD Key1, Key2, Data
Prerequisites: SLTA,
SUUC
Can not be followed by: FREE
Potential successors:
TORD, SPST
Prerequisite of:
SWAP, MV2S
Related key:
SWAP.
Key1 (rotation stepsize strategy):
- UNIF - Rotation stepsize is kept constant
- EXBI - Rotation stepsize is scaled inversely with the
maximum extension of the molecule
in the plane perpendicular to the rotation axis.
- EXBQ - Rotation stepsize is scaled inversely with the
square root of the maximum extension
- EXBA - Rotation stepsize is scaled inversely with the
square root of the root mean square extension
Key2 (molecule selection strategy):
- RAND - Molecules are selected randomly (uniformly) for
move
- CYCL - Cyclic selection - increasing order
- SHCY - Shuffled cyclic procedure
of Mezei is used.
- CYCI - Cyclic selection - decreasing order
Data: wsltpdis, cedpsltx, rtxpsltx,
wrotx, cedpslty, rtxpslty, wroty,
cedpsltz, rtxpsltz, wrotz, nsltpardis
- wsltpdis - Relative frequency of partial solute
displacements
- cedpslt* - Displacement size parameters
corresponding to the x,y, and z directions, resp.
- rtxpslt* - For Key1=UNIF: Rotation size
parameters
corresponding to the x,y, and z directions, resp.
- rtxpslt* - For Key1=EXBI: Rotation size
parameter will be calculated as
rotmax=rtxpslt* / extmax where extmax is the maximum extension of
the molecule to be rotated in the plane perpendicular to the
rotation axis.
- rtxpslt* - For Key1=EXBQ: Rotation size
parameter will
be calculated as rotmax=rtxpslt* / (2.0 * sqrt(extmax))
- wrot* - Weight for selecting an axis for rotation.
- nsltpardis - When >0, the molecule(s) containing the
first nsltpardis atoms after the free energy molecule (if any)
will be subjected to independent moves (translation and
rotation).
When negative, the molecules containing the last -nsltpardis
atoms will be moved (This allows their treatment in subsequent
analysis as solvents).
Note, that nsltpardis refers to the explicite solute atom
list,
and not the full list of solute atoms after cloning (if
any).
III.6.07. Solute molecule swap
SWAP Data
Prerequisite: PARD
Data: wswap
- The relative frequency of attempting to swap
two randomly selected solute molecules
III.6.08. Correlated 2-solute molecule move
MV2S Data
Prerequisite: PARD
Data: wmv2s, r2scut, rtxcslt, iaxis2s
Key1 (stepsize selection strategy): See Key1 of
key PART
- wmv2s - The relative frequency of attempting to make
a correlated move of two solute molecules
- r2scut - The distance threshold for selection of the
solute molecule pairs
- rtxcslt - The solute rotation stepsize (see
key PARD)
- iaxis2s - The axis of correlated rotation
III.6.09. Partial torsion of the solute
PART Key1, Key2, Data
Prerequisites: SLTA,
SUUC
Can not be followed by: SVVC
Prerequisite of:
TAUC
Related keys:
LOOP,
TORD.
Key1 (stepsize selection strategy):
- UNIF - Rotation stepsize is kept constant
- EXBI - Rotation stepsize is scaled inversely with the
maximum extension of the part of the molecule
that is affected by this torsion,
calculated from the line of the torsion bond
- EXBQ - Rotation stepsize is scaled inversely with the
square root of the maximum extension
- EXBA - Rotation stepsize is scaled inversely with the
square root of the root mean square extension
Key2 (strategy for selecting the solute molecule for
torsion):
- RAND - Molecules
are selected randomly (uniformly) for move
- CYCL - Cyclic selection - increasing order.
- SHCY - Shuffled cyclic procedure
of Mezei is used for selecting the
molecule for torsion change.
- CYCI - Cyclic selection - decreasing order.
Key3 (strategy for selecting the torsion on a solute
molecule):
- RAND - Torison groups are selected randomly
(uniformly) for change
- CYCL - Cyclic selection - increasing order.
- CYCI - Cyclic selection - decreasing order.
Data: wsltptor
- wsltptor - Relative frequency of partial solute
torsion
III.6.10. Alternating translation and rotation
MVRT Key1
Potential successor:
STUN
Prerequisite of:
STUN
Key1 (solute and/or solvent moves)
- BOTH - Both the solute and the sovent will be
translated and rotated simulatneously
- SVAL - Translations and rotations will alternate for
the solvent
- STAL - Translations and rotations will alternate for
the solute
- TVAL - Translations and rotations will alternate for
both the solute and the solvent
Data: wsltdisp, wsltrot, wslvdisp, wslvrot
- wsltdisp, wsltrot - Relative frequences of
solute translatons and rotations
- wslvdisp, wslvrot - Relative frequences of
solvent translatons and rotations
III.6.11. Insertion/deletion preferential sampling
PRFI Key1 Key2 [Data] [Fdata]
Prerequisites: GCEN,
MOVE
The syntax is the same as for the preferential sampling for the
solvent moves. Thus, except for not allowing RAND,
CYCL, SHCY, CYCI for
Key1, the input is the same as for the key
MOVE.
III.6.12. Random number seed initialization
SEED Data
Data: ix1, ix2, ix3, ix4, ixscr
- ix1, ix2, ix3, ix4 - the new random
number seed will be
ix1*2^48+ix2*2^32+ix3*2^16+ix4
- ixscr - the new scrambler seed will be ixscr
The random number generator will be reinitialized with these seeds.
III.6.13. Non-Boltzmann solute samplings
NONB Key1 Data
Key1 (sampling type):
- SCTR - sampling will use reduced solute torsion terms
- SCRP - sampling will use reduced solute-solvent repulsion
term
- SCNB - sampling will use reduced intra-solute non-bonded terms
- SCTN - sampling will use reduced torsion and
intra-solute non-bonded terms
- TMOD - solute energy will be scaled with a different
temperature
- TSAL - sampling follows Tsallis statistics
(REF.) for the solute energy
Data: param(s), ew0, signincfac
- param(s) - the extent of non-Boltzmann sampling:
- For Key1=SCTR,
the scale factor (< 1) to apply on the
torsion energy.
- For Key1=SCRP, the scale factor (< 1) to apply on the
solute-solvent repulsion.
- For Key1=SCNB, the scale factor (< 1) to apply on the
intra solute non-bonded term.
- For Key1=SCTN, the scale factor (< 1) to apply on the
torsion energy, followed by the scale factor (< 1) to apply on the
intra solute non-bonded term.
- For Key1=TMOD, the temperature for the solute energy
(> T (see key TEMP))
- For Key1=TSAL, the q parameter
- ew0 {value at start} - the value to
subtract from energy factor of the weight factor's exponent
- signincfac {1.1} - for Key1=TSAL only:
Add to the exponential argument
-signincfac times its initial value if
it turns out to be negative (to avoid taking
the log of a negative number)
III.6.14. Temporary stepsize scaling
STSC Key1 Data
Key1 (move type to scale):
- SVTR - Scale solvent translation stepsize
- SVRO - Scale solvent rotation stepsize
- STTR - Scale whole solute translation stepsize
- STRO - Scale whole solute rotation stepsize
- SMTR - Scale solute molecule translation stepsizes
- SMRO - Scale solute molecule rotation stepsizes
- NTOR - Scale normal torsion stepsizes
- LTOR - Scale loop torsion stepsizes
Data: scalefac
- scalefac {1.0} - the factor to use for this type move
The scaled stepsizes will only be used for this run - the next
calculation (initiated by a new
RUNS key) will use the originally inputted
values.
III.06.15. Loop torsion moves
LOOP Key1, Key2, [ Data]
Potential successor:
TORD
Prerequisite of:
LPST
Variable to set to 'T' in the preprocessor: LO
Related key:
IGJA
When this key is present the program will analyze the torsion angle
list and determine which torsions can be the first torsion of a loop
move. Loop moves change six consecutive torsions in such a way that
the bond of the first and last torsion remain fixed,
thereby limiting the change in the molecular geometry
due to the change in the torsionangle(s).
The formulae giving the loop-closing torsion angles
were obtained based on a distance geometry approach.
The current version is limited to loops where the torsions are
on consecutive bonds (-------),
or on consecutive bonds broken by a rigid bond (-=-----, --=----,
---=---, ----=--, -----=-)
or on a peptide backbone with rigid peptide bonds
(-[=--]3, -[-=-]3).
Key1 (solution selection algorithm):
- PROX - Search only in the vicinity of the reference
conformation for the nearest solution.
- SCAN - Scan the whole solution space to obtain all solutions
and select the nearest. Currently it is implemented only for
consecutive torsions.
- JACW - Scan the whole solution space to obtain all solutions
and select one with Jacobian weight. Currently it is implemented only for
consecutive torsions.
Key2 (iteration direction(s))
- FORW - Loop-closing iteration is in the 'forward' direction only
- BCKW - Loop-closing iteration is in the 'backward' direction only
- TRNG - Loop-closing iteration direction is chosen based on the
closeness of the triangle inequlities
- BOTH - Loop closing iterations are run in both direction and the
solution with better precision is chosen
Data: wloop, d3grid, d3tol, dijtol,
d24tol, drevtol, dextol, dihpmtol
- wloop {1.0} - Probability of selecting a loop torsion move
whenever possible (instead of a simple rotation)
- d3grid {0.1} - r3 range will be searched
by d3grid increments (or smaller when a root appears near).
- d3tol {0.000001} - Convergence tolerance of the distance
between putative r3 positions in the loop-closing algorithm
- dijtol {0.01} - Tolerance of the bond length squares
for the no-torsion debug option
- d24tol {0.1} - Tolerance of (r2-r4)^2
in the loop-closing algorithm
- drevtol {0.2} - Distance square tolerance for the
check of the reverse proximity criterion - only used for
Key2=PROX
- dextol {0.2} - Tolerance of the filtering distance squares
- dihpmtol {5.0} - Frozen bond torsion angle signs will be
conserved for angles that deviate from planarity by
less than dihpmtol degrees.
III.06.16. Ignore Jacobian
IGJA
Related key:
LOOP,
When this key is present, the acceptance probability of a local move
(see key LOOP) will ignore the Jacobian.
This will allow skipping the calculation of the loop-closure problem
for the reverse move and thereby speeding up the calculation.
The speedup, however, comes at the expense of the loss of Boltzmann
sampling and thus this option is to be used for search or minimization runs.
III.06.17. Skewed Torsion sampling
SKWT Key1 Data
Related key: TORD,
Key1 (type of skewing)
- DRIV - for selected bonds,
torsion attempts will be made wskew times
more frequently in the target direction than away from it.
- LOOP - for loops involving selected bonds,
the loop solution closest to the target angles will be chosen
wskew times more likely than a randomly chosen solution.
- DRLP - a combination of both
options described by the keys DRIV and LOOP
- PROX - for loops involving selected bonds,
the loop move will be rejected with 1-1/wskew probability whenever
the Rloop torsions moved away from their target (in the least-square sense).
- DRPR - a combination of both options described
by the keys DRIV and PROX
- DLPR - a combination of the options described
by the keys DRIV, LOOP, and PROX
.
Data: wskew, wskewstepmin, resumefac
- wskew - the weight used for skewing the sampling for the
torsions selected for skewed sampling.
- wskewstepmin {1.0) - for stepsize tuning runs, the skewed
sampling is suspended whenever the stepsize falls below
wskewstepmin degrees.
- resumefac {1.2} - for stepsize tuning runs, the skewed
sampling is resumed whenever the stepsize is raised above
wskewstepmin*resumefac degrees.
When this key is present then
the formatted data after the key TORD
will specify a target angle for some of the torsions.
III.06.18. Non-intractive solvent
IGSV
When this key is present then solvent-solvent interactions will be ignored.
III.06.19. Random number generator information
RNDG Key1 Data
Key1 (generator type)
- MMC - MMC RNG: 64-bit congruantial RNG; scrambled in 64 number segme
- LCG - Linear congruential generator
- MRTW - Mersenne twister generator
- INPT - A list of random numbers is read from a file; before reusing
they are scrambled
- INNS - A list of random numbers is read from a file; they are reused
unchanged when the list is exhausted
- OUTP - A list of random numbers generated by MMC RNG is written to a
the run will stop after writing the file
Data for Key1=LCG : lcg_fac, lcg_add,
lcg_exp, seed
- lcg_fac {314159269} - factor C (see below)
- lcg_add {453806245} - additive constant A (see below);
default values from Forsythe.
- seed {1357} - First random number
- lcg_exp {31} - modulus exponent E (see below)
Data for Key1=MRTW: seed
- seed {4357} - random number seed
Data for Key1=INPT or Key1=OUTP:
rndgfile, nrand
- rndgfile - name of the file the random numbers
are read from or written to (Key1 = INPT or OUTP)
- nrand - number of random numbers to write (Key1 = OUTP)
III.7. Calculation flow keywords
- RUNS: Run MC
simulation
- SCAN: Scan trajectory for
proximity analysis
- PXAN: Proximity analysis
frequencies
- RCKP: Checkpoint file read
- STOP: Stop the run
III.07.01. Execution of a MC simulation
RUNS Data
Prerequisites:
SLTA, CNFG,
TEMP, FILE,
STEP
Data: nmcmax, nmcrep, nplt,
nmcrec, ncntin, nmcadp
- nmcmax - Number of MC steps to run.
- nmcrep {nmcmax/100} -
After every nmcrep steps certain useful
intermediate output, e.g., energy, energy-minimum, molecule moved,
fraction of accepted steps, solute binding energy is printed.
After reading a checkpoint file (see key RCKP)
the rest of the data is ignored.
- nplt - After every nplt steps, various statistics
and distribution functions, such as the radial distribution
functions g(R), binding energy distributions QCBE, coordination
number distributions QCCN, pair-energy distributions QCPE, etc.
are printed and are optionally stacked on the .dst plot file
see DSTC **** FILE)
(for later analysis). nplt must
be an integer multiple of nmcrep. If it is not, the program
will automatically change it to the nearest multiple.
- nmcrec {2*nmolec} - Frequency of saving a full
configuration on
the .hstfile (see key TRAJ)
- ncntin {greater of 10000 and nmcmax/100} -
Length of the interval for
control-function (block average) calculations. Some of the control
functions thus obtained will be plotted on the printer at the end
of the output. (Recommended value: 100000.)
- nmcadp {100000} - Frequency of adaptive umbrella
sampling weight updates. nmcadp must be an integer multiple
of nrecd
(see also key CHKP.
If it is not, the program will automatically
change it to the nearest multiple.
III.07.02. Proximity analysis from history
SCAN [Key1] Data
Prerequisite:
SLTA
Key1 (source of data):
- TRAJ - analyze a trajectory
- TRNC - analyze a trajectory that is the continuation of
the previous trajectory (but with different runnumber),
i.e., don't initalize the analysis accumulators.
This key assumes that a previous SCAN TRAJ was
already executed in the same run.
TRAJ ALLV.
- CONF - analyze the current configuration
For Key1=TRAJ or TRNC:
Data: nmcpxmax, navgpx, nranpx,
nmcpxdsc, npxres, npxcntin,
lumppr, nsymdim, ifrsdim
- nmcpxmax - The program will scan the history file until
the nmcpxmax-th configuration and perform proximity
analysis.
If no
CNFG command reading in a configuration
preceded SCAN, the
initial coordinates will be gathered from the history file.
- navgpx - frequency of selecting a structure for
the proximity analysis contributions
- nranpx {0} - number of random points to generate (for
volume-element estimation) after the analysis of each
configuration
- nmcpxdsc {0} - Analysis to be started after the first
nmcpxdsc-th configuration
- npxres {npxmax} - Proximity analysis results printed
after each npxres-th configuration
- npxcntin {ncntin} - Block size
(number of configurations to average) for proximity
analysis error estimates
- lumppr {2} - Printed error estimates are based on
2lumppr*npxctn long blocks
- nsymdim {3} - For each random point symmetry-related
points will be generated in nsymdim dimension
- ifrsdim {1} - Start symmetry generation at the
ifrsdim-th dimension
For Key1=CONF:
Data: nranpx, lumppr, nsymdim,
ifrsdim - see above.
Structures in history files in CHARMM
(TRAJ CHRM) or AMBER
(TRAJ AMBR) trajectory syntax
are labeled 1,2,... while structures in all other syntaxes
are labeled by the MC stepnumber they were saved at.
III.07.03. Proximity analysis frequency parameters
(concurrent to MC run)
PXAN Data
Can not be followed by: PXGR,
PXDP, PXBE,
PXWW, TAND,
ATFR, FLDG,
RTIM, DIFC,
VORO, CHKP
Related keys:
PXLM,
PCPA,
PXYZ.
Data: navgpx, nranpx, nmcpxdsc,
npxres,
npxcntin, lumppr, nsymdim,
ifrsdim
These variables are the same as defined for the keyword
SCAN.
Using this keyword the proximity analysis will be performed
concurrently to the MC simulation (eliminating the need of saving
the history file).
III.07.04. Continue from a checkpoint file
RCKP [Key1 Key2 Data]
Prerequisite:
FILE
Can not be followed by:
TITL, PBCN,
SUPT, SVPT,
TRAJ, INCT,
MIXR, SUVC,
SVVC, SUUC,
NSLV, MOVE,
FREE, SLTA,
SLVA, CNFG
Potential successor:
VOLE
Related keys:
CHKP,
WCKP,
RMCK,
SCKP.
The program will read the content of the current checkpoint file,
.ckp and restore from there
the state of the variables, including
the file unit numbers and the run number numrun (!),
allowing the continuation of an earlier run as if it never stopped.
If you want to reinput some of the data used by the run the run may abort
as certain inputs are forbidden in different context.
In this case either use the
RUNS key with zero steps to run or the
FILE key.
Use with caution, though!
Key1 (checkpoint file checking procedure):
- STPD - Stop whenever the self test discovers a
discrepancy.
- IGND - Ignore whenever the self test discovers a
discrepancy.
- FIXD - Attempt to fix discrepancies, but stop if fails
to.
- FIXD - Attempt to fix discrepancies, but don't stop if fails
to.
Key2 (trajectory checking procedure):
- NOFX - Abort if trajectory file is damaged
- FXTR - Run with damaged trajectory file.
Data: numrunr, jobnamer
- numrunr {numrun}, jobnamer
{jobname} - The program will look for the file
<jobnamer>.<numrunr>.ckp to restore the data
from. If the filename is different from the default checkpoint file
name, the program will open a new checkpoint file name with the
default name, i.e., <jobname>.<numrun>.ckp and
save subsequent checkpoint data there.
If the checkpoint file was found too short,
the program will assume that the last configuration was still properly
read (since that is the first item in the file) and write it on a new file
<jobname>.99.crd in binary form (to be read with
CNFG READ BNRY NOFX 99).
III.07.05. Termination of the calculation
STOP [Key1, Key2, Data]
Key1 (possible exit self test request):
- SLFT - Perform a self-test before
stopping.
Key2 (Same as Key1 for SLFT see details there):
- BASC - Basic test
- FULL - Full test (This is the default here)
- ALL - FULL test plus BASC test on selected
molecules:
Data for Key2=ALL : nmolck
- nmolck {nmolec} - BASC tests are
performed on molecules 1 - nmolck
- NOSF - Don't perform a self-test
before stopping.
III.8. Output related keywords
- OUTP: Print various
system charateristics
- PRNT: Input echo level
- DSTP: Bulk solvent
distribution
print
- PRAC: Acceptance rate print
detail
- BNDL: Listing of bonds
found
- PRCO: Compilation option
listing
- PLBP: Potential parameter
echo
- KMNP: PDB file property
threshold
- EPLT: Switch between
Cv and Utot in convergence plot
- PRPL: Print plotted data
- NMVP: 'Not-moved'
threshold to print solvents
- PXPA: Empty proximity
region print
- PXPR: Proximity analysis
printing options
- PDBT: PDB atomname convention
- PLCV: Postscript convergence plots
III.08.01. Print various system characteristics
OUTP Key1
Key1 (type of output):
- REPT -
The program prints a full description of the system (normally
printed before each simulation run).
- PRAT -
The program prints a list of atomic constants stored (atomic
weights, radii, valences). The vdw radii stored can be used
for cavity search (see keys GCEN and
FREE WIDO) or solute size definition
(see keys FILT and
PBCN PHS ).
- PRPB - The program prints the centers of the periodic cells.
- PRFC - The program prints the list of functional groups.
- PROP - The program prints the internal option and
debug option arrays.
If PRNT ECHO was present,
for each option number that is defined by a main key MMC lists
these keys with their option number.
III.08.02. Input echo level selection
PRNT Key1
Key1 (level of echoing input):
- NECH - The lines with unformatted input will only be
echoed
- ECHO - Each individual unformatted datum will be echoed,
use of default values will be stated
(note, that in some instances these inital defaults may be
overridden),
opening and closing of files will be announced,
extra information about the cavity grid will be printed.
- DETL - Besides the output described with
Key1=ECHO,
the formatted input will be also echoed labeled with the
corresponding
record type number.
III.08.03. Bulk solvent distribution function print level
specification
DSTP Key1
Related key: DSTC.
Key1:
- NONE - Do not print distribution functions at all.
- GRKC - Print only the overall solvent-solvent and
solute-solvent g(R), coordination number distributions.
When torsions are active and cloned, print the
individual torsion angle distributions.
The functions are printed in a compact form.
- ALL - Print output for all the properties, including
g(R) and coordination number distribution.
The functions are printed in a compact form.
- GKCC - same as GRKC but the
functions are printed in a longer format to allow easy plotting
- ALLC - same as ALL but the
functions are printed in a longer format to allow easy plotting
III.08.04. Acceptance rate listing detail
PRAC Key1[,Key2]
Key1:
- SUMR - Only the mean,minimum and maximum acceptance
rates are printed.
- INDI - Individual acceptance rates are also printed
(wherever collected)
- NTRY - Individual number of trials (or rate of trials)
are also printed.
Key2:
- USED - The periodic output prints the acceptance rate
for the last steps movetype
- ALLA - The periodic output prints the acceptance rates
for all types of moves
III.08.05. Bond list print
BNDL [Key1 Data]
Prerequisite: SLTA
Key1:
- DIST - Print only the neighbor list and bond lengths
- ANGL - Print also the bond angles
- TORS - Print also the torsion angles
- ANRS - Print also both the bond and the torsion angles
- CNCT - Print PDB-style CONECT records for the bonds found
Data: nf, nl
- nf {1} - Start neighbour list at atom nf
- nl {nslt}- End neighbour list at atom nl
Note: atoms with one neighbor will not be listed separately, only
as neighbors.
III.08.06. Compilation option print
PRCO [Key1 Data]
Key1:
- NOSA - don't create a preprocessor
- SAVE - create a preprocessor reproducing the compilation
- SVMN - create a preprocessor with minimal size, tailored
to the currently running system
Data for Key1=SAVE: numrunr
Run number
- numrunr {numrun} - the
run number
of the checkpoint file to use. If numrunr=0, then the
currently running program options will be printed.
The program prints all compilation options and array sizes as
well as version dates used either by the currently running
ptogram or the program that created the checkpoint file
with jobname.numrunr.ckp.
If requested by Key1, a file pre_jobname.f will be
generated that can be inserted into the preprocessor pre.f
to compile the program the same way.
Furthermore, if the file pre.f is present in the current
directory then the file pre_jobname.f will contain
the full source code of the modified preprocessor.
III.08.07. Potential parameter listing
PLBP Key1 [Key2]
Key1:
- USED - Print the parameters in the potential libraries used.
If the solute was not read yet, the list will be printed with the
run description
- ALL - Print the parameters of all potential libraries
- COMU - Like Key1=USED, but for general solvent,
the combined solute-solvent parameters will also be listed.
- COMU - Like Key1=ALL , but for general solvent,
the combined solute-solvent parameters will also be listed.
- SNGL - Print the parameters of a selected single library
Key2 (for Key1=SNGL only)
- PPPP - the potential code that can be specified with the
SUPT key
III.08.8. Coordination number threshold for property
output
KMNP [Data]
Data: rkpdbmin
- rkpdbmin {0.0} - When writing a PDB file with the
WCNF PDB* command, properties
(other then first and second shell coordination numbers and
first shell volume) belonging to
solute atoms whose primary coordination number is below
rkpdbmin are set to zero.
III.08.9. Switch between Cv and Utot
in the convergence plot
EPLT Key1
Key1:
- UICV - Energy block averages and Cv are
plotted
- UIUT - Energy block averages and cumulative total energy
are plotted.
III.08.10. Listing of function values for plots
PRPL Key1
Key1:
- NONE - Draw the plots only on the output file
- CPLD - List the plotted Umbrella Sampling distribution
values.
- ENRG - List the plotted energy and heat-capacity values.
- CONV - List the data plotted with the
PLCV key
- ALL - List all plotted data
III.08.11. 'Not-moved' threshold
NMVP Data
Data: notmoved
- notmoved {2} - Solvents with acceptance rate <
notmoved percent and their distance from the nearest solute
atom and their energy with the solute will be listed when a) the
run is longer than 200*nmolec steps and b) more than 10
attempts were made to move that solvent. However, for
PRNT DETL the list will be produced
anyway. The list is sorted by the acceptance rate.
III.08.12. Empty proximity region print
PXPA Key1
Key1:
- ALL - All proximity regions will be printed
- NOEM - Proximity regions not containing solvent will not
be printed (neither to the standard output nor to the .pxp
file).
III.08.13. Proximity analysis print levels
PXPR [Key1 Key2 Key3 Key4]
Key1 (distribution function print level):
- DSTR - Print all calculated distribution functions
- NDST - Don't print the calculated distribution
functions
- NDSF - Don't even print the calculated forces on the
solute atoms
Key2 (volume element function print level):
- NVLE - Don't print the radial volume element
estimates
- VOLE - Print the radial volume element
estimates
Key3 (functional group analysis):
- FULT - Print the full functional group analysis
table before the condensed list
- NFLT - Don't print the full functional group
analysis table before the condensed list
- NFGT - Don't print functional group analysis table
at all
- FFGT - Print (only) the full functional group
analysis table
Key4 (parameter echo):
- NECH - Don't echo the run parameters before each
proximity analysis table
- FECH - Echo the run parameters before each
proximity analysis table
The proximity analysis results consist of two parts: distribution
functions
(and their characteristics) and tables of calculated avareges.
The distribution printing is governed by Key1 and Key 2.
The program can print the the averages arranged in three different
tables:
- In the original order, further averaged by residues
- Ordered by functional groups (chemical or user defined - see
the key
FCGD)
- Condensed in the same way as the RDF's are (as governed by
indxrdf - see key SLTA). Key3
governs which of these tables will be printed.
III.08.14. PDB atomname convention
PDBT Key1
Key1 (name conversion):
- INPT - no conversion
- REGU - convert to PDB convention
- LEFT - left adjust atomnames
When printing a structure in PDB format (e.g., keys
WCNF,
GENS,
FILT)
this option allows the choice of PDB conventions.
III.08.15. Postscript convergence plots
PLCV Key1, Data
Key1 (Property):
- ENRG - energy
- VOLU - volume (after a run with IBEN)
- NUMB - Number of molecules
(after a run with GCEN)
- BPAR - GCE B parameter
(after a run with BTUN PICL)
- FETI - Free energy (after TIQU)
and the polynomial fitting to the TI integrand
Data: xmn, xdv, nx,
ymn, ydv, ny,
linwid
- xmn {0} - the beginning of the X-axis
- nx {10} - the number of ticks on the X axis
- xdv {Nmc(max)/10} - the distance between ticks on the X axis
Note: Default values will be used for all when nx=0
- ymn {Y(min)} - the beginning of the Y-axis
- ny {10} - the number of ticks on the Y axis
- ydv {(Y(max)-Y(min))/10} -
the distance between ticks on the Y axis
Note: Default values will be used for all when ny=0
- linwid {1} - the plot line width
This key can be used to generate a Postscript plot of the block averages
and cumulative averages of the energy, volume, number of molecules or the
free energy calculated with thermodynamic integration.
The name of the plot file will be of the form
<jobname>.<ext>.<runnumber<.ps
where <ext> is
eng, vol, nml, bpr, fen for
Key1= ENGR, VOLU, NUMB, BPAR, FETI,
resp.
If the .ps file already is present, a new file will be opened vith
an added (or incremented) version number.
III.9. General analysis keywords
- DSTC: Bulk solvent
distribution calculation
- FCGA: Functional group
analysis
- ATFR: Calculation of mean
force on solute atom
- FLDG: Field gradient
calculation on solvents
- SENS: Sensitivity
analysis
- ENGL: Energy contribution
calculation
- FILT: Solvent filtering
- HBBR: Calculate
hydrogen-bond bridges
- CMPC: Configuration
comparison
- DENF: Combined solvent
conformation
- PRTG: Print cavity
grids
- STVG: Estimate solute and
cavity volumes with multiple grids
- CPOC: Calculate cavity/pocket occupancie
- GENS: Calculate generic
solvent sites
- IWSL: Prepare site assignment file,
entropy input
- GSAN: Generic site analysis choice
- GSAO: Generic site analysis options
- GABF: Generic site graph analysis
by frame
- GENT: Configuration with
input torsion angles
- TAND: Calculate torsion
angle distribution
- TORT: Energy analysis
along a torsion
- LPST: Energy analysis
of backbone loop solutions
- IDAG: Aggregate insertion
and deletion sites
- SUPI: Trajectory superimposition
- SVDP: Solvent dipole distribution
- MOND: Bulk density
monitoring
- BLKW: Bulk density
monitoring
III.09.01. Bulk solvent
distribution function calculation level specification
DSTC Key1 [Key2]
Related key: DSTP.
Key1 (level of distribution function calculations):
- NONE - Do not calculate any bulk solvent distribution
function
(to save time, particularly in vector mode).
- SOME - The pair properties, such as the pair energy
distribution and dipole-dipole correlation functions are not
calculated.
- ALL - All distribution function calculations are
done.
Key2 (creation of plot file):
- NOFI - Nothing is written on the .dst file.
- FILE - g(r)s, QCDFs are written on
the .dst file for plot.
Fdata:
- RECTYPEs 32-33 are read only for
DSTC SOME or
DSTC ALL.
- RECTYPE 32: rdmx, ri, rfsslv,
empslv, egrslv (5F15.5)
- rdmx {ct, the solvent cutoff} - Maximum range
in Å for the COM g(R) for the solute an the solvent. Its default
(and maximum) value is the same as that of CUTOF. Beyond this
distance the g(R) computation would have to include more
than one image of a molecule, leading to both the inclusion
of unwanted correlations and additional computational
complications.
- ri {0.05} - Grid interval in Å for g(R) computation.
In the program, g(R) is computed in tabular form at each grid
interval starting from 0 to rdmx.
- rfsslv {3.3} - First solvation shell radius (in Å) of the
solvent molecule. This quantity is used in conjunction with coord.
no. calculations. All molecules which lie inside a sphere
of radius rfsslv (centered at the COM of a given solvent
molecule) are said to be coordinated to that solvent
molecule. It is usually set at the location of the first
minimum of the g(R).
- empslv - The midpoint of the solvent binding energy
distribution (in kcal/mol) for the quasi-component of the
binding energy calculation (QCBE).
- egrslv {0.25} - Grid interval in kcal/mol for the
above QCBE calculation. The QCBE is again tabulated for 100
gridpoints starting from (-50*egrslv+empslv) to
(50*egrslv+empslv).
- RECTYPE 33: rssmin, rssmax, rfsslt,
empslt, egrslt, ksltmn (5F15.10,I5)
- rssmin {rfsslt}, rssmax {rdmx}
- The solvent-solvent distributions (g(R), coordination number
distributions, near-neighbor pair energy and orientational correlation
distributions;
solute-solvent pair energy distributions and
dipole correlation functions are computed
only for solvent molecules
in the concentric shell around the solute between rssmin and
rssmax.
- rfsslt {5.0} - First solvation shell radius in Å for the
solute. Used in conjunction with the coordination number
calculation.
- empslt - Midpoint (in kcal/mol) for the QCBE of
solute.
- egrslt {0.5} - Grid interval (in kcal/mol) for the
QCBE of solute.
- ksltmn {0} - Solute coordination number will be
collected from k=ksltmn.
- RECTYPE 34: (only for DSTC ALL)
dmnslt, dmxslt, dmnslv, dmxslv (4F15.10)
- dmnslt {0}, dmxslt {rfsslt} -
solute-solvent pair energy distributions and
solute-solvent dipole correlation functions are computed
only for solvent molecules
in the concentric shell around the solute between dmnslt and
dmxslt.
- dmnslv {0}, dmxslv {rfsslv} -
near-neighbour pair energy and
solvent-solvent dipole correlation functions are computed
only for solvent-solvent distances
between dmnslv and dmxslv.
- RECTYPE 35: (only for DSTC ALL)
emnslt,egpslt,emnslv,egpslv (4F15.10)
- emnslt - Minimum of the solute-solvent pair-energy
distribution scale (in kcal/mol).
- egpslt {0.5} - Grid interval (in kcal/mol) for the
above pair-energy distribution.
- emnslv - Minimum of the solvent-solvent pair-energy
distribution scale (in kcal/mol).
- egpslv {0.25} - Grid interval (in kcal/mol) for the
above pair-energy distribution. The pair-energy distributions are
very helpful to locate any surprise (overlooked) minimum in
the potential function.
III.09.02. Functional-group analysis of the solute.
FCGA [Key1]
Prerequisite:
SLTA
Potential successor:
FCGD
Key1 (specifies groups to be printed):
ALL - All functional groups will be listed
OERR - Only the ???? and VERR functional groups will be
listed. Atoms in these groups are likely to have
missing or superfluous bonds.
The solute is broken up into functional groups and a list of
bonded atoms is given (based on atom-atom distances).
It is also useful for the detection of errors in the
connectivity generated by the program.
The errors found can be corrected with the keys
MAKB or BRKB
III.09.03. Calculation of average force on the solute
atoms
ATFR
Potential successor:
PXAN
Also requires the
PXAN or the
SCAN key.
III.09.04. Calculation of average field gradient at the
solvents
FLDG Key1, Key2, Data
Prerequisite:
PXCR
Potential successor:
PXAN
Variable to set to 'T' in the preprocessor: FG
Key1 (specifies the type of file to write the gradients):
- NONE - Turn of field gradient calculation
- BNRY - Binary file, with all contributions
- ASCI - Ascii file, with all contributions
- BNTO - Binary file, only the total field gradients
- ASTO - Ascii file, only the total field gradients
Key2 (do or don't do solvent sites)
- ALLG - Calculate field gradients at solute and solvent sites
- NOSV - Don't calculate field gradients at the solvent sites
Data: fgruucut, fgruvcut, fgrvvcut,
rproxmax
- fgruucut - solute-solute cutoff
- fgruvcut - solute-solvetn cutoff
- fgrvvcut - solvent-solvent cutoff
- rproxmax - field gradients will be calculated only at
solvents nearer than rproxmax to their proximal solute atom
Only works for general solvent.
Also requires the
PXAN or the
SCAN key.
III.09.05. Sensitivity coefficent calculation
SENS Data
Data: logfreq
- logfreq - The sensitivity coefficient contributions
w.r.t. the LJ potential parameters and the solute site charges will
be calculated at every 10logfreq step. Valid only
for three-site water solvents.
III.09.06. Calculate energy contributions
ENGL Key1 Key2 [Data]
Prerequisites:
SLTA, SUUC
Key1 (source):
- AVRG - Print only the averages over a trajectory
- INDI - Print results for each frame of a trajectory
Key2 (sum exclusion):
- ALLI - Include all interactions
- NOGI - Exclude interactions within each solute group/residue
- NOMI - Exclude interactions within each solute molecule
Key3 (limit type):
- ARNG - Interaction range specification refers to atom numbers
- GRNG - Interaction range specification refers to
group (residue) numbers.
Data: iafrst, ialast,
iafrst2, ialast2, ctonnb
- iafrst {1}, ialast {nslt} - Restrict analysis to solute
atoms/groups from iafrst to ialast
- iafrst2 {1}, ialast2 {nslt} - Restrict interactions
of the atoms/groups selected for analysis with
atoms/groups from iafrst2 to ialast2.
- ctonnb {cutslt} - switching function cutoff
will be used from ctonnb to cutslt
(read with key SUVC)
For a configuration analyzed, the program prints
ia, ig,E(es), E(d), E(r), E14(es), E14(d), E14(r)
(2i10,3e12.5,3e12.5)
ia, Esw(es), Esw(d), Esw(r)(i10,3e12.5)
ia, ER(es), ER14(d), ERsw(r), sigma, Nnb, N14, Nw
(i10,3e12.5,3i6)
where E(*) refers to different terms of energy between solute atom
ia and the rest with (es), (d) and (r) indicating electrostatic
(1/r), dispersion (1/r6) and repulsion
(1/r12) terms, respectively;
14 refers to 1-4 interactions; w refers to solute-water
interactions;
R refers to the sensitivity coefficient of the energy with respect
to the Lennard-Jones sigma of solute atom ia;
Nnb, N14, Nw are the number of non-bonded, 1-4 and solute-water
terms contributing, respectively.
The calculated contributions are also calculated for each group (residue)
as defined in the .slt file.
For trajectory analysis,a the calculated properties are averaged over
the trajectory; the standard deviations are also calculated.
Also requires the
PXAN or the
SCAN key.
III.09.07. Solute and/or solvent filtering
FILT Key1 Key2 Key3 Key4 Key5 Data :
Prerequisite: CNFG or
TRAJ
Key1 (basis of filtering):
- SOLV - Filter solvents by solvation state (inside, invagination
or shell)
- GEOM - Filter solvents by position, system size and coupling
parameter
- NMBR - Filter out solute and/or solvent segments
- ENRG - Filter solvents by solute-solvent energy
- FRAM - Filter frames from a trajectory
Key2 (target of filtering), skip when Key1=FRAM :
- CONF - Filter the current configuration
- TRAJ - Filter a trajectory
- TRAC - Filter a trajectory, center each frame
- TRAO - Filter a trajectory, and write the file of original solvent
numbers
- TRCO - Filter a trajectory, center each frame and
write the file of original solvent numbers
Note: there is some (optional)
additional data input described
at the end of the section.
Key3 (format of filetered result):
- For Key2=CONF: see Key2 of CNFG
- For Key2=TRAJ or Key1=FRAM: see Key1 of TRAJ
Note: each data input can be followed by the file version increment,
incvers
For Key1=SOLV:
Key4 (filtering specifics):
- INSD - Discard all solvents that are not 'inside'
the solute, based on solvent neighborhood analysis.
- INSV - Discard all solvents that are not 'inside'
the solute or in invaginations of the solute,
based on solvent neighborhood analysis.
- INVG - Discard all solvents that are not in
invaginations of the solute, based on solvent neighborhood
analysis.
- CRCV - Discard solvent based on the
circular variance of
the vectors from the solvent to its neighboring solute atoms, as
specified by the data items rslv and cvlim.
- CRCH - Same as CRCV above, except the circular
variance is calculated using the solute heavy atoms only.
- CVLS - Similar to CRCV above, but
a specified number (leavemol) of innermost (cvlim > 0)
or outermost (cvlim < 0) solvents
- SHL1 - Discard all solvents that are outside the
first solvent shell.
- SHL2 - Discard all solvents that are outside the
second solvent shell.
- SHL3 - Discard all solvents that are outside the
third solvent shell.
- NCLS - Keep a specified number (leavemol)
solvents that are nearest to the solute.
Key5: Like Key2 of GCEN
(Sec. III.2.5) to specify the solute radius source
(i.e., RVDW, RSIG, RSGH)
to be used for determining solvent connectivity.
For Key4 = INSD, INSV, or INVG:
Data: rslv, nnoffset, minmem
- rslv {1.4} - Solvent radius (in Å) to be used for
determining solvent connectivity.
- nnoffset {0} - when filtering for solvents in
invaginations, solvent will be considered to be in an invagination
if the number of its solvent neighbors is not less than the number
of
its solute atom neighbors minus nnoffset and is part of an
'outside' cluster. Thus increasing
nnoffset will retain solvents that are deeper in
invaginations.
- minmem {100} - When > 0, all solvent clusters
having at least minmem members will be considered 'outside'
(instead of the largest one).
When Key1=SOLV, the program determines connected
solvent clusters and considers either the largest or all clusters
having at least minmem members to be 'outside' solvent and
the rest to be 'inside' (unless the nearest solute atom to the
inside
candidate solvent is outside the first two solvent layer). Two
solvents are connected if their COM's are within 2*rslv. A
solute atom is neighbor to a solvent if the solvent's COM is within
RVdW+rslv to that solute
atom.
For Key4 = NCLS:
Data: leavemol,
nafltrange, iafltrange1, iafltrange2, ...
- leavemol - The number of solvent molecules
nearest to the solute to be kept
- nafltrange {0} - the solute atoms to solvate will
be restricted to nafltrange ranges
- iafltrange1, iafltrange2 (nafltrange times)
- the first and last solute atoms of each range to solvate
For Key4 = SHL1, SHL2, or SHL3:
Data: rslv,
nafltrange, iafltrange1, iafltrange2, ...
- rslv {1.4} - Solvent radius (in Å) to define a shell
(1st shell width: rslt+rslv; 2nd shell width: 2*rslv)
- nafltrange {0} - the solute atoms to solvate will
be restricted to nafltrange ranges
- iafltrange1, iafltrange2 (nafltrange times)
- the first and last solute atoms of each range to solvate
For Key4=CRCV or CRCH
Data: rcrcv, cvlim, nsltcvfst,
nsltcv
- rcrcv {10.0} - Circular variance is calculated with
solute atoms within rcrcv + R(solute atom).
It can be considered as the radius of the test sphere rolled over
the surface
used to calculate solvent accessibility.
- cvlim {0.5} - When cvlim>0 then solvents
with circular variance > cvlim are kept (inside the
solute);
when cvlim<0 then solvents with circular variance <
-cvlim
are kept (outside the solute).
- nsltcvfst {1} - the first solute atom to use for the
circular variance calculation
- nsltcv {nslt} - the last solute atom to use for the
circular variance calculation
For Key4 = CVLS:
Data: rcrcv, cvlim, nsltcvfst, nsltcv, leavemol
- rcrcv, cvlim, nsltcvfst, nsltcv - See above
- leavemol - The number of innermost/outermost
solvent molecules to be kept
For Key1=GEOM:
Data: rminx, rmaxx, rminy,
rmaxy, rminz, rmaxz, centx,
centy, centz, rmin, rmax,
cplmin, cplmax, nmolmin, nmolmax,
isltfilt.
- rminx, rmaxx, rminy, rmaxy,
rminz, rmaxz {box limits}-
Solvents whose COM falls between
(rminx,rminy,rminz)
and (rmaxx,rmaxy,rmaxz)
will be kept (default leaves all solvents)
- rmin {0}, rmax {outer box radius} -
solvents whose COM's distance from
cent is betwen rmin and rmax will be kept
(default leaves all solvents).
- cplmin {0.0}, cplmax {1.0} - Whenever applicable,
configurations where the coupling parameter is outside the range
[cplmin,cplmax] will be dropped.
- nmolmin {1}, nmolmax
{maxmol} - Whenever applicable,
configurations where the number of solvent
molecules is outside the range
[nmolmin,nmolmax] will be dropped.
- isltfilt [0] - If =1 then the solute atoms will also be filtered.
For Key1=NMBR:
Data:
nsltseg, ifsltseg1, ilsltseg1,
ifsltseg2, ilsltseg2, ..., nslvseg,
ifslvseg1, ilslvseg1, ifslvseg2,
ilslvseg2
- nsltseg, ifsltseg1, ilsltseg1, ... -
There will be nsltseg solute atom segments to be
deleted, and their limits (inclusive) are
ifsltseg1, ilsltseg1, etc. (up to a maximum of 250
segments).
i*sltseg* values refer to atom numbers.
- nslvseg, ifslvseg1, ilslvseg1, ... -
There will be nslvseg solvent molecule segments to be
deleted, and their limits (inclusive) are
ifslvseg1, ilslvseg1, etc. (up to a maximum of 250
segments).
The solvent number does NOT include the solute; i.e., solvent No 1 is
molecule No 2.
For Key1=ENRG:
Data: esusvmin, esusvmax
- esusvmin, esusvmax -
Drop solvents with solute-solvent energy outside the range
[ esusvmin, esusvmax] kcal/mol.
For Key1=FRAM:
Key4 (energy recalculation option) :
- ETRA - Use the total energy on the trajectory file
- ECAL - Calculate the total energy for each frame on
the trajectory file
Data: etotmin, etotmax,
nslvmin, nslvmax
- etotmin, etotmax - minimum and maximum total energy
of frames (configurations) kept
- nslvmin, nslvmax - minimum and maximum number of solvents
of frames (configurations) kept
ADDITIONAL INPUT:
Data: incvers
- incvers {10} - The filtered trajectory will
have a version number that is incremented from the initial
trajectory file version number (runvers, see key
FILE) by incvers.
ADDITIONAL INPUT for Key2=TR**:
Data: nfrmmax, nfrmfreq, nfrmskip
- nfrmmax {100000000} - gather configurations until the
nfrmmax-th frame (snapshot).
- nfrmfreq {1} - use every nfrmfreq-th frame only
- nfrmskip {0} - skip the first nfrmskip frames
III.09.08. Calculate hydrogen-bond bridges
HBBR Key1 Data
Prerequisite:
SLTA
Key1 (anchor atoms specification)
- ALST - Solute atoms that can anchor a hydrogen-bond bridge
are listed
- BBON - Hydrogen-bond bridges are anchored by
protein backbone atoms only
- TLST - Solute atom names that can anchor a
hydrogen-bond bridge are listed
- CHRG - Solute atoms with partial charge above a threshold
can anchor a hydrogen-bond bridge
Data rxhmax, hbanglemin
rxhmax {3.0} - maximum X...H distance for a hydrogen bond
hbanglemin {90.0} - minimum X...H-Y angle for a hydrogen bond
Additional data for Key1=ALST: nanchor,
ianchor(1), ianchor(2), ...
- nanchor - number of solute anchor atoms
- ianchor(i) - ith anchor atoms (nanchor numbers)
Additional data for Key1=TLST: ntype,
typename(1), typename(2), ...
- ntype - number of solute anchor atoms
- typename(i) - th anchor atom type (ntype numbers)
Replace leading blanks by "."
Additional data for Key1=CHRG: qbmin
qbmin {0.3} - solute atoms with partial charge less than
-qbmin or solute hydrogens with partial charge
greater than qbmin can anchor hydrogen-bond bridges
III.09.09. Configuration comparison
CMPC Key1, Key2, Key3, Data
Prerequisite: SLTA
Potential successor:
CNFG
Key1 (input file format):
- ASCI, BNRY, ASAN, PDB , CHRM
-
File format. See Key2 of Key CNFG
Key2 (comparison operation):
- SHRT - Shift and overlay the second conformation's
molecules on the first (valid only when the
PART key is active
- SHON - Shift the second conformation's molecules on the
first
- ALLD - Don't shift or rotate the molecules relative to
each other
Key3 (possible special treatment for lipids):
- GENL - General solute molecules
- LIPI - Lipid bilayer: 2nd half is
in mirror image orientation to the first.
Data: numrun0, numrun1, numrund,
ix1, dx1, ix2, dx2,
numshow
- numrun0 - first conformation's
run number
- numrun1 - second conformation's run number
- numrund - output files' run numbers will be incremented
by numrund
- ix1, dx1 - molecules will be dx1 Å
apart
along the ix1 axis
- ix2, dx2 - molecules will be dx2 Å
apart
along the ix2 axis
- numshow {all} - show only the top (most different)
numshow molecules
If Key3=LIPI:
ixflip, lipfst, liplst
- ixflip - invert the sign of the ixflip-th
coordinate
for the second half of the lipid molecules
- lipfst {1} - the first lipid is the
lipfst-th solute molecule
- liplst {all} - the last lipid is the
liplst-th solute molecule
This operation reads in two configurations, calculates the average,
minimum and maximum RMS between the solute molecules
(possibly subject to the operation specified by Key2)
and spreads out the compared molecules of the solute
in the ix1-ix2 plane for comparison.
The centers of the molecules will be shifted
to this plane in a rectangular arrangement.
III.09.10. Combining solvents from a trajectory
DENF Key1 [Data]
Prerequisite: TRAJ
Key1 (output format):
- PDB - Combined file will be in
PDB format
- CHRM - Combined file will be in CHARMM CRD format
- INSG - Combined file will be in Insight free (sxyzrq)
format
Data: nmcmax, nmcfreq, nmcskip
- nmcmax {100000000} - gather configurations until the
nmcmax-th step.
- nmcfreq {1} - use configuration saved at every
nmcfreq-th simulation step
- nmcskip {0} - skip the first nmcskip simulation steps
This operation scans the trajectory specified and puts the
first atom of all the solvent molecules into a single
configuration onto a file
<jobname>_den.dat (with the run and version number
taken from the trajectory file's name)
in InsightII free format) as He atoms
If the output file already exists, a file with incremented version
number
will be created.
If parts of the solute were allowed to move,
the affected solute atoms are
also collected. The resulting configuration is a simple
approximation to the three-dimensional density.
III.09.11. Cavity grid listing
PRTG Key1 Key2 Key3 [Data]
Prerequisites: CNFG,
GCEN
Potential successor:
MOND or
BLKW
Variable to set to 'T' in the preprocessor: PG
Grid points representing cavities around the solute will be
enumerated and separated into mutually disconnected clusters.
The first cluster is the set of gridpoints surrounding the
solute and the remaining ones (if any) represent cavities
inside the solute. The program writes out the solute and the
internal cavities to the file
<jobname>.<numrunw>.grd
and, if requested, the solute and external gridpoints (in the same format) to
<jobname>.<numrunw>_2.<ext> iwhere
<ext> is pdb, CRD or grd for PDB, CHARMM, Insight
free format, resp..
The gridpoints are written as helium atoms with zero charge
and the group (residue) number is the group number of the last
solute atom plus the cluster number.
Note, that only a layer of gridpoints near the solute atoms are
written
providing a bounding shell for each cavity.
Key1 (grid file format):
- NOLP - No grid file is written
- PDB - Grid files are in PDB format
- CHRM - Grid files are in CHARMM CRD format
- ASAN - Grid files are in annotated ASCII format
(see WCNF SVAN)
Key2 (need for grids in the interior of a cavity)
- ALLG - All gridpoints in a cavity will be written
- SRFG - Only gridpoints on the surface of a cavity
will be written
Key3 (controls printing of cavity/pocket environment)
- NOLS - No atom or residue list lining a cavity/pocket will be given
- GRLS - List of residues lining a cavity/pocket will be printed
- ATLS - List of heavy atoms
lining a cavity/pocket will be printed
- AGLS - Both list of heavy atoms and of residues
lining a cavity/pocket will be printed
Data: nnexmin, nnexmax, cvlim, rcutcv,
volcavmin, volpckmin, rnearlim, rsltfac,
cvlimcav, randsh, diamslvnew,
nlshave, maxnnsurf, noputbacksfg, numrunw
- nnexmin, {1} nnexmax {0} - external gridpoints will be
written only if the number of their grid neighbor is between
nnexmin, nnexmax (inclusive). Can be a huge file!
- cvlim {0.0}
- When cvlim > 0.0 external grid points will be dropped
when their circular variance
is < cvlim.
The remaining points will be clustered and printed, filtered by
volcavmin or volpckmin.
- volcavmin - {0.0} minimum volume estimate of a cavity
to be printed
- volpckmin - {0.0} minimum volume estimate of a pocket
to be printed
- rcutcv - {10.0} the cutoff for all XV calculations
- rnearlim - {101/2} atoms lininig a cavity/pocket
have to be closer then rnearlim Å to a cavity grid.
- rsltfac - {0.9} Solute atom radii will be multiplied by
rsltfac for the calculation of grid coverings.
- randsh {0.5} - grid will be shifted by randsh*gridsize
- cvlimcav - {0.0} circular variance threshold for the cavity centers:
cavites whose centers circular variance is below cvlimcav will be
dropped from consideration
- diamslvnew {0.0} - When diamslvnew > 0.0 the grids
are also generated with cavity radius diamslvnew. If it is
significantly smaller than the original rsph value inputted
after GCEN then the second set might
partition into more
discrete clusters, indicating the presence of invaginations, in
addition to real cavities. These new gridpoints will be written to
files <jobname>.<numrunw>_3.<ext>,
<jobname>.<numrunw>_4.<ext>.
- nlshave {0} - When nlshave > 0 both the cavities and
the pockets will be subjected by a sub-cluster search by removing
nlshave layers from each cluster and check if the resulting set of
gridpoints fall apart. If yes, these sub-clusters will be also written
to the corresponding PDB files with chain id 'S'.
- maxnnsurf {3} - Grids with maxnnsurf or less
neighbors will be considered surface grids and will be shaved off.
- noputbacksfg {0} - When noputbacksfg=1 the shaved
gridpoints will be left out from the subclusters
- numrunw {numrun} - the runnumber of the PDB files
to be written.
III.09.12. Estimate solute and cavity
volumes with multiple grids
STVG [Data]
Prerequisites: CNFG,
GCEN,
SLTA
Variable to set to 'T' in the preprocessor: PG
Data: diamslvnew, ngridrep
- diamslvnew {diamslv} - Calculate the number of grids covered by
the solute using diamslvnew as the solvent radius and
estimate the solute volume from it.
If diamslvnew is different from diamslv (read by the
GCEN key) then the solute radii will be also
scaled by diamslvnew/diamslv.
- ngridrep {10} - repeat the grid cover calcculation with
ngridrep
randomly shifted grid and print the average volumes.
The progam estimates the volume of the solute and the volume of
each cavity
based on the number of gridpoints covered/left uncovered, resp.
The first 'cavity' is the volume outside the solute.
The averaging assumes that the cavities occur in the same order in
each sample.
III.09.13. Calculate cavity occupancies
CPOC Key1 Key2 Key3
Prerequisite:
CNFG,PRTG,
GCEN
This key calculates the (average) number of solvents in each cavity
as determined by the key PRTG
Key1 (select cavity/pocket)
- CVTY - calculate the occupancies of internal cavities
- PCKT - calculate the occupancies of surface pockets
Key2 (input type specification)
- CONF - calculate occupancies for a single structure (read by the
CNFG key)
- TRAJ - calculate average occupancies for a trajectory (specified by the
TRAJ key). Note that the pockets/cavities are defined
by the conformation read at the start of the run thus it is assumed that
the solute is rigid
Key3 (saving cavity solvents)
- NOSA - cavity/pocket solvents will not be saved
- SAVE - When Key2=CONF, a PDB file containing the solute
and solvents found to be in a cavity/pocket will be saved
III.09.14. Calculate generic solvent sites
GENS Key1 Key2 Key3 Key4 Key5 Data
Prerequisite: PBCN
Can not be followed by: IWSL
Related keys: GABF,
GSAN,
GSAO, IWSL
Variable to set to 'T' in the preprocessor: PG
Key1 (type of calculation)
- CALC - Calculate generic solvation sites from a
trajectory
with the algorithm of Mezei and Beveridge
- CCAL - Continue the generic solvation site calculation.
This option requires a preceding RCKP key.
- COMP - Compare two site estimates, read from two files,
by performing a Hungarian method matching. If the solute is also present,
write a PDB file containing the sites, sorted by their circular variance
(inside sites first).
- CLST - Cluster sites that are within a threshold distance
in a a single site and combine their occupancies.
For Key1=CALC or Key1=CCAL:
Key2: (updating option)
- FIXI - Except for the first scan (iteration)
of the history file, keep
the site estimate fixed during an iteration (recommended).
- CONT - Continually update the site estimate during an
iteration.
Key3: (inital site estimate source)
- GETC - Obtain initial sites from a system read
previously
by the CNFG key.
- GETH - Obtain initial sites from the trajectory file
- RDAS - Read initial sites from a file specified here in
ASCI format (see key CNFG)
- RDBI- Read initial sites from a file specified here in
BNRY form (see key CNFG)
- RDAN - Read initial sites from a file specified here
in ASAN format (see key CNFG)
- RDPD - Read initial sites from a file specified here
in PDB format (see key CNFG)
- RDCH - Read initial sites from a file specified here
in CHRM format (see key CNFG)
Key4: (output file format - see key CNFG)
CNFG
- WPDB - Output file format is PDB, the file
<jobname>_gs. will contain the sites only
- WASC - Output file format is ASCI, the file
<jobname>_gs. will contain the sites only
- WANN - Output file format is ASAN, the file
<jobname>_gs. will contain the sites only
- SPDB - Output file format is PDB, the file
<jobname>_gs. will include the solute of the last frame
- SASC - Output file format is ASCI, the file
<jobname>_gs. will include the solute of the last frame
- SANN - Output file format is ASAN, the file
<jobname>_gs. will include the solute of the last frame
- RPDB - Output file format is PDB, the file
<jobname>_gs. will include the solute read at the start
- RASC - Output file format is ASCI, the file
<jobname>_gs. will include the solute read at the start
- RANN - Output file format is ASAN, the file
<jobname>_gs. will include the solute read at the start
Key5: (order of generated sites)
- FSRT - Generated sites will be sorted in the order of
decreasing occupancy, fully occuped sites also sorted in the order of
increasing site RMS
- DSRT - Generated sites will be sorted in the order of
increasing site RMS
- CSRT - Generated sites will be sorted in the order of
increasing site convergence
- PSRT - Generated sites will be sorted in the order of
increasing distance from the solute
- CVSR - Generated sites will be sorted in the order of
decreasing site CV wrt the solute (going from inside to outside)
- PXSR - Generated sites will be sorted in the order of
increasing proximity solute index
Data: ns, nmcmax, nmcfreq,
nmcdisc, rcutxst, rmateps, epsgsite,
rmaxmatch,
rmix, maxiter,
nsmin, nsmax, cplpmin, cplpmax,
cvfilt_traj, cvfilt_trajcut,
froccmin, cvfilt, distsltmax,
rmsdlim, froccminrep,
clstmerge_dmax, froccpairmax, rmaxrematch
- ns - Generate ns sites to start with. If ns=0,
the initial number of sites will be the number of solvents in the input
structure.
When a structure assigns a solvent to each site and there are solvents
left over those solvents will added as new sites.
- nmcmax - trajectory file will be scanned until simulation step
number nmcmax
- nmcfreq - Use every nmcfreq-th configuration
from the trajectory file
- nmcdisc - Discard the first nmcdisc
configurations.
Note: for MC history files,
nmcmax, nmcfreq, nmcdisc
should refer to MC stepnumbers, not structure numbers.
- rcutxst {8.0} - For the matching, consider only sites
within rcutxst Å from each solvent (just to speed up the
matching procedure)
- rmateps {0.01} - Convergence threshold for optimizing
the Hungarian method matching
- epsgsite {0.1} - Convergence threshold for the RMSD
between subsequent site estimates
- rmaxmatch {1000.0} - If a solvent is farther than
rmaxmatch Å from its assigned site, it will be added as
a new site to the site list and ns will be increased accordingly.
- rmix {0.0} - When rmix > 0.0 the previous
site estimate will be mixed with the new site estimate:
new=rmix*old+(1-rmix)*new . This might help the
iteration to converge.
- maxiter {100} - maximum number of iterations over the
trajectory. If it is less than the iterations already performed,
only the results will be printed.
- nsmin {0}, nsmax {#MH} - Use only configurations
where the number of solvents is in the (inclusive) range
[nsmin,nsmax] (meaningful only for analyzing
grand-canonical ensemble simulations - see the
GCEN key).
- cplpmin {-0.1}, cplpmax {2.0} - Use only
configurations where the coupling parameter is in the range
(meaningful for analyzing PMF calculations, i.e.,
FREE PMF1)
[cplpmin,cplpmax]
- cvfilt_traj - When >0.0 the solvents read from the trajectory
with circular variance (w.r.t the solute) < cvfilt_traj
will be dropped.
- cvfilt_trajcut - The CV will be calculated
using a distance cutoff of cvfilt_trajcut Å.
- froccmin {0.0} - Sites with fractional occupancy <
froccmin will be eliminated.
- cvfilt - When >0.0 sites whose CV (w.r.t. the solute) is
< cvfilt will be eliminated.
- distsltmax {0.0} - When >0.0 sites farther than
distsltmax from the nearest solute atom will be eliminated.
- rmsdlim {0.0} - When >0.0 sites with site rmsd
> rmsdlim will be dropped.
- froccminrep {0.0} - When >0.0 a representative
configuration where all sites with occupancy > froccminrep
(must be > froccmin) will also be generated.
- clstmerge_dmax {0.0} - When >0.0 sites that are closer to each
other than clstmerge_dmax will be combined into a single site.
- froccpairmax {2.0} - Site pairs whose sum of occupancies
exceed froccpairmax will not be paired for clustering
- rmaxrematch {0.0} - When > 0.0 the assigments of solvents
to sites will be repeated using the filtered sites only (see froccmin,
cvfilt, distsltmax, above). For this match only solvents whose
distance from the site is < rmaxrematch Å will be accepted.
When Key3=RD** additional Data:
- file, numrun, irunvers, nslt, nslv,
islvrep - information specifying the content of the input
file as described above with Key1=COMP
For Key1=COMP:
Key2, Key3 (input file syntaxes)
- ASCI or BNRY or ASAN or PDB or
CHRM - the formats of the two input files as described with
the CNFG key.
Key4 (site distance measure M)
- DIST - M = site-site distance
- ENVN - M = <# of neigbours> - # of identical neighbors
(both within a user defined threshold).
- ENVI - M = (<# of neighbors>+1)/(# of identical neighbors+1)
- ENVM - M = (site-site distance) *
(<# of neighbors>+1)/(# of identical neighbors+1)
- TANI - M = (1 - Tanimoto coef of the neighbor lists)
If nexp > 1 then M => M^N
If nexp < -1 then M => M^(1/N)
Data: ns1, ns2,
file1, numrun1, numvers1, nslt1, nslv1,
islvrep1,
file2, numrun2, numvers2, nslt2, nslv2,
islvrep2, rcutxst, rmateps, rcutcrcv,
rmsdgrid, rneigsite, nexp
- ns1, ns2 - Number of sites in systems 1 and 2,
resp. If either of them is zero, the number of sites will be
the number of solvents in the corresponding structure.
- file1,file2 - jobnames of the files
for systems 1 and 2, resp.
- numrun1,numrun2 - runnumbers of the files
for systems 1 and 2, resp.
- numvers1,numvers2 - version numbers of the files
for systems 1 and 2, resp.
- nslt1,nslt2 - number of solute atoms in the files
<jobname>.<runnumber>.crd
for systems 1 and 2, resp.
- nslv1,nslv2 - number of solvent atoms (per
molecule) on the files for systems 1 and 2, resp.
- islvrep1,islvrep2 - index of the solvent atom
representing the site for systems 1 and 2, resp.
- rcutxst {3.0} - For the matching, consider only sites
within rcutxst Å from each solvent
- rmateps {0.01} - Convergence threshold for optimizing
the Hungarian method matching
- rcutcrcv {8.0} - Cutoff for the circular variance calculation
- rmsdgrid {0.15} - The grid size for the matching distance
distribution
- rneigsite {5.0} - The neighbor distance threshold when
Key4 is not DIST
- nexp {1} - the exponent to use with Key4=TANI
For Key1=CLST:
Key2 (input file syntax)
- ASCI or BNRY or ASAN or PDB or
CHRM - the format of the input file as described with
the CNFG key.
Data: ns,
file, numrun, nslt, nslv,
islvrep, clstmerge_dmax
- ns - Number of sites in the system
If ns=0, the number of sites will be
the number of solvents in the input structure.
- file - jobname of the files
containing the generic sites to be clustered
- numrun - runnumber of the file
- numvers - version number of the file
- nslt- number of solute atoms in the file
- nslv - number of solvent atoms (per
molecule) in the file
- islvrep - index of the solvent atom
representing the site
- clstmerge_dmax {2.0} - Sites that are closer to each
other than clstmerge_dmax will be combined into a single site.
- froccpairmax {2.0} - Site pairs whose sum of occupancies
exceed froccpairmax will not be paired for clustering
Both GENS CALC and GENS COMP
are based on using the so-called Hungarian method
to obtain an optimal matching of solvent coordinates to (generic)
site coordinates.
With GENS CALC the calculation scans a simulation trajectory
and dedudeces iteratively a set of generic sites. For each
configuration, the solvent coordinates are matched to the site
estimates, allowing for the same solvent to contribute to different
sites during the simulation.
When GENS CALC is used,
the program will write several files:
- <jobname>_gs.: the generated sites.
The PDB data columns give the fractional occupancy and site RMS.
- <jobname>.gs_rep.pdb:
The configuration found to be closest to the configuration
represented by the sites.
The PDB data columns give the fractional occupancy and distance between
the representative solvent and the site CV w.r.t the solute.
- <jobname>.pdb_gs_rep_all.pdb: An other representative
configuration selected with the additional criterion that all sites
with occupancy > froccminrep are present
- <jobname>.pdb_gs_cmp.pdb: a composite configuration
where for each site (with occupancy > froccmin)
the solvent closest to it is selected.
The PDB data columns give the fractional occupancy and site RMS.
Note that these solvents may
be in incompatible conformations, especially when low occupancy
sites are included.
- <jobname>._gs_avg.pdb: a composite configuration
where the waters are in their average orientation (prepared
for water solvent only).
The PDB data columns give the fractional occupancy and
the site CV w.r.t the solute.
- <jobname>._gs_cls.pdb: the sites after clustering
those that are closer than clstmerge_dmax to each other.
The PDB data columns give the fractional occupancy and
the site RMSD.
- <jobname>._gs_cla.pdb: only for water solvent - waters in
their average orientation at the clustered sites.
The PDB data columns give the fractional occupancy and
the site CV w.r.t the solute.
- <jobname>._gs_fcl.pdb: the sites after filtering by
CV and clustering by distance
The PDB data columns give the fractional occupancy and
the site CV w.r.t the solute.
- <jobname>._gs_fcl.inf: Detailed information
(coordinates, occupancy, CV, RMSD, energy, proximal solute atom)
on the sites in each cluster and their averages over the cluster members
Depending on Key4, the sites or solvents may be preceded by
the full solute.
Representative and the composite configurations list
the solvents in the order of the sites they match to,
but may be offset due to missing matches.
Also, the standard output has a combined list of the sites
and their representatives with their properties.
The run and version number for these files are taken from
the trajectory file and extension defined by Key3.
If the output file already exists, a file with incremented version
number will be created.
III.09.15. Prepare site assignment file, entropy calculation,
clustering sites
IWSL Key1, Data
Potential successor: GENS
Key1 (ask for entropy input)
- WSTO - Write solvent site assignment file (.wsl) and
an input file (.sto) for the program STOW to calculate entropy.
- NOST - Write only solvent site assignment file (.wsl).
Data
clstfin_dmax, nclust_clstfin, cvfilt_stow,
froccmin_stow, rmsdlim_stow, maxclstmem_stow,
subreground
- clstfin_dmax {0.0} - When > 0.0
the filtered sites will be clustered
(using single-link clustering) with a distance cutoff of clstfin_dmax.
If Key1=WSTO than the default value is 3.5 A.
- nclust_clstfin {0} - When > 0
the filtered sites will be clustered (using single-link clustering)
by varying the distance threshold to yield nclust_clstfin
clusters.
- cvfilt_stow {0.0} - When >0.0 the clusters where the
lowest CV value is less then cvfilt_stow will be skipped
from the STOW input.
- occfilt_stow {0.0} - When >0.0 the clusters where the
lowest occupancy value is less then occfilt_stow will be skipped
from the STOW input.
- rmsdlim_stow {0.0} - When >0.0 the clusters where the
highest site rmsd exceeds rmsdlim_stow will be skipped
from the STOW input.
- maxclstmem_stow {10} - Clusters with more than maxclstmem_stow
members will be skipped from the STOW input.
- subreground {0.7} - When calculating the number of neighbour sites,
the mean number of waters in the neighborhood of a site
will be rounded up if the fractional part is > subreground
III.09.16. Generic site analyses
GSAN Key1 [Key2 [Data]]
Prerequisite: IWSL
Can not be followed by: GENS
Related key: GSAO
Key1 (analysis type)
- DPCV - the covariance matrix among the water
dipoles corresponding to each site will be
written to a file with extension .cov.
- DPCR - the correlation matrix among the water
dipoles corresponding to each site will be
written to a file with extension .cor.
- ENRG - the matrix of the average energies between waters belonging
all site pairs will be written to a file with extension .eng.
- HBND - the matrix of the fraction of time
waters belonging to each site are hydrogen bonded
will be written to a file with extension .hbd and
site-solute hydrogen bond statistics will be calculated.
- HBTT - in addition to the site-site hydrogen-bond occupancy
matrix, the matrix of site-solute and solute solute hydrogen bond
occupancies will be printed.
The solute hydrogen bonds are condensed by residues and residues not
forming hydrogen bond with sites are omitted.
- OCCC - the matrix the joint and alternative occupancies
will be written to a file with extension oro.
- JMPA - the average number of frames between solvent jumps
will be calculated
Key2 (eigenvalue/vector calculation, normalization, order)
For Key1=DPCV, DPCR, ENRG, or HBND only:
- NOEV - Don't calculate the eigenvalues/eigenvectors of this matrix
- CAEV - Calculate the eigenvalues/eigenvectors of this matrix
and write it to a file with extension .eiv
For Key1=OCCC only:
- ONRM - Normalize the occupancies calculated with the occupancy
expected from uniform distribution
For Key1=JMPA only:
- RDOR - Read in the original solvent numbers from a file created
by FILT TR*O
Data for Key1=HBND: rxhmax, hbanglemin,
occmin_at, occmin_res
- rxhmax {3.0} - maximum X...H distance for a hydrogen bond
- hbanglemin {90.0} - minimum X...H-Y angle for a hydrogen bond
- occmin_at {0.0} - minimum occupancy to show in the
solute atom - site hydrogen bond list
- occmin_res {0.0} - minimum occupancy to show in the
solute residue - site hydrogen bond list
Data for Key1=JMPA and Key2=RDOR:
numrunorig, iversorig
- numrunorig {numrun}, iversorig {1} -
the run number and version number of the original solvent number file
This key activates the selected analysis options after a generic
site calculation (with the key
GENS)
III.09.17. Generic site analysis options
GSAO Key1 Key2 Key3 [Data]
Prerequisite: IWSL
Can not be followed by: GENS
Related key: GSAN
Key1 (matrix sort option)
- NOSR - print the matrix in the original site order
- SXYZ - print the matrix sorted by one of the site coordinates
- RXYZ - print the matrix sorted by the site-origin distance
* sign of the one of the coordinates
- LEXI - print the matrix sorted by a quasi-lexicographical order
Key2 (output format specification)
- ASCI - formatted output, five numbers per line (text file)
- ASCL - formatted output, one line per site (text file)
- BNRY - unformatted output (binary file)
Key3 (vector to use for covariance when
Key1 of GSAN
=DPCV or DPCR)
- DIPL - Covariances are calculated for the HOH bisector
- HHVC - Covariances are calculated for the H-H vector
- DPHH - Covariances are calculated for the vector
normal to the HOH plane
Data for Key1=SXYZ or RXYZ: iaxis
- iaxis sorting wil be based on the x, y, or z coordinates
for iaxis = 1, 2, or,3, resp.
Data for Key1=LEXI:
w, iaxis1 iaxis2 iaxis3
- w, iaxis1 iaxis2 iaxis3 - Sites will be
sorted based on
w^2*c(iaxis1) +w*c(iaxis2) +c(iaxis2)
III.09.18. Generic site graph analysis by frame
GABF Key1 [Data]
Key1 (print option)
- NODP - No frame-by-frame printout
- DTPR - Frame-by-frame printout
- DLPR - Frame-by-frame printout with list of hydrogen bonds
Data rxhmax, hbanglemin, hbmatmin
- rxhmax {3.0} - maximum X...H distance for a hydrogen bond
- hbanglemin {90.0} - minimum X...H-Y angle for a hydrogen bond
- hbmatmin {0.0} - the clustering will use an adjacency matrix
whose elements are 1 when the frame-averaged connectivity probability is
> hbmatmin
Calculate the hydrogen-bond matrix for each frame,
find the connected clusters and accumulate (and average) the matrix where
all cluster members are connected.
The matrix of the connectivity probabilities will be written to a file
with extension .hbf.
III.09.19. Configuration with input torsion angles
GENT Data
Prerequisites: CNFG,
TORD.
Data ntorch
- ntorch - Number of torsion angles to specify
Fdata: ntorch records
- ia1, ia2, ang (2i5,f10.0)
- ia1, ia2 - Atoms of torsion bond to change
- ang - New torsion angle
If the atoms ia1 and ia2 are listed in the reverse order
on the torsion list, the torsion angle used will be -ang.
III.09.20. Calculate torsion angle distribution
TAND
Prerequisite:
TORD
Potential successor:
PXAN
When this key is present, analysis of a configuration or trajectory
(activated by the SCAN key)
will include the calculation of torsion angle distributions for
torsions defined by the TORD key.
The number of angular grids will be determined by the
value of #TD set by the preprocessor.
Besides the distribution, the program prints the standard deviation
(meaningful for not too wide distribution only), the
circular variance
(on a scale of 0 to 1) and the number of grids sampled.
III.09.21. Energy analysis along a torsion
TORT Key1 Data
Prerequisites: CNFG,
TORD.
Key1 (reference energy)
- ABSE - The actual energy terms will be printed
- RELE - Each energy term represents the difference
between the current and initial conformation
Data ntsamp times (ia1, ia1),
anginc, PDBfile
- ia1, ia1 - atomindices of the
atoms forming the torsion bond to be sampled
- anginc {5.0} - angle increment for the scan (in degrees)
- engmax {99999.9} - maximum energy to print
- PDBfile - if present, each structure generated will be
written to PDBfile.pdb
This key calculates the various non-bonded and torsion terms
as the selected torsion angle is driven through the full circle.
The values printed are relative to the initial torsion angle.
If a conformation was generated previously with the
GENT key then the torsion scan will
start at those angles (otherwise at 0).
III.09.22. Energy analysis of backbone loop solutions
LPST Key1 Data
Prerequisites: CNFG,
LOOP, TORD
Key1 (reference energy)
- ABSE - The actual energy terms will be printed
- RELE - Each energy term represents the difference
between the current and initial conformation
Data ia1, ia1, PDBfile
- ia1, ia1 - atomindices of the
atoms forming the torsion bond
- PDBfile -if present, each structure generated will be
written to PDBfile.pdb
This key calculates all possible backbone solutions for the six
torsions following the selected torsion angle,
and calculates the same energy contributions as the
TORT key.
HBBR Key1 Data
III.09.23. Aggregate insertion and deletion sites
IDAG Key1, Key2, Data
Prerequisite: IDLG
Related key:
GCEN
Key1 (select insert/delete/both)
- BOTH - Both insertion and deletion sites will be extracted
- INSR - Only insertion sites will be extracted
- DELE - Only deletion sites will be extracted
Key2 (filtering option)
- NOFT - All sites will be considered
- FILT - Sites will be subject to a proximity filter:
Data rmax, nslt1, nslt2, nmcscan
- rmax, nslt1, nslt2 -
Only present when Key2=FILT.
Sites selected should be closer than rmax to one of the
solute atoms in the range nslt1-nslt2
- nmcscan {0} - when not zero, the log file will be scanned
until the nmcscan-th MC step.
This option will prepare a PDB file containing the solute and
the sites of insertions and/or deletions as read from the
insertion deletion log file
(created by the IDLG key during a
grand-canonical ensemble run (see key GCEN)).
Insertion sites will be represented as oxygens
while deletion sites as nitrogens.
For solute atom records,
the occupancy and temperature factor columns will contain the distance of
the closest deletion and/or insertion sites (as specified by Key1)
and the number of such sites, resp.
For the deletion and/or insertion site records,
the occupancy and temperature factor columns will contain the index of
the proximal solute atom and the distance from it, resp.
III.09.24. Trajectory superimposition
SUPI Key1, Key2, Data
Prerequisite: TRAJ
When filtering a trajectory, the filtered configuration
will be superimposed to reference configuration.
Key1 (reference structure)
- CONF - Reference structure is the structure read with
the CNFG key
- TRAJ - Reference structure is the first structure read with
the TRAJ key
Key2 (atoms to be used)
- RNGE - Solute atoms to be used will be defined as index range(s)
- PROX - Solute atomd to be used will be defined as the
vicinity of a selected atom in the reference structure
Data for Key2=RNGE:
nsltseg, ifsltseg1, ilsltseg1,
ifsltseg2, ilsltseg2, ..., nslvseg,
ifslvseg1, ilslvseg1, ifslvseg2,
ilslvseg2
There will be nsltseg solute atom segments to be used,
and their limits (inclusive) are ifsltseg1, ilsltseg1, etc.
(up to a maximum of 250 segments).
i*sltseg* values refer to atom numbers.
Data for Key2=PROX: iaslt_cent, range
Atoms within range Å of solute atom
iaslt_cent will be used for superimposition
III.09.25. Solvent dipole distribution
SVDP Data
Prerequisites:
SLVA, SLTA
Data exa, eyb, ezc
- exa, eyb, ezc - unit vectors in the direction
of coodinate axes (default: Cartesian axes)
The program will calculate the distributions of the angles between the
solvent's dipole vector and the three coordinate axes.
III.09.26. Monitoring bulk solvent density
MOND [Data]
Prerequisite:
PBCN
Prerequisite of:
BTUN
Can not be followed by:
PRTG
Data:
glimminx, glimmaxx, glimminy, glimmaxy,
glimminz, glimmaxz
- The solvent density will be monitored inside and outside the
rectangle defined by the ranges
glimminx to glimmaxx,
glimminy to glimmaxy,
glimminz to glimmaxz in the x, y, and z directions,
respectively.
For non-rectangular simulation cells the center of the rectangle
has to be the origin of the coordinate system.
III.09.27. Monitoring bulk solvent density
BLKW [Data]
Prerequisite:
PBCN
Prerequisite of:
BTUN
Can not be followed by:
PRTG
Data: wlayer
- The solvent density will be monitored inside and outside the
rectangle defined by the ranges
glimminx to glimmaxx,
glimminy to glimmaxy,
glimminz to glimmaxz in the x, y, and z directions,
respectively, where these limits of the inside box are determined
to be wlayer away from the edges of the simulation rectangle.
It is only applicable with rectangular or cubic bundary condition and
the center of the rectangle
has to be the origin of the coordinate system.
III.10. Proximity analysis keywords
- PXYZ: Permute coordinates
read
- PXCR: Proximity criterion
choice
- FCGD: Functional group
definition
- PXLM: Omitting atoms from
proximity considerations
- RFSL: Set first shell
radii
- RMOD: First shell radius
modifications
- DBLG: Double proximity
radial gridsize
- DIPC: Dipole
fluctutations
- GENV: Proximity volume
element generation
- GRAV: Proximity RDF
averageing
- PXTD: Proximity table
difference calculation
- PXDP: Solute-solvent
proximity dipole distr. calculation
- PXGR: Solute-solvent
proximity g(r) and coordination number calculation
- PXBE: Solute-solvent
proximity binding energy calculation
- PXWW: Solvent-solvent
proximity distribution calculation
- VOLE: Volume element
regeneration
- VORO: Voronoi analysis
- RTIM: Solvent residence
time
- DIFC: Solvent diffusion
constant
- TAUC: Torsion angle autocorrelation
III.10.01. Permute coordinates read from history file
PXYZ Data
Potential successor:
CNFG
Data: ix, iy, iz
- ix, iy, iz - the X, Y, and Z coordinates
will be permuted to the ix-th, iy-th and iz-th
place before analysis or before writing out a new trajectory
with the key WTRA.
For example, to switch to the convention used for CHARMM
(Z axis is the prism axis, hexagon vertex on the Y axis) use
PXYZ 1 3 2 .
The coordinates will be permuted before analysis or before writing
the
new trajectory.
This functionality allows the use of a trajectory file where
the hexagonal prism's orientation is different from
the convention MMC is using.
Note, that it is only meaningful to use this key with
hexagonal prism cell.
Note also that the change is applied only to the configurations
analyzed.
Thus, depending on whether the last configuration read was
analyzed,
the configuration current after a SCAN
may or may not have the coordinates permuted.
III.10.02. Proximity criterion choice
PXCR Key1 Key2 Key3 [Data]
Prerequisite:
SLTA
Prerequisite of:
GENV, PXGR,
PXWW, PXBE,
PXLM, TIMM,
FLDG, RTIM,
DIFC, PXDP,
PXTD, PXWR
Key1 (solvent density source):
- TDEN - Solvent density is obtained form the cell volume
and solute p.m.v.
- PDEN - Solvent density is obtained from the considered
primary region volume
- WDEN - Solvent density is obtained from the solvent's
p.m.v.
- MDEN - Solvent density is obtained form the cell volume
and the solute volume (including the layer of solvent radius), as
calculated from onsets of the primary RDF's and the Monte Carlo
estimates of the shell volumes.
Key2 (tesselation strategy):
- BISE -
Original proximity criterion based on
bisectors
- RPVW -
Radical plane-based proximity criterion,
using Van der Waals radii
- RPCH -
Radical plane-based proximity criterion,
using atomic charges
Key3 (ignoring atoms):
- ALL - Use all atoms on the solute for proximity
index calculation
- NOHD - Ignore hydrogens on the solute for proximity
index calculation
- URBG - Use only solute atoms in the ranges
(specified below) for proximity index calculation
- URNH - same as Key3=URBG but, in addition,
ignore all hydrogens
Solute atoms ignored by Key3 will not contribute
to any rdf either.
Data:
- For Key3=RNG* only:
nrange; irangestart, irangestop (nrange
times)
- only atoms in the ranges irangestart - irangestop
will be used for proximity index calculation
- For Key2=RPCH only:
rqwmin, rqwmax - the atomic charges will be
mapped (linearly onto the [rqwmin,rqwmax] interval to
be used as radii in defining the radical planes.
Default values: rqwmin {0.0}, rqwmax {1.5}.
- pmvslt, pmvslv
- pmvslt {0.0} - solute partial molar volume in
ml/mol
- pmvslv {18.12004} - solvent partial molar volume
in ml/mol
III.10.03. Functional group definition
FCGD Key1 Data Fdata
Prerequisite: SLTA
Can not be followed by: FCGA
Key1 (definition type):
- STND - Standard (chemically defined) functional groups
will be used.
- INTO - Functional groups will be defined by the user.
The information will only be used for tabulating proximity
results.
- INTG - Functional groups will be defined by the user.
The information will be used for
tabulating proximity results and for grouping the
calculated distributions, overriding the
indxrdf
information given after the SLTA
key.
Data:
- ninpgrp - Number of functional groups to be defined.
- ninpspecd - Number of descriptors to be read.
Fdata:
- RECTYPE 11: (ninpgrp times):
short name, long name, scope (a4,1x,a8,i5)
- short name, long name - The names of the newly
defined functional groups.
- scope {0} - The scope of a definition.
- scope=0: All atoms satisfying the definition(s)
will form a single functional group
- scope=1: Atoms satisfying the definition(s)
and belonging to different residues (solute groups)
will form different functional groups
- scope=2: Atoms satisfying the definition(s)
and belonging to different solute molecules
will form different functional groups
- RECTYPE 12: (ninpspecd times):
resnam, atnam, labfg8, ifgrp
(a4,1x,a4,1x,a10,i5)
- resnam, atnam, labfg8, ifgrp
- Solute atoms with residue name resnam, atom name
atnam,
(see labslt for key SLTA)
long atom label (spgroup, see key SLTA)
of labfg8 are defined to belong to the ifgrp-th
functional group.
Blank field for any of these three variables will match everything.
Asterisk in any of these fields will match any character.
Note, that care must be exercised when this option is used after
restoring a checkpoint file.
If the INTG key was used before resulting in the non-default
grouping of rdf's then using it again or using the STND key
(to revert to the default) will abort the run since the original
atom by atom information is lost.
III.10.04. Omitting atoms from proximity considerations
PXLM Data
Prerequisite: PXCR
Data: nfadel, nladel
- nfadel {0} - The first nfadel solute atoms will
be
omitted from proximity considerations (i.e., when assigning
solvents
to solute atoms, these omitted atoms will be ignored).
- nladel {0} - The last nladel solute atoms will be
omitted from proximity considerations.
III.10.05. First shell radius definition
RFSL Key1 [Data]
Prerequisite:
SUPT
Potential successor:
SLTA
Can not be followed by: SLTA
Key1 (source of radii):
- STOR - Use stored first shell radii (shown on the list
obtained by PLBP.
Data: rfsfac, rfsadd
- rfsfac {1.0}
- All first shell radii will be multiplied by rfsfac.
- rfsadd {0.0}
- All first shell radii will be incremented by rfsadd
(after multiplication by rfsfac).
- SIGM - First shell radii will be obtained as the mean of
the L-J sigma parameters of the solvent and solute atoms.
The solvent's sigma is the sigma of the islvrep-th atom
(see key SLVA) of a general solvent or
the sigma of the water oxygen.
Data: rfsfac, rfsadd, sigmah
- rfsfac {1.05}
- All first shell radii will be multiplied by rfsfac.
- rfsadd {0.0}
- All first shell radii will be incremented by rfsadd
(after multiplication by rfsfac).
- sigmah {0} When Key1=SIGM and sigmah is
not zero, sigmah will be the sigma value used for the solute
hydrogen atoms.
- RDFM - Replace the first shell radii with the calculated
minima
of the primary RDF's.
III.10.06. First shell radius modification
RMOD Data
Data: nmod, nmod times : pfll, pfl,
rfsl
- nmod - number of potential types to modify
- pfll - potential library label
(see key SUPT)
- pfl - atom type label or atom type number to modify
- rfsl - new first shell radius for potential type
pfl in the potential library pfll
III.10.07. Radial gridsize doubling
DBLG
This key will double the gridsize for the proximity analysis
radial grid, modify all the accumulators involved
and print the full result pages with the modified gridsize.
If the PXPL key was used earlier, the
proximity distributions will also be written on the .pxp
file.
Note, that the change is permanent for the remainder
of the calculations.
III.10.08. Dipole fluctuation calculation
DIPC
The fluctuation of the dipole moments will be calculated
in the proximity regions to give estimates of the dielectric
constant.
III.10.09. Monte Carlo volume element estimation
GENV Key1 Data [Data]
Prerequisites:
SLTA, PXCR,
PXGR
Key1 (possible visualization):
- NOWR - Don't write .pxv file
- WRIT - Write file .pxv to visualize proximity
regions
Data: nransn, iwritefs, iwritels
- nransn - The number of (additional) random points to
generate this time. This will increase the precision of volume
estimates, but only when the solute conformation remains
fixed during the simulation.
- iwritefs {1}, iwritels {nslt} - points
in the proximity of solute atoms from iwritefs to
iwritels will be written to the
file <jobname>.<numrun>.pxv
- maxprint {100} - maximum number of points written per
proximity region
The calculation of the proximity RDF's require the volume element
of
shells in the various proximity regions. This volume is calculated
by a Monte Carlo method: random points of uniform distributions are
generated and the volume of each shell is assumed to be
proportional to
the number falling into them.
Generating a certain number of random points
after each snapshot analyzed
(see keys SCAN or PXAN),
instead of generating them in one step at the beginning or end,
allows for an approximate accounting of the variation of these
volume element as a result of conformational changes in the solute.
III.10.10. Calculating RDF extrema averages
GRAV Key1 [Data]
Prerequisite:
PXGR
Key1 (averaging strategy):
- ELEM - Averages calculated over chemical elements
- PTYP - Averages calculated over potential types
Data: rkgrmin, r2kgrmin, rtkgrmin
- rkgrmin {0.5} - Minimum first shell coordination number
to contribute to averages
- rkgrmin {2.5} - Minimum first+second shell coordination
number to contribute to averages
- rkgrmin {10.0} - Minimum total coordination number to
contribute to averages
If primary or total RDF's were calculated, the program averages
the location and values of the first and second maxima, and the
first minima for atoms that are sufficiently solvated
as defined by the threshold values inputted above.
III.10.11. Proximity table difference generation
PXTD Data
Prerequisite:
PXCR
Data: filename1, runnum1, filename2,
runnum2
- filename1 - File name root of the first checkpoint file
- runnum1 -
Run number
of the first checkpoint file
- filename2 - File name root of the second checkpoint file
- runnum2 - Run number of the second checkpoint file
The program reads both checkpoint files and calculated the
differences of
the table entries (shell volumes, coordination numbers, densities,
energies, etc.). The two checkpoint files should refer to the same
system.
III.10.12. Solute-solvent dipole distribution function
calculation
PXDP
Prerequisites:
PXCR, PXGR
Potential successor:
PXAN
This key will result in the calculation of two functions for the
solute atoms,
based on the angle THETA between the line drawn from the solute
atom to the
first solvent atom and the bisector of the angle S2-S1-S3
(i.e., assuming that the solvent is water and the first atom
is the oxigen, the dipole direction):
- The distribution of THETA for waters in the first shell around
the solute atom.
- The average of THETA as a function of the waters' distance from
their proximal solute atom.
III.10.13. Solute-solvent proximity radial distribution
function and coordination number calculation
PXGR Key1 Key2 Key3 Data
Prerequisite:
SLTA
Prerequisite of:
GENV, PXGR,
PXWW, PXBE,
PXLM, TIMM,
FLDG, RTIM,
DIFC, PXDP,
PXTD, PXWR
Key1 ('total' solute-solvent g(r) calc.):
- NOTO - Don't calculate total solute-solvent g(r)'s
- PXTO - Calculate total solute-solvent g(r)'s (centered
at each solute atom)
Key2 (proximity solute-solvent g(r) calc.):
- NOPR - Don't calculate solute-solvent proximity
g(r)'s or proximity Kc's
- PRKC - Calculate proximity coordination numbers
- PRIM - Calculate solute-solvent proximity g(r)'s,
coordination numbers and their distributions
Key3 (density weighting for proximity regions):
- NOWT - No density weighting is used for the primary
solute-solvent g(r)'s
- TDWT - Primary solute-solvent g(r)'s are weighted by
the total primary area solvent density
- PDWT - Primary solute-solvent g(r)'s are weighted by
the density of the primary area used for the g(r) calculation
Data: rgpxmax, rpxdiv, rpxnear
- rgpxmax {10.0} - Range of the solute-solvent g(r)'s in Å
- rpxdiv {0.1} - Grid size to be used with the
solute-solvent g(r)'s in Å
- rpxnear {1.0} - When a solvent gets closer than
rpxnear Å to the solute, a strong warning is issued
and the configuration is saved
The total g(r) is calculated for the solute atoms by considering
all solvents.
The proximity g(r) is calculated for solute atoms by considering
only
solvents proximal to it.
Coordination numbers (and their distributions) will be calculated
if either
total g(r) or proximity g(r) calculation was requested.
III.10.14. Solute-solvent proximity binding energy
calculation
PXBE Data
Prerequisite:
PXCR
Potential successor:
PXAN
Data: epmink, epdivk
- epmink - Minimum of the solute-solvent pair
energy distributions (in kcal/mol)
- epdivk - Grid size of the solute-solvent pair energy
distributions (in kcal/mol), assuming that there are 100 grid points.
This calculation will generate the distribution of the solute
solvent
energies for solvents in the first shell, as well as the sum of
solute solvent
energies, summed both for all solvent is the first shell and
for all solvents in the whole proximity region.
III.10.15. Solvent-solvent proximity distribution
calculation
PXWW Key1 Key2 Data
Prerequisite:
PXCR
Potential successor:
PXAN
Key1 (coordination number and pair energy distr. calc.):
- NOPE - No proximity solvent-solvent coordination number
and pair-energy averages will be calculated
- PEWW - Calculate proximity solvent-solvent
coordination number and pair-energy averages
Key2 (g(r) calculation):
- NOGR - No proximity solvent-solvent g(r) will be
calculated
- GRWW - Calculate proximity solvent-solvent g(r)
Data: rsolmin, rsolmax,
[rfsww, rceww,]
[rgpxmax, rpxdiv,
nijgvv, nijgvv times: islvirep(i),
islvjrep(i) ...]
- rsolmin, rsolmax: solvent-solvent proximity
analysis will use only solvents whose distance from
their proximal solute atom is in the range
[rsolmin,rsolmax]
If Key1 = PEWW:
- rfsww: the solvent first shell
radius to be used in the coordination number and pair energy
calculations.
- rceww: the solvent binding energy calculations will
include all solvents within a sphere of radius rceww.
If Key2 = GRWW:
- rgpxmax {10.0} - Range of the solvent-solvent g(r)'s
- rpxdiv {0.1} - g(r) grid size to be used.
If the key PXGR is also present,
the larger of the rgpxmax values will be used and
the two rpxdiv values have to be the same.
- nijgvv {0} - The number of different solvent-solvent
RDF's to calculate.
- islvirep(i), islvjrep(i): the i-th
solvent-solvent RDF
will be calculated between solvent centers islvirep(i)
and islvjrep(i).
The solvent-solvent coordination number is the mean number of
solvents
within a sphere of radius rfsww, averaged over all solvents
considered. The solvent-solvent pair energy is the average of
water-water energies, calculated for waters within a distance of
rfsww and
at least one of the pair being considered for the solvent-solvent
analysis.
The solvent-solvent binding energy is the sum of water-water
energies, averaged fover all solvents considered.
III.10.16. Volume element preservation/regeneration
VOLE Key1
Can not be followed by: RCKP
Key1:
- DROP - Volume element estimates are kept for successive
runs using the same solute
- KEEP - Volume element estimates are regenareted for each
run.
III.10.17. Voronoi polyhedra analysis
VORO Key1, Key2, Key3, Data
Prerequisites:
CNFG, PBCN
Potential successor:
PXAN
Key1 (use of PBC):
- PBC - Analyze with periodic images included
- NPBC - Analyze with periodic images ignored
Key2:
- CONF - Analyze single configuration
- TRAJ - Analyze trajectory
Key3:
- ALL - Use all atoms in Voronoi construction
- NOHD - Ignore hydrogens in Voronoi construction
NOT IMPLEMENTED YET
III.10.18. Solvent residence time calculation
RTIM Key1, Data
Prerequisite:
PXCR
Potential successor:
PXAN
Variable to set to 'T' in the preprocessor: DR
Key1 (shell boundary)
- FRST - Residence time within first shell
- SCND - Residence time within first two shells
- READ - Residence time within user-defined shell radius
Data: timestep, - aucmin,
- nzerosignore,
rtimlim
- timestep - The simulation timestep used
- aucmin - The exponential fit will use the autocorrelation
function until its value drops below aucmin
- nzerosignore {0} - In the list of 0's and 1's representing out
and in the solvation shell, resp., set to one zeros when there are at most
nzerosignore zeros are between ones.
- rtimlim {10.0} - Residence times will be calculated for
staying within rtimlim Å of the solute (only when Key1=READ)
The program will calculate the autocorrelation function
of the residence function (0: out; 1: in) and save it
on a file <jobname>.rtm
to be processed into actual times by a seperate program.
III.10.19. Solvent diffusion constant calculation
DIFC Data
Prerequisite:
PXCR
Variable to set to 'T' in the preprocessor: DR
Data: timestep, dcgrid, rdc1, rdc2
- timestep - The simulation timestep used
- dcgrid - The gridsize (in Å) for the diffusion
calculation
- rdc1 {0.0}, rdc2 {10.0}, - Diffusion constants
will be calculated for solvents whose distance from the solute
is in the range of [rdc1,rdc2] Å
(in addition to calculations for the first and second solvation shell).
The program will calculate the conformational average of the
solvent displacements in each proximity region; within the first shell,
the first two shells and the range defined by rdc1 and rdc2.
III.10.20. Torsion angle autocorrelation calculation
TAUC Data
Prerequisite: PART
Data: nmctorauc,tauc_timestep, tauc_min,
ntaucprint
- nmctorauc - The torsion angles are saved at every
nmctoraucth step for autocorrelation calculation.
- tauc_timestep {1} - The time elapsed between two samples
- tauc_min {0.0} - The minimum of the autocorrelation function
to be used for the fit
- ntaucprint {20} - the number of autocorrelation function values
to print (evenly distributed)
III.11. Numerical precision related keywords
- HRDW: Computer
architecture
- SLFT: Self test
- SLFT: Self test tolerances
- FIXD: Round-off error
elimination
- REGE: Regenerate from
torsion angles
III.11.01. Computer architecture-dependent algorithm
selection
HRDW Key1
Potential successor:
NSLV
Key1 (code-determining hardware selection):
- SCAL - 'Conventional' scalar machine
(32-bit words)
- VC32 - Vector machine, 32 bits word, requiring split
loops to preserve argument order (e.g., SGI, IBM)
- VC64 - Vector machine, 64 bits word (e.g., Cray)
III.11.02. Periodic self test
SLFT Key1 [Key2] [Data]
Related key:
PDBT.
At each nrep-th (see key RUNS) step
the program performs a
consistency check as specified by Key1.
Key1 (self test level):
III.11.03. Input self-test tolerances
STOL Data
Prerequisite: CNFG
Data: engtol, virtol, tortol,
comtol, zmattol, cslttol, d12tol,
d13tol, wsumtol
Self test tolerances for various quantities:
- engtol {0.0001} - for relative deviation in energies
- virtol {0.001} - for relative deviation in virial sums
- tortol {0.1} - for torsion angles
- comtol {0.001} - for centers of masses
- zmattol {0.0001} - for orientation matrix orthonormality
- cslttol {0.001} - for solute atom positions (on distance squares)
- d12tol {0.1} - for bond lengths
(see LOOP LIMM)
- d13tol {0.3} - for 1-3 distances
(see LOOP LIMM)
- wsumtol {0.0001} - preferential sampling weight sums
The default of engtol is relaxed by a factor of 10 for
(TPN) ensemble run (see key IBEN),
for runs with more than 1000 solute atoms or 1000 solvent
molecules, for systems with active torsions
(see key PART), or when
the C@G7 lines are active;
the default of tortol is relaxed to 20.0
when loop moves don't even touch the rest of the chain
(see LOOP LIMM);
the default of cslttol is relaxed to 0.001
for systems with more than 200 solute atoms.
III.11.04. Energy contribution recalculation
to eliminate round-off errors
FIXD Key 1 Data
Key1: (action on self-test failure):
- STPD - Stop run if self-test run before energy recalculation fails
- IGND - Ignore if self-test run before energy recalculation fails
- FIXD - Attempt to fix if self-test run
before energy recalculation fails, stop if fix is unsuccesfull
- FXDI - Attempt to fix if self-test run
before energy recalculation fails, but run no matter how the fix worked
Data: nmcacord, nmcgcord
- nmcacord - Recalculate from scratch all the current
energy contribution values and solute geometries
after every nmcacord MC step.
- nmcgcord - Recalculate from scratch only the
solute geometries after every nmcacord MC step.
There are several quantities that the program calculates by
accumulating the changes to them. Round-off errors may cause
some self tests to fail. This option eliminates periodically these
round-off errors.
III.11.05. Regenerate solute from torsion angles
to eliminate round-off errors
REGE
Prerequisite: TORD,
CNFG
When this keyword is invoked, the program takes the torsion angles
read from the .crd file and applies them to the solute molecule
given as
input after the SLTA key.
The rigid core of the molecule will be taken from the .crd file.
This way the input molecule's bond lengths and bond angles will be
made to agree with the input definition.
III.12. Developpers' keywords
- TEST: Special code-testing
- OPTN: Direct option
input
- DBUG: Input debug keys
listing
- CHKP: Checkpoint file saving
frequency
See also the key OUTP PROP.
III.12.01. Special code testing
TEST Key1 [Data]
Key1: (test type):
- PBCT - Periodic boundary condition test by system
translation
Data: xcn, ycn, zcn
- xcn {10.0} - x coordinates of the new center
- ycn {20.0} - y coordinates of the new center
- zcn {30.0} - z coordinates of the new center
The program shifts the system to
(xcn,ycn,zcn) and applies the PBC shifts
to the part of the system that is found to be outside the cell.
This will
result in a system that gives a different representation of the
same periodic system as before.
Comparison of the total energy (and its components) calculated on
the system with and without the shift should not show any
difference
(apart from round-off related deviations). Any difference should be
indicative of incorrect PBC handling.
- CPLW - Test coupling parameter weight calculation
- FSLT - Test solute force calculation by
finite difference quotients
Data: nfrq, dx
- nfrq {1} - every nfrq-th solvent molecule
(starting with the first) will be tested
- dx {0.1} - the delta x value for the finite difference calculation
- FSLV - Test solvent force calculation by
finite difference quotients
Data: nfrq, dx
- nfrq {1} - every nfrq-th solute
atom (starting with the first) will be tested
- dx {0.1} - the delta x value for the finite difference calculation
III.12.02. Direct option input
OPTN Data
Data: nopt, nopt times: index, value
- index, value - Set the internal option array
iop as iop(index)=value.
This key allows to set the internal option arrays without the
keyword
mechanism normally used.
III.12.03. Input debug keys
DBUG Data
Data: nkeys, nkeys times: index, value
- index, value - Set the internal debug option
array
(idebug) as idebug(index)=value.
numbers and the file unit numbers will be also listed. If
III.12.04. Checkpoint file saving frequency
CHKP Data
Potential successor:
PXAN
Data: nrecd
- nrecd - After every nrecd steps,
information in all of the common blocks are saved on the .ckp file.
This option overrides the default checkpoint file saving frequency
allowing the localization of a program failure (detected usually by a
failed self test).
See also the key SCKP for additional
options.
III.01.15. History file buffer save
BUFF
When this key is present, the buffers of the open history files are
written out whenever a new checkpoint file is saved.
This is currently achieved by rewindig the file and skipping to
the end.
III.13. Keyword summary
- AROM 05.18 Aromatic carbon list
- ATFR 09.03 Calculation of
mean force on solute atoms
- BLKW 09.27 Bulk density
monitoring
- BNDL 08.05 Listing of bonds
found
- BRKB 04.05 Break bonds
- BTUN 02.07 Tune GCE B
parameter
- BUFF 01.15 History file
buffer save
- CPOC 09.13 Cavity/pocket occupancy
- CHKP 12.04 Checkpoint file
saving frequency
- CHRG 05.15 Solute charge
manipulations
- CLON 04.03 Solute cloning
- CMPC 09.09 Configuration
comparison
- CNFG 02.11 Initial
configuration
- CSEG 04.11 CHARMM segment
id handling
- CNST 05.23 Constraint potential
- DBLG 10.07 Double proximity
radial gridsize
- DBUG 12.03 Input debug keys
- DENF 09.10 Combined solvent
conformation
- DIFC 10.19 Solvent diffusion
constant
- DIPC 10.08 Dipole
fluctutations
- DSTC 09.01 Distribution
calculation
- DSTP 08.03 Distribution
print
- ENGL 09.06 Energy
contribution calculation
- ENHB 05.20 Hydrogen-bond term
- EPLT 08.09 Energy
convergence plot form
- FCGA 09.02 Functional group
analysis
- FCGD 10.03 Functional group
definition
- FILE 01.01 File name root
- FILT 09.07 Solvent
filtering
- FIXD 11.04 Round-off error
elimination
- FLDG 09.04 Field gradient
calculation on solvents
- FLXR 05.22 Residues to sample
- FREE 03.01 Free energy
options
- GABF 09.18 Generic site
graph analysis by frame
- GCEN 02.05 Grand-canonical
ensemble
- GENS 09.14 Generic
solvation site calculation
- GENT 09.19 Configuration with
input torsion angles
- GENV 10.09 Proximity volume
element
- GRAV 10.10 Proximity RDF
averageing
- GSAN 09.17 Generic site analyses
- HBBR 09.08 Hydrogen-bond
bridges
- HBMO 06.21 Non-default hydrogen-bond
- HRDW 11.01 Computer
architecture
- IBEN 02.09 (TPN) ensemble
- IDAC 01.08 Proximity
- IDAG 09.23 Aggregate insertion and
deletion sites
- IDLG 01.09 I/D log file
- IGSV 06.18 Non-interactive solvent
- IGJA 06.16 Ignore Jacobian
- INCT 05.11 Solvent inner
cutoff
- IWSL 09.15 Prepare site assignment file,
entropy input
- KMNP 08.08 PDB file
property threshold
- LCMP 02.10 Virial biasing factor
- LIGA 02.16 Multi-copy ligand
- LIMG 02.08 Limit the cavity
grid
- LOOP 06.15 Loop torsion move
- LPST 09.22 Loop solution list
- MAKB 04.04 Make bonds
- MINE 01.07 Minimum energy structure extr
into history file
- MIXR 05.03 LJ parameter
mixing rule
- MODA 05.06 Element
modification
- MOLD 04.08 Define solute
molecules
- MOND 09.26 Bulk density
monitoring
- MOVE 06.01 Move selection
- MVRT 06.10 Alternate translation
and rotation
- MV2S 06.08 Correlated
2-solute molecule move
- NFBU 06.03 Solute force
bias
- NMVP 08.11 Print solvents
not moved
- NONB 06.13 Non-Boltzmann
samplings
- NSLV 02.01 Number of solvent
moleculess
- OPTN 12.02 Direct option
input
- OUTP 08.01 Print various
system characteristics
- OVRA 03.06 Overlap ratio
evaluation
- OVST 02.14 Solute copy
overlay
- PARD 06.06 Solute molecule
displacement
- PART 06.09 Solute torsion
- PBCT 11.01 Periodic
boundary condition test by system translation
- PBCN 02.04 Periodic
boundary conditions
- PBGR 02.16 Solute reset
with grids
- PDBT 08.14 PDB atomname convention
- PFRD 05.07 Potential
library input
- PLBP 08.07 Potential
parameter echo
- PLCV 08.15 Postscript
convergence plots
- PMOD 05.05 Potential
library modifications
- PRAC 08.04 Acceptance rate
print detail
- PRCO 08.06 Compilation
option listing
- PRFI 06.11
Insertion/deletion pref. sampl.
- PRMF 05.04 Torsion
potential specification
- PRNT 08.02 Input echo level
- PROT 05.12 Minimum
repulsion for general solvent
- PRPL 08.10 Plotted data print
- PRTG 09.11 Print cavity
grids
- PXAN 07.03 Proximity
analysis frequencies
- PXBE 10.14 Solute-solvent
proximity
- PXCR 10.02 Proximity
criterion choice
- PXDP 10.12 Solute-solvent
proximity dipole distr. calculation
binding energy calculation
- PXTD 10.11 Proximity table
difference calculation
- PXGR 10.13 Solute-solvent
proximity g(r) calculation
- PXLM 10.04 Omitting atoms
from proximity considerations
- PXPA 08.12 Empty proximity
region print
- PXPL 01.10 Proximity
analysis distribution plot file writing
- PXPR 08.13 Proximity
analysis printing options
- PXWR 01.11 Proximity
information file
- PXWW 10.15 Solvent-solvent
proximity
distribution calculation
- PXYZ 10.01 Permute
coordinates read
- RAUS 03.02 AUS parameter
change
- RCKP 07.04 Checkpoint file
read
- RDBD 07.04 Bond list input
- REGE 11.05 Regenerate from
torsion angles
- RFCR 05.14 Reaction-field
correction
- RFSL 10.05 Set first shell
radii
- RMCK 10.05 Checkpoint file
removal
- RMOD 10.06 First shell
radius modifications
- RNDG 06.19 Random number
generator information
- RTIM 10.18 Solvent
residence time
- RUNS 07.01 Run MC
simulation
- SAMP 06.02 Sampling
technique
- SANN 02.19 Simulated annealing
- SACP 02.30 Simulated annealing
of the chemical potential
- SCAL 02.13 Scale simulation
cell
- SCAN 07.02 Scan trajectory
for proximity analysis
- SCKP 01.04 Archive checkpoint
files
- SCRM 02.12 Scramble torsion angles
- SEED 06.11 Random number
seed input
- SENS 09.05 Sensitivity
analysis
- SETC 05.14 Set various constants
- SKWT 06.17 Skewed torsion sampling
- SLFT 11.02 Self test
- SLTA 04.01 Solute
description
- SLVA 04.02 Solvent
description
- SPPS 05.17 Special potential
- SPRD 02.15 Spread out
solute molecules
- SPST 05.16 Field-dependent
potentials
- STEP 06.04 Stepsizes
- STIR 01.13 Contact elimination
- STOL 11.03 Self test
- STOP 07.05 Stop the run
- STPS 06.05 Stepsize scaling
- STPX 02.06 Stop GCE run at
nx
- STSC 06.14 Temporary stepsize scaling
- STUN 02.17 Stepsize tuning
- STVG 09.12 Estimate solute
volume
- SUPI 09.24 Trajectory superimposition
- SUPT 05.01 Solute potential
type
- SUVC 05.08 Solute-solvent
cutoff
- SUUC 05.09 Solute-solute
cutoff
- SVDP 09.25 Solvent dipole distribution
- SVIN 05.19 Solvation parameter input
- SVPT 05.02 Solvent
potential type
- SVVC 05.10 Solvent-solvent
cutoff
- SWAP 06.07 Solute molecule
swap
- TAC0 02.18 Stepsize accumulator reset
- TAND 09.20 Torsion angle
distribution
- TAUC 10.20 Torsion angle autocorrelation
- TEMP 02.02 Simulation
temperature
- TITL 02.03 Description
- TIQU 03.05 TI quadrature
- TORD 04.10 Torsion angle
definitions
- TORT 09.21 Energy analysis
along a torsion angle
- TRAJ 01.04 History file
specification
- VOLE 10.16 Volume element
regeneration
- VORO 10.17 Voronoi analysis
- VVNE 05.24 Solvent-solvent electrostatic
- WCKP 01.03 Write checkpoint
file
- WCNF 01.02 Configuration
save
- WMAT 03.03 PMF matching
- WPLT 03.04 AUS iteration plot
- WTRA 01.12 Trajectory
conversions
IV. EXAMPLES
In this section a number of commented input files are presented
illustrating several different options.
- Glycine in water - minimal input; restart from checkpoint
- Ethanol in ethanol (neat liquid)
- Grand-canonical ensemble
- Constant pressure ensemble
- Dimethylphosphate-Sodium complex with
freely moving sodium
- Torsion angle sampling
- Potential of mean force between dimethyl
phosphate and
sodium ions with adaptive umbrella sampling
- Creation/annihilation polynomial path
thermodynamic integration
- Finite difference thermodynamic
integration
- Widom insertion method
- Proximity analysis, trajectory
filtering
- Conformation filtering
- Generic site calculation
- Primary hydration shell calculation
- Cavity and pocket determination
IV.1. Glycine in water - minimal input; restart from checkpoint file
This is a simple canonical ensemble MC run of a simple solute in
water.
FILE glycine ! File names start with glycine.
! Run # = 1
NSLV 215 ! One glycine and 215 waters
TEMP 298.0 ! 298 Kelvin
SVVC SPCC 7.75 ! Solvent-solvent cutoff=7.75 Å
SUVC MICC 0.0 ! MI on the solute
PBCN FCC 14.81526 ! Face-centered cubic cell
STEP 0.15 10.0 0.55 40.0 40
! Solute and solvent stepsizes, slt move frequency
SUPT AM94 ! Solute-solvent potential is AMBER
SVPT TIP4 TIP4!Solvent-solvent potential: TIP4P
SLTA SMPL MMC READ 10 ! 10 solute atoms
35 2.131 -1.202 -0.377 -.278623
34 2.096 1.058 0.348 -.260774
33 -0.549 -1.370 0.0 -.385556
1 1.526 0.0 0.0 .295148
18 0.0 0.0 0.0 -.040730
20 -1.602 -1.374 0.044 .147624
20 -0.249 -1.903 -0.836 .158476
22 -0.351 0.514 -0.895 .061766
22 -0.381 0.536 0.868 .086565
23 3.068 -0.947 -0.337 .216103
CNFG RANC ASCI SIZE !Random initial configuration
! Run 100000 steps
RUNS 100000 10000 25000 0100000 100000 100000
! Run an other 100000 steps, starting from the end of the first run.
! Run # = 2
RUNS 100000 10000 25000 0100000 100000 100000
STOP
Click HERE to view the output of this run
Files needed for run: none
Files created by the run:
- glycine.crd: initial configuration;
- glycine.2.crd: final configuration after the first run
- glycine.3.crd: final configuration after the last run
- glycine.ckp: checkpoint file after the first run ;
- glycine.2.ckp: checkpoint file after the last run ;
Suppose we decide to extend the run for an other 100000 steps.
It can be done by reading in the checkpoint file created after the
run:
! Extend the previus run with an other 100000 steps
FILE glycine 2! Set Run # to 2
RCKP ! Restore the status from the checkpoint file
! glycine.2.ckp
RUNS 100000 10000 25000 0100000 100000 100000
STOP
Click HERE to view the output of this run
Files needed by the run:
- glycine.2.crd: starting configuration of the second run
- glycine.2.ckp: checkpoint file after the second run
Files modified by the run:
- glycine.3.crd: final configuration after the last run
- glycine.2.ckp: checkpoint file after the last run
IV.2. Ethanol in ethanol (neat liquid).
This input runs a MC simulation in liquid ethanol.
One ethanol molecule
is considered the 'solute'
FILE etnl
PRNT ECHO ! Echo formatted input
TITL Ethanol
HRDW SCLR ! Scalar machine
SVVC SPCC 10.0 ! Solvent-solvent cutoff
SUVC SPGC 10.0 ! Solute-solvent cutoff
PBCN RECT 26.0
! Unit cell is a cube with 26 Å egdes
NSLV 181 ! One solute and 181 solvents
TEMP 298.0 ! Simulation temperature: 298 K
MOVE SHCY
! Molecules are selected by the shuffled cyclic scheme
STEP 0.50 30.0 0.50 30.0 40
! Solute and solvent stepsize params
SUPT AM94 ! AMBER solute
SVPT GENL AM94 ! General solvent, AMBER library
SAMP METC ! Metroplis sampling
SLVA 9 ! Read 9 solvent atoms
18 3.108016 0.653187 -8.526237 -0.198000
22 2.814817 -0.348721 -8.761003 0.066000
22 2.517394 1.015322 -7.710808 0.066000
22 2.956134 1.278341 -9.381230 0.066000
18 4.596542 0.674197 -8.131964 0.131000
22 4.748424 0.049042 -7.276970 0.066000
22 5.187164 0.312062 -8.947392 0.066000
36 4.988388 2.013194 -7.818209 -0.566000
23 5.982338 2.026292 -7.548412 0.303000
SLTA SMPL MMC READ 9 !read 9 solute atoms
18 3.108016 0.653187 -8.526237 -0.198000 1
22 2.814817 -0.348721 -8.761003 0.066000 1
22 2.517394 1.015322 -7.710808 0.066000 1
22 2.956134 1.278341 -9.381230 0.066000 1
18 4.596542 0.674197 -8.131964 0.131000 1
22 4.748424 0.049042 -7.276970 0.066000 1
22 5.187164 0.312062 -8.947392 0.066000 1
36 4.988388 2.013194 -7.818209 -0.566000 1
23 5.982338 2.026292 -7.548412 0.303000 1
DSTC NONE ! No distribution function calc.
CNFG READ ASCI NOFX
! Read initial coordinates in ASCII from etnl.crd
RUNS 1000000 10000 500000 100000 10000
!Run 1M steps
RUNS 1000000 10000 500000 100000 10000
!The second run will start at the end of the first.
STOP
Click HERE to view the output of this run
Files needed for run:
etnl.crd: initial configuration.
Files created by the run:
- etnl.ckp: checkpoint file after the first run ;
- etnl.2.ckp: checkpoint file after the second run;
- etnl.2.crd: final configuration after the first run;
- etnl.3.crd: final configuration after the second run.
IV.3. Grand-canonical ensemble
This input runs MC simulation in the grand-canonical ensemble.
The solute is the protein BPTI
(from the PDB).
FILE bpti
PRNT DETL !Echo formatted data, print extra inf
HRDW VC32 ! 32-bit vector
SVVC SPCC 7.75 ! Solvent cutoff
SUVC MIGC 0.0 ! MI on the solute
PBCN RECT 24.87 40.39 66.2 !Rectangular PBC
MOVE RAND ! Random selections
TEMP 298
NSLV 10 !10 solvent molecules to start with
STEP 0.00 00.0 0.55 40.0 60
! slt, slv stepsize, slt move freq.
SVPT TIP4 TIP4
SUPT AM94
SAMP FBSC 0.5 ! Scaled force-biased sampling
2
4.00 0.2 7.0 1.0
PMOD AM94 1
75 27
8 1.6 0.2
SLTA SMPL MMC READ 892
N3 -2.55400 7.05800 12.80100 -0.37640 1 ARG N
.....
..... 891 more lines describing solute atoms ....
.....
DSTC NONE
GCEN CAVB RSIG ALTI
! Grand-canonical ens., cavity biased,
! LJ sigma-based cavity radii, alternating i/d
1.0 2.5 1000.0 30 30 30 1000 1 1 0000100 50000
CNFG RANC ASCI SIZE
!Random configuration of 10 (from NSLV) waters
! using the given cell size and writing ASCII file
RUNS 10000 100 25000 100000 100000 100000
STOP
Click HERE to view the output of this run
Files needed for run: none.
Files created by the run:
- bpti.ckp: checkpoint file after the run;
- bpti.crd: initial configuration.
- bpti.2.crd: final configuration after the run.
IV.4. Constant pressure ensemble run
This input runs MC simulation in the isothermal-isobaric ensemble.
The solute is the protein a Na+ - DMP- ion pair.
FILE dmpna
TITL run 2 slt molecs in water, only moves
TITL NA+DMP-: KOLLMAN-STRAATSMA(NA+)
HRDW VC32 ! 32-bit vector
SVVC SPCC 7.75 ! Solvent cutoff
SUVC MIGC 0.0 ! MI on the solute
PBCN RECT 32.031 21.354 21.354
TEMP 298
NSLV 484
STEP 0.10 00.0 0.55 40.0 05
SVPT TIP3 TIP3
SUPT AM94
SAMP FBSC 0.5 ! Scaled force-biased sampling
2
4.00 0.2 7.0 1.0
PMOD AM94 1 ! Potential library modification
49 46
11 1.59955E+00 0.478833E-01
SLTA SMPL MMC READ 8
49 2.70000 0.0 0.00000 1.00000 1
38 0.0 0.0 0.0 0.91200 4
35 0.74000 0.0 1.28171 -0.65500 4
35 0.74000 0.0 -1.28171 -0.65500 4
37 -0.99164 -1.25560 0.00000 -0.41000 4
37 -0.99164 1.25560 0.00000 -0.41000 4
4 -1.92075 1.43300 1.07251 0.10900 4
4 -1.92075 -1.43300 -1.07251 0.10900 4
IBEN UNIF ISOT 1.0 250.0 500 100 !(TPN) ensemble
CNFG READ ASCI
RUNS 2000000 50000 1000000 0100000 100000
STOP SLFT
Click HERE to view the output of this run
Files needed for run:
- dmpna.crd: initial configuration.
Files created by the run:
- dmpna.ckp: checkpoint file after the run;
- dmpna.crd: initial configuration.
- dmpna.2.crd: final configuration after the run.
IV.5. Dimethylphosphate-Sodium complex with freely moving
sodium
This input show how to allow tranlational/rotational degrees of
freedom for molecules considered 'solute' by the program.
FILE dmpna
TITL Na+DMP-: Kollman-Straatsma(Na+)
TITL SPC water. H-R PBC
HRDW VC32 ! 32-bit vector algorithm
SVVC SPCC 7.75 ! Solvent spherical cutoff
SUVC MIGC 0.0
! MI on the solute based on group center distances
SUUC MIMC ! Molec-based MI for intrasolute
PBCN RECT 32.031 21.354 21.354
! Rectangular PBC, long box
TEMP 298 | NSLV 484
!Simulation at 298 Kelvin with 485 solvent molecules
STEP 0.1 0.0 0.55 40.0 10
! 0 slt stepsize, 0.55 Å, 40 deg solvent steps
SVPT TIP3 TIP3 ! Solvent-solvent pot: TIP3P
SUPT AM94 ! Solute-solvent pot: AMBER
SAMP FBSC 0.5 ! Scaled force-biased sampling
2
4.00 0.2 7.0 1.0
MOVE PRSP NRST
! Preferential sampling based on nearest slt atom dist
3
3.00 3.5 4.0 03.0 7.0 1.0
PMOD AM94 1 ! Potential library modification
49 46
11 1.59955E+00 0.478833E-01
! Solute's "own" solvent will be used for slt-slv
SLTA SMPL MMC READ 8
! 8 solute atom - regular solute, read from input stream
38 0.0 0.0 0.0 0.91200 1
35 0.74000 0.0 1.28171 -0.65500 1
35 0.74000 0.0 -1.28171 -0.65500 1
37 -0.99164 -1.25560 0.00000 -0.41000 1
37 -0.99164 1.25560 0.00000 -0.41000 1
4 -1.92075 1.43300 1.07251 0.10900 1
4 -1.92075 -1.43300 -1.07251 0.10900 1
49 2.70000 0.0 -0.00000 1.00000 2
PARD UNIF SHCY 1.0 0.3 0.0 1.0 0.3 0.0 ~
1.0 0.3 0.0 1.0 -1 ! Last slt group is independently moved
DSTC NONE ! No distribution function calc
SLFT BASC ! Basic self tests performed
CNFG READ ASCI ! Input from dmpna.crd in ASCII
RUNS 2000000 50000 1000000 0100000 100000
! Run 2M steps
STOP
Click HERE to view the output of this run
Files needed for run:
dmpna.crd: initial configuration.
Files created by the run:
- dmpna.ckp: checkpoint file after the first run;
- dmpna.2.crd: final configuration after the run.
IV.6. Torsion angle sampling
This input show how to sample torsional degrees of freedom
for molecules considered 'solute' by the program.
FILE aa3
TITL Three amino acids in vacuum
TITL Solute molecules move, swap and torsions
HRDW VC32 ! 32-bit vector
SVVC SPCC 12.00 ! Solvent cutoff
SUVC MIGC 0.0 ! MI on the solute
SUUC MIMC ! Molec-based MI for intrasolute
PBCN HEXG 99.00 99.00 99.00 !hexagonal PBC
TEMP 998
NSLV 0 ! Only solute
STEP 0.00 00.0 0.55 40.0 10
SVPT TIP3 TIP3 ! Solvent-solvent pot: CHARMM
SUPT CHRM
SLTA SMPL MMC READ 34
NH1 1.24900 2.31500 -0.79400 -0.47000 1 ILE N
CT1 1.55500 3.73000 -0.91700 0.07000 1 ILE CA
C 1.31700 4.34000 0.45300 0.51000 1 ILE C
O 2.09800 5.17100 0.91200 -0.51000 1 ILE O
CT1 0.73100 4.43400 -2.05000 -0.09000 1 ILE CB
H 0.48500 2.01200 -1.35900 0.31000 1 ILE HN
HB 2.61200 3.83400 -1.12700 0.09000 1 ILE HA
HA -0.33600 4.20200 -1.80500 0.09000 1 ILE HB
NH1 1.49900 2.78700 3.28800 -0.47000 2 LEU N
CT1 2.55800 2.23900 4.11700 0.07000 2 LEU CA
C 3.78000 3.10300 3.86100 0.51000 2 LEU C
O 4.51100 3.44400 4.79000 -0.51000 2 LEU O
CT2 2.81700 0.74400 3.78200 -0.18000 2 LEU CB
CT1 3.92200 0.01400 4.58300 -0.09000 2 LEU CG
CT3 4.04200 -1.44600 4.12400 -0.27000 2 LEU CD1
CT3 3.68900 0.07700 6.10000 -0.27000 2 LEU CD2
H 1.16000 2.16100 2.58800 0.31000 2 LEU HN
HB 2.28700 2.35700 5.15800 0.09000 2 LEU HA
HA 1.86200 0.19700 3.95500 0.09000 2 LEU HB1
HA 3.03300 0.64900 2.69400 0.09000 2 LEU HB2
HA 4.90300 0.49900 4.36300 0.09000 2 LEU HG
HA 4.86700 -1.95800 4.66500 0.09000 2 LEU HD11
HA 3.09800 -1.99800 4.32300 0.09000 2 LEU HD12
HA 4.25600 -1.49900 3.03500 0.09000 2 LEU HD13
HA 4.49200 -0.47200 6.63700 0.09000 2 LEU HD21
HA 3.69100 1.12200 6.47100 0.09000 2 LEU HD22
HA 2.71400 -0.38500 6.36400 0.09000 2 LEU HD23
NH1 3.65600 6.57500 5.52300 -0.47000 3 GLY N
CT2 3.90500 6.38700 6.93600 -0.02000 3 GLY CA
C 5.38300 6.32500 7.16600 0.51000 3 GLY C
O 5.89600 6.89700 8.12500 -0.51000 3 GLY O
H 3.16200 5.83500 5.07000 0.31000 3 GLY HN
HB 3.47900 5.44100 7.23900 0.09000 3 GLY HA1
HB 3.51900 7.24100 7.47600 0.09000 3 GLY HA2
PRMF CHRM par_all22_prot_na.inp
!Torsion pot param file
PARD UNIF RAND 1.0 0.5 30.0 1.0 0.5 30.0 ~
1.0 0.5 30.0 1.0 34 ! Note the continuation character ~
PART UNIF RAND RAND 1.0
!Torsion strategy and frequency
TORD INPT 7 !Definiton of active torsions
5 2 180.0 30.0 1
2 3 180.0 30.0 1
10 9 180.0 30.0 2
10 11 180.0 30.0 2
14 15 180.0 30.0 3
13 14 180.0 30.0 4
14 16 180.0 30.0 4
CNFG RANC ASCI
SWAP 0.2 !Specify swapping frequency
RUNS 100000 100 50000 10000 5000
STOP SLFT
Click HERE to view the output of this run
Files needed for run:
- aa3.crd: initial configuration.
- par_all22_prot_na.inp: torsion potential parameters
Files created by the run:
- aa3.ckp: checkpoint file after the run;
- aa3.crd: initial configuration.
- aa3.2.crd: final configuration after the run.
IV.7. Potential of mean force between dimethyl phosphate and
sodium ions with
adaptive umbrella sampling
This input performs one type of
free energy simulation:
it calculates the potential of mean force between two solute
molecules using adaptive umbrella sampling.
FILE dmpna_pmf ! SPC water. H-R PBC
TITL Na+DMP-: Kollman-Straatsma(Na+),
TITL #1 4 Å range MI/RC G/A Run A
HRDW VC32 ! 32-bit vector hardware
SVVC SPCC 7.75 ! Solvent-solvent cutoff
SUVC MIGC 0.0 ! MI on the solute
PBCN RECT 32.031 21.354 21.354 !Rectangular PBC
MOVE SHCY ! Shuffled-cyclic selections
TEMP 298 | NSLV 485
STEP 0.10 0.0 0.55 40.0 10
! 0.1 Åslt stepsize, no slt rotation, 0.55 Å,
!40 deg solvent steps; solute moves at every 10th MC step
SVPT TIP3 TIP3 ! Solvent-solvent pot: TIP3P
SUPT AM94 ! Solute-solvent pot: AMBER
SAMP FBSC 0.5 ! Scaled force-biased sampling
2
4.00 0.2 7.0 1.0
FREE PMF1 WRMM GEOC AUSE 2
!2-part PMF both parts of the slt moves
0.00 0.25 0.050 100.0 0.1
2 0 3 1 0.01 17. 0.0 0.0 0.00002 0.9
4 5 5 05 2.43 0.75
PMOD AM94 1 ! Potential library modification
49 46
11 1.59955E+00 0.478833E-01
SLTA SMPL MMC READ 24 16 8
!3*8 solute atom - regular solute, last 1 atom is in the 2nd part
38 0.0 0.0 0.0 0.91200 1
35 0.74000 0.0 1.28171 -0.65500 1
35 0.74000 0.0 -1.28171 -0.65500 1
37 -0.99164 -1.25560 0.00000 -0.41000 1
37 -0.99164 1.25560 0.00000 -0.41000 1
4 -1.92075 1.43300 1.07251 0.10900 1
4 -1.92075 -1.43300 -1.07251 0.10900 1
49 2.70000 0.0 0.00000 1.00000 2
38 0.0 0.0 0.0 0.91200 1
35 0.74000 0.0 1.28171 -0.65500 1
35 0.74000 0.0 -1.28171 -0.65500 1
37 -0.99164 -1.25560 0.00000 -0.41000 1
37 -0.99164 1.25560 0.00000 -0.41000 1
4 -1.92075 1.43300 1.07251 0.10900 1
4 -1.92075 -1.43300 -1.07251 0.10900 1
49 6.70000 0.0 -0.00000 1.00000 2
38 0.0 0.0 0.0 0.91200 1
35 0.74000 0.0 1.28171 -0.65500 1
35 0.74000 0.0 -1.28171 -0.65500 1
37 -0.99164 -1.25560 0.00000 -0.41000 1
37 -0.99164 1.25560 0.00000 -0.41000 1
4 -1.92075 1.43300 1.07251 0.10900 1
4 -1.92075 -1.43300 -1.07251 0.10900 1
49 2.70000 0.0 0.00000 1.00000 2
DSTC NONE
CNFG READ ASCI NOFX
! input from dmpna_pmf.crd in ASCII
RUNS 5000000 20000 2500000 0 100000 200000
STOP
Click HERE to view the output of this run
Files needed for run:
dmpna_pmf.crd: initial configuration.
Files created by the run:
- dmpna_pmf.ckp: checkpoint file after the first run;
- dmpna_pmf.2.crd: final configuration after the run.
IV.8. Creation/annihilation polynomial path
thermodynamic integration
This is an example of a different
free energy simulation
type:
calculation the solvation excess Helmholtz free energy
of NO in water with polynomial thermodynamic integration
over a creation/annihilation path.
FILE NO_ti 10
TITL Excess Helmholtz free energy of solvation
of NO in water
TITL Calculated by three-point Gaussian
quadrature
PRNT ECHO
HRDW VC32 ! 32-bit vector
SVVC MINI ! Solvent cutoff
SUVC MIGC 0.0 ! MI on the solute
PBCN RECT 14.74 !Rectangular PBC
MOVE RAND ! Random selections
!Set creation/annihilation TI using the first quadrature point
!Lambda exponents: 4, 3, 2 (for 1/r^12, 1/r^6, 1/r terms, resp.)
FREE TICA NOMX
4.0 3.0 2.0 0.112702
TEMP 298
NSLV 108
STEP 0.00 00.0 0.40 35.0 50
SVPT TIP3 TIP3 ! Solv-solv potential is TIP3P
SUPT CHRM
!Add NO parameters
PMOD CHRM 2
150 70 3.3 N/NO
7 1.824 0.17
151 70 3.3 O/NO
8 1.751 0.159
SAMP METC 0.5
!Make sure the dummy atoms in the first copy are treated as one molecule
MOLD 2 2 4
!Solute has 4 atoms, 4 free energy atoms, and the first copy has 2 atoms
SLTA SMPL MMC READ 4 4 2 !Dummy solute
DUM -.570 000.0 0.00000 0.00000 1
DUM 0.58 000.0 0.00000 0.00000 1
150 0.57000 000.0 0.00000 0.02800 2
151 -.58000 000.0 0.00000 -0.02800 2
DSTC NONE
CNFG READ ASCI NOFX
FIXD 2500000
!Equilibration
RUNS 2000000 100000 100000 500000 100000
RMCK !Remove unneded checkpoint file
!Production run, runnumber=11
RUNS 10000000 100000 100000 500000 100000
FILE NO_ti 20
!Set lambda to the 2nd quadrature points
FREE TICA NOMX
4.0 3.0 2.0 0.500000
CNFG READ ASCI NOFX 12
RUNS 2000000 100000 100000 500000 100000
RMCK !Remove unneded checkpoint file
!Production run, runnumber=21
RUNS 10000000 100000 100000 500000 100000
FILE NO_ti 30
!Set lambda to the 3rd quadrature points
FREE TICA NOMX
4.0 3.0 2.0 0.887298
CNFG READ ASCI NOFX 22
RUNS 2000000 100000 100000 500000 100000 00000
RMCK !Remove unneded checkpoint file
!Production run, runnumber=31
RUNS 10000000 100000 100000 500000 100000
!Evaluate the quadrature from the data on the three checkpoint files
TIQU REGL ALL 3 11 10
0 001 0
STOP SLFT
Click HERE to view the output of this run
Files needed for run:
- NO_ti.10.crd: initial configuration.
Files created by the run:
- NO_ti.11.crd: configuration after equilibration
- NO_ti.11.ckp: checkpoint file with the data for the 1st
quadrature point
- NO_ti.12.crd: configuration after 1st quadrature run
- NO_ti.21.crd: configuration after equilibration
- NO_ti.22.crd: configuration after 2nd quadrature run
- NO_ti.21.ckp: checkpoint file with the data for the 2nd
quadrature point
- NO_ti.31.crd: configuration after equilibration
- NO_ti.32.crd: configuration after 3rd quadrature run
- NO_ti.31.ckp: checkpoint file with the data for the 3rd
quadrature point
IV.9. Finite difference thermodynamic integration
This is a example combines an other two
free energy simulation
types.
It calculates the Helmholtz free energy difference between ethanol
and
acetone in water, using the
finite differenece thermdynamic
integration.
The integrand at each quadrature point is
obtained by the popular perturbation method.
FILE pm 1
TITL Perturbation method test ethane ->
acetaldehyde
TITL 3-point Gaussian quadrature (probably
inadequate)
HRDW VC32
SVVC SPCC 10.0
SUVC MICC
PBCN FCC 19.57563
TEMP 298 | NSLV 500
STEP 0.10 10.0 0.55 30.0 30
SVPT TIP3 TIP3
SUPT CHRM
!Lambda=0.112702 run
FREE PMLI CCMX
0.112702 0.102702 0.122702 0.0 1.0
SLTA SMPL MMC FILE 16 16 8 16 1
CNFG READ ASCI NOFX 50
!Equilibrate
RUNS 1000000 100000 10000 1000000 100000
RMCK !Remove equilibration checkpoint file to
save disk space
!Production run
RUNS 5000000 100000 10000 1000000 100000
!New coupling parameter - lambda=0.5
FREE PMLI CCMX
0.50 0.49 0.51 0.102702 0.122702
!Restore solute to original initial and final state
SLTA SMPL MMC FILE 16 16 8 16 1
!Read starting config from disk to prepare the new
!initial and final states
CNFG READ ASCI NOFX
!Equilibrate
RUNS 1000000 100000 10000 1000000 100000
RMCK
!Production run
RUNS 5000000 100000 10000 1000000 100000
!New coupling parameter - lambda=0.887298
FREE PMLI CCMX
0.887298 0.877298 0.897298 0.49 0.51
!Restore solute to original initial and final state
SLTA SMPL MMC FILE 16 16 8 16 1
CNFG READ ASCI NOFX
!Equilibrate
RUNS 1000000 100000 10000 1000000 100000
RMCK
!Production run
RUNS 5000000 100000 10000 1000000 100000
!Evaluate the TI quadrature
TIQU 2 2
3 0 1 0 0 0
STOP SLFT FULL
Click HERE to view the output of this run
Files needed for run:
- pm.crd: initial configuration.
- pm.slt: description of the two solutes.
Files created by the run:
- pm.crd: initial configuration.
- pm.2.ckp, pm.4.ckp, pm.6.ckp: checkpoint files at the
quadrature points.
- pm.2.crd, pm.3.crd, pm.4.crd, pm.5.crd, pm.6.crd, pm.7.crd:
Coordinate files at the end of equilibration and production
runs.
IV.10. Widom insertion method
This input shows how to run a Widom insertion calculation
based on a pure water trajectory run earlier by CHARMM.
FILE methane
TITL Calculate the chemical potential of Methane in Water
TITL using Widom cavity insertion method
HRDW VC32 ! 32-bit vector
SVVC MINI 10.0
SUVC MIGC 10.0 ! MI on the solute
PBCN RECT 15.5 15.5 15.5 !Rectangular PBC
TEMP 300
NSLV 124 !no of solvent molecs in the CHARMM run
SVPT TIP4 TIP4 ! Solvent-solvent potential is TIP4P
SUPT CHRM !Solute-solvent potential is CHARMM
FREE WIDO 2.6 64 64 64 2 5
SLTA SMPL MMC READ 5 5
CT3 -0.13265 0.14073 0.03004 -0.36000 1 CSP3C1
HA 0.84914 0.15299 0.45527 0.09000 1 CSP3H11
HA -0.35123 -0.84076 -0.33574 0.09000 1 CSP3H12
HA -0.84914 0.40992 0.77775 0.09000 1 CSP3H13
HA -0.18078 0.84077 -0.77776 0.09000 1 CSP3H14
TRAJ CHRM RGFX !Open trajectory in CHARMM format
SCAN TRAJ 1000 5 00 0 500 2 1 2 3 1
STOP
Click HERE to view the output of this run
File needed for run:
- methane.hst: the CHARMM trajectory (DCD) file
Files created by the run:
- methane.ckp: checkpoint files
- methane.pxc: analysis checkpoint file
IV.11. Proximity analysis, trajectory filtering
This input show how to set up proximity analysis
of a trajectory given as a series of PDB files and
how to use this analysis to filter this trajectory file.
FILE 5ht1
TITL GCE run with fixed solute
HRDW VC32 ! 32-bit vector
SVVC SPCC 7.75 ! Solvent-solvent cutoff
SUVC MIGC 0.0 ! Group-cent based MI on the slt
PBCN RECT 70.0 75.0 58.0 !Rectangular PBC
TEMP 298
SVPT TIP3 TIP3
SUPT CHRM ! Solute-solvent potential is CHARMM
MOLD 1 3192 ! Bypass the topology code
SLTA SMPL MMC FILE 3192
!Solute description is read from 5ht1.slt in MMC format
TRAJ ALLP RGFX ALST !Open trajectory in PDB format
FILT SOLV TRAJ ALLP SHL1 RVDW 1.4 0 0 10
!filter out solvents outside the first shell
TRAJ ALLP RGFX ALST 0 1 11
!Open filtered trajectory
PXCR WDEN BISE
!select proximity criterion type
PXWR ASCI
!Create ASCII proximity information file
PXBE -100.0 2.0
!Calculate solute-solvent energy analysis
SCAN TRAJ 250000 1 5000 0 2000 1000 2 3 1
!Run analysis
FILE 5ht1 1 !Reset run number to one
TRAJ ALLP RGFX ALST 0 1 11 !Open filtered traj
FILT ENRG TRAJ ALLP -100.0 -1.0 10
!Filter solvent by the solute-solvent energy
TRAJ ALLP RGFX ALST 0 1 11
!Open filtered trajectory again
FILT ENRG TRAJ ALLP -1.0 100.0 11
!Filter solvent by solute-solvent energy using a different range
TRAJ ALLP RGFX ALST 0 1 21
!Open trajectory after 1st energy filter
PXWR NONE
DENF INSG 0 1 0
!Create single configuration aggregating all solvents
TRAJ ALLP RGFX ALST 0 1 22
!Open trajectory after 2nd energy filter
DENF INSG 0 1 0
!Create single configuration aggregating all solvents
STOP
Click HERE to view the output of this run
Files needed for run:
- 5ht1.hst: Trajectory file in PDB format
Files created by the run:
- 5ht1_11.hst: Filtered (1st shell only) trajectory file
- 5ht1.pxi: proximity information file
(see PXWR)
- 5ht1.ckp: checkpoint file
- 5ht1.pxc: proximity analysis checkpoint file
- 5ht1_21.hst: Filtered (1st energy filtering) trajectory
file
- 5ht1_22.hst: Filtered (2nd energy filtering) trajectory
file
- 5ht1_den_21.dat: Aggregated solvents (in Insight free format)
file
from 1st energy filtering
- 5ht1_2.dat: Aggregated solvents (in Insight free format) file
from 2nd energy filtering
IV.12. Conformation filtering
This example shows the elimination of waters that are not
internal to the solute according to their circular variance.
The filtering is done on a CHARMM CRD file.
FILE scednas
PRNT ECHO
SVVC SPCC 12.00
SUVC SPGC 20.0
PBCN RECT 99.00 99.00 99.00 ! Large box
NSLV 16426
SVPT TIP3 TIP3 ! Solvent-solvent potential is TIP3
SUPT ATNO ! No potential information
BRKB 32 ~
92 97 92 99 231 236 231 238 292 297 292 299 ~
323 328 323 330 404 409 404 411 593 3691 856 861 ~
856 863 1149 1154 1149 1156 1784 1789 1784 1791 1825 1830 ~
1825 1832 1882 2138 2030 2035 2030 2037 2090 2095 2090 2097 ~
2299 2304 2299 2306 2318 2323 2318 2325 2467 2472 2467 2474 ~
2792 2797 2792 2799
!Break spurious bonds that would confuse the routine establishing
!the molecular connectivities
!Alternative: MOLD 1 3697
SLTA SMPL CRD FILE 3697
CNFG READ CHRM NOFX
FILT SOLV CONF CHRM CRCV RSIG 6.0 0.5 0 11
CNFG READ CHRM NOFX
FILT SOLV CONF CHRM CRCV RSIG 6.0 0.4 0 12
STOP
Click HERE to view the output of this run
Files needed for run:
- scednas.slt: Solute portion of the CHARMM CRD file
- scednas.CRD: Full configuration (including the solute) in CHARMM CRD format
Files created by the run:
- scednas_11.CRD: Filtered conformation (keeping only internal waters)
- scednas_12.CRD: Filtered conformation (keeping only internal waters),
weakened filtering (keeping more waters)
IV.13. Generic site calculation
FILE GENSfile
SVVC SPCC 10.0 ! Solvent cutoff
SUVC MIGC 0.0 ! MI on the solute
PBCN RECT 56.0011 67.5172 81.0524 !Rectangular PBC
SVPT TIP3 TIP3
SUPT CHRM
SLTA SMPL MMC FILE 5402 0 0 0 1
TRAJ ALLH FLEX ALST
GENS CALC FIXI GETH SPDB FSRT ~
125 5000000 1 10000 6.5 0.01 0.10 6.0 0.2 100 10
!Number of sites requested (initially): 500
!Number of simulation steps to read: 5000000
!Use every 1-th config (counted in MC stepnumbers)
!Discard the first 10000 MC steps
!For the matching, consider only sites within 6.5 Å
!Convergence threshold for the Hungarian method matching: 0.01
!Convergence threshold for the RMSD between iterations: 0.1
!Add a new site if a match is farther than 6.0 Å
!Previous site estimate will be mixed in with a factor of 0.2
!Maximum number of iterations over the trajectory: 199
!Minimum number of solvents in a configuration to be of use: 10
STOP NOSF
Click HERE to view the output of this run
Files needed for run:
- GENSfile.slt: Solute portion of the CHARMM CRD file
- GENSfile.CRD: Full configuration (including the solute)
in CHARMM CRD format
- GENSfile.hst: Simulation trajectory in CHARMM CRD format
Files created by the run:
- GENSfile_gs.pdb: Solute plus the generic sites in PDB format
- GENSfile_gs_cmp.pdb: Solute plus a composite of solvents
that are nearest to sites, in PDB format
- GENSfile_gs_cmp.rep: Solute plus a the most representative configuration
from the simulation history, in PDB format
IV.14. Primary hydration shell calculation
FILE alaval
TITL compacted alaval
TITL PHS, 2 shells
HRDW VC32
NSLV 190 !190 waters
TEMP 298.0
SUUC SPGC 21.00
SVVC SPCC 21.00
SUVC SPGC 21.00
PBCN PHS RSIG UPML 0.15 3.0 8.0 1.0 21.0 0.997 100 1000.0 3.
!PHS boundary conditions
STEP 0.00 0.0 0.40 30.0 20
SUPT CHRM
SVPT TIP3 TIP3
MIXR ARGE
SLTA SMPL MMC FILE 56
SAMP METC MOVE SHCY !Plain Metropolis moves
PART UNIF SHCY CYCI 0.1 !Solute torsions active
PRMF CHRM par_all22_prot_na.inp
TORD ALL SING 30.0 ! All torsions moved with same stepsize
CNFG READ ASCI
RUNS 200000 5000 100000 90000 9000000 90000000 1
STOP
Click HERE to view the output of this run
Files needed for run:
- alaval.slt: Solute description file
- alaval.crd: Coordinate file of the initial structure
- par_all22_prot_na.inp: Charmm parameter file for torsion potential
description
File created by the run:
- alaval.ckp: Checkpoint file
IV.15. Cavity and pocket determination
FILE f8
PRNT ECHO
TITL 1tf_8_13
NSLV 0
TEMP 298.0
SVVC SPCC 9.05 ! Solvent cutoff
SUVC SPCC 12.0
PBCN RECT 70.0 70.0 70.0
STEP 0.00 00.0 0.55 40.0 30
SUPT CHRM
SAMP METC 0.5 ! Scaled force-biased sampling
MOVE RAND
SVPT TIP3 TIP3 ! Solvent-solvent potential is TIP4P
SLTA SMPL PDB FILE 3928
GCEN CAVB RSIG ! Grand-canonical ensemble (establishes grid)
2.0 2.5 1000.0 250 250 250 1000 1 1 010000 20000
CNFG READ PDB
PRTG PDB ALLG AGLS 0 0 0.8 10.0 0.2 1.0 3.3 0.9 0.5 10.0
STOP NOSF
Click HERE to view the output of this run
Files needed for run:
- f8.pdb: Solute description file
Files created by the run:
- f8_2.pdb: PDB file of solute plus cavity grid coordinates
- f8_3.pdb: PDB file of solute plus pocket grid coordinates
V. BRIEF DESCRIPTION OF THE OUTPUT.
Since the output of the program is largely self-explanatory,
only a relatively brief description is given.
The output starts with information about the program: self
identification, dimensions used - important for compiling other
programs that read the interrupt file of the program - the dates
the program and the common blocks were modified and the date of
the run.
By default, the command lines are echoed verbatim.
The command PRNT ECHO
allows for direct echoing of the formatted data read as well.
At the start of each run the the options selected are
explained, numbers read are printed with their interpretation and
unit assumed.
These explanations are preceded by the keyword that was the
source of the information, allowing the easy interpretation
of the messages.
The solute description consists of three parts:
- List of atoms (before cloning)
and their properties (see below)
- Summary of the properties of residues (charge, radius)
- Summary of the properties of molecules (charge, radius, number of rings)
Each atom record consists of the following items
(several of these may be blank):
- Sequence number
- Chemical symbol
- Hydrogen-bond donor/acceptor label:
- D1: >NH donor
- D2: -OH donor
- A1: >C=O acceptor
- A2: R-O-R acceptor
- Potenial code (label and number)
- Potential library (see key SUPT)
- Atom type label (p: polar; a: aromatic) used for desolvation energy
calculation (see key SVIN)
- Functional group type
- Marker for group (residue) center: G
- Marker for molecule (segment) center: M
- X, Y, Z coordinates in a representative conformation
- Partial charge
- Lennard-Jones epsilon
- Lennard-Jones sigma
- Molecule number
- Group number
- Mobility marker
- FIX: atom is not moved during the simulation
- MOV: atom is moved only by whole solute molecule translation/rotation
(see key PARD)
- TOR: atom is moved by torsion angle change
(see key PART)
- INV: Atom stays unchanged between the two topologies
during thermodynamic integration
(see key FREE TICA)
- SIM: Atom stays nearly unchanged between the two topologies
during thermodynamic integration
(see key FREE TICA)
- Residue name
- Atom name
- First shell radius for proximity analysis
(see key PXCR) or solute atom volume
(see key SVIN)
- Proximity group index or
solute atom surface
(see key SVIN)
- User-defined functional group name
(see key FCGD)
- User-defined functional group label
(see key FCGD)
A very important feature of the program is the automatic
self test on continuing a previously interrupted calculation
(RCKP) as well as at every 2.5 millionth
Monte Carlo step. It can also be called more frequently with the
help of
SLFT. If the
self test failed a message "Discrepancy found" appears and the
program stops (unless RCKP IGND or
RCKP FIXD was used).
Discrepancies may arise when the checkpoint file (.ckp)
is corrupted
or in case of a program error.
If the latter is the case,
contact the author of the program.
A relatively harmless error can be
an accumulation of round-off errors to unacceptable levels.
Such errors can be prevented using the
FIXD FIXD keys.
The initial total energy and its components, and virial
sum header precedes the periodic output (at each nmcrep-th
step) during the simulation. This periodic output (one to three lines
each) contains the stepnumber (N) the energy of the currently
accepted configuration (E), the cumulative total energy average
(<E>),
the minimum and maximum energy value sampled
(Emn and Emx),
with the configuration number/1000 or /1000000
where it occurred after them
in parenthesis, the solute binding energy (Us) the
acceptance rate (a) of the current move type
(for solute molecule move or torsion the acceptance rate printed
is for the solute molecule or torsion angle sampled in this step)
and the index of the molecule selected for move
(I) and the move type coded as follows.
- MOV: solvent or whole solute displacement
- CPL: coupling parameter change
- PSM: solute molecule displacement
- TOR: solute torsion change
- LP*: solute torsion change with loop move (* is replaced by
the loop type)
- SWC: solute molecule swap
- 2ST: 2-solute molecule correlated rotation
- SPC: special move (if implemented)
For torsion changes the torsion number is also printed.
For torsion on cloned solute, the torsion number corresponding to the
original torsion list is also given.
For molecule swap, the second molecule's index is also printed.
The line is ended by the symbol A or R
for accepted and rejected moves, resp.
The second line gives the solute intramolecular energy terms
when the solute is not rigid.
An additional line gives grand-canonical ensemble
information: current and average number of solvent molecules, smallest
and largest number of solvent molecules sampled (Nmn, Nmx), the
acceptance rate of the insertion/deletion steps (id acc), the
number of uncovered gridpoints (Ngf) and the probability of
finding a cavity (pcav).
Periodically (at each nmcplt-th step), and at the end of
each run a complete report of the averages and distributions is
computed and printed. This report starts with the thermodynamic
averages: energy, energy square, standard deviation and range for
energy, total virial sum, solute virial sum and the pressure.This
is followed by the error estimates using the method of batch
means for the total energy or pressure. The error estimates are
given as two standard deviations - corresponding to ca 95%
confidence intervals. Since he method of batch means assumes that
the actual batch means are statistically independent the program
performs a test for independence - the so-called run test. At the
and of this section there are two tables containing the critical
values for the run test: for independence, the number of runs
found should be between the two numbers corresponding to the
number of "ups" (n1) and "downs" (n2) in the two tables. The
program prints CORRELATED or UNCORRELATED when the run test was
failed or passed, respectively and print ??? or >>> when
the number of "ups" or "downs" were either too low or too high for
the table. The test is repeated with double the blocksize until
the actual run lengths prevent further doubling.
During the run, various informations are gathered relevant
to the convergence of the calculation: Number of times the solute
is attempted to move, accepted to move; average translational and
rotational displacements; the minimum, maximum and average
correlations between the initial and final orientations; the
total displacement of the system (square root of the molecular
displacement square sums), information on the average molecular
displacements, the correlation between two successive moves of a
molecule (averaged over all moves and all molecules) and
convergence indicators involving average molecular displacement
square, acceptance ratio and correlation between successive
moves; if preferential sampling was used
(MOVE PRS*) the relative
frequencies the molecules were attempted to move (perturbed),
acceptance rates for individual molecules (useful to detect
"stuck" molecules).
For the various free energy options the distributions and
free energy values calculated are printed here. Also,
calculations involving the perturbation method print the solute
binding energy estimate on the two limiting state ensembles to
be used for convergence test and the range of energy values
that went into the exponential averages - for reliability these
ranges should not exceed cca 10 kcal/mol.
For adaptive umbrella sampling, the range covered by each
iteration is plotted with letters U and D indicating that the
probability distribution is increasing or decreasing. In the plot
each line corresponds to an iteration and the position of the U's
and D's refer to the coupling-parameter range sampled.
The program calculates first order quantum corrections to
the intermolecular free energy with the method of
Powles and Rickayzen.
This calculation is based on the computed
configurational average of the force and torque components and on
the mass and moment of inertia of the molecule involved. The
solute contribution to the quantum correction is also computed and
printed, along with the computed force and torque component
squares, molecular mass and moments of inertia.
The various distribution functions are printed only
optionally and they are self-explanatory. Each distribution is
followed by a summary locating the extrema and other pertinent
characteristics.
Two more pieces of information relevant to the convergence
of the run are given near the end of the output: the closest
solvent-solvent distance and the number of steps elapsed during
which all molecules were moved at least once.
Proximity analysis results are printed by the atoms on which
the proximity analysis was requested.
The output may consist of several parts (depending on key
PXPR and, of course, on the keys
invoking the calculation of the various propeties).
- Listing of the calulated distribution functions,
followed by the position and value of their extrema.
- Tables of various calculated averages,
listed both in the original order of solute atoms and grouped by
residues
and listed by functional groups (either chemical or user-defined
- see key FCGD)
The table entries for each solute atom (one line) are as follows:
- Atom numbers in the current order
- Atom numbers in the original order
- Residue number
- Atom name (read as labslt(1) with
key SLTA)
- Residue name
(read as labslt(2) with
key SLTA)
- Chemical symbol
First shell properties:
- First shell radius, rfs
(specified by key RFSL)
in Å
- Volume of the first shell, vfs, calculated as the sum of
volume elements for shells within rfs
in Å3
(see key GENV)
- The volume of the first two shells, v2fs,
defined as the volume of the proximity region extending to a
distance
rfs+solvdia, where solvdia is the L-J sigma of
the
islvrep-th (see key SLTA)
solvent atom
in Å3
- First shell coordination number, <K>, calculated
as the
average number of solvents in the proximity region within
rfs
- First shell density, <K>/vfs, converted to
g/ml
- First two shell coordination number, <2K>,
calculated as the
average number of solvents in the proximity region within
rfs
solvdia
- First shell binding energy, <sltbe>, the sum of
interactions
of solvents in the first shell with the (whole) solute
(see key PXBE)
- First shell pair energy, <sltpe>, calculated as
<sltbe>/<K>
Total primary region properties:
- Total coordination number, <K>, calculated as the
total number of solvents in the proximity region
- Total binding energy, <sltbe>, the sum of
interactions
of all solvents in the proximity region
First shell solvent-solvent properties:
- Coordination number, <Kw> - see key
PXWW
- Pair energy, <nnwwpe> - see key
PXWW
- Binding energy, <bewwe> - see key
PXWW
- Tables of the calculated extrema of the proximity g(r),
total g(r), and mean dipole
(see key PXDP) function.
The output concludes with averages of the total energy and
solute binding energy over the run, as well as the coupling
parameter range sampled in the run, if the variable coupling
parameter option was used.
Throughout the output, lines with periods are used to
delineate certain segments of the output.
Warning messages (indicating potential problem with the input)
are preceded by 5 dashes,
strong warnings (likely, but not certain problems)
are preceded by 5 equal signs,
and error messages are preceded by 5 asterisks.
Messages announcing some
specific action (not taken all the time) are preceded by 5 plus
signs.
Messages announcing that an input setting is overriden by the
program are preceded with five 'less-than' signs.
Before stopping the program prints the run time and the number of
overrides, warnings, strong warnings and errors encountered.
It is always a good practice to find
these messages in the output file
to make sure the program was running as intended.
Minimum critical values for the run test:
n2: 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
n1 ----------------------------------------------------------
2 | 2 2 2 2 2 2 2 2 2 | 2
3 | 2 2 2 2 2 2 2 2 2 3 3 3 3 3 3 | 3
4 | 2 2 2 3 3 3 3 3 3 3 3 4 4 4 4 4 | 4
5 | 2 2 3 3 3 3 3 4 4 4 4 4 4 4 5 5 5 | 5
6 | 2 2 3 3 3 3 4 4 4 4 5 5 5 5 5 5 6 6 | 6
7 | 2 2 3 3 3 4 4 5 5 5 5 5 6 6 6 6 6 6 | 7
8 | 2 3 3 3 4 4 5 5 5 6 6 6 6 6 7 7 7 7 | 8
9 | 2 3 3 4 4 5 5 5 6 6 6 7 7 7 7 8 8 8 | 9
10 | 2 3 3 4 5 5 5 6 6 7 7 7 7 8 8 8 8 9 | 10
11 | 2 3 4 4 5 5 6 6 7 7 7 8 8 8 9 9 9 9 | 11
12 | 2 2 3 4 4 5 6 6 7 7 7 8 8 8 9 9 9 10 10 | 12
13 | 2 2 3 4 5 5 6 6 7 7 8 8 9 9 9 10 10 10 10 | 13
14 | 2 2 3 4 5 5 6 7 7 8 8 9 9 9 10 10 10 11 11 | 14
15 | 2 3 3 4 5 6 6 7 7 8 8 9 9 10 10 11 11 11 12 | 15
16 | 2 3 4 4 5 6 6 7 8 8 9 9 10 10 11 11 11 12 12 | 16
17 | 2 3 4 4 5 6 7 7 8 9 9 10 10 11 11 11 12 12 13 | 17
18 | 2 3 4 5 5 6 7 8 8 9 9 10 10 11 11 12 12 13 13 | 18
19 | 2 3 4 5 6 6 7 8 8 9 10 10 11 11 12 12 13 13 13 | 19
20 | 2 3 4 5 6 6 7 8 9 9 10 10 11 12 12 13 13 13 14 | 20
----------------------------------------------------------
2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Maximum critical values for the run test:
n2: 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
n1 ----------------------------------------------------------
2 | | 2
3 | | 3
4 | 9 9 | 4
5 | 9 10 10 11 11 | 5
6 | 9 10 11 12 12 13 13 13 13 | 6
7 | 11 12 12 12 14 14 14 14 15 15 15 | 7
8 | 11 12 13 13 14 15 15 16 16 16 16 17 17 17 17 17 | 8
9 | 13 14 14 15 16 16 16 17 17 18 18 18 18 18 18 | 9
10 | 13 14 15 16 16 17 17 18 18 18 19 19 19 20 20 | 10
11 | 13 14 15 16 17 17 18 19 19 19 20 20 20 21 21 | 11
12 | 13 14 16 16 17 18 19 19 20 20 21 21 21 22 22 | 12
13 | 15 16 17 18 19 19 20 20 21 21 22 22 23 23 | 13
14 | 15 16 17 18 19 20 20 21 22 23 23 23 23 24 | 14
15 | 15 16 18 18 19 20 21 22 22 23 23 24 24 25 | 15
16 | 17 18 19 29 21 21 22 23 23 24 25 25 25 | 16
17 | 17 18 19 20 21 22 23 23 24 25 25 26 26 | 17
18 | 17 18 19 20 21 22 23 24 25 25 26 26 27 | 18
19 | 17 18 29 21 22 23 23 24 25 26 26 27 27 | 19
20 | 17 18 20 21 22 23 25 25 26 26 27 27 28 | 20
----------------------------------------------------------
2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
VI. COMPILATION OF THE PROGRAM.
VI.1. Preprocessor
The program source file mmc.for contains a number of special
symbols allowing the customization of the program.
To produce a valid Fortran source code mmc.f
a preprocessor (also a Fortran program) pre.f
is provided.
The preprocessor performs two functions:
- It replaces the special symbols in mmc.for
related to the size of the problem (dimension of arrays, etc.);
quantities that depend on other values
(e.g., the maximum number of atoms)
are calculated by the preprocessor to ensure consistency.
- It selectively activates segments of code depending on
the computational environment (e.g., timing routines),
and/or features planned to be used (to save computation time
by eliminating operations superfluous in certain context).
If the generated mmc.f is inadequate for a particular input,
mmc will abort the run and give specific instructions what to change.
The preprocessor has to be compiled and executed
(after which you can compile the program):
- To compile, type f77 -o pre.bin pre.f
(replace f77 with the command invoking the Fortran compiler on
your system, if different)
- To run, type pre.bin,
followed by three <ENTER>'s to accept the default
filename mmc, default choices and values
and to give only a brief list of the options used.
The prepocessor as provided has default choices and values that
will create a valid source file. There are three ways to change the defaults:
- Running the preprocessor interactively
(by responding with 'y' to the second prompt of pre.f)
- Editing the source file to change the defaults.
This involves
- Changing the initialization of selected variables
named iopdef*** (specifying the activation or inactivation of
selected functionalities) from 'T' to 'F' (or vice versa)
- Changing the initialization of
selected variables id
(specifying the size of arrays corresponding to
the selection) to a new value.
The notation refers to the the name assigned to each
size-related symbol. The names and symbols used are lietse below.
The two quantities most likely to be modified are
maxmol and maxslt.
- If there is a running version of MMC the key
PRCO SAVE can print a data statement that
- copied to the end of pre.f - will initialize the defaults to be the same
as those of the running version.
Instructions for changing the source code of pre.f
are at the beginning of that file.
The part of the prepocessor that may be changed by the user is between the
comment lines 'START Customization' and 'END Customization'.
In the first segment, calls to the subroutine setopt
select functionalities to enable/disable.
In the second (longer) segment,
calls to the subroutine setdim set maximum values for the various
aspects of the system size. Some array sizes are generated automatically
depending on the code segments activated.
The following list gives the special symbols related to array sizes
appearing in mmc.for with the names used in the preprocessor
and the quantity they limit.
- #MO maxmol: Solvent molecules +1
(has to be even) (see the key NSLV)
- #MA maxmolslt: Number of atoms per single
solute molecule
- #SX mxpxslt: Number solute atoms for proximity
analysis
- #MM maxsltmol: Number of molecules in the
solute.
- #UW maxwnnu: Bit map for the neighbors of a
solute atom
- #TN maxnst: Solute nuclear centers
- #VN maxnsv: Solvent nuclear centers
- #TE maxest: Solute electron centers
- #VE maxesv: Solvent electron centers
- #VW maxwnnv: Bit map for the neighbors of a molecule
(maxwnnv/(Number of integer word bits -1) + 1.
The integer word size is 32 on most machines,
46 on the CRAY).
- #ST maxslt: Solute centers (see the key
SLTA)
- #GR maxgslt: Solute groups
- #TA maxtslt: Torsion-affected atoms' list
- #SV maxslv: Solvent centers
- #SV maxslv4: maxslv-4
- #VT maxss: Greater of #SV and #ST
- #NA maxat: Atoms
- #TL maxtrgrgr: Torsion-affected group' list
- #GT maxstg: Solute centers for general
solvent case
- #GV maxsvg: Solvent centers for general
solvent case
- #DT maxsst: Solute centers for parameter
derivative
(see key SENS)
and energy decomposition
(see key ENGL)
calc.
- #DM maxmst: Molecules for parameter
derivative calculation
- #RG maxgrid: Radial distribution function
gridpoints (< 505 !)
- #PG maxpfgr: Preferential weight list length
(RMAX/RI)+1
(RI from RECTYPE 32 - see key DSTC)
- #WG maxcggr: US weighing function gridpoints
- #OR maxorgr: Number of grid points in the
energy distribution function for overlap ratio method calculations
- #GX maxxgr: Number of x cavity gridpoints
(<128, even)
- #GY maxygr: Number of y cavity gridpoints
(<128, even)
- #GZ maxzgr: Number of z cavity gridpoints
(>31, <128, even)
- #CV maxcav: Number of 'free' gridpoints for
cavity-biased (if less then #GX*#GY*#GZ
then some test calculations can't be done)
- #W2 maxlin: Work space for n-step optimized
matchings
- #WS maxausp: Storage space size in adaptive US
- #WI maxauit: Number of adaptive US iterations + 1
- #MI maxavit: Number of block average entries
- #TR maxtors: Torsion list
- #AT maxatyp: Number of different atomtypes
used
- #UU maxatypu: Maximum no of atom types in a
given solute
- #UV maxstmol: Greater of #MO and #ST
- #TG maxtgrid: Number of total g(r) grid points
- #VG maxwrgrid: Number of grids for
solvent-solvent g(r)
- #ND maxgvv: Number of solvent-solvent g(r)'s
- #DG maxdrgrid: Number of radial grids for
dipole correl. QCDF
- #LG maxdagrid: Number of angular grids for
dipole correl. QCDF
- #GE maxpegrid: Number of energy grids for
solute-solvent PE QCDF
- #GQ mxpxgslt: Number different RDF's and
QCDF's
- #PP maxcavps: Number of 'free' gridpoints for
cavity-biased
- #PS maxpfsum: Preferential weight partial sum
list length
- #WM maxmatch: Number of AUS iterations to
match.
- #TD maxtgrid: Number of torsion angle
distribution grids
- #FE mxfeslt: Number of free-energy solute
atoms (all copies)
- #MH maxhunmat: Number of sites for generic
solvation site calculations
- #NH maxhmneig: Number of neighbors for
establising a complete match
- #LT mxtorloop: Number of torsion loops
- #LS maxloopslt:When torsion loops moves are active,
maxslt otherwise 1
- #MD mxdiffmol: Number of solvent molecules+1 for
diffusion constant calculations
- #MR mxrescmol: Number of solvent molecules+1 for
residence time calculations
- #DC mxdiffcr: Number of structures to use
in diffusion calc
- #RC mxrescr: Number of structures to use
in residence time calc
- #MW mxwidslt: Number of Widom solute molecules
- #MS mxphsmol: Number of solvent molecules+1 for
- #MG maxmolfg: Number of solvent molecules+1 to
calculate field gradients for
PHS calculations
- #HA maxath: Number of atoms for representing
all sites (maxslt+maxslv*maxhunsite)
- #GM maxmapgrid: Number of gridpoints in the energy map
- #GH maxhbgrid: Number of gridpoints in the hydrogen-bond map
- #NE maxmap: Number of atoms to save for minimum energy
configuration
- #AU maxtorauc: Number of angles to save for torsion angle
autocorrelation calculation
- #GC maxgrdclst: The larger of the number of grid clusters fo
cavity/pocket calculations and of the number of sites used for the various site
analysis (key GSAN) calculations
- #RN maxrandinp: The maximum number of random numbers
that can be read
- #NL maxnnlist: The maximum number of 1st + 2nd + 3rd neighbo
Since the increased memory usage taxes the
computer resources (both memory and disk-space),
it is usually worth recompiling the program
separately for significantly different system sizes
(see the key PRCO SVMN for
preparing a minimum-sized executable).
Remember,
however, that the checkpoint file (.ckp) is only readable by
a program that was compiled with the same dimensioning!
Code segment that can be selectively activated contains lines with
the comment C@**.
The selection and removal of such lines is controlled by a number of
character variables in pre.f that can be set during the customization
of the preprocessor. Except for the variable SYSTEM, the value is
either 'T' or 'F'.
The following variables can be set:
- SYSTEM {'IRIX'} - Choices:
- 'IRIX' - C@UG (SGI Irix)
- 'AIX ' - C@UR (IBM RS600)
- 'HP ' - C@HP (Hewlett-Packard)
- 'MPDM' - C@DM (MPI - Distributed Memory),
- 'G77 ' - C@G7 (Gnu Fortran-77),
- 'G95 ' - C@G9 (Gnu Fortran-95),
- 'EFC ' - C@EF (Intel Fortran),
- 'ABSF' - C@EF (Absoft Fortran90),
- 'UNIX' - C@UX (Generic Unix)
- CB {'T'} - C@CB:
Cavity-biased (T,V,mu) ensemble code is active (inactive when 'F')
- see key GCEN
- IB {'T'} - C@IB: (T,P,N) ensemble code is
active
- see key IBEN
- FE {'T'} - #FE : Free-energy simulation arrays are
active
- see key FREE
- MS {'T'}: Any intersolute move is
active - see key PARD
or PART
- TR {'T'} - #TR : Torsion move arrays are
active
- see key PART
- LO {'T'} - #LT : Torsion loop move arrays are
active - see key LOOP
- PX {'T'} - #SX : Proximity analysis code is
active - see key PXCR
- FG {'F'} - C@FG: Field-gradient calculation code is
inactive - see keys PRTG
- PG {'F'} - C@PG: Cavity grid analysis code is
inactive - see keys PRTG
or STVG
- HU {'F'} - #MH : Generic solvation site code is
inactive - see key GENS
- RF {'F'} - C@RF: Reaction field correction code is
inactive
- DD {'F'} - C@DD: Distance-dependent dielectric
code is inactive
- DD {'F'} - C@1R: eps(r)=r dielectric
code is inactive
- PH {'F'} - #MS : Primary hydration shell
(PHS ) code is inactive (active when 'T')
- see key RFCR
- ED {'F'} - #DT : Energy decomposition
(ENGL) key arrays are inactive (active when 'T')
- DM {'F'} - C@DM : MPI communication library
calls are inactive (active when 'T') - for now only a very limited
set of functions are available.
- DR {'F'} - #MD :
Diffusion constant and residence time calculation code is inactive
(active when 'T') - see keys DIFC
or RTIM
- FR {'T'} - C@FR: Solvent force calculation code is
active (inactive when 'F') (used for force-biasing and pressure
calculation) - see key SAMP.
Calculation of forces can be inhibited
to save time when only proximity analysis is
run or when force-biasing is not used and the pressure is of no
interest.
- TS {'T'} - C@TS: Solute torque calculation code is
active (inactive when 'F') (for whole solute rot)
If the solute is not going to be rotated or no force bias
is applied on the solute (NFBU) then
the solute torque calculation is superfluous
and can be left out, resulting in some savings (significant for
large solutes) in computer time.
- NN {'T'} - C@NN: Solvent-solvent near-neighbor map is
active
Generally, the near-neighbor code has to be left out is when (1)
the solvent-solvent cutoff is close to the inside sphere radius
of the central cell, or (2) minimum image convention is used for
solvent-solvent solvent interactions
or (3) there are no or few solvent molecules.
- TN {'T'} - C@TN: Solute-solvent near-neighbor map is
active
It can be set to 'F' when the solute is completely rigid
(i.e., neither PARD
nor PART) are active).
- NA {'T'} - C@NA: Near-neighbor map arithmetic code
(logical code when 'F') is active.
The bitmap operations used both for the near-neighbour table
and for keeping track of intramolecular interactions to be used
can be performed either by logical or arithmetis operations.
The relative efficiency (when there is a choice) varies
from machine to machine.
Also, the code involving logical operations uses equivalence statements
that are planned to be phased out in futore Fortran versions.
Furthermore, using logical operations works only if the compiler performs
the logical operation on all bits.
If it is not the case (e.g., the Silicon Graphics),
NA should be set to 'T'.
- VC {'T'} - C@VC: Vectorizable (scalar when F)
routine call code is active
- I2 {'F'} - C@I2:
Integer*2 array use for cavity grids is inactive
- see key GCEN.
For grand-canonical ensemble calculations requiring a
relatively small number of free grids
(< 215-1 ~ 30,000) but a
large number of gridpoints setting to 'T' can
save a few megabytes of memory.
- PS {'F'} - C@PS:
Code amenable to autoparallelization
is active (so far, if failed to yield speedup).
- QP {'F'} - #TN : QPEN/EPEN potential arrays are
inactive.
Activating some of the codes requires deactivating a complementary code.
Such code pairs are:
C@NL - C@NA
C@VC - C@NC
C@DM - C@ND
C@1R - C@NR
C@RF - C@1R
The preprocessor takes care of it automatically.
Besides the activating and inactivating lines prefixed with C@**,
the preprocessor provides the option to change all
double precision (real*8) variables to quadruple precision variables (real*16).
This option can be useful for calculations using the
LOOP key.
Note, however, that not all compilers support
quadruple precision variables.
The defaults for variables FG, PG, HU, and DR
are set to 'F' because the corresponding code requires
large arrays and RF is set to 'F'
because it adds code to the energy calculations.
The key PRCO produces an annotated list
of the values used for the currently executing version as well as
of the code segment selection choices made.
A quick listing on the terminal can be obtained by typing
mmc.bin, followed by the command
PRCO
at the MMC> prompt (followed by STOP
at the next prompt for a graceful exit).
If a compilation is needed to match a given checkpoint file
the command PRCO SAVE
will read the information of the current run's checkpoint file
and write Fortran data statements to be inserted into
pre.f
VI.2. Compilation
Once the mmc.f file is prepared it should be compiled
with the appropriate Fortran compilation
command, to produce an executable, say mmc.bin, e.g.,
f77 -o mmc.bin mmc.f.
Compiler options:
- For test runs use the -C
(to inhibit optimization and enable index boundary check)
- For production runs, high(est?) level optimization should be invoked,
e.g., -Ofast (SGI); -O4 (Compaq Alpha)
- For compatibility of binary trajectories note that the default
byte order on Linux and SGI is "big endian".
if your compiler's deafault
is "little endian" you can compile an executable with the
Linux/SGI byte order using the added directive "-byteswapio" on Linux .
- For runs requiring the MPI communication library, the compilation should
include linking instructions to the library and running the program
generally requires the use of a system-dependent script
- consult your system manager for the apropriate syntax.
Possible problems:
For some systems, the source code is too big to compile it in one step.
In this case, the program splitmmc.f (part of the distribution)
should be compiled and run.
This will result in five files: mmc1.f, ..., mmc5.f.
Each one should be compiled separately with the -c option, i.e.,
> f77 -c mmc1.f
followed by
> f77 -o mmc.bin mmc*.o
Some compilers fail due to a so-called 'relocation error' when optimizing
at levels higher than one is asked.
When using the Intel Fortran compiler (ifort), adding the compiler directives
-mcmodel=medium -share_intel
solved the problem. With some of the other compiler (but not the GNU compiler)
the compilation key
-fpic was found to solve the problem.
VI.3. Parallelization
There are a number of limitations in the parallelized version of MMC.
In particular, it is extremely inefficient (due to latency issues)
on distributed memory systems
so it should be run either on shared-memory systems, or
limit the number of CPU's to the nmumber of procesors on a single node
and make sure the the queuing system places each job on the same node.
The following has been parallelized: solvent-solvent energy calculations;
solute-solvent energy calculations; Widom-insertion free-energy calculations,
and grand-canonical ensemble (GCE) simulations.
However, force-calculations have to be disabled.
Also, use any solute move (keys
PARD or PART)
would reduce the parallel efficiency.
VIII. TESTING THE PROGRAM
A test suite has been compiled to allow a systematic check on the
integrity of the program.
The tests are located in the directory testsuite.
There is a c-shell script test.csh.
It runs all the tests and examines the output.
For each test, it prints different messages
if there were input errors, inconcistency errors or no errors.
For runs with special solute moves, it also prints the acceptance
rates.
The program is considered passing the test if there are no errors
and the acceptance rates printed are all positive.
Furthermore, if there is a subdirectory oldout containing
reference output files, the new output will be compared to the
old one. Differences should only show in CPU times and dates.
Clearly, this is only meaningful if the reference output was
generated by the same compiler and compilation options.
The table below gives a summary of the features tested by each test
system.
Nmolslt Nwat PARD PART SWAP TICA PRM PRM GCEN IBEN SUUCm CLON PMLI PROX WIDO
Ft Dr
1 1 215
2 1 215
3 3 0 x x x
4 7 0 x x
5 2 15 x
6 2 484 x x
7 1 0 x
8 4 484 x
9 6 484 x
10 1 9 x
11 6 0 x x x x
12 6 0 x x
13 2 0 x
14 2 0 x x
15 2 484 x x
16 3 484 x x
17 1 4 x
18 3 484 x
19 3 693 x
20 1 x
21 2 50 x x
22 2 50 x x
23 2 50 x
24 2 50 x x x
25 3 0 x x x x
26 3 484 x x x
27 3 500 x
28 2 484 x
29 4 215 x x x
30 1 0 x x
31 1 190 x
32 1 107 x
Nmlslt Nwat PARD PART SWAP TICA PRM PRM GCEN IBEN SUUCm CLON PMLI PROX WIDO
Ft Dr
IX. REFERENCES
Grand-canonical ensemble formalism:
D.J. Adams, Mol. Phys, 29, 307 (1975).
QCDF formalism:
A. Ben-Naim, Water and aqueous
solutions, Plenum, New York (1974).
Overlap ratio method:
Jacucci and N. Quirke, Mol. Phys., 40,
1005 (1980)
EPEN/QPEN potential:
F. Marchese, P.K. Mehrotra and D.L.
Beveridge, J. Phys. Chem., 85, 1 (1981).
Proximity criterion formalism:
P.K. Mehrotra and D.L. Beveridge, J. Am.
Chem. Soc., 102,4287 (1980).
Original description of the Metropolis method:
N. Metropolis et al., J. Chem. Phys.,
21, 1087 (1953),
Shuffled cyclic selection of molecules to be
moved:
M. Mezei, J. Comp. Phys., 39, 128 (1981)
Click HERE
to download a copy
Cavity-biased insertion - original formalism:
M. Mezei, Mol. Phys., 47, 1307 (1982)
Virial-biased volume change in the (T,P,N)
ensemble:
M. Mezei, Mol. Phys., 48, 1075 (1983)
Description of self tests:
par_all27_prot_na_crb.acly.crbn.prmM. Mezei, CCP5 Newsletter, Daresbury
Laboratory, No. 23, (1986)
Click HERE
to download a copy
Free energy methodology review:
M. Mezei and D.L. Beveridge, Ann. Acad.
NY Sci., 482, 1, (1986).
Click HERE
to download a copy
Cavity-biased insertion - grid-based formalism:
M. Mezei, Mol. Phys., 61, 565-582
(1987); Erratum, 67,1207-1208 (1989).
Click HERE
to download a copy
Adaptive Umbrella sampling:
M. Mezei, J. Comp. Phys. 68, 237 (1987)
Click HERE
to download a copy
Finite-difference thermodynamic
integration:
M. Mezei, J. Chem. Phys. 86, 7084 (1987)
Click HERE
to download a copy
Radical-plane based proximity criterion:
M. Mezei, Molecular Simulation, 1,
327 (1988).
Click HERE
to download a copy
Using a bitmap for keeping neighbour lists:
M. Mezei, Molecular Simulation, 1,
169 (1988).
Click HERE
to download a copy
Scaled force bias method:
M. Mezei, Molecular Simulation, 5,
405 (1990)
Click HERE
to download a copy
Polynomial Thermodynamic Integration
for the solvation free energy of liquid water:
M. Mezei, J. Comput. Chem., 13,
(1992) 651;
Iso-energy cutoff for ion-ion PMF:
M. Mezei, Int. J. Quantum Chem., 52,
147 (1994).
Preferential sampling near the solute:
J.C. Owicki, Computer Modelling of Matter,
P.G. Lykos, ed., ACS, Washington, D.C., (1978), p 159.
Force-bias sampling:
C. Pangali, Rao and B.J. Berne, Chem. Phys.
Lett., 55, 413 (1978), Mol. Phys., 37, 1773 (1979)
Quantum corrections:
Powles and Rickayzen, Mol. Phys., 38,
1875 (1979).
Polynomial TI
for the solvation free energy of the alanine dipeptede:
H. Resat and M. Mezei, J. Chem. Phys.,
99, (1993) 6052.
Click HERE
to download a copy
Smart MC (a force-biasing variant):
P.J. Rossky, J.D. Doll and H.L. Friedman, J.
Chem. Phys., 69, 4628 (1978).
Generic solvation sites:
M. Mezei and D.L. Beveridge,
J. Comput. Chem., 5, 523 (1984).
(1992) 651;
Click HERE
to download a copy
Primary Hydration Shell:
D. Beglov and B. Roux, Biopolymers., 35, 171 (1995).
A. Kentsis, M. Mezei, and R. Osman,
Biophys. J, 84, 805-815 (2003).
Sampling with Tsallis statistics:
I. Andricioaei and J.E. Straub, 53, 3055 (1996).
Cavity-biased Widom insertions:
P. Jedlovszky, and M. Mezei, J. Am. Chem. Soc.,
122, 5125-5131 (2000).
Extension-biased torsion moves:
P. Jedlovszky, and M. Mezei, J. Chem. Phys.,
111, 10770-10773 (1999).
Click HERE
to download a copy
Local torsion moves:
M. Mezei, J. Chem. Phys.,
118, 3874-3879 (2003).
Automatic tuning of the B-parameter:
J.A. Speidel, J.R. Banfelder, and M. Mezei, J. Chem. Theory and Comp.,
, (2006).
Automatic tuning of the stepsize parameters:
J.A. Speidel, J.R. Banfelder, and M. Mezei,
Algorithms, 2, 215-226 (2009).
Simulaid: a simulation facilitator and analysis program
J. Comput. Chem., 31, 2658-2668 (2010).
Return to contents page