MMC Monte Carlo program guide

Users' Guide to the Monte Carlo Program MMC

for Computer Simulation of Molecular Solutions.

Mihaly Mezei

Department of Pharmacological Sciences

Icahn School of Medicine at Mount Sinai

E-mail address: Mihaly.Mezei@mssm.edu

April 04, 2022

TABLE OF CONTENTS

I. General description
- I.1. Systems modeled
- I.2. Properties calculated during a simulation
- I.3. Potential functions
- I.4. Sampling techniques
- I.5. Free energy methods
- I.6. Solute move selection
- I.7. Initial configuration
- I.8. Information needed for a simulation
- I.9. Self test
- I.10. General analysis
II. File Organization
III. Input description
- III.01. File handling keywords
- III.02. System descriptor keywords
- III.03. Free-energy calculation keywords
- III.04. Molecule descriptor keywords
- III.05. Potential descriptor keywords
- III.06. Sampling-related keywords
- III.07. Calculation flow keywords
- III.08. Output related keywords
- III.09. General analysis keywords
- III.10. Proximity analysis keywords
- III.11. Numerical precision related keywords
- III.12. Developpers' keywords
- III.13. Keyword summary
IV. Examples
V. Brief description of the output
VI. Compilation of the program
VIII. Testing
IX. References

I. GENERAL DESCRIPTION

The present document describes the Force-Biased Metropolis Monte Carlo program for computer simulation studies of molecular liquids and solutions in the canonical, isobaric-isothermal and grand-canonical ensembles and the analysis of such calculations based on the Proximity Criterion. This section provides an overview of the capabilities of the program.

I.1. Systems modeled

The present program was originally written for simulations on infinitely dilute solutions (one solute in (numsol) solvent molecules). As a result of this origin, the program has developed two different treatments of molecules. Solvent molecules have to be rigid and be of the same type (this allows the use of special, rapid energy calculation code). The collection of all other molecules (i.e., the one(s) not declared to be solvents) are called by the program the solute. Thus the solute can consist of a single rigid molecule, a collection of freely translating and rotating rigid molecules or a collection of freely translating and rotating molecules with certain (user specified) torsion angles freely rotating.

The program establishes the connectivity of the solute molecule(s) based on interatomic distances and built-in threshold values. By default H-H bonds and bonds between an H and a heavy atom on different groups (residues) are ignored, but this can be changed (see the keys BDHH and BDRH). Also, the bond thresholds can be changed with the MODA key. This sometimes leads to missing or extra bonds, especally if the input structure (given by the SLTA key) is obtained from a molecular dynamics simulation. Such defects can be identified by using the FCGA and BNDL keys and corrected by the keys MAKB or BRKB or forced to be ignored (ok for molecules without active torsions) with the MOLD key.

Free energy simulations can be performed on selected molecules of the solute (see Sec. I.5.).

The torsion angles to be changed on the solute are specified by the two atoms defining the bond of the torsion (with the key TORD). The torsions can be grouped and it is the group of torsions that is selected for change. The main reason for this is that if a torsion affects atoms that are on an other rotatable bond then it may be efficient to move the affected bond as well. These moves, however, are not correlated (i.e., each is generated by a different random number). There is also an option of changing the torsion angles of a loop segment (with the key LOOP). The torsion potential parameters can be obtained from a CHARMM or AMBER parameter file.

It is possible to impose periodicity on the solute, either along the X axis (with the key SLTA POLY) i.e., with period equal to the edge of the cell in the X direction - an infinite polymer, or in all three directions (with the key SLTA XTAL) with its unit cell (or a multiple of a unit cell) coinciding with the periodic cell of the simulation (i.e., a crystal). The periodic extension of the solute outside the periodic cell is done automatically by the program.

The intermolecular interactions are assumed to be of the usual 12-6-1 type, i.e., Lennard-Jones terms for the repulsion and dispersion plus electrostatics based on partial charges on atoms (or pseudo atoms). The parameters can be obtained from libraries stores by the program or defined via input. The interactions can be truncated (see keys SVVC, SUVC, SUUC) either by a spherical cutoff (each atom interacts with all others within a certain distance) or by the minimum image convention (each atom interacts with one periodic image of all the other atoms). For the purpose of cutoff or image calculation, distances usually refer to molecule-molecule or residue-residue distances.

When spherical truncation is used, the long-range electrostatic energy can be estimated by the reaction field formula (provided all residues are neutral) - see key RFCR.

I.2. Properties calculated during a simulation

The major outputs of a Monte Carlo simulation (besides the configurations saved for further analysis by the TRAJ key) are as follows.

Macroscopic thermodynamic properties of the system: internal energy, pressure, heat capacity (constant-volume or pressure, depending on the ensemble used), the density far from the solute (see key MOND). Simulations in the grand-canonical ensemble (see key GCEN) also provide the density, excess chemical potential, isothermal compressibility, and expansivity. Simulations in the isothermal isobaric ensemble (see key IBEN) also provide the density and the thermal expansion coefficient.
Several methods to calculate free energy have been implemented (see key FREE) by relating it to either some computed energy values or to some distributions.
Bulk solvent-solvent and solute-solvent distribution functions (see key DSTC): radial distribution function based on center of mass (COM), quasi-component distribution functions for coordination number, binding energy and pair energy, and dipole correlation functions.
The derivative of the solute and total energy with respect to the solute potential parameters can also be calculated as an ensemble average (see key SENS). This may be used for the development of potential parameters and for the sensitivity of the calculated energy to the potential parameters.
Calculation of the mean forces exerted by the solvent on the solute atoms (see key SENS) and the electric field gradients at the solute and solvent sites (see key FLDG) can also be requested.
For systems with variable torsion angles the torsion angle distribution can be calculated (see key TAND)
Proximity analysis, i.e., distributions based on the original form of the Proximity Criterion or its enhanced version can also be performed. This analysis partitions the solvent according to the proximal solute atoms and calculates distributions for each partition. The solute atoms are partitioned into functional groups either defined by their chemical identity (see key FCGA) or into alternative functional groups specified by the user (see key FCGD). The proximity analysis results are presented for individual solute atoms, functional groups, and solute residues (see key SLTA) and also are averaged over functional groups of the same kind.

I.3. Potential functions

Several potential functions are built into the program. For the solute-solvent nonbonded interactions (specified with the SUPT key). Clementi et al.'s library, EPEN/QPEN, Kollman et al.'s libraries (AMBER-94 and AMBER-2002), CHARMM22 library, Berendsen-Van Gunsteren library (GROMOS87 and GROMOS96); and Jorgensen's OPLS library. Note that for the GROMOS force field only the combination rule can be used - the non-bonded parameters for pairs are not implemented. Except for the EPEN form, parameters are stored internally, but they can be modified from input and allowance is made for inputting new atom types besides the stored ones. As these parameter libraries are continually updated and extended, it is advisable to check if the value stored is the best one (using the PLBP key). For aqueous solutions, the solvent-solvent interaction (specified with the SUPT key) can be either in the form of MCY-CI type or TIPS/TIP3P/SPC type or TIPS2/TIP4PB-F type or TIP5P type potential. For the MCY-CI type the parameters of both the MCY-CI-II and the Yoon-Morokuma-Davidson parametrizations are built in. For TIPS2/TIP4P/B-F types, the A, B and q parameters may also be provided by the user, but the TIPS, SPC, TIP3P, Bernal-Fowler, TIPS2, TIP4P and TIP5P water model parameters are built in. For solvent other than water, 12-6-1 type potential can be used with parameters from the AMBER, GROMOS or OPLS libraries.

There is a framework for adding field-dependent potentials to the currently implemented atom-atom potentials, but this requires some additional coding.

I.4. Sampling techniques

Various sampling techniques can be selected with the MOVE key. Translations and rotations can be sampled with the original Metropolis Method or with the Force-Biased algorithm of Pangali, Rao and Berne (not implemented for multimolecular solute moves, though). Force biasing, of course, necessitates the recalculation of energies and forces at every accepted step, thereby significantly increasing the computer time required to perform a move. The improvement in convergence, however, more than outweighs the added cost. By default a lambda value of 1/2 is used but the scaled force-bias technique where lambda is scaled down near the solute has also been implemented. Novel torsion-angle sampling techniques include the extension-biased torsion moves and the local moves with the reverse proximity criterion.

Molecules to be moved can be selected randomly, in a cycle or in a shuffled cycle or using the preferential sampling scheme of J.C. Owicki, which leads to improved statistics for the local solution environment of the solute (see key MOVE PRF*). The cavity-biased insertion technique for grand-canonical ensemble simulations and the virial-biased volume-change for the isobaric ensemble have also been implemented.

I.5. Free energy methods

Free energy simulations can be performed using a coupling parameter lambda (CP in the followings) in the energy expression. There are two fundamentally different approaches to generate a path: (A) simultaneous creation/annihilation and (B) continuous deformation. In the first the Hamiltonians of the initial and final states are used in unchanged and the coupling parameter is used to combine the two; in the second case there is a single hybrid Hamiltonian that is based on a parametrization that is a mixture of the two systems. For multimolecular solute, the free energy simulation can be applied to just one or two solute molecules, depending on the type of coupling.

I.5.A. Creation/annihilation path

I.5.A.1. Creation/annihilation path to be used in thermodynamic integration (FREE TICA). Here

E(CP)=CP^k*E₁+(1-CP)^k*E_o

where E_o and E₁ represent the Hamiltonians initial and final state, respectively. k=1 represents linear coupling, k>1 has been referred to as or 'almost linear' coupling (e.g., k=4 for 1/r¹² repulsion). It is also possible to use different k values for the different types of energy terms (1/r¹², 1/r⁶, and 1/r) with the so-called polynomial TI. E_o and E₁ can differ in conformation, partial charges, and chemical identity (the present version requires equal number of atoms in the two states - this can always be achieved with he use of dummy atoms. It is also possible to provide two more coupling parameter values CP_o and CP₁ such that CP_o < CP < CP₁. The program will compute the free energy difference between states represented by E(CP_o) and E(CP₁) with the perturbation method as the sum of two free energy differences: A₁-A_o = -kT ln<exp((E(CP₁)-E(CP))/kT)>_CP + kT ln<exp((E(CP_o)-E(CP))/kT)>_CP. <E(CP_o)>_CPo and <E(CP₁)>_CP1 are also computed for consistency check. The two molecules have to contain the same number of residues (groups), but can have different number of atoms.

I.5.A.2. Perturbation method based on a sequence of intermediate states (FREE TILI). This option performs a perturbation method calculation between two states generated using the deformation-type path described below in I.5.B. The reference state will be (E(CP₁)-E(CP_o))/2. In this case, the inital and final states have to have the same number of atoms also.

I.5.B. Deformation path

I.5.B.1. The program also provides option for a fundamentally nonlinear coupling where the coupling parameter changes both the solute coordinates and the potential parameters to generate intermediate states (continuous deformation). Besides creating intermediate E_o and E₁ states for the linear or 'almost linear' couplings described above in I.5.A.2, this coupling can be used with the probability ratio method (FREE PMF1) or with the overlap ratio method (FREE PMNL), as described below in I.5.B.2 and I.5.B.3, respectively. The initial and final states may differ in position, partial charge or identity of atoms (there are limitations for EPEN).

In each of the following cases, the path involves one or two molecules of the solute. The following paths are supported:

I.5.B.1.1. Linear combination of two conformations of a single molecule (FREE PMF1 GENL). In this case all atoms of the molecule may change positions independently. The configurations at the beginning and at the end of the path are given on input and the intermediate configurations are convex linear combinations (i.e., R(CP)=CP*R₁+(1-CP)*R_o) of these two.

I.5.B.1.2. Moving two molecules of the solute in a correlated fashion (FREE PMF1 WRMM). In this case, the intramolecular distances within the two parts should remain constant. Note, that if the orientation of either part is different in the initial and final states then at the intermediate states the molecule will be distorted.

I.5.B.1.3. Varyig the distance between two solute molecules of the solute but keeping the one of them fixed (FREE PMF1 WRFM). In this case the configurations at the two ends of the path differ only in the conformation of the moving part of the solute. This option may be more economical when one part is large and the other is small.

I.5.B.1.4. Moving and rotating one molecule while keeping the other one fixed (FREE PMF1 WRFR). In this case the center of mass of the first part is moved between the center of masses at the two endpoints and the orientation of the moving part is also changed to transform it from the initial to the final orientation. This option differs from I.5.B.1.3 in that at each point of the path the moving molecule has the same intramolecular conformation.

I.5.B.1.5. Correlated rotation of a set of torsion angles (FREE PMF1 TORS). In this case a set of torsion angles are changed in a user-specified interval using a common CP parameter: CP=0 sets all the selected angles to the beginning of their interval and CP=1 sets all of them to the end of their interval.

I.5.B.2. The solvent contribution to the free energy change between two solute conformations can also be determined by computing the ratio of the Boltzmann probabilities as sampled along a line in the conformation space of the solute (described by any of the paths described in Sec. I.5.B.1, FREE PMF1 **** *USS). This approach is called the probability ratio method. Calculations of this type usually require umbrella sampling techniques. The program allows either the use of a harmonic potential to be given from input or the self-consistent determination of a tabulated weighing function using the Adaptive Umbrella Sampling Method.

I.5.B.2. Any of the paths descibed in Sec. I.5.B.1 can be used to estimate free energy changes simultaneosly using the overlap ratio method of Jacucci and Quirke] and the perturbation method (FREE PMNL). Here the calculations are run at a given fixed E(CP) and two more coupling parameter values, CP_o and CP₁ are specified, CP_o < CP < CP₁. For the overlap ratio method, the program will calculate the CP averaged distribution of <E(CP)-E(CP_o)>_CP and <E(CP₁)-E(CP)>_CP. Separate calculations at CP_o and CP₁ will give the needed CP_o and CP₁ averaged distributions to be used with the overlap ratio method. Simultaneously, the program also calculates the perturbation method differences between the states CP_o and CP as well as CP and CP₁. Notice that this kind of perturbation method calculation is different from the calculation described with the creation/annihilation path above: there the middle state E(CP) was a linear combination of the initial and final states, E(CP_o) and E(CP₁). For consistency check, (similarly to the perturbation method calculations described above) the CP_o average of E(CP_o) and the CP₁ average of E(CP₁) are also calculated.

I.5.C. Chemical potential.

Free energy can be also obtained from the chemical potential. The excess chemical potential can be obtained either from grand-canonical ensemble simulations (key GCEN), from calculating the excess free energy of changing a molecule to ideal gas (with one of the methods described above, e.g., with key FREE TICA) or by the Widom insertion method (key FREE WIDO). The program implements a version that is based on insertions into cavities.

I.6. Solute move selection

The sampling of the moves is specified as follows. The overall frequency of solute moves is specified by input provided with the STEP key. As discussed above, the whole solute can be a) rotated and displaced; b) one constituent molecule of the solute can be rotated and displaced; c) two constituent molecules can be swapped; d) a group of torsion angles can be changed; and e) the coupling parameter used in a potential of mean force calculation can be changed. If moves of type b, c, d, and e are required, then the program will request a relative frequency (weight) of making that move. Regular solute moves and coupling parameter changes are both given a weight of 1.0 . Thus giving a weight of 2.0 for type 3 moves, out of 4 solute move attempts, 1 will change the solute position, 1 will change the coupling parameter, and 2 will change one of the randomly selected torsion angle groups (on the average, of course). Clearly, the more types of moves on is using, the more frequently has the solute to be moved.

For the free energy options, the treatment of moves of type 2 or 3 depends of the type of path. For the creation/annihilation path using TI (FREE TICA) the torsions are independently sampled on both copies, since these can be, and generally are, different molecules. For the continuous path (FREE PMF1 or FREE PMNL) or the cration/annihilation PM (FREE PMLI), the torsions are treated identically in each copy since here it is assumed that the molecule is changing smoothly. This means, that for a creation/annihilation TI these moves have to be sampled twice as frequently.

I.7. Initial configuration

The starting configuration for the Monte Carlo (see key CNFG) is either read from a previously prepared file, generated from the solute's description (see key SLTA) and randomly placed solvents or from a configuration of another system through various transformations. Grand-canonical ensemble simulation (see key GCEN) is also an efficient way of obtaining a random initial configuration.

I.8. Information needed for a simulation

Several steps in the preparation of the input to and the analysis of the results form MMC can be helped by the program Simulaid. The list of its features are described in a J. Comp. Chem. paper.

I.9.1. T, temperature at which the simulations are to be run (specified with the TEMP key).

I.9.2. The coordinates of the nuclei, electrons and pseudoatoms (if any) in a local coordinate framework, and the charges associated with these centers for the solute and (optionally) for the solvent (specified with the SLTA key). The program Simulaid can help convert structures in different file formats (e.g., PDB, CHARMM .CRD) to MMC format and can optimize the orientation of a solute within a periodic box (to maximize the smallest inter-image distance and thereby minimize the artifactual effect of the solute images), within a boundig box, or optimally center a solute in a sphere (for droplet-type calculations).

I.9.3. Potential specifications. For each solute atom the potential type has to be specified from input (except for EPEN where for each atom and pseudoatom the A, B, and C parameters have to be specified individually). The program Simulaid can help to convert from PDB, CHARMM, AMBER or Macromodel input files to MMC format. Several water potentials are implemented with specific codes, but solvent can be defined similarly to the solute as well.

I.9.4. numsol, the number of solvent molecules used in the simulation, specified with the NSLV key.

I.9.5. Size and shape of the periodic cell (specified with the PBCN key).

I.9.6. When generating a new configuration the program can determine nmolec and the simulation cell size parameters from the partial molar volumes of the solute and solvent and the required number of solvent shells around the solute.

I.9.7. For partial solute displacements the groups of solute atoms (independent molecules) to be moved separately (specified with the PARD key).

I.9.8. For solute torsions (specified with the TORD key). the atoms defining the torsion angle and the torsion potential parameters.

I.9. Self testing

There is an extensive self testing routine in the program that tests the integrity/consistency of the data. It protects against both corruption of data and program errors. It is invoked at several ways:

Automatically at startup
Periodically, as specified by the user with the SLFT key.
Optionally at the end of the run, as specified by the user with the STOP key.
Automatically at every 2,500,000th step (but not at the last one).

I.10. General analysis

Except for the the bulk distributions and most thermodynamic variables, all the the properties discussed above in Sec. I.2., can be calculated from the history of a simulation as well. Note, that such analyses can be done on simulation trajectories generated by other programs, e.g., by CHARMM or AMBER. In addition to the analyses described above, several other types of analyses are implemented as summarized below.

Generic solvent sites can be calculated (see key GENS)
Two configurations can be compared and the changes in the solute molecules can be calculated (see key CMPC)
Cavities in the solute can be visualized by a grid technique (see key PRTG) and their volume be estimated (see key STVG)
The solute-solvent and solute solute energies can be separated into terms involving each solute atom (see key ENGL)
The distribution of the solvents around the solute can be visulaized by creating a configuration with the solvent coordinates from a set of selected configurations (see key DENF)
A simulation history can also be filtered according to various criteria, e.g., energetic, geometric or simply eliminating segments of the system (see key FILT)

II. FILE ORGANIZATION.

Beside then standard input (Fortran unit 5) - for the information needed for the run - and the standard output file (unit 6) - printed output, diagnostics, described in Section V. - the program uses a number of different files. In the followings, writing to the standard output will be refererred to as printing.

File names are limited to 80 characters and are structured the following way:

<jobname>[.<numrun>][_<version>].<file type>.

where

<jobname> is any legitimate filename, common to all files generated by the run (specified by the FILE key);
<numrun> is a number greater than 1 and less than 99 (i.e., the omit the run number from the file name for <numrun>=1 and limit the number of runs to 98); After each RUNS command, <numrun> gets incremented by one. This allows the execution of several different MC runs (equilibrations, free energy runs with different coupling parameters, etc.) within a single job. Also, <numrun> can be modified by the FILE command.
<version> is an integer between 2 and 9999 (i.e., the omit the version number from the file name for <version>=1. It is used whenever a given runnumber deals with several files of the same type during a run (e.g., FILT, TRAJ CHKP or the SCKP keys)
<file type>, as explained below, is one of
{ckp, crd, hst, dst, idl, pxc, pxp, pxi, slt, wmp, dat, grd, pxv, wmp, pdb, fgd, CRD}
- File type=ckp: Checkpoint file storing the complete program status (checkpoint). It is created automatically when the RUNS, WCKP or SCKP keys are used. It allows the resumption of the calculation with the RCKP command by restoring all common blocks to their value prior to the last write. It starts with the coordinates of the current configuration, in the same (binary) format as on file .crd (except for the coupling parameter that is stored differently).
  Checpoint file is written before a selftest (see key SLFT), at every nplt-th step (before printing the distributions - see key RUNS) and at every nrecd step. By default, nrecd is set to the smaller of nplt or 1000000, 750000, 500000, 200000, 100000 for systems with number of atoms <3000, <5000, <7500, <10000, >10000, respectively but it can be set to any value with the key CHKP.
- File type=crd: The coordinates of the initial configuration are stored here. Additional records may store the coupling parameter (see key FREE), the number of molecules (see key GCEN) and/or the simulation cell sizes (see key IBEN) It is read by the CNFG command, and can be written by the WCNF command
- File type=hst: Trajectory file where the simulation history is (to be) stored. It's use is activated by the TRAJ command.
- File type=dst: The bulk solvent distribution functions calculated during the simulation may be saved after every nplt steps (the g(R), various QCDF's (Quasi-component distribution functions) on this file. It can later be used with plot programs for graphical displays. Examples for QCDF's are the distribution of coordination numbers or binding energies.
- File type=idl: The log of insertions and deletions are stored here, as described for the command GCEN.
- File type=pxc: The checkpoint file for proximity analys is generated when keys PXAN or SCAN are used.
- File type=pxp: The file containing the proximity analysis distributions for further plotting, written when the PXPL key is present.
- File type=pxi: The file containing the proximity information (generated by PXWR) for each configuration analyzed.
- File type=slt: The file containing the solute description (read after the key SLTA).
- File type=wmp: The file of matched PMF segments prepared for plotting by the WMAT command.
- File type=dat: Configuration file written for InsightII input (generated by WCNF SVAN).
- File type=grd: Grid point file written for cavity analysis, generated by PRTG.
- File type=pxv: Proximity region visualization file, generated by GENV WRIT.
- File type=pdb: Configuration written in PDB format.
- File type=fgd: Field-gradient file, generated by the FLDG key.
- File type=CRD: Coordinate file in CHARMM CRD format (with or without the CHARMM headers).

III. INPUT DESCRIPTION

The program input is based on keyword driven commands. The structure of a command is as follows:

Leading keyword
Auxiliary keyword(s)
Unformatted data
Formatted data

Of these only the leading keyword appears all the time. Each keyword is a four-character, upper case string. Keywords and unformatted data have to be on the same line. Commands requiring more than one line require the continuation character "~" at the end of the line that is not the end of the command. Several commands can be put on the same line if they are separated by the separator character "|". Text after the comment character "!" is neglected (i.e., considered just comment).

The type of items required are defined by the leading keyword. Auxiliary keywords and data may have default values. The order of keywords and data is fixed, so if there is a needed auxiliary keyword or data, all other items have to be specified that precede the needed one. Formatted data (when required) always start in a line after the keywords and it can not be omitted, but for items with default value, can be secified as zero.

While there is no rigid order of the leading keywords, there are certain rules. Certain leading keywords have prerequisite keywords (i.e., the prerequisite keywords all have to be inputted earlier) and potential precursor keywords that, if used, must be read before the current one or after the next RUNS) or SCAN). If KEY1 is a potential precursor of KEY2 then Key2 is a potential successor of KEY1. If the same leading keyword is used more than once before a RUNS command, a warning is given and the last occurrence will be used (except for TITL).

During input and initializations the data read is cheked for consistency and for current program limits. This may result in messages prefixed by >>>>> OVERRIDE, ----- WARNING, ===== STRONG WARNING or ***** ERROR. At the conclusion of the run, the number of each type of messages is printed. ERROR messages prevent a simulation or an analysis run but don't abort immediately the input. When the input has more than one ERROR messages, some of the additional ERROR messages may only be the consequence of the first one. WARNING message indicates that under some circumstances the input may be incorrect and STRONG WARNING message indicates that the input is likely but not necessarily incorrect or that the error is not fatal. OVERRIDE messages are generally harmless, but they can indicate that the input was not what the user intended.

This section describes the input for each leading keyword and specifies their prerequisites and potential precursors. The description of the commands is given by leading keywords. For example CNFG Key1 [Key2 Key3 Data] Fdata means that the leading keyword CNFG has to be followed by at least one other auxiliary keyword and optionally two more ("Key 2 Key 3"), followed by free-format numerical data ("Data") (all on the line containing the leading keyword), followed by formatted data, started in a new line. Note also that the type of (or the need for) auxiliary keys, free-format and formatted data may depend on the actual choice of auxiliary keys. The possible keywords to be used for Key1, Key2 and Key3 and their effects are explained subsequently, as well as the nature and (for Fdata) the format of the data. Keywords are always given with upper case BOLD characters while variable (data) names are given in lower case bold characters. Input item that is optional is given in [ brackets ] and the default values are given in { braces }. The default value for an auxiliary keyword is the first one on the list.

A record type number (RECTYPE) is assigned to every formatted data record. It is used in error messages on the program's output to identifying the type of data read when the error was encountered.

III.1. File handling keywords

FILE: Job name
WCNF: Configuration saving
WCKP: Creation of a checkpoint file
SCKP: Periodic archiving of checkpoint files
RMCK: Checkpoint file removal
TRAJ: History file specification
MINE: Minimum energy extraction into history file
IDAC: Proximity insertion/deletion acceptance rate
IDLG: Insertion/deletion log file specification
PXPL: Proximity analysis distribution plot file writing
PXWR: Proximity information file specification
WTRA: Trajectory conversions
STIR: Contact elimination
LPAT: Path of local disk directory
BUFF: History file buffer save

III.01.01. Job name

FILE Data

Data: jobname[, numrun]

jobname - Each file name will start with jobname.
numrun - if present, the run number.

III.01.02. Configuration saving

WCNF

Prerequisite: CNFG, SLTA
Related key: PDBT.

Key1 (coordinate file format):

BNRY - Configuration is saved in binary
ASCI - Configuration is saved in ASCII, (3f15.5)
ASAN - Configuration is saved in ASCII, and in , each atom's coordinates are preceded by its atomic symbol and followed by the group number and charge (a4,1x,3f15.5i5,f10.5)
PDB - Configuration is exported to PDB format; last two colums contain the atomic radius and charge
PDBO - Like Key1=PDB , but the last two colums of solvent atoms contain the fractional occupancies and RMS's determined by GENS or read by CNFG. PDB format;
PDBQ - Save the configuration in GRASP PDB format #1 (last two colums contain the atomic radius and charge)
CHRM - Configuration is exported to CHARMM CRD format
Key2 (possible manipulations) - for any of the keys above:
- UNCH - Save the configuration unchanged
- FULL - When the FLXR key is present, this option will save the reassembled full structure since otherwise only the moving atoms are saved.
- CENT - The solute is centered in the simulation cell before saving.
- FIXC - Change all solute molecules to conform to the geometry defined by the SLTA key.
- SW23 - Switch between 2-copy and 3-copy free energy solutes before saving the configuration:
  - For FREE TICA or FREE PM** runs a third solute copy is added (to prepare an input for FREE PMF1 run)
  - For FREE PMF1 runs the third solute copy is removed
- MLCT - Extract from the configuration the centers of the solute molecules and discard the rest (including the solvent) before saving
- REPL - Extend the system with its periodic replicas as specified by the PBCN key. If extensions required only in a few directions then the required replica cell centers should be read with the PBCN READ (cell center coordinates can be printed with the PRPB key). All solvent molecules are printed after all the solute images and the solute atoms affected by the key PARD are also printed contiguously.
- REP0 - Extend the system with its periodic replicas but don't replicate the solute. This option is useful for enlarging an existing system.
- NOSV - Don't write the solvents
- PRET - Apply the pre-transformation that has been specified by the solute data header read with the SLTA key or defined by the CNFG RANC key
  Data: numrunr
  - numrunr - runnumber to use on the new file
- TRRT - Translate and rotate the system before saving
- RTTR - Rotate and translate the system before saving
  Data (for TRRT or RTTR): numrunr, dx, dy, dz, R11,R21,R31, R12,R22,R32, R13,R23,R33
  - numrunr - runnumber to use
  - dx, dy, dz - the components of the displacement vector
  - R - the rotation matrix, specified column-wise
- PXSR - Rearrange the solvents so that the ones non-proximal to selected solute atoms will be put first.
  Data (for PXSR): numrunr, nasltrange, iasltrange1, iasltrange2, ...
  - numrunr - runnumber to use
  - nasltrange, iasltrange1, iasltrange2, ... The selected solute atoms fall into nasltrange segments and the limits of each segment are iasltrangei and iasltrangei.
- SPSR - Rearrange the solvents so that the ones outside a user-defined spehere will be put first.
  Data (for SPSR): numrunr, x, y, z, rsph.
  - numrunr - runnumber to use
  - x, y, z - center of the sphere
  - rsph - radius of the sphere
PDBD - Save the configuration in PDB format (last two colums contain general data in free format - see Key2 and Key3 below and key KMNP)
PDBG - Save the configuration in Grasp PDB format #2 (last two colums contain general data in free format - see Key2 and Key3 below and key KMNP)
PDTD - Same as PDBD except the data in the two data colums will come from the difference between two datasets, generated by the PXDT key.
PDTG - Same as PDBG except the data in the two data colums will come from the difference between two datasets, generated by the PXDT key.
CHRD - Save the configuration in CHARMM CRD format. The content of the data column after the coordinates will be determined by Key2 as described below.
Key2 (for Key1=CHRD only); Key2, Key3 (for Key1=PD*D or Key1=PD*G only): Any (two) of the following keywords, specifying the two quantities to be printed in the last (two) data column(s) of the atom records
- VFSH Volume of the first solvation shell
- V2SH Volume of the first two solvation shells
- NFSH Number of solvents in the first solvation shell
- N2SH Number of solvents in the first and second solvation shells
- DFSH Solvent density in the first solvation shell
- DSSH Solvent density in the first and second solvation shell
  EBFS Solute-solvent binding energy in the first solvation shell
- EPFS Solute-solvent binding energy in the first solvation shell
- TOBE Total binding energy
- TOTN Total number of solvents
- NFVV Solvent-solvent coordination number
- EPVV Solvent-solvent pair energy
- EBVV Solvent-solvent binding energy
- MRDF First minimum of the primary RDF
- KRDF Coordination number using the actual first minimum of the primary RDF
- VRDF First shell volume using the actual first minimum of the primary RDF
- DRDF First shell density using the actual first minimum of the primary RDF
- CRCV Circular variance, based on all solute atoms
- CRCH Circular variance, based on the solute heavy atoms only
- INAC insertion acceptance rate (saved with the IDAC key)
- DLAC deletion acceptance rate
  Data: numrunr, rcrcv
  - numrunr {numrun} - The configuration is saved to the file <job name>.<numrunr>.dat when Key1 is SVAN, to the file <job name>.<numrunr>.pdb when Key1 is PDB*, otherwise to file <job name>.<numrunr>.crd When Key2 or Key3 = CRC*:
  - rcrcv {6.0} - The circular variance will be calculated using solute atoms within a radius of rcrcv. The circular variance for a point R is defined as
    CV = 1.0 - [ SUM (<R>-r_i) / SUM (|<R>-r_i| ]
    where r_i are the coordinates of solute atoms within rcrcv. When R is inside the solute, CV is near one, and when R is outside the solute, CV is near zero.

III.01.03. Creation of a checkpoint file

WCKP [Data]

Prerequisite: FILE

Data: numrunw

numrunw {numrun} - Checkpoint file <jobname>.numrun.ckp is to be written.

III.01.04. Periodic archiving of checkpoint files

SCKP [Data]

Data: nsavckpf

nsvckpf - Frequency of writing a separate checkpoint file.

Each checkpoint file will have the runnumber of the current run and consecutive version numbers, starting at 2. This is option allows the later calculation of proximity analysis averages over different segments of the run (see key CMPC), provides a backup procedure in case the program crashes while writing the checkpoint file, and is useful for debugging.

III.01.05. Checkpoint file removal

RMCK [Data]

Data: numrunp

numrunp {<current runnumber> -1} - if positive, the checkpoint file(s) for run number. numrunp will be removed (by default, the checkpoint file created by the last RUNS or SCAN will be removed). If numrunp is negative then the checkpoint file(s) for the <current runnumber> - |numrunp| will be removed.

This option is useful for preventing the disk to become too full when the calculation is broken up into several RUNS or SCAN steps.

III.01.06. History file specification

TRAJ Key1, [Key2,Key3] [Data]

Can not be followed by: MINE
Potential successors: RCKP
Prerequisite of: DENF, WTRA
Variable to set to 'T' in the preprocessor: ME
Related keys: MINE, WTRA, BUFF.

For MC runs this key sets up the saving or reading of the run history on file <jobname>.<runnum>_<runvers>.hst (For runnum=1 the run number part is omitted and for runvers=1 the version part is omitted). See also the key BUFF.

For analysis (SCAN) or filtering (FILT **** TRAJ) this command specifies the format of the history file. An input history can consist of several files with the same runnumber and with consecutive versionnumbers (i.e., the second, third, etc. piece should have names <jobname>.<runnum>_<runvers+1>.hst , <jobname>.<runnum>_<runvers+2>.hst , etc.). Note, that this naming convention also applies to non-MMC formats that have customary extensions (e.g., .pdb, .DCD) requiring either renaming these files or setting up aliases for them.

Key1 (trajectory file format - if any):

NONE - No saving of run history.
ALLE - The following information is stored at every accepted step in blocks of 250 when no variable coupling parameter is applied:
- estac - Total energy of the configuration
- cst - Coordinates of the first three atoms of the moved molecule istc.
- istc - Number (label) of the solvent molecule moved (0 when the whole solute was moved).
- nmcst - Step number in the MC walk.
- bngst - Binding energy of the molecule moved. When variable coupling parameter is used (i.e., FREE PMF1 run), the coupling parameter value is saved instead of the binding energy.
This option allows the full reconstruction of the run's history but works only when neither the key PARD nor the key PART is in use and only in the canonical ensemble.
ALLV - Same as ALLE, but the total viral sum is saved instead of the binding energy. Same limitations apply as for ALLE.
ALLC - Trajectory file contains full configurations in formatted (ASCII) records as follows:
- 1st line: number of atoms written (natomp), number of solvent molecules+1, total number of atoms, first solute atom written out, last solute atom written out, MC stepnumber, coupling parameter in format(i6,9x,i6,8x,i6,7x,2i6,5x,i9,4x,f8.0)
- Next natomp lines: x,y,z coordinates (in &%197;) (3f15.5) format
- Last -1 line: ins/del stepnumber, total energy, number of accepted insertions and deletions, umbrella sampling weight factor for the configuration
- Last line: solute-solvent energy and its contributions for each interaction term. Important: for water solvents only the first three atoms of each solvent molecule are written out.
  For (T,P,N) ensemble runs the three edge parameters (see key PBCN) are also saved in an additional record in (3f10.5) format.
ALLA - Trajectory file contains full configurations file in formatted (ASCII) records as follows:
- 1st line: number of atoms written (natomp), number of solvent molecules+1, total number of atoms, first solute atom written out,i last solute atom written out, MC stepnumber, coupling parameter in format(i6,9x,i6,8x,i6,7x,2i6,5x,i9,4x,f8.0)
- Next natomp lines: atomic symbol, x,y,z coordinates (in Å), group (residue) number, partial charge with (a4,1x,3f15.5,i5,f10.5) format
- Last -1 line: ins/del stepnumber, total energy, number of accepted insertions and deletions, umbrella sampling weight factor for the configuration
- Last line: solute-solvent energy and its contributions for each interaction term. Important: for water solvents only the first three atoms of each solvent molecule are written out.
- For (T,P,N) ensemble runs the three edge parameters (see key PBCN) are also saved in an additional record.
ALLB - Trajectory file contains full configurations in binary format:
- First record: number of solvent molecules, number of atoms, umbrella sampling weight factor for the configuration, MC stepnumber, insertion/deletion stepnumber, number of accepted insertions, number of accepted deletions, first atom written out, total energy, solute-solvent energy and its contributions for each interaction term
- Second record: the whole configuration
- For free energy runs the coupling parameter is written as an additional record. Again, for water solvents only the first three solvent atoms are written out.
- For (T,P,N) ensemble runs the three edge parameters (see key PBCN) are also saved in an additional record.
ALLP - Trajectory file contains full configurations in PDB format. The first record is a REMARK giving the same information as the first line of the ALLA format in format(9x,i6,3x,i6,5x,i6,7x,2i6,5x,i9,4x,f8.0) and the next REMARK may give the total energy and solute solvent energy in the format (9x,e16.8,17x,e13.6); additional REMARKs may follow.
ALLH - Trajectory file contains full configurations in CHARMM CRD format. The first title line gives the same information as the first line of the ALLA format in format(9x,i6,3x,i6,5x,i6,7x,2i6,5x,i9,4x,f8.0) and the next title line may give the total energy and solute solvent energy in the format (9x,e16.8,17x,e13.6); additional title lines may follow.
CHRM - Trajectory file contains full configurations in CHARMM binary trajectory format. For runnum=1 the .DCD extension is also allowed.
AMBR - Trajectory file contains full configurations in AMBER format. The program will check if the system is centered around (0,0,0) or (Ex/2,Ey/2,Ez/2).
Key2 (level of solute changes):
- RGFX - The solute is rigid and is not moved during the simulation
- RGMO - The solute is rigid and but it is allowed to move during the simulation
- FLEX - The solute is flexible
  Key3 after Key1=ALL* (level of solute atom saving):
  - ALST - All solute atoms are saved
  - MVST - Only the solute atoms affected by partial solute moves (PARD key) are saved
  - NOST - Solute atoms are not saved
  - NOSV - Solvent atoms are not saved
  Key3 after Key1=AMBR (PBC information):
  - NOBX - No box-size information after configurations
  - BOX - After each configuration the simulation cell edges are written in a new line
  - BOXX - Read the box description after each configuration but keep using the values inputted with the NSLV command
    Data: natskip, runnum, runvers, tfilename, incrun, alttrajext
    - natskip {0} - When natskip > 0 then the last natskip atoms of each frame will be skipped. When natskip < 0 then the first natskip atoms of each frame will be skipped. Ignored with MMC trajectories.
    - runnum {current run number} - The run number of the trajectory file.
    - runvers {1} - The version number of the (initial) trajectory file.
    - tfilename {jobname} - The trajectory file is named tfilename.runnum_runvers.hst
    - incrun {0} - If zero, additional trajectory segments will be searched with increasing the version number; if one, with increasing the run number.
    - alttrajext - the trajectory extension (instead of hst). It has to be exactly three characters.

When running a MC simulation, full configurations (Key1=ALL*) will be written on the history file at every nmcrec-th MC step (read with the RUNS key).

III.01.07. Minimum energy extraction

MINE Key1

Related key: TRAJ

Key1 (trajectory or overall):

NONE - Don't extract the minimum energy
TRAJ - extract the minimum energy over every nmcrep MC step segment and save it on the trajectory (.hst) file. Requires the TRAJ key. Also, save the overall lowest energy and write it on a PDB file at the end of the run defined by the RUNS key.
ALLC - Only extract the minimum energy and write it on a PDB file at the end of the run defined by the RUNS key.
WANN - Besides the function of ALLC the configuration at each temperature change is written to a file jobname_annh.runnum.pdb
WANN - Besides the function of ALLC the reassembled full configuration at each temperature change is written to a file jobname_annfh.runnum.pdb (valid only when the TRAJ key is present).

III.01.08. Proximity insertion/deletion acceptance rate

IDAC [Data]

Data: rpxidlim, nrange, irangestart, irangestopnrange times)

rpxidlim {4.0} - Insertion and deletion acceptance rates within rpxidlim of the nearest solute heavy atom will be calculated for each solute heavy atom.
nrange {0} - Number of solute atom index ranges
irangestart, irangestop (nrange times) - proximity index calculation will be restricted to heavy atoms in the rage(s) irangestart - irangestop

III.01.09. Insertion/deletion log file specification

IDLG Key1 [Data]

Prerequisite: GCEN.
Prerequisite of: IDAG
Related key: IDAG.

Key1 (ins/del log file creation):

OFF - No log is written.
ASCI - For each accepted insertion and deletion the program writes +1 or -1 (for insertion or deletion), the solvent molecule index, the number of solvent molecules after the insertion/deletion, the MC stepnumber and the coordinates of the center of mass of the molecule inserted/deleted on the .idl file.
ASPX - Same as ASCI, but the atom number of the nearest solute atom to the insertion/deletion site as well as the distance between them are also printed.
Data: rpxidlim
- rpxidlim {4.0} - Solute atoms with insertion and deletion sites within rpxidlim will be listed with the respective counts.

III.01.10. Proximity analysis distribution plot file writing

PXPL If this keyword is present, the program will write an ASCII file containing all the calculated proximity distributions to the .pxp file. The distributions written on the .pxp file can be plotted with the companion (Fortran-77) program pxps.f. pxps.f produces Postscipt files of plots, 8 plots per page. It can put two distributions on the same plot, but both have to use the same dependent variable. The right-side vertical axis will belong to the second plot.

A typical run of pxps.f is as follows (user input is displayed in bold face):

% pxps
 
Proximity plot generator Version:/07/22/98
Data file name root=wt1
 File wt1.pxp opened                             <-  the file wt1.pxp
                                                     was generated by
                                                     the PXPL key
Use @ for: @f=Greek fnt, @n=Normal fnt, @+=superscript
 @-=subscript, @m=newline, @#n#=font size n, @@=@symbol
Title of graph page:
GnRH Wild Type, Extended Conformation
Do you want grid lines drawn (y/n)? n   <- If yes, only one plot
                                                  can be plotted
Plot types:
 1  Solute-solvent radial distribution functions       <- from key PXGR
 2  Solute-solvent pair energy distribution functions  <- from key PXBE
 3  Mean solvent orientation as a function of R        <- from key PXDP
 4  Distribution of first shell solvent orientations   <- from key PXDP
 5  Solvent-solvent radial distribution functions      <- from key PXWW
Plot type number=1
Data colums: 1 R        2 gpx(R)   3 Kpx(R)   4 gt(R)    5 Kt(R)
Data column number=2
Do you want a second function also plotted (y/n)? y
Plot types:
 1  Solute-solvent radial distribution functions
 2  Solute-solvent pair energy distribution functions
 3  Mean solvent orientation as a function of R
 4  Distribution of first shell solvent orientations
 5  Solvent-solvent radial distribution functions
Plot type number=1
Data colums: 1 R        2 gpx(R)   3 Kpx(R)   4 gt(R)    5 Kt(R)
Data column number=3
Number of graphs to print (0: will ask for range)=0
First and last distribution function=1,18
Data set not found (skipped): gkr is=   1    <- empty proximity
regions
Data set not found (skipped): gkr is=  10
 File wt1_2_9_gpx_Kpx.ps opened
 File wt1_11_18_gpx_Kpx.ps opened
More plots (y/n)? n   <- allows to plot other distributions
%

The format of the .pxp file is as follows:

Header:
- nsltpx, nsltpxgr (2i5)
- ianslt (20i4)
- resnam,atnam (20a4)
  - nsltpx - Number of solute atoms analysed
  - nsltpxgr - Number of rdf's generated
  - ianslt - atomic numbers of all solute atoms analysed
  - resnam,atnam - 8-character label of all solute atoms analysed
Radial distribution functions (PXGR key was present). For each distribution function the program writes
- is, mxgrd (' is=',i4,' ngrids=',i4,' gkr')
- (grd(i), gr(i), rcoord(i), grt(i), rcoort(i), i=1,mxgrd) (5f8.3)
  - is - rdf number
  - mxgrd - Number of gridpoins
  - grd(i) - Radius of the i-th gridpoint
  - gr(i) - Primary rdf
  - rcoord(i) - Primary running coordination number
  - grt(i) - Total rdf
  - rcoort(i) - Total running coordination number
Primary solute pair energy distributions (PXBE key was present). For each distribution function the program writes
- is, mxgrd (' is=',i4,' ngrids=',i4,' xpe')
- (egr(i),sltep(i),rcoore(i),i=1,mxgrd) (10f8.4)
  - is - rdf number
  - mxgrd - Number of gridpoins
  - grd(i) - Energy of the i-th gridpoint
  - sltep(i) - Primary pair energy distribution value
  - rcoord(i) - Primary running coordination number
Primary solvent-solvent rdfs (PXWW key was present). For each distribution function the program writes
- is, mxgrd, nijgvvp (' is=',i4,' ngrids=',i4,' ngvv=',i2,' gvv')
- mxgrd lines: grd(i),(groo(i,iv),v=1,nijgvvp) (10f8.4)
  - is - rdf number
  - mxgrd - Number of gridpoins
  - nijgvvp - Number of rdf types (between different solvent centers)
  - grd(i) - Radius of the i-th gridpoint
  - groo(i,iv) - Primary solvent-solvent rdf
Solvent mean orientation (PXDP key was present). For each distribution function the program writes
- is, mxgrd (' is=',i4,' ngrids=',i4,' tav')
- mxgrd lines: (grd(i),tav(i),i=1,mxgrd) (10f8.4)
  - is - rdf number
  - mxgrd - Number of gridpoins centers)
  - grd(i) - Radius of the i-th gridpoint
  - tav(i) - Primary mean solvent-solute angle
Solvent orientation rdf (first shell) (PXDP key was present). For each distribution function the program writes
- is, mxgrd (' is=',i4,' ngrids=',i4,' xtd')
- mxgrd lines: (grd(i),xtd(i),i=1,mxgrd) (2f9.4)
  - is - rdf number
  - mxgrd - Number of gridpoins centers)
  - grd(i) - Angle THETA of the i-th gridpoint
  - xtd(i) - Primary solvent-solute angle distribution

III.01.11. Proximity information file specification

PXWR Key1 [Key2 Data]

Prerequisite: PXCR,

Key1 (file format - if any):

OFF - Proximity information writing is turned off.
ASCI - The proximity information file is an annotated ASCI file containing the following data for each configuration analyzed:
(nsv,nmc,ncnfpx, (iproxi(i),resname(i),resno(i),atnam(i),rmin(i),esvst(i),i=1,nsv)
where
- nsv is the number of solvent molecules
- nmc is the MC stepnumber
- ncnfpx is the number of configurations analyzed so far
- iproxi(i) is the solute atom nearest to solvent i
- rmins(i) is the corresponding minimum distance (in Å)
- esvst(i) is the interaction energy of solvent i with the solute (in kcal/mol).
BNRY - The proximity information file is a binary file containing one record for each configuration analyzed:
(nsv,nmc,ncnfpx,(iproxi(i),rmin(i)²,esvst(i),i=1,nsv)
Important: iproxi is stored as INTEGER*2 to save space. Note, that the size of a file created with the BNRY option is about the tenth of the size of a file created with the ASCI option.
BNRR - The proximity information file is a binary file containing one record for each configuration analyzed:
(nsv,nmc,ncnfpx,(iproxi(i),rmin(i)²,i=1,nsv) .
This file is about 2/3rd of the size of the file written with the BNRY option.
Key2 (read or write):
- WRIT - Write the proximity information
- READ - Read the proximity information from an existing .pxi file
  Data:
  - numrunpxi - The run number of the .pxi file to read from (only read with Key2=READ)

The proximity information file has extension .pxi.

III.01.12. Converting/shifting a trajectory

WTRA Key1 Key2 [Data]

Prerequisite: TRAJ

Key1 (transformation type):

UNCH - Don't change coordinates
STCT - Shift the system to put the solute's center of mass at the cell center
SHCT - Shift the system to move the origin from the lower left corner of the box to the center (update PBC images accordingly)
Key2 (trajectory format):
- One of ALLC, ALLA, ALLB, ALLP - see key TRAJ. Note, that ALLB is the most compact.
  Data: nmcmax, nmcfreq, nmcskip, incvers, nmcmaxseg
  - nmcmax {100000000} - gather configurations until the nmcmax-th step.
  - nmcfreq {1} - use configuration saved at every nmcfreq-th simulation step
  - nmcskip {0} - skip the first nmcskip simulation steps
  - incvers {10} - The converted trajectory will have a version number that is incremented from the initial by incvers
  - nmcmaxseg {0} - When > 0, after nmcmaxseg steps, open a new output file with version number incremented by one.

This key allows the conversion of the input trajectory to a new format and gives the option of centering it in two different ways.

III.01.13. Contact elimination

STIR [Data]

Prerequisite: CNFG

Data: eijmin, disp, dispa

eijmin {1000.0}, disp {3.0} - solute molecule pairs with energy above eijmin kcal/mol will be pulled apart by disp Å
dispa {90.0} - solute torsion group members with intramolecular energy exceeding eijmin will be incremented by dispa degrees.

This option is useful for 'cleanng up' a starting configuration generated with the RANC option.

III.01.14. Local disk directory

LPAT [Data]

Data: scratchpath

scratchpath - The full path to the directory where temporary files and checkpoint files are written during the run. The checkpoint files wil be copied to the originating directory only at the end of the run. This option allows the reduction of network traffic.

III.02. System descriptor keywords

NSLV: Number of solvent molecules
TEMP: Simulation temperature
TITL: Description
PBCN: Periodic boundary conditions
GCEN: Grand-canonical (T,V,mu) ensemble
STPX: Stop GCE run at nx
BTUN: Tune GCE B parameter
LIMG: Limit the cavity grid
IBEN: Isothermal isobaric (T,P,N) ensemble
LCMP: Virial biasing factor
CNFG: Initial configuration
SCRM: Scramble torsion angles
SCAL: Simulation cell rescaling
OVST: Overlap of the second solute copy with the first
SPRD: Spread out solute molecules
PBGR: Solute reset with grids
LIGA: Multi-copy ligand
STUN: Stepsize tuning
TAC0: Stepsize accumulator reset
SANN: Simulated annealing
SACP: Simulated annealing of the chemical potential

III.02.01. System size

NSLV Data

Can not be followed by: HRDW
Potential successors: CNFG, RCKP

Data: numsol, nsolfix

numsol - Number of solvent molecules. For grand-canonical ensemble runs (GCEN key) using CNFG RAN* will generate numsol randomly placed solvents to start with.
nsolfix {0} - The first nsolfix solvent molecules will be kept fixed during the simulations.

nsolfix > 0 requires that the solute does not undergo translation/rotation (cedslt and rtxslt should both be 0.0 - see the STEP key).

If the input has no NSLV key then the number of solvent molecules will be established from the .crd or .hst files.

III.02.02. Simulation temperature

TEMP Data

Prerequisite of: RUNS, SANN

Data: T

T - the simulation temperature in Kelvin

III.02.03. Title

TITL Data

Potential successor: RCKP

Data: description of the calculation.

The program is prepared to read at most two title lines.

III.02.04. Periodic boundary conditions and cell dimension.

PBCN Key1 [Key2 Key3] Data [Fdata]

Prerequisites: SUVC, SVVC
Potential successor: RCKP
Prerequisite of: SLTA, GCEN, MOND, LIMG, VORO, FREE, PRPB, GENS

Key1 (cell shape):

RECT - Rectangular (cubic) boundary conditions (SC).
Data: edgex [edgey, edgez] (in Å).
- edgex, edgey {edgex}, edgez {edgex} - The X,Y,Z edges of the rectangle.
- Vol=edgex*edgey*edgez Å³, R_insc=min(edgex,edgey,edgez)/2
FCC - Face-centered cubic boundary conditions (FCC).
Data: edg
- edg- The edge of the rombohedral dodecahedron is edg*sqrt(3)/2
- Vol=2*edg³Å³, R_insc=edg/sqrt(2)
TOCT - Truncated octahedron boundary conditions.
Data: edg
- edg - distance of the squrre face from the center
- Vol=4*edg³Å³, R_insc=edg(sqrt(3)/2)
  The coordinate axes go through the center of the square faces.
HCP - Hexagonal close-packed boundary conditions.
Data: edg
- edg - diameter of the inscribed sphere
- Vol=edg³/sqrt(2)Å³,
- R_insc=edg/2
HEXG - Hexagonal prism boundary conditions (HP).
Data: plen, pedge
- plen - The length of the prism
- pedge - The edge of the hexagon.
- Vol=plen*pedge²*3*sqrt(3) Å³, R_insc=min(plen, pedge*sqrt(3)/2.
- The axis of the prism is the X axis and the vertex of the hexagon goes through the Z axis. For analyzing trajectories with different axis conventions, use the PXYZ key.
SPHR - Sphere boundary conditions (SPH)
Data: rsph
- rsph-radius of the sphere.
PHS - Primary hydration shell (Beglov & Roux)
Variable to set to 'T' in the preprocessor: PH
Key2 (solute atom radius source):
- RVDW - Standard Van der Waals radii will be used for the solute atom radius.
- RSIG - Lennard-Jones SIG/2 values will be used for the solute atom radius.
  Key3 (target energy definition):
  - UTTO - uphsref read is the total restraining energy
  - UTPM - uphsref read is the per molecule restraining energy
  - UPML - uphsref read is the per molecule restraining energy, but solvents farther than a threshold are ignored for the reference shell energy calculation
READ - Input boundary conditions.
Fdata:
- RECTYPE 29: rinscr, vol, rmax (3F10.0)
- rinscr - Radius of the largest inscribed sphere into the cell.
- vol - Volume of the unit cell (in �³)
- rmax - For minimum image convention the cutoff radius used is rmax.
- RECTYPE 39: ncell (I5)
- ncell - Number of cell-center coordinates to be read.
- RECTYPE 38's: (cic(k,i),k=1,3) (3F15.0)
- ncell times repeated.
- cic - The negatives of the coordinates of center i (i.e., -X, -Y, -Z).

Note: Input boundary conditions work only if the image cell can be found by comparing the distances of a point from the various cell centers, the smallest distance will belong tot he wanted cell number. It works when the periodic system can be described by an orthogonal crystal coordinate system. For other cases, more complicated calculations are required - to start with, a transformation to non-orthogonal coordinate system - but it is not implemented yet.

For pure liquids, FCC is the best since this maximizes the distance of a molecule from its image. On the other hand, RECT is conceptually the simplest. Furthermore, for RECT the dimensions of the cell can be different along the three axes, allowing for various types of crystals. Solutes that are far from spherical can also be wrapped around by waters more efficiently. (In this case, however, the solute rotation should be prevented - see key STEP.) HEXG have been developed for long solutes or solutes representing an infinite polymer chain, to maximize the distance of the chain from its image. SPHR simply encloses the system into a sphere of inputted radius. Note that the FCC calculations are only partially vectorized.

For the rectangular, face-centered cubic, hexagonal prism and sphere boundary conditions there are special algorithms in the program, one for each. When boundary conditions are inputted, the cell number of the periodic image cell where a molecule is located is determined simply by comparing the distance of the molecule from the various cell centers read in.

The unit cell parameters are determined from the bulk density or molar volume of the solution, N_mol, and the choice of boundary conditions. If the density of the system is d g/ml , then

d (g/ml) = weight in g of N_mol / volume of N_mol in ml =

(N_mol/L)*Mol.wt / (Vol/10²⁴)

where L is Avogadro's number. The cell parameter(s) can be obtained from the resulting Vol value.

III.02.05. Grand-canonical ensemble selection

GCEN[ Key1 Key2 Key3 Key4 Key5 Data ] Fdata

Prequisite: PBCN
Can not be followed by: LIMG
Prerequisite of: PRTG, PRFI, BTUN, STVG, STPX, SACP
Related keys: IDLG, IDAG
Variable to set to 'T' in the preprocessor: CB

Grand-canonical ensemble simulation using the cavity-biased method with grid-insertion or the original, random insertion method.

Key1 (GCE insertion strategy):

OFF - Return to canonical ensemble.
CAVB - Cavity biased insertions at gridpoints
CVBF - Cavity biased insertions at a rectangle around the gridpoints
UNBI - Unbiased (random insertions)
Key2 (solute atom radius source):
- RVDW - Standard Van der Waals radii will be used for the solute atom radius.
- RSIG - Lennard-Jones SIG/2 values will be used for the solute atom radius.
- RSGH - Like RSIG, but hydrogens with SIGMA=0 will be assigned SIGMA=1.
  Key3 (insertion/deletion selection strategy):
  - ALTI - Alternating insertion and deletions attempts
  - RANI - Randomly decided insertion and deletions attempts
    Key4 (alternative Pcav definitions):
    - ALLG - Pcav is defined as the number of free grids / total number of grids
    - SLVG - Pcav is defined as the number of free grids / number of grids not covered by the solute
      Key5 (strategy to use electrostatic term for insertion/deletion steps):
      - LJQI - All potential terms are used for the I/D decision.
      - LJNC - The electrostatic contributions are neglected for the insertion/deletion decision. Ensemble averages for the thermodynamical quantities are not corrected for the applied approximation.
      - LJCN - As in LJNC, but the ensemble averages for the thermodynamic quantities are corrected for by utilizing the applied umbrella sampling.
      - LJCU - As in LJCN, but the umbrella sampling is applied to both the insertion/deletion and the regular moves, and thermodynamic quantities are corrected for the biasing.
        Data: ucspma, usmixp.
        
        ucspma - Average solute-solvent electrostatic energy(in kcal/mole). If the supplied average electrostatic solute-solvent energy is less than -1000 kcal/mole, then the program uses the starting configuration's value for the average solute-solvent electrostatic energy. Umbrella sampling bias is E_w = usmixp*(us1-n*ucspma) where us1=solute-solvent electrostatic energy, and n=number of solvent molecules at a given configuration.
        Fdata:
        
        RECTYPE 13: ba, diamslv, pmvslt, ngridx, ngridy, ngridz, nslvxp, nmolfx, idfreq, idrepf, nmcransh (3F10.0,6I5,2I10)
        ba - The B parameter of Adams. The larger it is, the larger will the chemical potential be.
        diamslv - The solvent diameter in �, to be used as the cavity radius.
        pmvslt - The molar volume of the solute (to be used in the density calculation).
        ngridx, ngridy, ngridz - The number of gridpoints in the X, Y and Z directions (not used for GCEN UNBI runs). By default, the whole simulation cell is covered by the grid, but a smaller portion can also be specified with the LIMG key.
        nslvxp - The targeted number of solvent molecules referred to the key STPX
        nmolfx {0}- The number of solvent molecules that are not allowed to be deleted.
        idfreq - The frequency of insertion/deletion attempts (in MC steps).
        idrepf - The frequency of summary line (in Ins/Del steps)
        nmcransh - The inital grid will be regenerated after every nmcransh-th MC step with a random shift in its position.

Other aspects of a GCE simulation are controlled by the keys PRFI, IDLG, IDAG, STPX, LIMG, and BTUN.

III.02.06. Stop GCE run at nx

STPX Data

Prerequisite: GCEN,

Data nslvxp

nslvxp {nslvxp read with the GCEN key} - target number of solvents to stop at.

When this key is present the run will stop if the number of solvent molecules reached nslvxp.

III.02.07. Tune the GCE B parameter

BTUN Key1 [Data]

Prerequisite: GCEN, MOND or BLKW

Key1 (tuning algortithm)

NONE - Turn off tuning, continue with the last B value (not recommended)
AVRG - Turn off tuning, continue with the average B value over the last run
ADPT - Adaptive algorithm
FLUC - Fluctuation-based algorithm
PICL - Process control algorithm
Data for Key1=ADPT and Key1=FLUC: targetden, nmctunave, nmctunskip, chabmax, dentol, inorout
- targetden {0.997} - The targeted density in the bulk region as defined by the MOND key.
- nmctunave {100000} - The number of MC steps to use for collecting data before each B parameter change.
- nmctunskip {nmctunave} - The number of MC steps to skip (as equilibration) after each B parameter change.
- chabmax {1.0} - The maximum absolute change allowed for the B parameter.
- dentol {0.01} - The density range around targetden within which the tuning is considered converged.
- inorout {2} - when =1, the region inside the cube defined by the MOND key will be assumed bulk; when =2, the region outside the cube is assumed bulk.
- targetn {-50.0} - the targeted number of solvent molecules in the region selected by inorout. If it is positive, it will override the targetden value read above.
Data for Key1=PICL: targetden, integraltime, gain, inorout
- targetden {0.997} - The targeted density in the bulk region as defined by the MOND key.
- integraltime {13000.0} - integraltime
- gain {-113.0} - gain
- inorout {2} - see above.
- targetn {-50.0} - see above.

The program will periodically change the GCE B parameter to achieve the targeted density in the 'bulk' region - defined as the outside region by the MOND key.

Note that the keys NONE and AVRG are only meaningful after having restored a checkpoint file (key RCKP).

III.02.08. Limit the cavity grid

LIMG [Key1,Data]

Prerequisite: PBCN
Potential successor: GCEN

Key1 (used only for GCE simulations to specify treatment of boundary crossings):

NOWA - Free movement of solvents in and out of the grid region
WARC - Free movement of solvents in and out of the grid region but counts of border crossings are maintained
WALL - Movement of solvents in and out of the grid region is prevented, counts of crossing attempts are maintained
Data: glimminx, glimmaxx, glimminy, glimmaxy, glimminz, glimmaxz
- The limited grid will extend from glimminx to glimmaxx, glimminy to glimmaxy, glimminz to glimmaxz in the x, y, and z directions, respectively. For any glim*value that reaches or exceeds the limits of the simulation cell (centered at (0,0,0), the periodic boundary conditions will also be invoked.

III.02.09. Isothermal isobaric ensemble simulation (TPN)

IBEN Key1, Key2, Data

Can not be followed by: VCHA
Potential successors: CNFG, VCHA
Prerequisite of: LCMP
Related key: LCMP.

Variable to set to 'T' in the preprocessor: IB

Key1 (volume sampling technique):

NONE - Turn off (TPN) ensemble sampling
UNIF - (TPN) ensemble sampling with uniformly distributed steps
VB3D - (TPN) ensemble sampling with virial-biased sampling of Mezei.
VB1D - (TPN) ensemble sampling with refinement of the virial-biased sampling of Mezei, using the components of the virial sum to generate cell size changes in the X, Y, and Z directions, respectively.
Key2 (volume change (un)isotropy):
- ISOT - Volume change involves all three axes
- IXYZ - Volume change alternates between changes in the X, Y or Z directions.
- ISYZ - Volume change alternates between changes in the X dimensions and in the Y-Z dimensions. directions
  Data:
  - press {1.0} - The external pressure specified (in atm)
  - vrange {100.0} - The volume change range (in Å³)
  - nvchfreq {nmolec} - The frequency of volume change attmpts (in MC steps)
  - nvchrep {nmolec} - The frequency of reports on volume change attmpts (in number of volume change attempts)
  - vlam {0.5} - The virial biasing lambda factor.
    For Key2=IXYZ:
  - ixyzfreeze(1), ixyzfreeze(1), ixyzfreeze(1) {0,0,0} - If ixyzfreeze(k) is > 0 then no change will be done along the k-th axis.

Note, that putting the key IBEN after CNFG will result in ignoring the cell size on the .crd file and the vaulues inputted with the PBCN key will be used instead.

III.02.10. Virial-biasig factor

LCMP Data

Prerequisite: IBEN

Data:

complfac {0.5} - The scaling factor to use for virial biasing (corresponding to the lambda factor for force biasing).

III.02.11. Initial configuration

CNFG Key1 [ Key2 Key3 Data ] [Fdata]

Prerequisites: SLTA, FILE
Can not be followed by: FREE, TORD, CMPC, NSLV, IBEN, VCHA, PXYZ
Potential successors: RCKP, SPST
Prerequisite of: RUNS, SCAL, PRTG, REGE, SPRD, VORO, STIR, SCRM, TORT, LPST, GENT
Related keys: GENT, SPRD, SCRM, STIR.

Key1 (data source):

READ : Initial configuration is read from user supplied .crd file. For Key2=PDB or Key2=CHRM, the extensions .pdb and .CRD, resp., will be assumed first.
RDEX : When part of the system is only used for grid generation (see key FLEX) the full configuration is read from a PDB or Charmm CRD file and the moving atoms will be extracted and saved in ASCI format on a .crdfile.
Key2 (file format):
- ASCI - Coordinate file will be read/written in ASCII.
- BNRY - Coordinate file will be read/written in binary.
- ASAN - Coordinate file is an annotated file written with WCNF SVAN.
- PDB - Coordinate file is imported from PDB format
  If the program is dimensioned for generic sites (nmolec < #MH) then the solvent fractional occupancies and RMS's are also read from the last two data columns.
- CHRM - Coordinate file is imported from CHARMM CRD format
  Key3 (configuration check procedure):
  - NOFX - Abort if solute on the .crd file differs form the input definition
  - FXCR - If solute on the .crd file differs form the input definition by more than 0.01 Å, use the input definition for both
  - FXCC - If solute on the .crd file differs form the input definition, by more than 0.001, use the input definition for both
  - FXIN - If solute on the .crd file differs form the input definition, use one on the .crd file for both
  - IGND - Ignore if solute on the .crd file differs form the input definitition
    Data: numrunr, irunvers, jobnamer
    - numrunr {numrun}, irunvers {1}, jobnamer {jobname} - The program will look for the file <jobnamer>.<numrunr>.crd (<jobnamer>.crd when numrunr=1) to read the coordinates from. If the filename is different from the default coordinate file name, the program will open a new coordinate file name using numrun as well, and copy the coordinates there.
RANC - The starting configuration is obtained by
- Placing the solute (as defined by the SLTA key) at the cell center, with its COM shifted to the cell center. This shift is added to the information that was read by the SLTA key.
- Inserting the number of solvents requested by the NSLV key at random location and orientation. Overlap with the solute van der Waals envelop is prevented.
- When solute molecules are cloned, the cloned copies will be placed at randomly selected points with randomly selected orientations in the cell, with their COM outside the rest of the solute. No tests for overlap will be performed.
- Force bias and the calculation of distribution functions will be turned off.
See also the key STIR for the elimination of persistent contacts.
Key2 (file format): see CNFG READ
RANI - Same as RANC, except that the solute will be placed into the simulation cell unchanged (i.e., as defined by the SLTA key).
TRAJ : The initial configuration is gathered from the trajectory file opened previously.
Key2(file format): see CNFG READ
RPSU : A built-in procedure is used. The first molecule of an old system is replaced by the solute. The COM of the old and new solute will coincide. Solvents overlapping with the Van der Waals envelope of the new solute will be discarded. The resulting configuration will be written on the .crd file. Also, DSTC NONE and SAMP METC will be set.
Key2(file format): see CNFG READ
RPUV : The starting configuration is obtained from an old system by replacing both the solute and solvent molecules in such a way that the orientation matrices are preserved. Both the number of solute and solvent atoms can change.
Key2: see CNFG READ
PMFN - Create an input file for conformational transition from a file with a fixed solute conformation. It takes the conformation given after SLTA (RECTYPEs 14) as R_o reads in the R₁ conformation (together with new charges) and prepares the linear combination of R_o and R₁. All three solute conformations will be stored as a single solute on the .crd file, along with the value of the parameter CP (read from input). Note, that it assumes that the solute in the input configuration is in the R_o conformation.
Key2: see CNFG READ
RPUU - Same as RPSU, but no solvent molecule is discarded.

III.02.12. Scramble torsion angles

SCRM

Prerequisite: CNFG

When this key is present, all torsion angles are given a random value and the solute is regenerated in this random conformation.

III.02.12. Simulation cell rescaling

SCAL Data

Prerequisite: CNFG

Data: e1, e2, e3

e1, e2, e3 - Same meaning as edgex, edgey, edgez for the PBCN key, defining the new cell. The COM's of the solvents will be scaled to fill in the new box evenly.

III.02.13. Overlap of the second solute copy with the first

OVST Data

Data: i1, i2, i3

i1, i2, i3 - The second copy of the solute will be shifted and rotated to best overlap with the first. The overlap is based on the coordinates of atoms i1, i2, i3. If any of them are zero, the first three atoms will be chosen. This is useful for free-energy calculations to reduce the change in the solute that the calculation has to model.

III.02.14. Spread out the solute molecules for display

SPRD [Data]

Prerequisite: CNFG

Data:

numrunw {numrun} - the runnumber of the PDB file to be written.

This key generates a PDB file containing the solute molecules spread out in the x-y plane so that they don't overlap.

III.02.15. Force solute reset with grids

PBGR

When this key is present, the grid calculations will reset each solute atom to the periodic cell before calculating the cover list. This is a safety option, since it is automatically set when the inital structure contains solute atoms outside the simulation cell.

III.02.16. Multi-copy ligand

LIGA Data

Data:

ligands - The number of solute molecules that are different copies of the same ligand (they will not see each other).

III.02.17. Stepsize tuning

STUN Key1, Key2, Key3, Data

Prerequisite: MVRT
Can not be followed by: MVRT
Prerequisite of: TAC0

Related key: TAC0.

Key1 (move type):

TRAN - Tune solute molecule translation stepsize
ROTA - Tune solute molecule rotation stepsize
TORS - Tune solute molecule torsion stepsize
LOOP - Tune solute molecule loop move stepsize
Key2 (tune type):
- PICL - Process control algorithm for tuning.
- PCLG - Process control algorithm for tuning; tuning is done on the logarithm of the acceptance rate.
- OFFW - Turn of tuning, keep current average stepsize(s),calculated with the weight wstepsum
- OFFC - Turn of tuning, keep current cumulative average stepsize(s)
- OFF - Turn of tuning, keep current (tuned) stepsize(s)
Data: targetacc, wsum, timeint, gain, wstepsum, tunedstepmax
- targetacc {0.3} - target acceptance rate (< 1.0)
- wsum {0.999} - current acceptance rate Pacc_i for tuning purposes is
  Pacc_i - wsum*Pacc_i-1 + (1-wsum)*iacc_i
  where iacc_i is 1 for accepted and 0 for rejected moves.
- timeint {2000.0} - integraltime of tuning
- gain {-50.0} - gain of tuning
- wstepsum {wsum} - weight factor used to calculate the average tuned stepsizes, analogously to the calculation of the acceptance rates.
- tunedstepmax - the maximum value the tuned stepsize(s) can take. Default value is 2 � for Key1=TRAN, 180^o otherwise.

Note that the solute sampling should be set to alternating translation and rotation with the MVRT key. For cloned solute molecules (see the key CLON) the tuning will generate identical stepsizes for each instance of a torsion angle.

III.02.19. Stepsize accumulator reset

TAC0 Key1, Data

Prerequisite: STUN

Key1 (move type): same as Key1 of STUN

Data: nmc_zeroacc

nmc_zeroacc - the stepsize accumulator for the step type specified by Key1 will be reset at the nmc_zeroacc-th MC step. nmc_zeroacc has to be a multiple of the frequency of printing full results (nplt read by the RUNS key). This option is designed to work with STUN **** OFFC.

III.02.19. Simulated annealing

SANN Key1, Data

Prerequisite: TEMP

Key1 (annealing schedule):

NONE - turn off simulated annealing
LINE - temperature is decreased by constant increments
EXPO - temperature at the nstep-th step is set as T=T₀*exp(-nstep*expfac)
LOG - temperature at the nstep-th step is set as T=coef_log/ln(nstep+2)
LIST - temperature values at each step are specified from input
Data for Key1=LINE: nmctempstep, tempinc
- nmctempstep - temperature is changed at every nmctempstep MC step
- tempinc - temperature decrement at each step
  - If tempinc=0.0 then tempfin, the final temperature; MMC will calculate tempinc
Data for Key1=EXPO: nmctempstep, expfac
- nmctempstep - temperature is changed at every nmctempstep MC step
- expfac - temperature decrement exponent at each step
  - If expfac=0.0 then tempfin, the final temperature; MMC will calculate expfac
Data for Key1=LOG: nmctempstep, coef_log,
- nmctempstep - temperature is changed at every nmctempstep MC step
- coef_log - prefactor of the logarithmic formula
  - If coef_log=0.0 then tempfin, the final temperature; MMC will calculate coef_log
Data for Key1=LIST: nmctempstep, ntemplist, templist(1),...,templist(ntemplist)
- nmctempstep - temperature is changed at every nmctempstep MC step
- ntemplist - number of temperature values to read (< 101)
- templist(i) - the temperature at the i-th temperture step. After 100 steps, the temperature stays at the last list value. The run stops if a negative value is reached.

III.02.20. Simulated annealing of the chemical potential

SACP Key1, Data

Prerequisite: GCEN

Key1 (iteration length)

FIXS - Iteration lengths are fixed, independent of the number of molecules
SCLS - Iteration length is proportional to the number of molecules

Data: bincr, nmcsacpstep, fract_lim

bincr - B parameter increment
nmcsacpstep - for Key1=FIXS the B parameter will decreased by bincr at every nmcsacpstep MC step; for Key1=SCLS the B parameter will decreased at every nmolec*nmcsacpstep MC step.
fract_lim - Cavity biasing will be turned off if the fraction of the number of free grids exceeds fract_lim

When this option is used and the trajectory writing (key TRAJ) is not used, at each B parameter change the solvent coordinates will be saved on the history file as PDB MODELs.

III.3. Free-energy calculation keywords

FREE: Free energy options
RAUS: AUS parameter change
WMAT: PMF matching
WPLT: AUS iteration plot
TIQU: TI quadrature
OVRA: Overlap ratio
FETK: Temperature scaling for TI evaluation

III.03.01. Free energy options

FREE Key1, [Key2-4 Data Fdata]

Prerequisite: PBCN
Potential successors: CNFG, SLTA, TORD, RCKP, PARD
Related keys: TIQU, WMAT, WPLT.

Key1 (free-energy method choice):

WIDO - Excess chemical potential is calculated using the Widom insertion method with cavity modification. This option requires either a subsequent SCAN key or the PXAN key before the RUNS key
Data: rspha, ngridx, ngridy, ngridz, ngtry, nitry, ishftx, ishfty, ishftz, nmolwid; nmolwid times (ifdummy, ew0)
- rspha - Cavity radius (solute atom sizes are defined by the vdw radii see key PRAT)
- ngridx, ngridy, ngridz - Number of grid points in the x, y, z directions, respectively. By default, the whole simulation cell is covered by the grid, but a smaller portion can also be specified with the LIMG key (preceding the FREE key)
- ngtry {1} - Number of different grids (different random shifts) to use for each configuration
- nitry {1} - Number of insertion trys per grid
- ishftx {0}, ishfty {0}, ishftz {0} - When isftx=1, no grid shift is applied in the x direction; same for y and z. When isftx=2, succesive grids will be centered at i/ngtry.
  nmolwid {1} - Number of different solutes to calculate free energy for.
  For each solute:
  - ifdummy {0} - When ifdummy=1, the first atom of the inserted molecule is assumed to be non-interacting (used only to ensure the best centering into the cavity).
  - ew0 {0.0} - expected energy value (will be subtracted from all calculated energies before forming exponentials, added back at the end)
ADDW - An excess chemical potential-like quantity is calculated for the mutation of a solute atom into a new group, using the Widom insertion method formalism. In the figure below, atom X will be replaced by different groups. These groups have to be defined as separate molecules, counted as free-energy molecules and they have to include at least the atoms that are in 1-4 position to the atom X. In the figure below they are labeled as A.
```
             L    A             A
   L-L       |    |             |
     |  L    L----A      D      A
     |  |   /      \    /        \
   L-L--L--L        A--R          A--R
     |  |   \      /    \        /    \
     |  L    L----A      X      A      G(X)--G
   L-L       |    |             |      |
     |       L    A             A      G
 
```
This option requires either a subsequent SCAN key or the PXAN key before the RUNS key
Key2 (ligand flexibility):
- FIXL - the conformation of the ligand (atoms L, A, R, D and X) is unchanged during the simulation
- FLEX - the conformation of the ligand varies during the simulation
  Data: nwgtry, nwitry, nmolwid, nmcransh, nghanchor; nghanchor+1 times ighanchor; nmolwid times ew0
  - nwgtry {1} - Number of different orientations of the replacement group to try (evenly spaced)
  - nwitry {1} - Number of different orientations of the atom to be replaced
  - nmolwid {1} - Number of different groups to use as a replacement.
  - nmcransh {0} - At each nmcransh-th simulation step a new random increment is generated for the orientational sampling.
  - nghanchor {1} - Number of anchor atoms
  - ighanchor - the atomindex of the anchor atoms - negative index means to drop that atom from the ligand (atom D above); the (nghanchor)th is the atom R bonded to the atom X to be replaced and the (nghanchor+1)th is the atom X to be replaced. Thus the nganchor-1 atoms of type A and D should be followed by the atom R, the atom G(X) and the rest of the G atoms (if any)
    For each of the nmolwid group:
    - ew0 {0.0} - expected energy value (will be subtracted from all calculated energies before forming exponentials, added back at the end)
CHIM - the excess Helmholtz free energy of changing a solvent molecule into a new substance is calculated using the perturbation formula without a coupling parameter. Each solvent molecule is changed to its new form and the energy of the new substance is evaluated and fed into the perturbation method formula. Clearly, it is a low-accuracy technique. This option requires either a subsequent SCAN key or the PXAN key before the RUNS key
Key2 (Grid direction):
- XGRD - free energy profile aling the X axis will be calculated
- YGRD - free energy profile along the Y axis will be calculated
- ZGRD - free energy profile along the Z axis will be calculated
Data: ngrid, nwitry, nmolchim, nmolchim times (ifdummy, ew0)
- ngrid - Number of grid points in the direction chosen by Key2.
- nwitry - Number of insertion trys in the place of each deleted molecule nmolchim {10} - Number of different solutes to calculate free energy for.
  For each solute:
  - ifdummy {0} - When ifdummy=1, the first atom of the inserted molecule is assumed to be non-interacting (used only to ensure the best centering into the cavity).
  - ew0 {0.0} - expected energy value (will be subtracted from all calculated energies before forming exponentials, added back at the end)
TICA - Energy coupling is used to change from one solute to an other:
E(CP) = CP^tiexp*E₁ + (1-CP)^tiexp*E_o
The control function of the TI integrand will be plotted at the end of the run and error estimates will be provided for it.
PMLI - Same as TICA except that the program will calculate the free energy difference between the states precombined with the CIMX or CCMX key using the perturbation method. The control function of <exp(-(E(uscp1)-E(cplpar))/kT)>_cplpar will be plotted at the end of the run and error estimates will be provided for it.
PMNL - The calculation is performed at a fixed coupling parameter value CP (atomic coupling) to get free energy dfferences between states CP and CP_o and states CP and CP₁ with both the perturbation method and overlap ratio method.
Key2 for Key1=TICA, PMLI, or PMNL (pre-combining):
- NOMX - No combining of states. The (excess) free energy difference will be calculated between the two states represented by the two free energy solutes defined with the SLTA key.
- CIMX - Prepare the initial and final states as the linear combinations of the inputted ones (after the SLTA key) with coupling parameter values uspar0, uspar1. The potential coefficients will be similarly combined. During the run, these states will then be considered to correspond to coupling parameters 0 and 1, respectively. The coupling parameter cplpar will transformed accordingly to maintain the ratio of the forward and backward coupling parameter change.
- CCMX - The same linear combinations will be done as for CIMX but both for the conformation pair given after SLTA and for the conformation on the .crd file. In addition to that, the two solute conformations on the .crd file are assumed to have been be obtained previusly as linear combinations of the two inputted ones with coupling parameter values uscpo0, uscpo1 (read with formatted input, see below). This allows the preparation of a new 'window' from a previously used (and equilibrated) 'window'.
  Data (for Key1=TICA only): ngquadp, igquadp, epstol, sigtol, qtol, xyz2tol
  - ngquadp {0}, igquadp {0} - if not zero, the coupling parameter will be the igquadp-th point of the ngquadp-point Gaussian quadrature
  - epstol {0.0} - When the LJ sigma values of all atoms in a group differ by less than epstol then that group will be coupled linearly, i.e., tiexp = 1 will be used.
  - sigtol {0.0}, qtol {0.0}, xyz2tol {0.0} - similar tolerance values for the LJ epsilon, atomic charge and distance square.
  Data for key1=PMNL: gmor0k, gmor1k, gdvork (X20,5F10.0)
  - gmor0k, gmor1k - the energy difference distributions will be accumulated for the overlap ratio method calculation in a grid starting at gmor0k, gmor1k for the initial and final states, respectively. Each distribution contains 100 grids with a grid width of gdvork.
    Fdata (for Key1=TICA, PMLI, or PMNL):
    - RECTYPE 5: tiexp, cplpar, uspar0, uspar1, uscpo0, uscpo1 (3F5.2,5X,5F10.0)
    - tiexp - Coupling parameter exponent array. Its elements give the coupling parameter exponents for the 1/r¹², 1/r⁶ and 1/r potential energy terms (Polynomial TI). When tiexp(i)>0, in using TI (FREE TICA), the CP and (1-CP) factors for the corresponding potential energy terms are raised to the tiexpi(i)-th power).
    - cplpar - Coupling parameter value to be used in the energy expression. Ignored when ngquadp > 0 !!
    - uspar0, uspar1 - when Key2 = CIMX or CCMX the free energy difference will be determined between states represented by these two coupling parameter values (instead of between the two states defined by the key SLTA), using the perturbation method (Key1 = PMLI or PMNL) and the overlap ratio method (Key1 = PMNL).
    - uscpo0 {0}, uscpo1 {1} - The coupling parameter values that had been used to prepare the solute previously on the .crd file. It is used when a starting configuration is generated for an intermediate coupling parameter value from a configuration that itself was already an intermediate one (Key2 = CCMX).
PMF1 - The conformational coupling parameter CP will be sampled with umbrella sampling using either harmonic or adaptive weighing.
Key2 for Key1=PMF1 (coupling type):
- GENL - Linear coupling in Cartesian space
- WRMM - Linear coupling in Cartesian space, but the solute has two rigid molecules that move together as the coupling parameter changes.
- WRFM - Like WRMM, but the second molecule solute is kept fixed. The second molecule is not considered part of the free energy solute and should be specified among the non-free energy solutes.
- WRFR - Like WRFM but the first molecule is not only displaced but also rotated.
- TORS - Coupling parameter is defined along torsional coordinate(s). Requires also the TORD key.
- WRTR - Linear coupling of the center of mass of two positions of a single molecule, followed by translation and rotation in certain directions.
  Key3 (solute change type):
  - GEOC - Only the solute geometry changes
  - GEQC - Only the solute geometry and partial charges change
  - GDQC - The solute geometry, partial charges and atom types all change
    Key4 (umbrella sampling type):
    - HUSS - Umbrella sampling is based on a harmonic potential.
    - AUSL - Adaptive umbrella sampling is used with linear interpolation for the AUS weighths
    - AUSE - Adaptive umbrella sampling is used with exponential interpolation for the AUS weighths
      Data: (for FREE PMF1 WRF* **** **** nmolcnt2
      - nmolcnt2 - The second molecule of the PMF calculation that is kept fixed. This information is used to generate the common center of the two PMF molecules to make sure that all molecules interact with the same images of both PMF molecules
      Data: (for FREE PMF1 WRTR **** **** cdpmfx, crpmfx, cdpmfy, crpmfy, cdpmfz, crpmfz
      - cdpmfx, cdpmfy, cdpmfz - Center of mass displacement step size limits (see key STEP) in the X, Y, and Z directions, respectively. Zero value in any direction will restrict the motion to the other, nonzero directions.
      - crpmfx, crpmfy, crpmfz - Rotation step size limits (see key STEP) around the X, Y, and Z axes, respectively.
        Fdata for FREE PMF1 **** **** **SS):
        
        RECTYPE 6: nsweep, cplmin, cplmax, delcpl, c0cplh, p0cplh, wcplcha (15x,i5,6F10.0)
        nsweep {0}. If > 0, an adaptive umbrella sampling iteration will continue if the interval was traveled at least nsweep times.
        cplmin, cplmax - minimum and maximum of the coupling parameter CP to be sampled in this run.
        delcpl - Stepsize parameter for CP.
        c0cplh, p0cplh - With HUSS, the umbrella sampling weights are obtained as exp(c0cplh*(CP-p0cplh)²).
        wcplcha {1.0} - Relative weight of coupling parameter moves (compared to other types of solute moves).
        Fdata (additional) for FREE PMF1 **** **** AUS*):
        
        RECTYPE 7: iopnrm, iopeql, iopenc, nitssk, faclim, fcenc1,
        smplmx, ratmax, rldvmx, diffmx (4I5,6F10.0)
        RECTYPE 8: ngovmn, nsubmn, ngrcor, nwtst, fcenc2, encexp, tolera (4I5,6F10.0)
        iopnrm - Option to select matching algorithm in adaptive US.
        
        iopnrm < 0: linear n-step optimized:
        iopnrm = -1: matching is done on the probabilities (not recommended).
        iopnrm = -2: matching is done on the potential of mean force.
        iopnrm = -3: matching is done on the potential of mean force, with each increment matched separately (Dutch method).
        iopnrm = 0: nonlinear n-step optimized.
        iopnrm > 0: nonlinear n-step optimization, with regenerating initial guess from successive 1-step optimization at every iopnrm-th iteration (after resorting the iterations based on the range covered).
        
        iopeql - When >0, self test for equilibration in the adaptive umbrella sampling is performed by screening previous iterations at every iteration and dropping iterations 'contradicted' by subsequent ones.
        iopenc - Controls the strategy to force the adaptive umbrella sampling to extend the sampled range of the coupling parameter by temporarily modifying the weights.
        
        iopenc < 0 : Instead of Boltzmann probabilities, the US weights will be calculated from their sampling frequencies
        iopenc = 0 : No weight modification.
        iopenc = 1 : Weights are modified gradually as the number of iterations during which a grid is unsampled grows. ("global encouraging")
        iopenc = 2 : Suddenly abandoned regions will get higher weight:
        iopenc = 3 : 1 and 2 combined.
        
        nitssk - Number of iterations to be skipped when equilibration is found to be necessary during adaptive US.
        faclim {0.5} - K factor used to limit the sampling to the requested interval.
        fcenc1 {2.0} - C factor used to encourage sampling of new, undersampled region (iopenc = 1 or 3 or < 0),
        smplmx, ratmax {2.0} - The allowed ratio between W(CP) over neighboring grid points is limited to the range [1/ratmax, ratmax] when the frequency of sampling of that grid point is under smplmx.
        rldvmx {0.0001} - Maximum relative deviation allowed in nonlinear optimization.
        diffmx {0.9} - Threshold value for the iteration difference indicator.
        ngovmn {4} - Minimum number of grids to overlap for two iterations to be considered in the screening.
        nsubmn {5} - Iteration is rejected if one out of every nsubmn subsequent overlapping iterations give difference indicator > diffmx.
        ngrcor {5} - Number of gridpoints over which the weighting is to be modified when sampling encouragement is used.
        nwtst - Partial results of adaptive step is printed for iterations until nwtst. If nwtst=#WI (see Section VI.), stop the run after saving the data on the iteration.
        fcenc2 {1.1} - For global encouraging, unsampled grid weight will be multiplied by fcenc2 in each iteration.
        
        encexp {0.75} - for global encouraging (iopenc = 1 or 3), sampled grids modifying weights will be replaced by their encexp-th power.
        encexp {1.00} - for sampling-frequency based US weights
        (iopenc < 0), the sampling frequencies will be raised to their encexp-th power.
        tolera {0.06} - The maximum distance of the coupling parameter from the interval defined by cplmin and cplmax.

III.03.02. Adaptive Umbrella Sampling parameter change

RAUS Key1, Fdata

Key1 (initialization option):

NOIN - No accumulator/counter is changed
CTIN - Initialize the sampling frequency accumulator
ALIN - Initialize all AUS-related accumulators, counters
Fdata:
- RECTYPE's 6-8 as described with the FREE PMF1 **** **** AUS* keywords followed by
- RECTYPE 9: iterw0, newlim (2i5)
- iterw0 - When iterw0 > 0, use only iterations from iterw0 for the matching. When iterw0 < 0, restore dropped iterations.
- newlim - If iterw0 = 1, iterations are discarded until the newly inputted region is reached and further nitskp iterations are performed (as equilibration).

Note that cplmin and cplmax have to stay at their original value.

III.03.03. PMF matching

WMAT Key1, Key2, Key3, Key4, Data, Fdata

This function joins the adaptive umbrella sampling estimates of the potential of mean force obtained from the checkpoint files of calculations on overlapping coupling parameter ranges (windows).

Key1 (checkpoint list specification):

REGL - checkpoint file runnumbers are evenly spaced
LIST - checkpoint file runnumbers are irregularly spaced
Key2 (plot option)
- NONE - matched distributions will not be put on .wmp files
- PLPS - matched distributions and sampling frequency values will be written on the file <jobname>.<numrun>.wmp (for further plotting) and a Postscript plot will be written on the file <jobname>.<numrun>.wmp.ps.
- NOPS - only the .wmp file will be written
- NOPL - only the Postscript plot will be written
  Key3 (plot cropping option)
  - NOCR - distribution beyond match points will be printed/plotted on the .wmp file(s)
  - CROP - distribution beyond match points will be croppped (to produce a contiguous plot)
    Key4 (matching algorithm)
    - MATW - Match at the best sampled gridpoint in the overlap range
    - MATD - Find the matching factor by minimizing the weighted deviation square sum in the overlap range
    - MATC - Find the matching factor by first minimizing the weighted deviation square sum in the overlap range and then select the gridpoint where the matched curves are the closest and match at that point
    Data for Key1=REGL: nruns, numrunr, incrnumrun
    - nruns - Number of runs to match
    - numrunr {numrun} - Run number for the checkpoint file containing the first segment (window)
    - incrnumrun {1} - Run number increment, i.e., the i-th window's checkpoint file was generated by run number numrunr+incrnumrun*(i-1).
    Data for Key1=LIST: nruns, numrun1, numrun2 ...
    - nruns - Number of runs to match
    - numrun* - Run numbers of the checkpoint files to match (nruns number)
      Fdata:
      - RECTYPE 48: delpl1, delpl2, r0, cx, pmn, imatread, cplzero, xlab (5F10.0,I5,A)
        
        delpl1, delpl2 - To be added to the values of w(CP) and p(CP) (to shift plots).
        r0, cx - The CP values on the .crd file will be transformed into r0+cx*CP.
        pmn - Minimum sampling frequency for a grid point to be written on the .wmp file (to keep noise out of the plots).
        imatread - Number of match points to read (if any)
        cplzero - when not zero, the match PMF will be shifted in such a way that the PMF at that coupling parameter value will be zero.
        xlab - The X axis label when r0>0 or cx is not one.
      - RECTYPE 49: (read if imatread > 0): (match(i),i=1,nruns+1) (11I5)
        
        match - matching point specification. match(1) and match(nruns+1) specify the beginning and the end point of the complete coupling parameter interval. match(2) is the gridpoint used to match runs 1 and 2. In general, match(i+1) is the matching point between runs i and i+1. Zero at any place results in default values. The default matching points are determined based on the combined sampling frequencies and the interval defaults to the [0.0,1.0] coupling parameter interval. The whole record can be omitted.

III.03.04. AUS iteration plot

WPLT Key1, Data, [Data]

Key1 (scale for data points):

LIN - circles with radius proportional to the extent of sampling will be drawn at each data point
SQRT - radius will be proportional to the square root of the sampling extent
LOG - radius will be proportional to the logarithm of the sampling extent
Data: PSfile, nfstplt, maxit, nfrplt, nrad
- PSfile - name of the file the plot will be written in Postscript format
- nfstplt {1}, maxit {number of iterations}, nfrplt {1} - plot data for every nfrplt -th iteration from iteration nfstplt to iteration maxit
- nrad {5} - the radius of the largest circle (in pixels)

This key will produce a family of curves showing the PMF calculated from each iteration as lines, with circles representing the extent of sampling at each coupling parameter grid. The matched PMF is shown with large filled circles.

III.03.05. TI quadrature

TIQU Key1,[Key2] Data, [Fdata]

Key1 (data source specification):

DATA - Just read the integrand values
REGL - checkpoint file runnumbers are evenly spaced
LIST - checkpoint file runnumbers are irregularly spaced
Key2 (contribution type, only for Key1=REGL or Key1=LIST)
- ALL - use both solute-solvent and solute solute (if any) contributions
- STSV - use only the solute-solvent contribution
- STST - use only the solute-solute contribution
Data for Key1=DATA: nquad, integrand values
- nquad - Number of quadrature points. Currently allowed values: 3, 5 and 8.
- integrand values - nquad numbers
Data for Key1=REGL: nquad, numruns, incrnumrun
- nquad - Number of quadrature points.
- numruns {numrun} - Run number for the checkpoint file containing the result of the first quadrature point
- incrnumrun {1} - Run number increment, i.e., the i-th quadrature point checkpoint file was generated by run number numrunr+incrnumrun*(i-1).
Data for Key1=LIST: nquad, numrun1, numrun2 ...
- nquad - Number of quadrature points.
- numrun* - Run numbers of the quadrature point checkpoint files (nquad number)
  Fdata: (only for Key1=REGL or Key1=LIST)
  - RECTYPE 41: ncread, nfac, iskipread, itrapread, nendread (6I5)
    - ncread - If > 0, the cumulative averages of the TI integrands will be read from STDIN (unit 5) (RECTYPE's 43-45), ncread numbers per line. Maximum value: 8.
    - nfac - The TI integrands read from STDIN (unit 5) will be divided by nfac if nfac > 0 (ignored if ncread=0).
    - iskipread{0} - if > 0, allow for a-posterior equilibration (read RECTYPE 42 below). ncntin long segments) to skip from the beginning of the run (a posteriori equilibration).
    - itrapread {0} - If > 0, perform trapezoid rule integration (read RECTYPE 46 below).
    - nendread {0} - If > 0, calculate the TI integral over different intervals (read RECTYPE 46 below).
  - RECTYPE 42: (read if iskipread > 0) (nskp(iq),iq=1,nquad) (8I5)
    - nskp(iq) - Number of control function blocks (i.e., ncntin long) to skip from the beginning for quadrature point iq.
  - RECTYPE 43: (read if ncread > 0) ident (a80,/,a80)
    - ident - 2-line description of the data.
    - If iread > 0 then nquad times (one data set for each quadrature point):
  - RECTYPE 44: nbl, ncntin, cplpr (2I10,F10.0)
    - nbl - The number of control function blocks to read in.
    - ncntin - The length of a control function block.
    - cplpar - The coupling parameter value.
  - RECTYPES 45: ticums(ib,iq),ib=1,nbl, ncread numbers per line (6E12.6)
    - ticums - The cumulative average of the integrand at each control function block.
  - RECTYPE 46: (read if itrapread > 0 ): idef, ymin, ymax (I5,2F10.0)
    - idef,- If idef=0, the fitted polynomial will be used to calculate the integrand at the endpoints.
    - ymin, ymax - The value of the TI integrand at coupling parameters 0 and 1 (if available) for trapezoid rule integration (to be used when idef > 0).
  - RECTYPE 47: (nendread diffferent lines): xmin, xmax (6F12.6)
    - xmin, xmax - New coupling parameter limits. The program will calculate the integral of the Gaussian approximating polynomial between these limits. This line is only read if RECTYPE 43 was present.

This function performs the numerical (Gaussian) quadrature on thermodynamic integration runs. It prepares a plot of the polynomial fitting the quadrature ponts and, optionally, performs other numerical integrations.

III.03.06. Overlap ratio evaluation

OVRA Data

This function obtains the free energy from two overlap ratio method runs.

Data: numrun1, numrun2

Run number

numrun1, numrun2 - The run numbers that generated the two checkpoint files containing the energy distributions the method requires.

III.03.07. Temperature scaling for TI

Prerequisite: TEMP

FETK Data

This option scales the temperature by dividing it with cplpar^kexp to represent a run where the total system is made to turn into ideal gas

Data: kexp, nquad, iquad

kexp {4} - the coupling parameter exponent
nquad - the number of Gaussian quadrature points (3, or 5 or 8)
iquad - the quadrature index (≤ nquad)

III.4. Molecule descriptor keywords

SLTA: Solute description
SLVA: Solvent description
CLON: Solute cloning
MAKB: Make bonds
BRKB: Break bonds
BDHH: Forcing H-H bonds
BDRH: Forcing bonds of an H to a different group
MOLD: Define solute molecule limits
RDBD: Bond list read
TORD: Torsion angle definitions
CSEG: CHARMM segment id handling

III.04.01. Solute description

SLTA Key1 Key2 Key3, Data, [Data, Fdata]

Prerequisites: FILE, PBCN, SVPT, SUPT
Can not be followed by: SLVA, PMOD, SLVA, PFRD, FREE, PROT, p FLXR, CNST, CHRG, CLON, MAKB, BRKB, RFSL, MIXR, SETC, MODA, MOLD, STEP, RDBD
Potential successors: RCKP, PRMF
Prerequisite of: CNFG, RUNS, TORD, WCNF, FCGA, PARD, FCGD, PMFM, AROM, ENGL, STVG, CMPC, BNDL, HBMO, SVDP, GENV, PXCR, SCAN, HBBR
Related keys: CLON, MAKB, BRKB, MOLD, RDBD.

Key1 (possible solute periodicity):

SMPL - Simple solute.
POLY - Periodic solute along the X axis. Displacement not made along the X axis.
XTAL - Solute is periodic along all 3 axes (crystal). Global solute moves are forbidden.
Key2 (input format):
- MMC - MMC format - specific to this program (the program Simulaid can help convert structures in different file formats (e.g., PDB, CHARMM .CRD) to this format).
- MMC4 - Older version of MMC format (4-character PF labels)
- PDB - PDB format input - atomnames will be used to deduce atomic numbers only
- CRD - CHARMM CRD format input - atomnames will be used to deduce atomic numbers only
  Key3 (data source):
  - READ - Read the solute description data (the formatted input below describing each solute atom) from the input file, starting from the next line
  - FILE - Read the solute description data from a file with extension .slt . For Key2=PDB, .pdb is also allowed, For Key2=CRD, .CRD is also allowed.
    Data: nslt, nsltfe, nsltfe0, nsltpr, numrunr, jobnamer
    - nslt - Number of solute atoms. Note that free energy calculations require multiple copies and nslt includes the atoms in all the copies.
    - nsltfe {0} - The number of solute atoms in all copies of the free-energy molecule(s).
    - nsltfe0 {nsltfe/<number of free energy copies (1,2 or 3)>} The number of solute atoms in the first copy of the free-energy molecule(s).
      - For the Widom ((FREE WIDO) or the chimera (FREE CHIM) there is one copy of the free energy solute, the atoms in the nmolwid solutes for which the free energy is calculated. Each molecule has to be a single (and different) residue.
      - For creation/annihilation path calculations (FREE TICA) or (FREE PMLI) the free energy solute consists of two copies of the same molecule in its two different states.
      - For calculations over a path of continuous deformation, (FREE PMF1 or FREE PMNL ) the program requires three copies of the solute: the initial and final states and an intermediate state. This third copy is essentially a placeholder for the actula, continually changing intermediate copy used in the calculation of energies, etc.
    - nsltpr {nslt} - Number of solute atoms to echo
    - numrunr {numrun} - Run number of the solute description file (only read when Key3=FILE).
    - jobnamer {jobname} - File name root of the solute description file (only read when Key3=FILE).
      Fdata: Description of the solute atoms
      When the solute description is read from a file then the title/comment lines can contain information about how to translate/rotate (or vice versa) to a previous position and orientation (to be used by, e.g., the WCNF **** PRET key). This information is provided automatically by Simulaid's orientational optimization features. The program will scan the first 100 lines for labels 'TRRT' or 'RTTR', indicating translation+rotation or rotation+translation. A letter 't' after the label indicates that the next three (free format) numbers are the components of the translation vector and a letter 'r' followed by 1 or 2 or 3 indicates that the next three numbers are the corresponding rows of the rotation matrix. For exemple the header of an .slt file may contain the lines
```
! Translation first  TRRTt  1.0  2.0  3.0
! Rotation next      TRRTr1 0.0 1.0 0.0
!                    TRRTr2 1.0 0.0 0.0
!                    TRRTr3 0.0 0.0 1.0
```
      indicates that translation by (1,2,3) should be followed by a multiplication (from the left) by the matrix whose rows are (0,1,0), (1,0,0,) an (0,0,1).
      - RECTYPE 14's: nslt records - one for every atoms of the solute for MMC format: tslt(i),(cslt(j,i),j=1,3),qslt(i), igrslt(i), gcent, mcent, labslt1(i), labslt2(i), indxrdf(i), pf(i), spgroup(i)
        Format for Key2=MMC: (1x,A6,4F10.0,I5,2a1,2A4,I5,1x,A4,A8)
        Format for Key2=MMC4: (1x,A4,4F10.0,I5,2a1,2A4,I5,1x,A4,A10)
      - tslt(i) - Atom type specification for atom i (not EPEN) for solute-solvent potential - either its code number or its 4-character label (AMBER or CHARMM only) - or atomic number for atom i for EPEN solute-solvent potential.
      - cslt(k,i) - X,Y,Z coordinates in � of the atom i.
      - qslt(i) - charge on atom i. For Jorgensen OPLS it is disregarded and internally stored values will be used, unless this feature is overridden by CHRG INPT.
      - igrslt - group (residue) index of the solute atoms. Used only with SUVC **GC. Important: The atoms have to be listed by increasing group index.
      - gcent - if gcent='G' then this atom will be the group center, overriding the built in topological algorithm
      - mcent - if mcent='M' then this atom will be the molecular center, overriding the built in topological algorithm
      - labslt1(i) { }, labslt2(i) { } - Optional atom labels label used in proximity analysis output. Whenever applicable, they are interpreted as residue name and atom names (four characters each, residue name left adjusted).
      - indxrdf(i) {i} - optional index pointing to the proximity distributions this solute atom contributes to. If all indxrdf(i)'s are zero then individual proximity distributions will be calculated for each solute atom. If at least one indxrdf(i) is nonzero then the atoms with indxrdf(i)=0 will not contribute to any distribution.
      - pf(i) - For Key2=MMC: if not blank the potential function type for the whole residue containig this atom (overriding the type specified by the SUPT key);
        For Key2=MMC4: two possible uses: If pf(i) is a valid potential function type then the same as for Key2=MMC otherwise if pf(i) is not blank then it will be added to the potential label tslt(i) allowing the use of more than four characters (maximum six).
      - spgroup(i) { } If not blank, atoms with the same spgroup may be considered a single functional group for the proximity analysis (see key FCGD). For the Widom insertion (FREE WIDO) or the chimera method (FREE CHIM) spgroup in the record of the atom belonging to the molecule's center will be used as the name of the solute.
    - RECTYPE 51's: nslt records - one for every atoms of the solute for PDB format:
      - mcent,atnam,resnam,(cslt(k,i),k=1,3) (11x,a1,a4,1x,a4,10x,3f8.0)
      - resnam,atnam - the analysis label labslt will be a concatenation of resnam and atnam.
      - mcent - if mcent='M' then this atom will be the molecular center, overriding the built in topological algorithm
    - RECTYPE 50's: nslt records - one for every atoms of the solute for CRD format:
      - resnam,mcent,atnam,(cslt(k,i),k=1,3) (11x,a4,a1,a4,1x,3f10.0)
      - resnam,atnam - the analysis label labslt will be a concatenation of resnam and atnam.
      - mcent - if mcent='M' then this atom will be the molecular center, overriding the built in topological algorithm Fdata: RECTYPEs 19-21 are only needed for SUPT EPEN (EPEN potentials). The location and the type of each center have already been defined above. The program finds their breakdown into electrons and nuclei automatically thus here only the repulsion/dispersion term parameters are to be inputted.
        
        RECTYPE 19's: nslte records - for every solute electronic center, in the same order as the centers were defined: as(i), bs(i), ds(i) (3F20.10)
        as - A parameter for solute electron i
        bs - B parameter for solute electron i
        ds - C parameter for solute electron i
        RECTYPE 20's: nslve records - for every solvent electronic center:
        asv(i), bsv(i), dsv(i) (3F20.10)
        asv - A parameter for solvent electron i
        bsv - B parameter for solvent electron i
        dsv - C parameter for solvent electron i
        The units of the various parameters are discussed in Marchese et al.
      - RECTYPE 21: cutin, escale, iopict (2F20.10,I5)
      - cutin,iopict - The radius of the hard sphere placed at the COM of the solute when iopict = 1 or 2. (Use it with some caution! There might be some discontinuity problems.)
      - escale {1.0} - Energy scaling parameter for the solute-solvent potential function. All the solute-solvent pair energies are multiplied by this constant scale factor. The majority of the potential functions, including EPEN, break down at very close distances leading to very high negative energies in certain orientations. Fortunately, these spurious minima are followed by large barriers as the molecule moves outward. If the starting configuration has such close contacts and regular potential is used, then the total energy of this configuration will be very negative and it is highly unlikely that the system will ever get out of this bottleneck. This problem can be circumvented by redefining the potential function as repulsive in the inner Van der Waals region.
        
        escale=0: Regular EPEN potential without any inner cutoff.
        escale=1: A hard sphere of radius cutin is placed at the COM of the solute. This feature is included to expel any solvent molecule which are in close contact with the solute molecule. Beyond the cutin distance (see RECTYPE 21), regular EPEN potential is used.
        escale=2: Van der Waals envelopes are created around the solute and solvent molecules to check whether any solvent molecule is in close contact with the solute.

The program is currently prepared to handle atoms up to atomic number 86. The atomic number for lone-pair is 89 (in case of EPEN, it is disregarded) and atomic number 90 is an electron. Atomic numbers 91-93 have been reserved for the united atoms CH₁-CH₃, respectively. The various atomtypes for the different potential libraries implemented and the actual functional form of the potentials are specified in a separate document.

Important: For EPEN, the different types of centers have to be grouped together. Lone-pairs should follow nuclei and electrons should follow nuclei and lone-pairs.

When the coupling parameter is used to determine conformational transition (FREE PMF1), three sets of solute coordinates are used: the two conformations R_o and R₁ and their combination CP*R₁+(1-CP)*R_o. The solute description thus has to contain the coordinates, charges of the two conformations as well as room for the mixture. In this third set, only the atomtypes have to be given. When CNFG PMFN, however, only the R_o conformation is required here since the R₁ conformation will be given at a different part of the input. It is important that the different descriptions of the conformation give the solute atoms in identical order. Solute atom type change during the transition can not involve EPEN.

III.04.02. Solvent description

SLVA [Data Fdata]

Prerequisite: SVPT
Potential successors: SLTA, RCKP
Prerequisite of: SVDP

Data: nslv, islvrep, label, islv4, filename

nslv - Number of solvent atoms per solvent molecule.
islvrep {1} - The central atom to be used for representing the solvent molecule when only one center is needed, e.g., for proximity analysis, solute-solvent distances will be calculated from the islvrep-th solvent atom
label {HOH } - The residue label to use with this molecule.
islv4 - {0} When islv4=1 the atom records use only four characters for the potential label
filename - when present, the solvent desription (see Fdata below) will be read from a file filename.slv, in the same format as the Fdata for the solute atoms (key SLTA)
Fdata:
- RECTYPE 15's: nslv records - one for every atoms of the solvent: itypslv(i), (cslv(j,i),j=1,3), qslv(i), labslv(i)
  - For nslv > 0:
    For islv4=0:(1x,a6,4F10.0,2x,a4) or (1x,i6,4F10.0,2x,a4)
    For islv4=1:(1x,a4,4F10.0,2x,a4) or (1x,i4,4F10.0,2x,a4)
  - For nslv = 0: (1x,a6,4f10.0,11x,a4)
    Also, a character G in colum4 53 can override islvrep read above.
- itypslv - for general solvent (SVPT GENL) it is the atom type (either a potential label, left-adjusted, or number, right adjusted - see key SLTA), otherwise the atomic number of atom i.
- cslv - X,Y,Z coordinates in � of the atom i.
- qslv - charge on atom i to be used in the SLT-SLV interactions.
- labslv - label of atom i

Pseudo-atoms are to be placed after the regular atoms.

Note: This key is ignored if the solvent is not GENL type.

Note: the dipole of the solvent molecule should be parallel to the X axis for the purpose of the near-neighbor solvent dipole correlation function computation (DSTC ALL).

III.04.03. Solute molecule cloning

CLON Data

Potential successor: SLTA

Data: nclone, iclonef(1), iclonel(1), ncopy(1), ... iclonef(nclone), iclonel(nclone), ncopy(nclone)

nclone - Number of solute molecules to clone
iclonef(1), iclonel(1) - First and last atoms of the first molecule to be cloned (atom numbers refer to the input list read after the key SLTA)
ncopy(1) - Number of copies required (> 1).
This operation takes parts of the input solute description read with the SLTA key, makes duplicate descriptions. This way a 'solute' that consists of several identical molecules can be specified without needless repetition. Note that free-energy molecules can not be cloned.

III.04.04. Making additional bonds

MAKB Data

Potential successor: SLTA

Data: nmake, imake1(1), imake2(1), ..., imake1(nmake), imake2(nmake)

nmake - Number of bonds to make
imake1(i), imake2(i) - The i-th new bond connects atoms imake1(i) and imake2(i)

Note that this key is not implemented for the case when cloning (CLON) is in effect.

III.04.05. Breaking bonds

BRKB Data

Potential successor: SLTA

Data: nbreak, ibreak1(1), ibreak2(1), ... ibreak1(nbreak), ibreak2(nbreak)

nbreak - Number of bonds to break
ibreak1(i), ibreak2(i) - The i-th bond to be broken connects atoms ibreak1(i) and ibreak2(i)

Note that this key is not implemented for the case when cloning (CLON) is in effect.

III.04.06. Forcing H-H bonds

BDHH

By default, the topology search excludes bonds between hydrogens. If this key is present, these bonds will be kept.

III.04.07. Forcing bonds of an H to a different group

BDRH

By default, the topology search excludes bonds of hydrogens to heavy atoms of a different group/residue. If this key is present, these bonds will be kept.

III.04.08. Defining solute molecule limits

MOLD Data

Potential successor: SLTA

Data: molslt, ilastm(1), ilastm(2), ...

molslt - Number of molecules in the solute
ilastm(i) - Last atom of molecule i.

This key overrides the molecular topology for defining the molecules within the solute (or just a part of it). This is useful when the molecule is distored and too many incorrect bonds are found or too many bonds are missing. When applied to cloned (CLON) solute atoms, the definitions have to be repeated for all clones. The solute atoms not covered by this operation have to be placed contiguously at the end of the solute definition read with the (SLTA) key.

III.04.09. Bond list input instead of calculating

RDBD Key1 [Data]

When this key is present, the solute bond list is read from a file.

Key1 (bond list type):

MAKB - Bond list is read from a file whose syntax is the same as the input for the MAKB key (including the key itself). Simulaid can prepare this file along with the .slt file
CPSF - Bond list is read from a file that is a Charmm topology (.psf) file.
ATOP - Bond list is read from a file that is an Amber topology file.

Data file name

file name - bond list is a file called file name.mkb, file name.psf, and file name.top for Key1 = MAKB, CPSF, and ATOP, resp. Default file name: jobname.

III.04.10. Torsion angle definitions

TORD Key1 Key2 Data Fdata

Prerequisite: SLTA
Can not be followed by: PRMF, SETC, LOOP, FREE, PARD, SKWT
Potential successors: CNFG, SPST
Prerequisite of: REGE, TORT, LPST, GENT, TAND
Related keys: LOOP, PART, TORT, LPST.

Key1 (torsion definiton source):

INPT
Data: ntorsinp, nsltcop, rngfac, rngfacl, inrcat
- ntorsinp - Number of torsion angles on the solute.
- nsltcop {1} - number of solute copies to apply the inputted torsions. Applicable for free energy simulations where two or three solute copies are needed (see the key FREE for details.
- rngfac {1.0} (nsltcop numbers, one for each solute copy) - The torstep values read in RECTYPE 24's will be all multiplied with rngfac.
- rngfacl {1.0} (nsltcop numbers, one for each solute copy) - The steploop values read in RECTYPE 24's will be all multiplied with rngfacl. The different solute copies may have different factors.
- incrat {0} - the atom indices read in below will be incremented by incrat (to allow the insertion of new molecules into the solute).
  Fdata (when ntorsinp > 0):
  - RECTYPE 24's (for each torsion) ik, il, steploop, torstep, itorgrp, isense, wtors, deldih targetang, iskew (2i5,2f10.0,2i5,3f10.0,i5)
  - ib, ik - The two (bonded) atoms defining the bond of torsion. The program will assign automatically one neighbour each of ik and il to define the actual torsion angle. For solute molecules that were sepcified with the help of the CLON key, ik and il refer to the atoms in the original (uncloned list). Torsions on cloned molecules will also be cloned automatically. For free-energy calculations using three solute copies (FREE PMF1 or FREE PMNL ) the atom numbers should relate to the third copy.
  - steploop - The torsional displacement stepsize range for loop moves. It is analogous to wpslt* (given with the PARD key). Extension biasing is not applied to loop moves.
  - torstep - The torsional displacement stepsize range. It is analogous to wpslt* (given with the PARD key) and its meaning also varies depending on Key 1 of PART.
  - itorgrp - The torsion group number. For FREE PMF1 TORS the first group is the group of torsions moved by the coupling parameter. Torsions belonging to the same torsion group will be changed simultaneously. All torsions of a group must refer to the same solute molecule.
  - isense - The direction of rotation for deldih. +1: clockwise, -1: counterclockwise.
  - wtors {1.0} Weight for selecting this torsion.
  - deldih - For torsions involving the coupling parameter the program will generate the second conformation by a rotation of deldih degrees (in the direction specified by isense) whenever it is not zero. This saves the user the need to actually generate the coordinates of the transformed solute conformation.
    For torsions not involved in the coupling parameter, the program will keep this torsion within deldih degrees of its initial value (as defined by the coordinates read by the SLTA key) whenever it is not zero.
    deldih can not be given as negative.
    targetang and iskew are read only when the SKWT key is present:
    - targetang - angle toward which the sampling may be skewed
    - iskew - when it is not zero, the sampling of the corresponding torsion will be skewed
ALL
Key2 (torsion group definition):
- MOLG - Each solute molecule will be a torsion group
- SING - Each torsion will be a torsion group
- BRAG - Each branch from the root atom will be a torsion group
  Data: defrot, deflooprot, maxtorchain, nfixbondtyp, atypx1, atypy1, atypx2, atypy2, ...
  - defrot {10.0 deg or 0.5 for EXB*} - The torstep value to be used for all torsions
  - deflooprot {10.0} - The steploop value to be used for all torsions
  - maxtorchain {nslt} - Simple torsions moving chains longer than maxtorchain atoms will be disabled
  - atypx1, atypy1 - list of nfixbondtyp pairs of atomtypes. Torsion angles around bonds between such pairs of atoms will be held fixed.
>LI>RDBB
Same input as Key1=INPT, but in addition the program will ensure that protein backbones (if any) use backbone atoms for the four atom defining each backbone torsion angle. >LI>ALBB
Same input as Key1=ALL , with the same added function as for Key1=RDBB

III.04.11 CHARMM segment id handling

CSEG

When this key is present, the last 4 characters of spgroup (read with SLTA **** MMC) will be used for the segment id when CHARMM CRD format is written with the WCNF CHRM or DENF CHRM keys.

III.5. Potential descriptor keywords

SUPT: Solute potential type
SVPT: Solvent potential type
MIXR: LJ parameter mixing rule
PRMF: Torsion potential specification
PMOD: Potential library modifications
MODA: Element modification
PFRD: Potential library input
SUVC: Solute-solvent cutoff
SUUC: Solute-solute cutoff
SVVC: Solvent-solvent cutoff
INCT: Solvent inner cutoff
PROT: Minimum repulsion for general solvent
RFCR: Reaction-field correction
SETC: Set various constants
CHRG: Solute charge manipulations
SPST: Field-dependent potentials
SPPS: Solute molecule-dependent potentials
AROM: List of aromatic carbons
SVIN: Solvation parameter input
ENHB: Hydrogen-bond potential
HBMO: Non-default hydrogen-bond
FLXR: Residues to sample
CNST: Constraint potential
VVNE: No solvent-solvent elecrostatic in Solvation parameter input
IGTT: Solute-solute potential is set to zero

III.05.01. Solute-solvent potential

SUPT : Key1 [Key2]

Potential successor: RCKP
Prerequisite of: SLTA, PROT, RFSL

Related keys: PMOD, MODA, PFRD, MIXR.

Key1 (solute PF library source):

CLMG - Clementi et al.'s library.
EPEN - EPEN/QPEN potential (parameters to be read in)
AM02 - Kollman et al.'s new library (AMBER).
AM94 - Kollman et al.'s DB94 library (AMBER).
CHRM - CHARMM (Parm 22)
GR87 - Berendsen/Van Gunsteren library (GROMOS87).
GR96 - Berendsen/Van Gunsteren library (GROMOS96).
OPLS - Jorgensen et al.'s library (OPLS). See also key CHRG
HSPH - Hard spheres - atom types (i.e., HS diameters) have to be defined by the PMOD key
ATNO - Atomtype is just atomic number (only used for proximity analysis)
DUMM - Dummy atom (no interactions)
Key2 (solute PF library source): if present it will apply to the second solute copy for FREE TICA and FREE PMFL calculations.

III.05.02. Solvent-solvent potential

SVPT Key1 Key2 [Data]

Potential successor: RCKP
Prerequisite of: SLTA, SLVA

Key1 (solvent PF type):

MCY - Solvent is MCY-CI water
Key2 (solvent PF parameters):
- MCY - Use the stored MCY-CI-II parameters.
- YMD - Use the stored Yoon-Morokuma-Davidson parameters (YMD).
TIP3 - Solvent is TIPS/SPC/TIP3P (type) water.
Key2 (solvent PF parameter source):
- TIPS - Use the stored parameters for Jorgensen TIPS
- SPC - Use the stored parameters for SPC.
- TIP3 - Use the stored parameters for Jorgensen TIP3P.
- SPCE - Use the stored parameters for SPC/E.
- READ - Input water-water interaction parameters.
  Data: c6, c12, qh (in kcal, � and electron)
  - c6 - Coefficient of the -1/r⁶ term in kcal �⁶.
  - c12 - Coefficient of the 1/r¹² term in kcal �¹².
  - qh - Charge on the hydrogen
TIP4 - Solvent is TIPS2/TIP4P/B-F (type) water.
Key2 (solvent PF parameter source):
- BRFW - Use the stored parameters for Bernal-Fowler
- TIP2 - Use the stored parameters for Jorgensen TIPS2
- TIP4 - Use the stored parameters for Jorgensen TIP4P.
- READ - See above
TIP5 - Solvent is TIP5P (type) water. Currently implemented only for HRDW SCLR
Key2 (solvent PF parameter source):
- TIP5 - use the stored parameters for TIP5P water.
- READ - See above
GENL - Solvent is general 1-6-12 molecule.
Key2: The parameter library used (see SUPT for the list).

Key2=GENL also requires the key SLVA READ

III.05.03. Lennard-Jones parameter mixing rules

MIXR Key1

Potential successors: SLTA, RCKP

Key1:

GEGE - Geometric mean for both the SIG's and the EPS's
ARGE - Arithmetic mean for the SIG's and geometric mean for the EPS's.

E_LJ(r)=4*EPS[(SIG/r)¹²-(SIG/r)⁶] = c12/r¹²-c6/r⁶. Relevant only for AMBER, CHARMM, GROMOS and OPLS potentials.

III.05.04. Torsion potential parameter input

PRMF Key1 Data

Can not be followed by: SLTA
Potential successor: TORD
Related key: PFRD

Key1: Same as Key1 of SUPT - the name of the potential library.
Data: Name of the file containing the parameters.

The input syntax for the parameter files is defined by the respective libraries. At present, only AMBER, CHARMM, OPLS (in CHARMM format) and GROMOS libraries can be read. Furthermore, for GROMOS input the file name has to be of the form <name>bon.itp and the file <name>.rpt also has to be present.

III.05.05. Potential (re)definition

PMOD Key1 Data Fdata

Potential successor: SLTA

Key1:

Key of the potential function library to be modified (see key SUPT). All parameters will be echoed on the output listing.
Data: ntyps
- ntyps - Number of new types
  Fdata
  - RECTYPE 17's: (ntyp times) ityp, ifcg, rfshl, pflab (2I5,F10.0,1X,A4) or (1X,A4,F10.0,1X,A4)
  - ityp - Atom type number (see key SLTA). For new types, it is preferable to give here an unused type number (the PLBP key can give a list of the type numbers used). For modifying existing types it can be given as zero - in this case the type number will be determined from the potential label pflab (see below).
  - ifcg - Functional group to which this atom type belongs to. It is used only for display and data checking. The following functional groups are coded:
```
 1:>C<    11:>N-     21:-OH    31:PO4-   41:  Li   51:  Ca
 2:>CH-   12:>NH     22:>PO2   32:HPO3   42: Li+   52:Ca2+
 3:>CH2   13:-NH2    23:>N+<   33:       43:  Be   66:  F-
 4:-CH3   14:=N-     24:N+H1   34:       44:Be2+   67: -Cl
 5:=C<    15:=NH     25:N+H2   35:       45:  Na   69: Cl-
 6:=CH-   16:-=N     26:N+H3   36:       46: Na+   73: -Br
 7:=CH2   17:C=O     27:COO-   37:       47:  Mg   76: Br-
 8:=C=    18:H/CO    28:-SH    38:       48:Mg2+   77:  -I
 9:-=CH   19:-O-     29:-S-    39:       49:   K   78:  I-
10:==C    20:P-O-    30:MNH+   40:       50:  K+
```
  - rfshl - First shell radius for proximity analysis (used only with RFSL STOR).
  - pflab - Potential label corresponding to potential type ityp (see key SLTA). If left blank and ityp refers to an existing type, the label corresponding to this existing type will be used.
  - Non-bonded parameters. If the first number, iatnos(ityp), is zero then no more data has to be provided for this atomtype (and the non-bonded parameters are left unchanged).
    - RECTYPE 18A's for CLEMENTI potentials: (ntyp times) iatnos(ityp), a(ityp,o), a(ityp,h), b(ityp,o), b(ityp,h), c(ityp,o), c(ityp,h) (I5,4E15.6,/,2F10.0)
      - iatnos(ityp) - Atomic number of type ityp.
      - a, b, c - The A,B,C parameters of the
      Clementi potential in kcal/mol and � units for a and b and in a.u. and � units for c.
    - RECTYPE 18B's for Kollman et al.(AMBER) potentials (both old and new) and CHARMM: iatnos(ityp), rmin(ityp), emn(ityp), rmn14(ityp), emn14(ityp) (I5,4E15.6)
      - iatnos(ityp) - Atomic number of type ityp.
      - rmin, emin - c6=2.0*emn*(rmin*2.0)⁶ and c12=emn*(rmin*2.0)¹² in � and kcal/mol units. Note, that rmin=(2*c12/c6)^(1/6)/2 and emin=(c6²/c12)/4.
      - rmin14{rmin}, emin14{emin} - non bonded parameters for 1-4 interactions.
    - RECTYPE 18C's for Berendsen-Van Gunsteren (GROMOS 87 or 96) potentials: iatnos(ityp), a(ityp), b(ityp), a14(ityp), b14(ityp) (I5,4E15.6)
      - iatnos(ityp) - Atomic number of type ITYP.
      - a, b: The A,B parameters (c6 and c12) in kcal/mol and � units.
      - a14{a}, b14{b} - The A,B parameters (c6 and c12) for the 1-4 interactions.
    - RECTYPE 18D's for Jorgensen (OPLS) potentials: iatnos(ityp), sig(ityp), eps(ityp), q(ityp) (I5,3E15.6)
      - iatnos(ityp) - Atomic number of type ityp.
      - sig, eps - The LJ SIG and EPS parameters in � and kcal/mol units.
      - q - The charge associated with atomtype ityp.
    - RECTYPE 18E's for Hard Spehere potentials: iatnos(ityp), sig(ityp) (I5,E15.6)
      - iatnos(ityp) - Atomic number of type ityp.
      - sig - The hard sphere diameter in �.

III.05.06. Atom (element) descriptor modifications

MODA Data Fdata

Potential successor: SLTA

Data: nmod

nmod - Number of element descriptors to modify
Fdata:
- RECTYPE 10 (nmod records): ian, ianpr, ifgr, namr, awr, vdwr, ramaxr, nv (3i5,1x,a4,3f10.0,i5)
  - ian - atomic number to modify
  - ianpr - new atomic number for print (to be used with united atom types)
  - ifgr - new functional group index (see key PMOD)
  - namr - new name
  - awr - new atomic weight
  - vdwr - new Van der Waals radius
  - ramaxr - new covalent radius
  - nv - new valence

Using this key the built in constants for each chemical element can be modified or new ones can be defined. Fields left blank or given as zero will retain their original value

III.05.07. Potential library input

PFRD Key1 Data

Potential successor: SLTA
Related key: PRMF

Key1 (library type):

CHRM - Read the NB parameters from a Charmm parameter file
OPLS - Read the NB parameters from a Charmm-formatted parameter file for OPLS types
Data: paramfile iswitchcase
- paramfile - the name of the parameter file
- iswitchcase {0} - when =1 the potential labels read will be converted from lower case to upper case

III.05.08. Solute-solvent cutoff

SUVC Key1 [Data]

Potential successor: RCKP
Prerequisite of: PBCN

Key1 (cutoff type):

SPCC - Spherical cutoff based on the solute COM
MICC - Minimum image convention based on the solute COM
ISOE - Isoenergy cutoff
SPGC - Spherical cutoff based on solute group COM's
MIGC - Minimum image convention, based on solute group COM's
Data: Cutoff radius {R_insc - see PBCN}

For the particular case of a single solute molecule in nmolec-1 solvent it is advisable to use the minimum-image cutoff for solute-solvent interaction to avoid the single discontinuity at the boundary of the cutoff sphere. Multimolecular solutes or for solutes with torsional degree of freedom require group-based cutoffs.

In the minimum-image cutoff for each pair of molecules only the interaction between the nearest images are taken into account, irrespective of the intermolecular separation. This is equivalent to the use of a cutoff cube, prism, etc., identical in size, shape and orientation with the periodic cell.

The cutoff radius may be based on the centers of solute groups (see SPGC or MIGC above) - useful for large solutes. Also, interionic PMF calculations can use the iso-energy cutoff introduced by Mezei.

III.05.09. Solute-solute cutoff

SUUC Key1

Potential successor: RCKP
Prerequisite of: PARD, PART, ENGL

Key1 (cutoff type):

NONE - No cutoff, no periodic images considered
MIMC - Minimum image convention based on solute molecule centers
SPMC - Spherical cutoff based on solute molecule centers
MIGC - Minimum image convention based on solute molecule group (residues) centers
SPGC - Spherical cutoff based on solute molecule group (residue) centers
Data: rc
- rc {R_insc} - Cutoff radius (see PBCN)

Note, that the **GC options will update the energies of all atoms that are in a residue when a torsion move moves the group center atom. To reduce the extra calculations, a) you should not use too large groups when this option is active and b) whenever possible, group torsions so that they don't leave too many group members unchanged.

III.05.10. Solvent-solvent cutoff

SVVC Key1 [Data]:

Potential successors: RCKP, PART
Prerequisite of: PBCN

Key1 (cutoff type):

SPCC - Spherical cutoff
MINI - Minimum image convention
Data: rc, exprc, maprc
- rc {R_insc - see PBCN} - Cutoff radius
- exprc {5.5 �} - Special cutoff for the exponential term in the MCY potential.
- maprc {rc+2�} - Cutoff for inclusion to the solvent-solvent near-neighbor bitmap

III.05.11. Inner cutoff on the solvent

INCT Key1 [Data]

Potential successor: RCKP

Key1:

NONE - None used
EXPL - Exponential repulsion at short distances
Data: rwwina
- rwwina {2.0 �} - For water-water distance rww < rwwina, replace the energy by 10.0*exp(rwwina²-rww²) + 1000.0 kcal/mol.

III.05.12. Minimum repulsion.

PROT Data

Prerequisite: SUPT
Potential successor: SLTA

Data: c12prot

c12prot - For general solvent only, solute-solvent interactions where no LJ interaction is specified by the potential library are modified ('protected') with a repulsion term of c13prot/r12¹² kcal/�¹² . This can be used to avoid numerical instabilities during creation-annihilation TI (FREE TICA) that can result from unlike charges getting too close to each other.

III.05.13. Reaction-field correction

RFCR [Data]

Variable to set to 'T' in the preprocessor: RF

Data: epsrf

epsrf {80.0} - The dielectric constant in the reaction field correction formula. epsrf=1.0 represents vacuum (i.e., no correction); epsrf=80.0 represents water.

The use of the reaction field correction requires the use of identical cutoffs for the solvent-solvent (SVVC), solute-solvent (SVVC) and solute-solute (SUUC) interactions. Also, the cutoffs are to be group based and each group has to be electrically neutral.

III.05.14. Set various constants

SETC Key1 Data

Potential successors: SLTA, TORD

Key1=DIEL:
- dielc - An overall dielectric constant, applied (as a divisor) to all electrostatic interactions. If a distance-dependent dielectric function code is activated it multiplies whichever (Mehler-Solmayer or epr(r)=r) distance-dependent dielectric function is calculated (see the preprocessor).
Key1=DDIE: Constants for the Mehler-Solmayer dielectric function
eps=A +(eps0-A)/[1+ k * exp(-lambda*(eps0-A)*r)]
- eps0_ddd {78.4} - eps0
- a_ddd {6.0294} - A
- rlambda_ddd {0.18733345} - lambda
- rk_ddd {213.5782} - k
- eps1_dd_ddd {8.0} - when eps1_dd_ddd > 0 then the dielectri constant will be set to eps1_dd_ddd for distances that are less than the distance at which the Mehler-Solmayer function reaches eps1_dd_ddd
NOTE: This dielectric constant is not applied to interactions involving any of the explicit water models, i.e., only to interactions involving the solute and a general solvent.
Key1=FC14: Data: esf14, vdwf14
- esf14 {0.833333} Factor to apply to the 1-4 electrostatic interactions
- vdwf14 {0.5} Factor to apply to the 1-4 Lennard-Jones (i.e., Van der Waals) interactions.
  If FC14 is not used then the program sets the default values that are potential dependent: (1.0, 1.0) for CHARMM, (0.5, 0.5) for OPLS, (0.833333, 0.5) for AMBER and all the others as well.
Key1=FCIN: Data: fcintra
- fcintra Factor to apply to the solute intramolecular energies when summing them into the total energy
Key1=BLKS: Data: nblskip, nbllim
- nblskip {0} - Number of control function blocks to skip when calculating error estimates (to drop equilibration part).
- nbllim {#MI} - Number of the last control function blocks to use calculating eror estimates.

III.05.15. Solute charge manipulations

CHRG Key1

Potential successor: SLTA

Key1:

ZERO - Set the solute and solvent charges to zero. This is advisable when a solvent configuration is generated randomly since it will avoid solvents getting stuck to the solute due to unphysically close distances involving atoms that have no repulsive core assigned to them.
INPT - Use inputted charges (after SLTA) for OPLS

III.05.16. Field-dependent solute potential

SPST Key1 Data

Prerequisite: FLXR

Can not be followed by: AROM, SVIN, ENHB, PARD, TORD, CNFG, HBMO
Related keys: AROM, ENHB
Variable to set to 'T' in the preprocessor: GM

Key1:

NONE - No field-dependent potential
EMAP - Additional solute potential depending on energy grid maps are read from files (see below).
CMAP - Energy grid maps are calculated.
CMPW - Energy grid maps are calculated and written to the map files.
Data: ningridmap, ewvdmax
- ningridmap {131} - Number of gridpoints in the energy maps - has to be an odd number and has to be the same as the dimension symbol #GM
- ewvdmax {1000.0} - VdW energies above ewvdmax will be truncated to ewvdmax
- scalemap {1.0} - All maps read will be multiplied by scalemap
Additional data for Key1=CMAP or CMPW: edgemap, gridcent_x, gridcent_y, gridcent_z, gridbuffer
- edgemap - the edge of the (cubic) grid map.
- gridcent_x, gridcent_y, gridcent_z - the coordinates of the center of the grid map.
- gridbuffer - atoms in a box extended by gridbuffer will contribute to the energy maps.

For Key1=EMAP, after establishing the grid constants the program reads the energy maps from the files prot.pi, lp.map, pa.map, pc.map, pd.map, ph.map, pn.map, po.map, pp.map, ps.map. for the electrostatic energy, van der Waals energy for C, A, N, O, P, S and H atoms, respectively (A is aromatic C) and, whenever needed, the file lp.map containing the contribution of the (Autodock 4) solvation term and the file ov.map containing the O-H VdW term. These are binary files containig the 3-dimensional array e(ningridmap,ningridmap,ningridmap) in a single record. The electrostatic map (read first) also contains a second record containig title, nx, ny, nz, ep, es, micro, stern, radd, ho, hi2, akp, aks where

title: 40-character description
nx,ny,nz: number of grids in the x, y,and z directions (unless nx=ny=nz=ningridmap the program aborts)
ep,es: dielectric constant inside and outside the solute
hi2: grid spacing
akp,akps: Debey screening constant inside and outside the solute

map(hi2*[(ningridmap-1)/2+1], hi2*[(ningridmap-1)/2+1], hi2*[(ningridmap-1)/2+1]) is the energy at <0.0,0.0,0.0>; map(1,1,1) is the energy at <-hi2*(ningridmap-1)/2, -hi2*(ningridmap-1)/2, -hi2*(ningridmap-1)/2>

The solvation term E_solv(i,j) contribution from atoms i and j is of the form
E_solv(i,j) = (V_iS_j + V_jS_i)*exp{r(i,j)/gaussdist2}
S_i=A_i + qk_par*abs(q(i))
where gaussdist2, and qk_par are constants V_i, and A_i are atom type-dependent constants and r(i,j) is the distance between atoms i and j and q(i) is the partial charge on atom i. All constants have built-in values that can be changed with the SVIN key (the built-in values serve as defaults).

Furthermore, if the map file hb.map is present then the program will treat all fixed oxygens as hydrogen bond donors and include a 12-10 hydrogen-bond potential. Hydrogen-bond acceptors can not be included in such a map since the H-X bond's orientation is conformation-dependent. It can be calculated explicitly during the simulation with the ENHB key

Note that on restart the energy maps have to be read again the same way as for startup.

For Key1=CMAP or Key1=CMPW, the program needs the description of the atoms contributing to the grid potentials. These are assumed to be in a file <jobname>_2.slt that is in the same format as the records read by the key (SLTA, but only the fields tslt(i), (cslt(j,i),j=1,3), qslt(i), labslt2(i), indxrdf(i) are read. There should be a record for each atom that is not explicitly represented.

III.05.17. Molecule-dependent solute potential

SPPS Key1 Data

Key1:

NONE - No Molecule-dependent potential
FIEL - An additional potential depending on the solute atoms' positions only is used.
Data: nint, int1, ..., int(nint), The nint integer values and the nreal real values will be stored in a common block and made available to the subroutine esltfield.
POSI - An additional potential depending on the solute molecules' position is used.
Data: nint, int1, ..., int(nint), nreal, real1, ..., real(nreal)
The nint integer values and the nreal real values will be stored in a common block and made available to the subroutine esltmolec.

This key allowed the use of potential terms depending on the field or on the molecule's center, orientation, etc. To use this option, the user also have to provide the corresponding code to the body of the subroutine esltmolec.

III.05.18. List of aromatic carbons

AROM Key1 Data

Prerequisite: SLTA
Potential successor: SPST
Prerequisite of: SVIN
Related key: SPST.

Key1: Solute potential type see key SUPT.

Data

narom

pflab(1)

pflab(narom)

narom - number of labels to read
pflab - potential label of aromatic carbon (narom labels)

This information is needed for deciding what type of energy maps to use with these atoms - see, e.g., SPST EMAP.

III.05.19. Desolvation parameter input

SVIN Data

Prerequisite: AROM
Potential successor: SPST
Related keys: SPST, AROM.

Key1:

ALLA - Adds a desolvation term (Autodock 4 parametrization) acting between all solute atoms. Aliphatic carbons will be treated as united atoms (the charges of hydrogens on carbons are added to the carbon charge).
CARB - same as ALLA but the desolvation term is only applied between carbon atoms.
CARA - same as CARB but the desolvation term is only applied between non-polar carbon atoms. A carbon is considered polar if its charge is > polarcarblim
NOHP - same as ALLA but the desolvation term is not applied to pairs involving a hydrogen atom.
NOPP - same as ALLA but the desolvation term is not applied between polar atoms: N,O,S, polar H and polar C. Data: gaussdist2, gausslimfac, qp_par, 6*[A_solv(i), V_solv(i)], polarcarblim
- gaussdist2 {24.50}
- gausslimfac {4.0} - desolvation terms will not be calculated for distances that are larger than gausslimfac*gaussdist2.
- qk_par {0.01097}
- A_solv(1) {-0.00143} Carbon
- V_solv(1) {33.5103 } Carbon
- A_solv(2) {-0.00052} Aromatic carbon
- V_solv(2) {33.5103 } Aromatic carbon
- A_solv(3) {-0.00162} Nitrogen
- V_solv(3) {22.4493 } Nitrogen
- A_solv(4) {-0.00251} Oxygen
- V_solv(4) {17.1573 } Oxygen
- A_solv(5) {-0.00214} Phosphorus
- V_solv(5) {33.5103} Phosphorus
- A_solv(6) {-0.00214} Sulphur
- V_solv(6) {33.5103} Sulphur
- A_solv(7) {-0.00051} Hydrogen
- V_solv(7) { 0.0 } Hydrogen
- polarcarblim {0.2} - the carbon polarity threshold

III.05.20. Hydrogen bond potential

ENHB Key1 Key2 Data

Prerequisite: SLTA
Potential successor: SPST
Prerequisite of: HBMO
Related keys: SPST, HBMO, SSVIN.

Key1 (solute-solute HB potential type):

NONE - Hydrogen-bond terms will be not be used for the solute-solute interactions.
SMPL - Hydrogen-bond putential is of the form
(c12/rHB¹²-c10/rHB¹⁰)*cos(A...H-X)
2ANG - Hydrogen-bond putential is of the form
(c12/rHB¹²-c10/rHB¹⁰) * cos^iexptheta(A...H-X) * cos^iexpomega(LP-A...H)
Key2 (solute-map HB potential type):

When this key is used a hydrogen-bond term will be used between polar hydrogens and oxygens as specified by Key1 and Key2, subject to the filters specified by rmaxhb, rminhb, qmaxhbdon, qminhbacc. This term may be used between non-bonded solute atoms and - if used - the grid-based energy term (SPST EMAP).

When the grid energy is used, and there are hydrogen bonds between the explicitly represented atoms and the atoms contributing to the grid then the atoms (that are otherwise fixed) involved in hydrogen bonds have to be specified. The hydrogen-bond donors and acceptors will be extracted from the non-felxible part of the solute as specified by the FLXR key.

III.05.21. Non-default hydrogen bond

HBMO Key1 Data

Prerequisites: ENHB, SLTA
Potential successor: SPST
Related key: SPST, ENHB.

Key1 (type of action):

ADDT - Add new hydrogen bonding type(s). Default types are:
- "D>NH": H on a nitrogen
- "D-OH": H on an oxygen
- "DNH+": H on a protonated nitrogen
- "AC=O": Carbonyl oxygen
- "AH-O": Non-carbonyl oxygen
- "AC-O": Ester or ether oxygen
New types are defined by their potential label:
Data for Key1=ADDT: pflib, ntype, pflab(1), ..., pflab(ntype)
- pflib - Potential library of the labels (see key SUPT)
- ntype - Number of new types
- pflab(i) - Potential type label of the ith new type.
ADDC - Add new hydrogen bonding coefficients:
Data for Key1=ADDC: hbtyp1, hbtyp2, c12, c10, rminhbheavy
- hbtyp1, hbtyp2 - hydrogen bond type labels for the donor and acceptor if this type of bydrogen bond (either the default labels listed above or the potential labels read with the key HBMO ADDT)
- c12, c10 the c12 and c10 coefficients of this type of bydrogen bond. If both are given as zero then this type of bydrogen bond will not be used.
- rminhbheavy {0.0} - For Key*=2ANG, hydrrogen bond between types hbtyp1, hbtyp2, oxygen acceptor with one bond only, the lower threshold of distance between the donor hydrogen's heavy-atom and the atom the oxygen is bonded.
EXCL - Exclude atom types from hydrogen bonding
Data for Key1=EXCL: pflib, nhbexclude, pflab(1), ...,pflab(nhbexclude)
- pflib - Potential library of the labels (see key SUPT)
- nhbexclude - number of labels to read
- pflab - potential label to exclude (nhbexclude labels)

III.05.22. Residues to sample

Potential successor: SLTA

FLXR Key1 Data

Related key: TORD

Prerequisite of: SPST

Key1 (input syntax):

INDX - Ranges of flexible residues are listed as pairs of integers
PDB - PDB-formatted ATOM records, one for each flexible residue

Data for Key1=INDX: n_residue_range, nresidue_range times: irf, i2f
Data for Key1=PDB: n_residue, nresidue times: PDB ATOM records specifying the residue numbers of the flexible residues

For calculations using the energy map this key defines the residues to be sampled. All other residues will contribute to the energy maps.

III.05.23. Constraint potential

Potential successor: SLTA

CNST Key1 Key2 Data

Key1 (new or modify)

NONE - No constraints
INIT - New constraint list
UPDT - Modifying constraint list
Key2 (constraint bond creation)
- NOCT - No CONECT records will be written on PDB files
- WCNT - CONECT records will be written on PDB files representing the constraints as bonds
  Data for Key1=INIT: nconstr_pair, nconstr_pair times: iatom1, iatom2, d_target, force_target, tol_target
  Data for Key1=UPDT: nconstr_pair, nconstr_pair times: d_target, force_target, tol_target
  - nconstr_pair: The number of solute atom pairs whose distance is to be constrained
  - iatom1, iatom2: The two atoms whose distance is constrained with this constraint.
  - d_target: The target distance for this constraint
  - force_target: The constraint force constant. The constraint energy will be
    force_target*(d_target-actual distance)²- force_target*tol_target²
  - tol_target: Distance tolerance - the constraint energy is set to zero if the actual distance is within tol_target of d_target

III.05.24. No solvent-solvent electrostatic interaction

Potential successor: SLTA

VVNE

When this key is present solvent-solvent interactions will mot include the electrostatic terms.

III.05.25. Ignore solute-solute energy

IGTT

When this key is present all solute-solute interactions are set to zero.

III.6. Sampling-related keywords

MOVE: Move selection strategy
SAMP: Sampling technique
NFBU: Solute force bias
STEP: Stepsizes
STPS: Stepsize scaling
PARD: Solute molecule displacement
SWAP: Solute molecule swap
MV2S: Correlated 2-solute move
PART: Solute torsion
MVRT: Alternating translation and rotation
PRFI: Insertion/deletion pref. sampl.
SEED: Random number seed input
NONB: Non-Boltzmann solute samplings
STSC: Temporary stepsize scaling
LOOP: Loop torsion moves
IGJA: Ignore Jacobian
SKWT: Skewed torsion sampling
IGSV: Non-interactive solvent
RNDG: Random number generator type

III.6.01. Molecule selection strategy specification

MOVE Key1 [Data] [Fdata]

Potential successor: RCKP
Prerequisite of: PRFI

Key1 (molecule selection strategy):

RAND - Molecules are selected randomly (uniformly) for move
CYCL - Cyclic selection - increasing order
CYCI - Cyclic selection - decreasing order
SHCY - Shuffled cyclic procedure of Mezei is used.
PRSL - Preferential sampling with piecewise linear weight function. Preferential sampling (PS) systematically selects solvents near the solute more frequently to be moved to improve the satatistics of the solute-related properties.
PRSP - Preferential sampling with polynomial weigth function
PRSE - Preferential sampling with exponential weigth function
Key2 (distance choice for PS):
- COM - PS is based on (solvent's COM's) distance from solute COM
- NRST - PS is based on (solvent's first atom's) distance from the nearest solute heavy atom. Not valid for EPEN solute.
- CXYZ - PS is based on distance from inputted center
  center
  For Key2=CXYZ Data:
  - CXYZ - The coordinates of the center from where the PS distance is calculated.
  Fdata for Key1=PRSL:
  - RECTYPE 1: npf (I5)
  - RECTYPE 2: (rpf(i),apf(i),i=1,-npf) (10F7.0)
  - rpf, apf - The weight function value w(r) at each rpf(i) is apf(i). For r < rpf(1) w(r)=apf(1) while, for r > rpf(npf) w(r)=apf(npf). For intermediate r's (rpf(1) < r < rpf(npf)) w(r) obtained by linear interpolation betwen the two nearest rpf's.
  Fdata for Key1=PRSP:
  - RECTYPE 1: npf (I5)
  - npf = 1+ the smallest exponent having non-zero coefficient.
  - RECTYPE 3: (apf(i),i=1,npf) (10F7.0)
  - apf - The coefficients in the weighing polynomial. Units are in �. The weighing function is constant for r < apf(1) and If r > apf(1) then w(r)=apf(2)+apf(3)/r +apf(4)/r²+ ... . w(apf(1)) gives the value of the constant.
  Fdata for Key1=PRSE:
  - RECTYPE 4: stiff, stvar(2F7.0)
  - stiff,stvar - The weighting function is exp(-(stiff*/stvar²)*r²) where r is the solute-solvent distance as specified by Key2.

III.6.02. Sampling technique selection

SAMP Key1 [Data Fdata]

Variable to set to 'T' in the preprocessor for Key1=FBPR or FBSC: FR

Key1 (possible displacement biasing strategy):

FBPR - Force biased.
Data: fblam {0.5} .
- fblam - the lambda factor for force biasing (the strength of the bias)
METC - Standard Metropolis, moves in a cube.
METS - Standard Metropolis, moves in a sphere.
METX - Standard Metropolis, fixed length moves (half of cedslt or cedslv as read by the key STEP
METR - Standard Metropolis, length of the moves should fall within a range
FBSC - Force biased, scaled depending on the slt-slv distance.
For Key1=METR: Data: rngminfac
- rngminfac {0.8} - the step magnitude will be between rngminfac*ceds** and ceds**
For Key1=FB**: Data: fblam
- fblam {0.5} - the lambda factor for force biasing (the strength of the bias)
For Key1=FBPR: Fdata:
- RECTYPE 27: nfblam (I5)
- nfblam - Number of data points in the table specifying the force-bias scaling.
- RECTYPE 28: (rfblam(i),afblam(i),i=1,nfblam) (10F7.0)
- rfblam(i), afblam(i) - The force-bias lambda parameter will be multiplied by a scale factor that is interpolated between the values afblam(i) at solute-solvent distance rfblam(i).

III.6.03. Solute sampling technique selection

NFBU

If present, the Metropolis method will be used for for moves translating/rotating the whole solute (se key STEP), even when the key SAMP specifies force-biased sampling. In this case leaving the C@TS lines comments saves some computer time.

III.6.04. Solute and solvent stepsizes

STEP Data

Potential successor: SLTA
Prerequisite of: RUNS

Data: cedslt, rtxslt, cedslv, rtxslv, nsltfreq

cedslt - Shift interval in � for the COM of the solute: The displacement of the solute COM will be limited into the interval [-cedslv/2,cedslv/2].
rtxslt - Rotation interval in degrees in which the solute is to be rotated.
cedslv - Shift interval in � for the COM of the solvent.
rtxslv - Rotation interval in degrees by which the solvent is to be rotated.
nsltfreq - The frequency of all solute moves. Solute move may be any of the following type: displacement-rotation (cedslt > 0.0 and nmolfx=0 ); coupling parameter change (FREE PMF1); partial solute move (PARD); partial solute torsion (PART) or solute group swap (SWAP).

The above variables are used to perturb a chosen molecule, solute or any of the solvent during the Metropolis walk. As a simple rule of thumb, these variables are chosen by trial and error to obtain 30-40% acceptance rate. For water the recommended values are: 0.30 � for the COM of the solvent and 30 degrees for the rotation angle for standard Metropolis and 0.55 � and 40 degrees for Force Bias. The stepsize of the solute should be adjusted to provide similar acceptance rate as the solvent. In general, the larger the solute the smaller stepsizes are to be used.

III.6.05. Stepsize scaling

STPS Data

Data: notmovcyc

notmovcyc {1} - Solvent stepsizes are scaled down by 10% when a solvent is found that was not moved during the last 10*notmovcyc cycles. One cycle is nmolec MC steps.

III.6.06. Partial displacement of the solute

PARD Key1, Key2, Data

Prerequisites: SLTA, SUUC
Can not be followed by: FREE
Potential successors: TORD, SPST
Prerequisite of: SWAP, MV2S
Related key: SWAP.

Key1 (rotation stepsize strategy):

UNIF - Rotation stepsize is kept constant
EXBI - Rotation stepsize is scaled inversely with the maximum extension of the molecule in the plane perpendicular to the rotation axis.
EXBQ - Rotation stepsize is scaled inversely with the square root of the maximum extension
EXBA - Rotation stepsize is scaled inversely with the square root of the root mean square extension
Key2 (molecule selection strategy):
- RAND - Molecules are selected randomly (uniformly) for move
- CYCL - Cyclic selection - increasing order
- SHCY - Shuffled cyclic procedure of Mezei is used.
- CYCI - Cyclic selection - decreasing order
  Data: wsltpdis, cedpsltx, rtxpsltx, wrotx, cedpslty, rtxpslty, wroty, cedpsltz, rtxpsltz, wrotz, nsltpardis
  - wsltpdis - Relative frequency of partial solute displacements
  - cedpslt* - Displacement size parameters corresponding to the x,y, and z directions, resp.
  - rtxpslt* - For Key1=UNIF: Rotation size parameters corresponding to the x,y, and z directions, resp.
  - rtxpslt* - For Key1=EXBI: Rotation size parameter will be calculated as rotmax=rtxpslt* / extmax where extmax is the maximum extension of the molecule to be rotated in the plane perpendicular to the rotation axis.
  - rtxpslt* - For Key1=EXBQ: Rotation size parameter will be calculated as rotmax=rtxpslt* / (2.0 * sqrt(extmax))
  - wrot* - Weight for selecting an axis for rotation.
  - nsltpardis - When >0, the molecule(s) containing the first nsltpardis atoms after the free energy molecule (if any) will be subjected to independent moves (translation and rotation). When negative, the molecules containing the last -nsltpardis atoms will be moved (This allows their treatment in subsequent analysis as solvents). Note, that nsltpardis refers to the explicite solute atom list, and not the full list of solute atoms after cloning (if any).

III.6.07. Solute molecule swap

SWAP Data

Prerequisite: PARD

Data: wswap

The relative frequency of attempting to swap two randomly selected solute molecules

III.6.08. Correlated 2-solute molecule move

MV2S Data

Prerequisite: PARD

Data: wmv2s, r2scut, rtxcslt, iaxis2s

Key1 (stepsize selection strategy): See Key1 of key PART

wmv2s - The relative frequency of attempting to make a correlated move of two solute molecules
r2scut - The distance threshold for selection of the solute molecule pairs
rtxcslt - The solute rotation stepsize (see key PARD)
iaxis2s - The axis of correlated rotation

III.6.09. Partial torsion of the solute

PART Key1, Key2, Data

Prerequisites: SLTA, SUUC
Can not be followed by: SVVC
Prerequisite of: TAUC
Related keys: LOOP, TORD.

Key1 (stepsize selection strategy):

UNIF - Rotation stepsize is kept constant
EXBI - Rotation stepsize is scaled inversely with the maximum extension of the part of the molecule that is affected by this torsion, calculated from the line of the torsion bond
EXBQ - Rotation stepsize is scaled inversely with the square root of the maximum extension
EXBA - Rotation stepsize is scaled inversely with the square root of the root mean square extension
Key2 (strategy for selecting the solute molecule for torsion):
- RAND - Molecules are selected randomly (uniformly) for move
- CYCL - Cyclic selection - increasing order.
- SHCY - Shuffled cyclic procedure of Mezei is used for selecting the molecule for torsion change.
- CYCI - Cyclic selection - decreasing order.
  Key3 (strategy for selecting the torsion on a solute molecule):
  - RAND - Torison groups are selected randomly (uniformly) for change
  - CYCL - Cyclic selection - increasing order.
  - CYCI - Cyclic selection - decreasing order.
    Data: wsltptor
    - wsltptor - Relative frequency of partial solute torsion

III.6.10. Alternating translation and rotation

MVRT Key1

Potential successor: STUN
Prerequisite of: STUN

Key1 (solute and/or solvent moves)

BOTH - Both the solute and the sovent will be translated and rotated simulatneously
SVAL - Translations and rotations will alternate for the solvent
STAL - Translations and rotations will alternate for the solute
TVAL - Translations and rotations will alternate for both the solute and the solvent
Data: wsltdisp, wsltrot, wslvdisp, wslvrot
- wsltdisp, wsltrot - Relative frequences of solute translatons and rotations
- wslvdisp, wslvrot - Relative frequences of solvent translatons and rotations

III.6.11. Insertion/deletion preferential sampling

PRFI Key1 Key2 [Data] [Fdata]

Prerequisites: GCEN, MOVE

The syntax is the same as for the preferential sampling for the solvent moves. Thus, except for not allowing RAND, CYCL, SHCY, CYCI for Key1, the input is the same as for the key MOVE.

III.6.12. Random number seed initialization

SEED Data

Data: ix1, ix2, ix3, ix4, ixscr

ix1, ix2, ix3, ix4 - the new random number seed will be ix1*2^48+ix2*2^32+ix3*2^16+ix4
ixscr - the new scrambler seed will be ixscr

The random number generator will be reinitialized with these seeds.

III.6.13. Non-Boltzmann solute samplings

NONB Key1 Data

Key1 (sampling type):

SCTR - sampling will use reduced solute torsion terms
SCRP - sampling will use reduced solute-solvent repulsion term
SCNB - sampling will use reduced intra-solute non-bonded terms
SCTN - sampling will use reduced torsion and intra-solute non-bonded terms
TMOD - solute energy will be scaled with a different temperature
TSAL - sampling follows Tsallis statistics (REF.) for the solute energy
Data: param(s), ew0, signincfac
- param(s) - the extent of non-Boltzmann sampling:
  - For Key1=SCTR, the scale factor (< 1) to apply on the torsion energy.
  - For Key1=SCRP, the scale factor (< 1) to apply on the solute-solvent repulsion.
  - For Key1=SCNB, the scale factor (< 1) to apply on the intra solute non-bonded term.
  - For Key1=SCTN, the scale factor (< 1) to apply on the torsion energy, followed by the scale factor (< 1) to apply on the intra solute non-bonded term.
  - For Key1=TMOD, the temperature for the solute energy (> T (see key TEMP))
  - For Key1=TSAL, the q parameter
- ew0 {value at start} - the value to subtract from energy factor of the weight factor's exponent
- signincfac {1.1} - for Key1=TSAL only: Add to the exponential argument -signincfac times its initial value if it turns out to be negative (to avoid taking the log of a negative number)

III.6.14. Temporary stepsize scaling

STSC Key1 Data

Key1 (move type to scale):

SVTR - Scale solvent translation stepsize
SVRO - Scale solvent rotation stepsize
STTR - Scale whole solute translation stepsize
STRO - Scale whole solute rotation stepsize
SMTR - Scale solute molecule translation stepsizes
SMRO - Scale solute molecule rotation stepsizes
NTOR - Scale normal torsion stepsizes
LTOR - Scale loop torsion stepsizes
Data: scalefac
- scalefac {1.0} - the factor to use for this type move

The scaled stepsizes will only be used for this run - the next calculation (initiated by a new RUNS key) will use the originally inputted values.

III.06.15. Loop torsion moves

LOOP Key1, Key2, [ Data]

Potential successor: TORD
Prerequisite of: LPST

Variable to set to 'T' in the preprocessor: LO

Related key: IGJA

When this key is present the program will analyze the torsion angle list and determine which torsions can be the first torsion of a loop move. Loop moves change six consecutive torsions in such a way that the bond of the first and last torsion remain fixed, thereby limiting the change in the molecular geometry due to the change in the torsionangle(s).

The formulae giving the loop-closing torsion angles were obtained based on a distance geometry approach. The current version is limited to loops where the torsions are on consecutive bonds (-------), or on consecutive bonds broken by a rigid bond (-=-----, --=----, ---=---, ----=--, -----=-) or on a peptide backbone with rigid peptide bonds (-[=--]₃, -[-=-]₃).
Key1 (solution selection algorithm):

PROX - Search only in the vicinity of the reference conformation for the nearest solution.
SCAN - Scan the whole solution space to obtain all solutions and select the nearest. Currently it is implemented only for consecutive torsions.
JACW - Scan the whole solution space to obtain all solutions and select one with Jacobian weight. Currently it is implemented only for consecutive torsions.
Key2 (iteration direction(s))
- FORW - Loop-closing iteration is in the 'forward' direction only
- BCKW - Loop-closing iteration is in the 'backward' direction only
- TRNG - Loop-closing iteration direction is chosen based on the closeness of the triangle inequlities
- BOTH - Loop closing iterations are run in both direction and the solution with better precision is chosen
  Data: wloop, d3grid, d3tol, dijtol, d24tol, drevtol, dextol, dihpmtol
  - wloop {1.0} - Probability of selecting a loop torsion move whenever possible (instead of a simple rotation)
  - d3grid {0.1} - r3 range will be searched by d3grid increments (or smaller when a root appears near).
  - d3tol {0.000001} - Convergence tolerance of the distance between putative r3 positions in the loop-closing algorithm
  - dijtol {0.01} - Tolerance of the bond length squares for the no-torsion debug option
  - d24tol {0.1} - Tolerance of (r2-r4)^2 in the loop-closing algorithm
  - drevtol {0.2} - Distance square tolerance for the check of the reverse proximity criterion - only used for Key2=PROX
  - dextol {0.2} - Tolerance of the filtering distance squares
  - dihpmtol {5.0} - Frozen bond torsion angle signs will be conserved for angles that deviate from planarity by less than dihpmtol degrees.

III.06.16. Ignore Jacobian

IGJA

Related key: LOOP,

When this key is present, the acceptance probability of a local move (see key LOOP) will ignore the Jacobian. This will allow skipping the calculation of the loop-closure problem for the reverse move and thereby speeding up the calculation. The speedup, however, comes at the expense of the loss of Boltzmann sampling and thus this option is to be used for search or minimization runs.

III.06.17. Skewed Torsion sampling

SKWT Key1 Data

Related key: TORD,

Key1 (type of skewing)

DRIV - for selected bonds, torsion attempts will be made wskew times more frequently in the target direction than away from it.
LOOP - for loops involving selected bonds, the loop solution closest to the target angles will be chosen wskew times more likely than a randomly chosen solution.
DRLP - a combination of both options described by the keys DRIV and LOOP
PROX - for loops involving selected bonds, the loop move will be rejected with 1-1/wskew probability whenever the Rloop torsions moved away from their target (in the least-square sense).
DRPR - a combination of both options described by the keys DRIV and PROX
DLPR - a combination of the options described by the keys DRIV, LOOP, and PROX .
Data: wskew, wskewstepmin, resumefac
- wskew - the weight used for skewing the sampling for the torsions selected for skewed sampling.
- wskewstepmin {1.0) - for stepsize tuning runs, the skewed sampling is suspended whenever the stepsize falls below wskewstepmin degrees.
- resumefac {1.2} - for stepsize tuning runs, the skewed sampling is resumed whenever the stepsize is raised above wskewstepmin*resumefac degrees.

When this key is present then the formatted data after the key TORD will specify a target angle for some of the torsions.

III.06.18. Non-intractive solvent

IGSV

When this key is present then solvent-solvent interactions will be ignored.

III.06.19. Random number generator information

RNDG Key1 Data

Key1 (generator type)

MMC - MMC RNG: 64-bit congruantial RNG; scrambled in 64 number segme
LCG - Linear congruential generator
MRTW - Mersenne twister generator
INPT - A list of random numbers is read from a file; before reusing they are scrambled
INNS - A list of random numbers is read from a file; they are reused unchanged when the list is exhausted
OUTP - A list of random numbers generated by MMC RNG is written to a the run will stop after writing the file

Data for Key1=LCG : lcg_fac, lcg_add, lcg_exp, seed

lcg_fac {314159269} - factor C (see below)
lcg_add {453806245} - additive constant A (see below); default values from Forsythe.
seed {1357} - First random number
lcg_exp {31} - modulus exponent E (see below)

Data for Key1=MRTW: seed

seed {4357} - random number seed

Data for Key1=INPT or Key1=OUTP: rndgfile, nrand

rndgfile - name of the file the random numbers are read from or written to (Key1 = INPT or OUTP)
nrand - number of random numbers to write (Key1 = OUTP)

III.7. Calculation flow keywords

RUNS: Run MC simulation
SCAN: Scan trajectory for proximity analysis
PXAN: Proximity analysis frequencies
RCKP: Checkpoint file read
STOP: Stop the run

III.07.01. Execution of a MC simulation

RUNS Data

Prerequisites: SLTA, CNFG, TEMP, FILE, STEP

Data: nmcmax, nmcrep, nplt, nmcrec, ncntin, nmcadp

nmcmax - Number of MC steps to run.
nmcrep {nmcmax/100} - After every nmcrep steps certain useful intermediate output, e.g., energy, energy-minimum, molecule moved, fraction of accepted steps, solute binding energy is printed.

After reading a checkpoint file (see key RCKP) the rest of the data is ignored.

nplt - After every nplt steps, various statistics and distribution functions, such as the radial distribution functions g(R), binding energy distributions QCBE, coordination number distributions QCCN, pair-energy distributions QCPE, etc. are printed and are optionally stacked on the .dst plot file see DSTC **** FILE) (for later analysis). nplt must be an integer multiple of nmcrep. If it is not, the program will automatically change it to the nearest multiple.
nmcrec {2*nmolec} - Frequency of saving a full configuration on the .hstfile (see key TRAJ)
ncntin {greater of 10000 and nmcmax/100} - Length of the interval for control-function (block average) calculations. Some of the control functions thus obtained will be plotted on the printer at the end of the output. (Recommended value: 100000.)
nmcadp {100000} - Frequency of adaptive umbrella sampling weight updates. nmcadp must be an integer multiple of nrecd (see also key CHKP. If it is not, the program will automatically change it to the nearest multiple.

III.07.02. Proximity analysis from history

SCAN [Key1] Data

Prerequisite: SLTA

Key1 (source of data):

TRAJ - analyze a trajectory
TRNC - analyze a trajectory that is the continuation of the previous trajectory (but with different runnumber), i.e., don't initalize the analysis accumulators. This key assumes that a previous SCAN TRAJ was already executed in the same run. TRAJ ALLV.
CONF - analyze the current configuration

For Key1=TRAJ or TRNC:
Data: nmcpxmax, navgpx, nranpx, nmcpxdsc, npxres, npxcntin, lumppr, nsymdim, ifrsdim

nmcpxmax - The program will scan the history file until the nmcpxmax-th configuration and perform proximity analysis. If no CNFG command reading in a configuration preceded SCAN, the initial coordinates will be gathered from the history file.
navgpx - frequency of selecting a structure for the proximity analysis contributions
nranpx {0} - number of random points to generate (for volume-element estimation) after the analysis of each configuration
nmcpxdsc {0} - Analysis to be started after the first nmcpxdsc-th configuration
npxres {npxmax} - Proximity analysis results printed after each npxres-th configuration
npxcntin {ncntin} - Block size (number of configurations to average) for proximity analysis error estimates
lumppr {2} - Printed error estimates are based on 2^lumppr*npxctn long blocks
nsymdim {3} - For each random point symmetry-related points will be generated in nsymdim dimension
ifrsdim {1} - Start symmetry generation at the ifrsdim-th dimension

For Key1=CONF:
Data: nranpx, lumppr, nsymdim, ifrsdim - see above.

Structures in history files in CHARMM (TRAJ CHRM) or AMBER (TRAJ AMBR) trajectory syntax are labeled 1,2,... while structures in all other syntaxes are labeled by the MC stepnumber they were saved at.

III.07.03. Proximity analysis frequency parameters (concurrent to MC run)

PXAN Data

Can not be followed by: PXGR, PXDP, PXBE, PXWW, TAND, ATFR, FLDG, RTIM, DIFC, VORO, CHKP
Related keys: PXLM, PCPA, PXYZ.

Data: navgpx, nranpx, nmcpxdsc, npxres, npxcntin, lumppr, nsymdim, ifrsdim

These variables are the same as defined for the keyword SCAN.

Using this keyword the proximity analysis will be performed concurrently to the MC simulation (eliminating the need of saving the history file).

III.07.04. Continue from a checkpoint file

RCKP [Key1 Key2 Data]

Prerequisite: FILE
Can not be followed by: TITL, PBCN, SUPT, SVPT, TRAJ, INCT, MIXR, SUVC, SVVC, SUUC, NSLV, MOVE, FREE, SLTA, SLVA, CNFG
Potential successor: VOLE
Related keys: CHKP, WCKP, RMCK, SCKP.

The program will read the content of the current checkpoint file, .ckp and restore from there the state of the variables, including the file unit numbers and the run number numrun (!), allowing the continuation of an earlier run as if it never stopped. If you want to reinput some of the data used by the run the run may abort as certain inputs are forbidden in different context. In this case either use the RUNS key with zero steps to run or the FILE key. Use with caution, though!

Key1 (checkpoint file checking procedure):

STPD - Stop whenever the self test discovers a discrepancy.
IGND - Ignore whenever the self test discovers a discrepancy.
FIXD - Attempt to fix discrepancies, but stop if fails to.
FIXD - Attempt to fix discrepancies, but don't stop if fails to.
Key2 (trajectory checking procedure):
- NOFX - Abort if trajectory file is damaged
- FXTR - Run with damaged trajectory file.
  Data: numrunr, jobnamer
  - numrunr {numrun}, jobnamer {jobname} - The program will look for the file <jobnamer>.<numrunr>.ckp to restore the data from. If the filename is different from the default checkpoint file name, the program will open a new checkpoint file name with the default name, i.e., <jobname>.<numrun>.ckp and save subsequent checkpoint data there.

If the checkpoint file was found too short, the program will assume that the last configuration was still properly read (since that is the first item in the file) and write it on a new file <jobname>.99.crd in binary form (to be read with CNFG READ BNRY NOFX 99).

III.07.05. Termination of the calculation

STOP [Key1, Key2, Data]

Key1 (possible exit self test request):

SLFT - Perform a self-test before stopping.
Key2 (Same as Key1 for SLFT see details there):
- BASC - Basic test
- FULL - Full test (This is the default here)
- ALL - FULL test plus BASC test on selected molecules:
  Data for Key2=ALL : nmolck
  - nmolck {nmolec} - BASC tests are performed on molecules 1 - nmolck
NOSF - Don't perform a self-test before stopping.

III.8. Output related keywords

OUTP: Print various system charateristics
PRNT: Input echo level
DSTP: Bulk solvent distribution print
PRAC: Acceptance rate print detail
BNDL: Listing of bonds found
PRCO: Compilation option listing
PLBP: Potential parameter echo
KMNP: PDB file property threshold
EPLT: Switch between C_v and U_tot in convergence plot
PRPL: Print plotted data
NMVP: 'Not-moved' threshold to print solvents
PXPA: Empty proximity region print
PXPR: Proximity analysis printing options
PDBT: PDB atomname convention
PLCV: Postscript convergence plots

III.08.01. Print various system characteristics

OUTP Key1

Key1 (type of output):

REPT - The program prints a full description of the system (normally printed before each simulation run).
PRAT - The program prints a list of atomic constants stored (atomic weights, radii, valences). The vdw radii stored can be used for cavity search (see keys GCEN and FREE WIDO) or solute size definition (see keys FILT and PBCN PHS ).
PRPB - The program prints the centers of the periodic cells.
PRFC - The program prints the list of functional groups.
PROP - The program prints the internal option and debug option arrays. If PRNT ECHO was present, for each option number that is defined by a main key MMC lists these keys with their option number.

III.08.02. Input echo level selection

PRNT Key1

Key1 (level of echoing input):

NECH - The lines with unformatted input will only be echoed
ECHO - Each individual unformatted datum will be echoed, use of default values will be stated (note, that in some instances these inital defaults may be overridden), opening and closing of files will be announced, extra information about the cavity grid will be printed.
DETL - Besides the output described with Key1=ECHO, the formatted input will be also echoed labeled with the corresponding record type number.

III.08.03. Bulk solvent distribution function print level specification

DSTP Key1

Related key: DSTC.

Key1:

NONE - Do not print distribution functions at all.
GRKC - Print only the overall solvent-solvent and solute-solvent g(R), coordination number distributions. When torsions are active and cloned, print the individual torsion angle distributions. The functions are printed in a compact form.
ALL - Print output for all the properties, including g(R) and coordination number distribution. The functions are printed in a compact form.
GKCC - same as GRKC but the functions are printed in a longer format to allow easy plotting
ALLC - same as ALL but the functions are printed in a longer format to allow easy plotting

III.08.04. Acceptance rate listing detail

PRAC Key1[,Key2]

Key1:

SUMR - Only the mean,minimum and maximum acceptance rates are printed.
INDI - Individual acceptance rates are also printed (wherever collected)
NTRY - Individual number of trials (or rate of trials) are also printed.
Key2:
- USED - The periodic output prints the acceptance rate for the last steps movetype
- ALLA - The periodic output prints the acceptance rates for all types of moves

III.08.05. Bond list print

BNDL [Key1 Data]

Prerequisite: SLTA

Key1:

DIST - Print only the neighbor list and bond lengths
ANGL - Print also the bond angles
TORS - Print also the torsion angles
ANRS - Print also both the bond and the torsion angles
CNCT - Print PDB-style CONECT records for the bonds found
Data: nf, nl
- nf {1} - Start neighbour list at atom nf
- nl {nslt}- End neighbour list at atom nl

Note: atoms with one neighbor will not be listed separately, only as neighbors.

III.08.06. Compilation option print

PRCO [Key1 Data]

Key1:

NOSA - don't create a preprocessor
SAVE - create a preprocessor reproducing the compilation
SVMN - create a preprocessor with minimal size, tailored to the currently running system
Data for Key1=SAVE: numrunr
- numrunr {numrun} - the run number of the checkpoint file to use. If numrunr=0, then the currently running program options will be printed.

The program prints all compilation options and array sizes as well as version dates used either by the currently running ptogram or the program that created the checkpoint file with jobname.numrunr.ckp. If requested by Key1, a file pre_jobname.f will be generated that can be inserted into the preprocessor pre.f to compile the program the same way. Furthermore, if the file pre.f is present in the current directory then the file pre_jobname.f will contain the full source code of the modified preprocessor.

III.08.07. Potential parameter listing

PLBP Key1 [Key2]

Key1:

USED - Print the parameters in the potential libraries used. If the solute was not read yet, the list will be printed with the run description
ALL - Print the parameters of all potential libraries
COMU - Like Key1=USED, but for general solvent, the combined solute-solvent parameters will also be listed.
COMU - Like Key1=ALL , but for general solvent, the combined solute-solvent parameters will also be listed.
SNGL - Print the parameters of a selected single library
- PPPP - the potential code that can be specified with the SUPT key

III.08.8. Coordination number threshold for property output

KMNP [Data]

Data: rkpdbmin

rkpdbmin {0.0} - When writing a PDB file with the WCNF PDB* command, properties (other then first and second shell coordination numbers and first shell volume) belonging to solute atoms whose primary coordination number is below rkpdbmin are set to zero.

III.08.9. Switch between C_v and U_tot in the convergence plot

EPLT Key1

Key1:

UICV - Energy block averages and C_v are plotted
UIUT - Energy block averages and cumulative total energy are plotted.

III.08.10. Listing of function values for plots

PRPL Key1

Key1:

NONE - Draw the plots only on the output file
CPLD - List the plotted Umbrella Sampling distribution values.
ENRG - List the plotted energy and heat-capacity values.
CONV - List the data plotted with the PLCV key
ALL - List all plotted data

III.08.11. 'Not-moved' threshold

NMVP Data

Data: notmoved

notmoved {2} - Solvents with acceptance rate < notmoved percent and their distance from the nearest solute atom and their energy with the solute will be listed when a) the run is longer than 200*nmolec steps and b) more than 10 attempts were made to move that solvent. However, for PRNT DETL the list will be produced anyway. The list is sorted by the acceptance rate.

III.08.12. Empty proximity region print

PXPA Key1

Key1:

ALL - All proximity regions will be printed
NOEM - Proximity regions not containing solvent will not be printed (neither to the standard output nor to the .pxp file).

III.08.13. Proximity analysis print levels

PXPR [Key1 Key2 Key3 Key4]

Key1 (distribution function print level):

DSTR - Print all calculated distribution functions
NDST - Don't print the calculated distribution functions
NDSF - Don't even print the calculated forces on the solute atoms
Key2 (volume element function print level):
- NVLE - Don't print the radial volume element estimates
- VOLE - Print the radial volume element estimates
  Key3 (functional group analysis):
  - FULT - Print the full functional group analysis table before the condensed list
  - NFLT - Don't print the full functional group analysis table before the condensed list
  - NFGT - Don't print functional group analysis table at all
  - FFGT - Print (only) the full functional group analysis table
    Key4 (parameter echo):
    - NECH - Don't echo the run parameters before each proximity analysis table
    - FECH - Echo the run parameters before each proximity analysis table

The proximity analysis results consist of two parts: distribution functions (and their characteristics) and tables of calculated avareges. The distribution printing is governed by Key1 and Key 2. The program can print the the averages arranged in three different tables:

In the original order, further averaged by residues
Ordered by functional groups (chemical or user defined - see the key FCGD)
Condensed in the same way as the RDF's are (as governed by indxrdf - see key SLTA). Key3 governs which of these tables will be printed.

III.08.14. PDB atomname convention

PDBT Key1

Key1 (name conversion):

INPT - no conversion
REGU - convert to PDB convention
LEFT - left adjust atomnames

When printing a structure in PDB format (e.g., keys WCNF, GENS, FILT) this option allows the choice of PDB conventions.

III.08.15. Postscript convergence plots

PLCV Key1, Data

Key1 (Property):

ENRG - energy
VOLU - volume (after a run with IBEN)
NUMB - Number of molecules (after a run with GCEN)
BPAR - GCE B parameter (after a run with BTUN PICL)
FETI - Free energy (after TIQU) and the polynomial fitting to the TI integrand
Data: xmn, xdv, nx, ymn, ydv, ny, linwid
- xmn {0} - the beginning of the X-axis
- nx {10} - the number of ticks on the X axis
- xdv {Nmc(max)/10} - the distance between ticks on the X axis
  Note: Default values will be used for all when nx=0
- ymn {Y(min)} - the beginning of the Y-axis
- ny {10} - the number of ticks on the Y axis
- ydv {(Y(max)-Y(min))/10} - the distance between ticks on the Y axis
  Note: Default values will be used for all when ny=0
- linwid {1} - the plot line width

This key can be used to generate a Postscript plot of the block averages and cumulative averages of the energy, volume, number of molecules or the free energy calculated with thermodynamic integration. The name of the plot file will be of the form <jobname>.<ext>.<runnumber<.ps where <ext> is eng, vol, nml, bpr, fen for Key1= ENGR, VOLU, NUMB, BPAR, FETI, resp. If the .ps file already is present, a new file will be opened vith an added (or incremented) version number.

III.9. General analysis keywords

DSTC: Bulk solvent distribution calculation
FCGA: Functional group analysis
ATFR: Calculation of mean force on solute atom
FLDG: Field gradient calculation on solvents
SENS: Sensitivity analysis
ENGL: Energy contribution calculation
FILT: Solvent filtering
HBBR: Calculate hydrogen-bond bridges
CMPC: Configuration comparison
DENF: Combined solvent conformation
PRTG: Print cavity grids
STVG: Estimate solute and cavity volumes with multiple grids
CPOC: Calculate cavity/pocket occupancie
GENS: Calculate generic solvent sites
IWSL: Prepare site assignment file, entropy input
GSAN: Generic site analysis choice
GSAO: Generic site analysis options
GABF: Generic site graph analysis by frame
GENT: Configuration with input torsion angles
TAND: Calculate torsion angle distribution
TORT: Energy analysis along a torsion
LPST: Energy analysis of backbone loop solutions
IDAG: Aggregate insertion and deletion sites
SUPI: Trajectory superimposition
SVDP: Solvent dipole distribution
MOND: Bulk density monitoring
BLKW: Bulk density monitoring

III.09.01. Bulk solvent distribution function calculation level specification

DSTC Key1 [Key2]

Related key: DSTP.

Key1 (level of distribution function calculations):

NONE - Do not calculate any bulk solvent distribution function (to save time, particularly in vector mode).
SOME - The pair properties, such as the pair energy distribution and dipole-dipole correlation functions are not calculated.
ALL - All distribution function calculations are done.
Key2 (creation of plot file):
- NOFI - Nothing is written on the .dst file.
- FILE - g(r)s, QCDFs are written on the .dst file for plot.
  Fdata:
  - RECTYPEs 32-33 are read only for DSTC SOME or DSTC ALL.
  - RECTYPE 32: rdmx, ri, rfsslv, empslv, egrslv (5F15.5)
    - rdmx {ct, the solvent cutoff} - Maximum range in � for the COM g(R) for the solute an the solvent. Its default (and maximum) value is the same as that of CUTOF. Beyond this distance the g(R) computation would have to include more than one image of a molecule, leading to both the inclusion of unwanted correlations and additional computational complications.
    - ri {0.05} - Grid interval in � for g(R) computation. In the program, g(R) is computed in tabular form at each grid interval starting from 0 to rdmx.
    - rfsslv {3.3} - First solvation shell radius (in �) of the solvent molecule. This quantity is used in conjunction with coord. no. calculations. All molecules which lie inside a sphere of radius rfsslv (centered at the COM of a given solvent molecule) are said to be coordinated to that solvent molecule. It is usually set at the location of the first minimum of the g(R).
    - empslv - The midpoint of the solvent binding energy distribution (in kcal/mol) for the quasi-component of the binding energy calculation (QCBE).
    - egrslv {0.25} - Grid interval in kcal/mol for the above QCBE calculation. The QCBE is again tabulated for 100 gridpoints starting from (-50*egrslv+empslv) to (50*egrslv+empslv).
  - RECTYPE 33: rssmin, rssmax, rfsslt, empslt, egrslt, ksltmn (5F15.10,I5)
    - rssmin {rfsslt}, rssmax {rdmx} - The solvent-solvent distributions (g(R), coordination number distributions, near-neighbor pair energy and orientational correlation distributions; solute-solvent pair energy distributions and dipole correlation functions are computed only for solvent molecules in the concentric shell around the solute between rssmin and rssmax.
    - rfsslt {5.0} - First solvation shell radius in � for the solute. Used in conjunction with the coordination number calculation.
    - empslt - Midpoint (in kcal/mol) for the QCBE of solute.
    - egrslt {0.5} - Grid interval (in kcal/mol) for the QCBE of solute.
    - ksltmn {0} - Solute coordination number will be collected from k=ksltmn.
  - RECTYPE 34: (only for DSTC ALL) dmnslt, dmxslt, dmnslv, dmxslv (4F15.10)
    - dmnslt {0}, dmxslt {rfsslt} - solute-solvent pair energy distributions and solute-solvent dipole correlation functions are computed only for solvent molecules in the concentric shell around the solute between dmnslt and dmxslt.
    - dmnslv {0}, dmxslv {rfsslv} - near-neighbour pair energy and solvent-solvent dipole correlation functions are computed only for solvent-solvent distances between dmnslv and dmxslv.
  - RECTYPE 35: (only for DSTC ALL) emnslt,egpslt,emnslv,egpslv (4F15.10)
    - emnslt - Minimum of the solute-solvent pair-energy distribution scale (in kcal/mol).
    - egpslt {0.5} - Grid interval (in kcal/mol) for the above pair-energy distribution.
    - emnslv - Minimum of the solvent-solvent pair-energy distribution scale (in kcal/mol).
    - egpslv {0.25} - Grid interval (in kcal/mol) for the above pair-energy distribution. The pair-energy distributions are very helpful to locate any surprise (overlooked) minimum in the potential function.

III.09.02. Functional-group analysis of the solute.

FCGA [Key1]

Prerequisite: SLTA
Potential successor: FCGD

Key1 (specifies groups to be printed):

ALL - All functional groups will be listed

OERR - Only the ???? and VERR functional groups will be listed. Atoms in these groups are likely to have missing or superfluous bonds.

The solute is broken up into functional groups and a list of bonded atoms is given (based on atom-atom distances). It is also useful for the detection of errors in the connectivity generated by the program. The errors found can be corrected with the keys MAKB or BRKB

III.09.03. Calculation of average force on the solute atoms

ATFR

Potential successor: PXAN

Also requires the PXAN or the SCAN key.

III.09.04. Calculation of average field gradient at the solvents

FLDG Key1, Key2, Data

Prerequisite: PXCR
Potential successor: PXAN

Variable to set to 'T' in the preprocessor: FG

Key1 (specifies the type of file to write the gradients):

NONE - Turn of field gradient calculation
BNRY - Binary file, with all contributions
ASCI - Ascii file, with all contributions
BNTO - Binary file, only the total field gradients
ASTO - Ascii file, only the total field gradients
Key2 (do or don't do solvent sites)
- ALLG - Calculate field gradients at solute and solvent sites
- NOSV - Don't calculate field gradients at the solvent sites
  Data: fgruucut, fgruvcut, fgrvvcut, rproxmax
  - fgruucut - solute-solute cutoff
  - fgruvcut - solute-solvetn cutoff
  - fgrvvcut - solvent-solvent cutoff
  - rproxmax - field gradients will be calculated only at solvents nearer than rproxmax to their proximal solute atom

Only works for general solvent. Also requires the PXAN or the SCAN key.

III.09.05. Sensitivity coefficent calculation

SENS Data

Data: logfreq

logfreq - The sensitivity coefficient contributions w.r.t. the LJ potential parameters and the solute site charges will be calculated at every 10^logfreq step. Valid only for three-site water solvents.

III.09.06. Calculate energy contributions

ENGL Key1 Key2 [Data]

Prerequisites: SLTA, SUUC

Key1 (source):

AVRG - Print only the averages over a trajectory
INDI - Print results for each frame of a trajectory
Key2 (sum exclusion):
- ALLI - Include all interactions
- NOGI - Exclude interactions within each solute group/residue
- NOMI - Exclude interactions within each solute molecule
  Key3 (limit type):
  - ARNG - Interaction range specification refers to atom numbers
  - GRNG - Interaction range specification refers to group (residue) numbers.
    Data: iafrst, ialast, iafrst2, ialast2, ctonnb
    - iafrst {1}, ialast {nslt} - Restrict analysis to solute atoms/groups from iafrst to ialast
    - iafrst2 {1}, ialast2 {nslt} - Restrict interactions of the atoms/groups selected for analysis with atoms/groups from iafrst2 to ialast2.
    - ctonnb {cutslt} - switching function cutoff will be used from ctonnb to cutslt (read with key SUVC)

For a configuration analyzed, the program prints

ia, ig,E(es), E(d), E(r), E14(es), E14(d), E14(r) (2i10,3e12.5,3e12.5)
ia, Esw(es), Esw(d), Esw(r)(i10,3e12.5)
ia, ER(es), ER14(d), ERsw(r), sigma, Nnb, N14, Nw (i10,3e12.5,3i6)

where E(*) refers to different terms of energy between solute atom ia and the rest with (es), (d) and (r) indicating electrostatic (1/r), dispersion (1/r⁶) and repulsion (1/r¹²) terms, respectively; 14 refers to 1-4 interactions; w refers to solute-water interactions; R refers to the sensitivity coefficient of the energy with respect to the Lennard-Jones sigma of solute atom ia; Nnb, N14, Nw are the number of non-bonded, 1-4 and solute-water terms contributing, respectively.

The calculated contributions are also calculated for each group (residue) as defined in the .slt file.

For trajectory analysis,a the calculated properties are averaged over the trajectory; the standard deviations are also calculated.

Also requires the PXAN or the SCAN key.

III.09.07. Solute and/or solvent filtering

FILT Key1 Key2 Key3 Key4 Key5 Data :

Prerequisite: CNFG or TRAJ

Key1 (basis of filtering):

SOLV - Filter solvents by solvation state (inside, invagination or shell)
GEOM - Filter solvents by position, system size and coupling parameter
NMBR - Filter out solute and/or solvent segments
ENRG - Filter solvents by solute-solvent energy
FRAM - Filter frames from a trajectory
Key2 (target of filtering), skip when Key1=FRAM :
- CONF - Filter the current configuration
- TRAJ - Filter a trajectory
- TRAC - Filter a trajectory, center each frame
- TRAO - Filter a trajectory, and write the file of original solvent numbers
- TRCO - Filter a trajectory, center each frame and write the file of original solvent numbers
  Note: there is some (optional) additional data input described at the end of the section.
  Key3 (format of filetered result):
  - For Key2=CONF: see Key2 of CNFG
  - For Key2=TRAJ or Key1=FRAM: see Key1 of TRAJ
    Note: each data input can be followed by the file version increment, incvers

For Key1=SOLV:

Key4

INSD - Discard all solvents that are not 'inside' the solute, based on solvent neighborhood analysis.
INSV - Discard all solvents that are not 'inside' the solute or in invaginations of the solute, based on solvent neighborhood analysis.
INVG - Discard all solvents that are not in invaginations of the solute, based on solvent neighborhood analysis.
CRCV - Discard solvent based on the circular variance of the vectors from the solvent to its neighboring solute atoms, as specified by the data items rslv and cvlim.
CRCH - Same as CRCV above, except the circular variance is calculated using the solute heavy atoms only.
CVLS - Similar to CRCV above, but a specified number (leavemol) of innermost (cvlim > 0) or outermost (cvlim < 0) solvents
SHL1 - Discard all solvents that are outside the first solvent shell.
SHL2 - Discard all solvents that are outside the second solvent shell.
SHL3 - Discard all solvents that are outside the third solvent shell.
NCLS - Keep a specified number (leavemol) solvents that are nearest to the solute.
Key5: Like Key2 of GCEN (Sec. III.2.5) to specify the solute radius source (i.e., RVDW, RSIG, RSGH) to be used for determining solvent connectivity.
For Key4 = INSD, INSV, or INVG:
- rslv {1.4} - Solvent radius (in �) to be used for determining solvent connectivity.
- nnoffset {0} - when filtering for solvents in invaginations, solvent will be considered to be in an invagination if the number of its solvent neighbors is not less than the number of its solute atom neighbors minus nnoffset and is part of an 'outside' cluster. Thus increasing nnoffset will retain solvents that are deeper in invaginations.
- minmem {100} - When > 0, all solvent clusters having at least minmem members will be considered 'outside' (instead of the largest one).
When Key1=SOLV, the program determines connected solvent clusters and considers either the largest or all clusters having at least minmem members to be 'outside' solvent and the rest to be 'inside' (unless the nearest solute atom to the inside candidate solvent is outside the first two solvent layer). Two solvents are connected if their COM's are within 2*rslv. A solute atom is neighbor to a solvent if the solvent's COM is within R_VdW+rslv to that solute atom.
For Key4 = NCLS:
- leavemol - The number of solvent molecules nearest to the solute to be kept
- nafltrange {0} - the solute atoms to solvate will be restricted to nafltrange ranges
- iafltrange1, iafltrange2 (nafltrange times) - the first and last solute atoms of each range to solvate
For Key4 = SHL1, SHL2, or SHL3:
- rslv {1.4} - Solvent radius (in �) to define a shell (1st shell width: rslt+rslv; 2nd shell width: 2*rslv)
- nafltrange {0} - the solute atoms to solvate will be restricted to nafltrange ranges
- iafltrange1, iafltrange2 (nafltrange times) - the first and last solute atoms of each range to solvate
For Key4=CRCV or CRCH
- rcrcv {10.0} - Circular variance is calculated with solute atoms within rcrcv + R(solute atom). It can be considered as the radius of the test sphere rolled over the surface used to calculate solvent accessibility.
- cvlim {0.5} - When cvlim>0 then solvents with circular variance > cvlim are kept (inside the solute); when cvlim<0 then solvents with circular variance < -cvlim are kept (outside the solute).
- nsltcvfst {1} - the first solute atom to use for the circular variance calculation
- nsltcv {nslt} - the last solute atom to use for the circular variance calculation
For Key4 = CVLS:
- rcrcv, cvlim, nsltcvfst, nsltcv - See above
- leavemol - The number of innermost/outermost solvent molecules to be kept

For Key1=GEOM:

Data

rminx

rmaxx

rminy

rmaxy

rminz

rmaxz

centx

centy

centz

rmin

rmax

cplmin

cplmax

nmolmin

nmolmax

isltfilt

rminx, rmaxx, rminy, rmaxy, rminz, rmaxz {box limits}- Solvents whose COM falls between (rminx,rminy,rminz) and (rmaxx,rmaxy,rmaxz) will be kept (default leaves all solvents)
rmin {0}, rmax {outer box radius} - solvents whose COM's distance from cent is betwen rmin and rmax will be kept (default leaves all solvents).
cplmin {0.0}, cplmax {1.0} - Whenever applicable, configurations where the coupling parameter is outside the range [cplmin,cplmax] will be dropped.
nmolmin {1}, nmolmax {maxmol} - Whenever applicable, configurations where the number of solvent molecules is outside the range [nmolmin,nmolmax] will be dropped.
isltfilt [0] - If =1 then the solute atoms will also be filtered.

For Key1=NMBR:

Data

nsltseg

ifsltseg1

ilsltseg1

ifsltseg2

ilsltseg2

nslvseg

ifslvseg1

ilslvseg1

ifslvseg2

ilslvseg2

nsltseg, ifsltseg1, ilsltseg1, ... - There will be nsltseg solute atom segments to be deleted, and their limits (inclusive) are ifsltseg1, ilsltseg1, etc. (up to a maximum of 250 segments). i*sltseg* values refer to atom numbers.
nslvseg, ifslvseg1, ilslvseg1, ... - There will be nslvseg solvent molecule segments to be deleted, and their limits (inclusive) are ifslvseg1, ilslvseg1, etc. (up to a maximum of 250 segments). The solvent number does NOT include the solute; i.e., solvent No 1 is molecule No 2.

For Key1=ENRG:

Data

esusvmin

esusvmax

esusvmin, esusvmax - Drop solvents with solute-solvent energy outside the range [ esusvmin, esusvmax] kcal/mol.

For Key1=FRAM:

Key4

ETRA - Use the total energy on the trajectory file
ECAL - Calculate the total energy for each frame on the trajectory file

ADDITIONAL INPUT:
Data: incvers

incvers {10} - The filtered trajectory will have a version number that is incremented from the initial trajectory file version number (runvers, see key FILE) by incvers.

ADDITIONAL INPUT for Key2=TR**:
Data: nfrmmax, nfrmfreq, nfrmskip

nfrmmax {100000000} - gather configurations until the nfrmmax-th frame (snapshot).
nfrmfreq {1} - use every nfrmfreq-th frame only
nfrmskip {0} - skip the first nfrmskip frames

III.09.08. Calculate hydrogen-bond bridges

HBBR Key1 Data

Prerequisite: SLTA

Key1 (anchor atoms specification)

ALST - Solute atoms that can anchor a hydrogen-bond bridge are listed
BBON - Hydrogen-bond bridges are anchored by protein backbone atoms only
TLST - Solute atom names that can anchor a hydrogen-bond bridge are listed
CHRG - Solute atoms with partial charge above a threshold can anchor a hydrogen-bond bridge
Data rxhmax, hbanglemin

III.09.09. Configuration comparison

CMPC Key1, Key2, Key3, Data

Prerequisite: SLTA
Potential successor: CNFG

Key1 (input file format):

ASCI, BNRY, ASAN, PDB , CHRM - File format. See Key2 of Key CNFG
Key2 (comparison operation):
- SHRT - Shift and overlay the second conformation's molecules on the first (valid only when the PART key is active
- SHON - Shift the second conformation's molecules on the first
- ALLD - Don't shift or rotate the molecules relative to each other
  Key3 (possible special treatment for lipids):
  - GENL - General solute molecules
  - LIPI - Lipid bilayer: 2nd half is in mirror image orientation to the first.
    Data: numrun0, numrun1, numrund, ix1, dx1, ix2, dx2, numshow
    - numrun0 - first conformation's run number
    - numrun1 - second conformation's run number
    - numrund - output files' run numbers will be incremented by numrund
    - ix1, dx1 - molecules will be dx1 Å apart along the ix1 axis
    - ix2, dx2 - molecules will be dx2 Å apart along the ix2 axis
    - numshow {all} - show only the top (most different) numshow molecules
      If Key3=LIPI: ixflip, lipfst, liplst
    - ixflip - invert the sign of the ixflip-th coordinate for the second half of the lipid molecules
    - lipfst {1} - the first lipid is the lipfst-th solute molecule
    - liplst {all} - the last lipid is the liplst-th solute molecule

This operation reads in two configurations, calculates the average, minimum and maximum RMS between the solute molecules (possibly subject to the operation specified by Key2) and spreads out the compared molecules of the solute in the ix1-ix2 plane for comparison. The centers of the molecules will be shifted to this plane in a rectangular arrangement.

III.09.10. Combining solvents from a trajectory

DENF Key1 [Data]

Prerequisite: TRAJ

Key1 (output format):

PDB - Combined file will be in PDB format
CHRM - Combined file will be in CHARMM CRD format
INSG - Combined file will be in Insight free (sxyzrq) format

Data: nmcmax, nmcfreq, nmcskip

nmcmax {100000000} - gather configurations until the nmcmax-th step.
nmcfreq {1} - use configuration saved at every nmcfreq-th simulation step
nmcskip {0} - skip the first nmcskip simulation steps

This operation scans the trajectory specified and puts the first atom of all the solvent molecules into a single configuration onto a file <jobname>_den.dat (with the run and version number taken from the trajectory file's name) in InsightII free format) as He atoms If the output file already exists, a file with incremented version number will be created. If parts of the solute were allowed to move, the affected solute atoms are also collected. The resulting configuration is a simple approximation to the three-dimensional density.

III.09.11. Cavity grid listing

PRTG Key1 Key2 Key3 [Data]

Prerequisites: CNFG, GCEN
Potential successor: MOND or BLKW
Variable to set to 'T' in the preprocessor: PG

Grid points representing cavities around the solute will be enumerated and separated into mutually disconnected clusters. The first cluster is the set of gridpoints surrounding the solute and the remaining ones (if any) represent cavities inside the solute. The program writes out the solute and the internal cavities to the file <jobname>.<numrunw>.grd and, if requested, the solute and external gridpoints (in the same format) to <jobname>.<numrunw>_2.<ext> iwhere <ext> is pdb, CRD or grd for PDB, CHARMM, Insight free format, resp.. The gridpoints are written as helium atoms with zero charge and the group (residue) number is the group number of the last solute atom plus the cluster number. Note, that only a layer of gridpoints near the solute atoms are written providing a bounding shell for each cavity.

Key1 (grid file format):

NOLP - No grid file is written
PDB - Grid files are in PDB format
CHRM - Grid files are in CHARMM CRD format
ASAN - Grid files are in annotated ASCII format (see WCNF SVAN)
Key2 (need for grids in the interior of a cavity)
- ALLG - All gridpoints in a cavity will be written
- SRFG - Only gridpoints on the surface of a cavity will be written
  Key3 (controls printing of cavity/pocket environment)
  - NOLS - No atom or residue list lining a cavity/pocket will be given
  - GRLS - List of residues lining a cavity/pocket will be printed
  - ATLS - List of heavy atoms lining a cavity/pocket will be printed
  - AGLS - Both list of heavy atoms and of residues lining a cavity/pocket will be printed
    Data: nnexmin, nnexmax, cvlim, rcutcv, volcavmin, volpckmin, rnearlim, rsltfac, cvlimcav, randsh, diamslvnew, nlshave, maxnnsurf, noputbacksfg, numrunw
    - nnexmin, {1} nnexmax {0} - external gridpoints will be written only if the number of their grid neighbor is between nnexmin, nnexmax (inclusive). Can be a huge file!
    - cvlim {0.0} - When cvlim > 0.0 external grid points will be dropped when their circular variance is < cvlim. The remaining points will be clustered and printed, filtered by volcavmin or volpckmin.
    - volcavmin - {0.0} minimum volume estimate of a cavity to be printed
    - volpckmin - {0.0} minimum volume estimate of a pocket to be printed
    - rcutcv - {10.0} the cutoff for all XV calculations
    - rnearlim - {10^1/2} atoms lininig a cavity/pocket have to be closer then rnearlim Å to a cavity grid.
    - rsltfac - {0.9} Solute atom radii will be multiplied by rsltfac for the calculation of grid coverings.
    - randsh {0.5} - grid will be shifted by randsh*gridsize
    - cvlimcav - {0.0} circular variance threshold for the cavity centers: cavites whose centers circular variance is below cvlimcav will be dropped from consideration
    - diamslvnew {0.0} - When diamslvnew > 0.0 the grids are also generated with cavity radius diamslvnew. If it is significantly smaller than the original rsph value inputted after GCEN then the second set might partition into more discrete clusters, indicating the presence of invaginations, in addition to real cavities. These new gridpoints will be written to files <jobname>.<numrunw>_3.<ext>, <jobname>.<numrunw>_4.<ext>.
    - nlshave {0} - When nlshave > 0 both the cavities and the pockets will be subjected by a sub-cluster search by removing nlshave layers from each cluster and check if the resulting set of gridpoints fall apart. If yes, these sub-clusters will be also written to the corresponding PDB files with chain id 'S'.
    - maxnnsurf {3} - Grids with maxnnsurf or less neighbors will be considered surface grids and will be shaved off.
    - noputbacksfg {0} - When noputbacksfg=1 the shaved gridpoints will be left out from the subclusters
    - numrunw {numrun} - the runnumber of the PDB files to be written.

III.09.12. Estimate solute and cavity volumes with multiple grids

STVG [Data]

Prerequisites: CNFG, GCEN, SLTA
Variable to set to 'T' in the preprocessor: PG

Data: diamslvnew, ngridrep

diamslvnew {diamslv} - Calculate the number of grids covered by the solute using diamslvnew as the solvent radius and estimate the solute volume from it. If diamslvnew is different from diamslv (read by the GCEN key) then the solute radii will be also scaled by diamslvnew/diamslv.
ngridrep {10} - repeat the grid cover calcculation with ngridrep randomly shifted grid and print the average volumes.

The progam estimates the volume of the solute and the volume of each cavity based on the number of gridpoints covered/left uncovered, resp. The first 'cavity' is the volume outside the solute. The averaging assumes that the cavities occur in the same order in each sample.

III.09.13. Calculate cavity occupancies

CPOC Key1 Key2 Key3

Prerequisite: CNFG,PRTG, GCEN

This key calculates the (average) number of solvents in each cavity as determined by the key PRTG
Key1 (select cavity/pocket)

CVTY - calculate the occupancies of internal cavities
PCKT - calculate the occupancies of surface pockets
Key2 (input type specification)
- CONF - calculate occupancies for a single structure (read by the CNFG key)
- TRAJ - calculate average occupancies for a trajectory (specified by the TRAJ key). Note that the pockets/cavities are defined by the conformation read at the start of the run thus it is assumed that the solute is rigid
  Key3 (saving cavity solvents)
  - NOSA - cavity/pocket solvents will not be saved
  - SAVE - When Key2=CONF, a PDB file containing the solute and solvents found to be in a cavity/pocket will be saved

III.09.14. Calculate generic solvent sites

GENS Key1 Key2 Key3 Key4 Key5 Data

Prerequisite: PBCN
Can not be followed by: IWSL Related keys: GABF, GSAN, GSAO, IWSL Variable to set to 'T' in the preprocessor: PG

Key1 (type of calculation)

CALC - Calculate generic solvation sites from a trajectory with the algorithm of Mezei and Beveridge
CCAL - Continue the generic solvation site calculation. This option requires a preceding RCKP key.
COMP - Compare two site estimates, read from two files, by performing a Hungarian method matching. If the solute is also present, write a PDB file containing the sites, sorted by their circular variance (inside sites first).
CLST - Cluster sites that are within a threshold distance in a a single site and combine their occupancies.

For Key1=CALC or Key1=CCAL:

Key2

FIXI - Except for the first scan (iteration) of the history file, keep the site estimate fixed during an iteration (recommended).
CONT - Continually update the site estimate during an iteration.
Key3: (inital site estimate source)
- GETC - Obtain initial sites from a system read previously by the CNFG key.
- GETH - Obtain initial sites from the trajectory file
- RDAS - Read initial sites from a file specified here in ASCI format (see key CNFG)
- RDBI- Read initial sites from a file specified here in BNRY form (see key CNFG)
- RDAN - Read initial sites from a file specified here in ASAN format (see key CNFG)
- RDPD - Read initial sites from a file specified here in PDB format (see key CNFG)
- RDCH - Read initial sites from a file specified here in CHRM format (see key CNFG)
  Key4: (output file format - see key CNFG) CNFG
  - WPDB - Output file format is PDB, the file <jobname>_gs. will contain the sites only
  - WASC - Output file format is ASCI, the file <jobname>_gs. will contain the sites only
  - WANN - Output file format is ASAN, the file <jobname>_gs. will contain the sites only
  - SPDB - Output file format is PDB, the file <jobname>_gs. will include the solute of the last frame
  - SASC - Output file format is ASCI, the file <jobname>_gs. will include the solute of the last frame
  - SANN - Output file format is ASAN, the file <jobname>_gs. will include the solute of the last frame
  - RPDB - Output file format is PDB, the file <jobname>_gs. will include the solute read at the start
  - RASC - Output file format is ASCI, the file <jobname>_gs. will include the solute read at the start
  - RANN - Output file format is ASAN, the file <jobname>_gs. will include the solute read at the start
    Key5: (order of generated sites)
    - FSRT - Generated sites will be sorted in the order of decreasing occupancy, fully occuped sites also sorted in the order of increasing site RMS
    - DSRT - Generated sites will be sorted in the order of increasing site RMS
    - CSRT - Generated sites will be sorted in the order of increasing site convergence
    - PSRT - Generated sites will be sorted in the order of increasing distance from the solute
    - CVSR - Generated sites will be sorted in the order of decreasing site CV wrt the solute (going from inside to outside)
    - PXSR - Generated sites will be sorted in the order of increasing proximity solute index
      Data: ns, nmcmax, nmcfreq, nmcdisc, rcutxst, rmateps, epsgsite, rmaxmatch, rmix, maxiter, nsmin, nsmax, cplpmin, cplpmax, cvfilt_traj, cvfilt_trajcut, froccmin, cvfilt, distsltmax, rmsdlim, froccminrep, clstmerge_dmax, froccpairmax, rmaxrematch
      - ns - Generate ns sites to start with. If ns=0, the initial number of sites will be the number of solvents in the input structure. When a structure assigns a solvent to each site and there are solvents left over those solvents will added as new sites.
      - nmcmax - trajectory file will be scanned until simulation step number nmcmax
      - nmcfreq - Use every nmcfreq-th configuration from the trajectory file
      - nmcdisc - Discard the first nmcdisc configurations. Note: for MC history files, nmcmax, nmcfreq, nmcdisc should refer to MC stepnumbers, not structure numbers.
      - rcutxst {8.0} - For the matching, consider only sites within rcutxst Å from each solvent (just to speed up the matching procedure)
      - rmateps {0.01} - Convergence threshold for optimizing the Hungarian method matching
      - epsgsite {0.1} - Convergence threshold for the RMSD between subsequent site estimates
      - rmaxmatch {1000.0} - If a solvent is farther than rmaxmatch Å from its assigned site, it will be added as a new site to the site list and ns will be increased accordingly.
      - rmix {0.0} - When rmix > 0.0 the previous site estimate will be mixed with the new site estimate: new=rmix*old+(1-rmix)*new . This might help the iteration to converge.
      - maxiter {100} - maximum number of iterations over the trajectory. If it is less than the iterations already performed, only the results will be printed.
      - nsmin {0}, nsmax {#MH} - Use only configurations where the number of solvents is in the (inclusive) range [nsmin,nsmax] (meaningful only for analyzing grand-canonical ensemble simulations - see the GCEN key).
      - cplpmin {-0.1}, cplpmax {2.0} - Use only configurations where the coupling parameter is in the range (meaningful for analyzing PMF calculations, i.e., FREE PMF1) [cplpmin,cplpmax]
      - cvfilt_traj - When >0.0 the solvents read from the trajectory with circular variance (w.r.t the solute) < cvfilt_traj will be dropped.
      - cvfilt_trajcut - The CV will be calculated using a distance cutoff of cvfilt_trajcut Å.
      - froccmin {0.0} - Sites with fractional occupancy < froccmin will be eliminated.
      - cvfilt - When >0.0 sites whose CV (w.r.t. the solute) is < cvfilt will be eliminated.
      - distsltmax {0.0} - When >0.0 sites farther than distsltmax from the nearest solute atom will be eliminated.
      - rmsdlim {0.0} - When >0.0 sites with site rmsd > rmsdlim will be dropped.
      - froccminrep {0.0} - When >0.0 a representative configuration where all sites with occupancy > froccminrep (must be > froccmin) will also be generated.
      - clstmerge_dmax {0.0} - When >0.0 sites that are closer to each other than clstmerge_dmax will be combined into a single site.
      - froccpairmax {2.0} - Site pairs whose sum of occupancies exceed froccpairmax will not be paired for clustering
      - rmaxrematch {0.0} - When > 0.0 the assigments of solvents to sites will be repeated using the filtered sites only (see froccmin, cvfilt, distsltmax, above). For this match only solvents whose distance from the site is < rmaxrematch Å will be accepted.
      When Key3=RD** additional Data:
      - file, numrun, irunvers, nslt, nslv, islvrep - information specifying the content of the input file as described above with Key1=COMP

COMP

Key2

Key3

ASCI or BNRY or ASAN or PDB or CHRM - the formats of the two input files as described with the CNFG key.
Key4 (site distance measure M)
- DIST - M = site-site distance
- ENVN - M = <# of neigbours> - # of identical neighbors (both within a user defined threshold).
- ENVI - M = (<# of neighbors>+1)/(# of identical neighbors+1)
- ENVM - M = (site-site distance) * (<# of neighbors>+1)/(# of identical neighbors+1)
- TANI - M = (1 - Tanimoto coef of the neighbor lists)
  If nexp > 1 then M => M^N
  If nexp < -1 then M => M^(1/N)
  Data: ns1, ns2, file1, numrun1, numvers1, nslt1, nslv1, islvrep1, file2, numrun2, numvers2, nslt2, nslv2, islvrep2, rcutxst, rmateps, rcutcrcv, rmsdgrid, rneigsite, nexp
  - ns1, ns2 - Number of sites in systems 1 and 2, resp. If either of them is zero, the number of sites will be the number of solvents in the corresponding structure.
  - file1,file2 - jobnames of the files for systems 1 and 2, resp.
  - numrun1,numrun2 - runnumbers of the files for systems 1 and 2, resp.
  - numvers1,numvers2 - version numbers of the files for systems 1 and 2, resp.
  - nslt1,nslt2 - number of solute atoms in the files <jobname>.<runnumber>.crd for systems 1 and 2, resp.
  - nslv1,nslv2 - number of solvent atoms (per molecule) on the files for systems 1 and 2, resp.
  - islvrep1,islvrep2 - index of the solvent atom representing the site for systems 1 and 2, resp.
  - rcutxst {3.0} - For the matching, consider only sites within rcutxst Å from each solvent
  - rmateps {0.01} - Convergence threshold for optimizing the Hungarian method matching
  - rcutcrcv {8.0} - Cutoff for the circular variance calculation
  - rmsdgrid {0.15} - The grid size for the matching distance distribution
  - rneigsite {5.0} - The neighbor distance threshold when Key4 is not DIST
  - nexp {1} - the exponent to use with Key4=TANI

CLST

Key2

ASCI or BNRY or ASAN or PDB or CHRM - the format of the input file as described with the CNFG key.
Data: ns, file, numrun, nslt, nslv, islvrep, clstmerge_dmax
- ns - Number of sites in the system If ns=0, the number of sites will be the number of solvents in the input structure.
- file - jobname of the files containing the generic sites to be clustered
- numrun - runnumber of the file
- numvers - version number of the file
- nslt- number of solute atoms in the file
- nslv - number of solvent atoms (per molecule) in the file
- islvrep - index of the solvent atom representing the site
- clstmerge_dmax {2.0} - Sites that are closer to each other than clstmerge_dmax will be combined into a single site.
- froccpairmax {2.0} - Site pairs whose sum of occupancies exceed froccpairmax will not be paired for clustering

Both GENS CALC and GENS COMP are based on using the so-called Hungarian method to obtain an optimal matching of solvent coordinates to (generic) site coordinates. With GENS CALC the calculation scans a simulation trajectory and dedudeces iteratively a set of generic sites. For each configuration, the solvent coordinates are matched to the site estimates, allowing for the same solvent to contribute to different sites during the simulation.

When GENS CALC is used, the program will write several files:

<jobname>_gs.: the generated sites. The PDB data columns give the fractional occupancy and site RMS.
<jobname>.gs_rep.pdb: The configuration found to be closest to the configuration represented by the sites. The PDB data columns give the fractional occupancy and distance between the representative solvent and the site CV w.r.t the solute.
<jobname>.pdb_gs_rep_all.pdb: An other representative configuration selected with the additional criterion that all sites with occupancy > froccminrep are present
<jobname>.pdb_gs_cmp.pdb: a composite configuration where for each site (with occupancy > froccmin) the solvent closest to it is selected. The PDB data columns give the fractional occupancy and site RMS. Note that these solvents may be in incompatible conformations, especially when low occupancy sites are included.
<jobname>._gs_avg.pdb: a composite configuration where the waters are in their average orientation (prepared for water solvent only). The PDB data columns give the fractional occupancy and the site CV w.r.t the solute.
<jobname>._gs_cls.pdb: the sites after clustering those that are closer than clstmerge_dmax to each other. The PDB data columns give the fractional occupancy and the site RMSD.
<jobname>._gs_cla.pdb: only for water solvent - waters in their average orientation at the clustered sites. The PDB data columns give the fractional occupancy and the site CV w.r.t the solute.
<jobname>._gs_fcl.pdb: the sites after filtering by CV and clustering by distance The PDB data columns give the fractional occupancy and the site CV w.r.t the solute.
<jobname>._gs_fcl.inf: Detailed information (coordinates, occupancy, CV, RMSD, energy, proximal solute atom) on the sites in each cluster and their averages over the cluster members

Depending on Key4, the sites or solvents may be preceded by the full solute. Representative and the composite configurations list the solvents in the order of the sites they match to, but may be offset due to missing matches. Also, the standard output has a combined list of the sites and their representatives with their properties.

The run and version number for these files are taken from the trajectory file and extension defined by Key3. If the output file already exists, a file with incremented version number will be created.

III.09.15. Prepare site assignment file, entropy calculation, clustering sites

IWSL Key1, Data

Potential successor: GENS

Key1 (ask for entropy input)

WSTO - Write solvent site assignment file (.wsl) and an input file (.sto) for the program STOW to calculate entropy.
NOST - Write only solvent site assignment file (.wsl).
Data clstfin_dmax, nclust_clstfin, cvfilt_stow, froccmin_stow, rmsdlim_stow, maxclstmem_stow, subreground
- clstfin_dmax {0.0} - When > 0.0 the filtered sites will be clustered (using single-link clustering) with a distance cutoff of clstfin_dmax. If Key1=WSTO than the default value is 3.5 A.
- nclust_clstfin {0} - When > 0 the filtered sites will be clustered (using single-link clustering) by varying the distance threshold to yield nclust_clstfin clusters.
- cvfilt_stow {0.0} - When >0.0 the clusters where the lowest CV value is less then cvfilt_stow will be skipped from the STOW input.
- occfilt_stow {0.0} - When >0.0 the clusters where the lowest occupancy value is less then occfilt_stow will be skipped from the STOW input.
- rmsdlim_stow {0.0} - When >0.0 the clusters where the highest site rmsd exceeds rmsdlim_stow will be skipped from the STOW input.
- maxclstmem_stow {10} - Clusters with more than maxclstmem_stow members will be skipped from the STOW input.
- subreground {0.7} - When calculating the number of neighbour sites, the mean number of waters in the neighborhood of a site will be rounded up if the fractional part is > subreground

III.09.16. Generic site analyses

GSAN Key1 [Key2 [Data]]

Prerequisite: IWSL
Can not be followed by: GENS
Related key: GSAO

Key1 (analysis type)

DPCV - the covariance matrix among the water dipoles corresponding to each site will be written to a file with extension .cov.
DPCR - the correlation matrix among the water dipoles corresponding to each site will be written to a file with extension .cor.
ENRG - the matrix of the average energies between waters belonging all site pairs will be written to a file with extension .eng.
HBND - the matrix of the fraction of time waters belonging to each site are hydrogen bonded will be written to a file with extension .hbd and site-solute hydrogen bond statistics will be calculated.
HBTT - in addition to the site-site hydrogen-bond occupancy matrix, the matrix of site-solute and solute solute hydrogen bond occupancies will be printed. The solute hydrogen bonds are condensed by residues and residues not forming hydrogen bond with sites are omitted.
OCCC - the matrix the joint and alternative occupancies will be written to a file with extension oro.
JMPA - the average number of frames between solvent jumps will be calculated
Key2 (eigenvalue/vector calculation, normalization, order)
- NOEV - Don't calculate the eigenvalues/eigenvectors of this matrix
- CAEV - Calculate the eigenvalues/eigenvectors of this matrix and write it to a file with extension .eiv
  For Key1=OCCC only:
- ONRM - Normalize the occupancies calculated with the occupancy expected from uniform distribution
  For Key1=JMPA only:
- RDOR - Read in the original solvent numbers from a file created by FILT TR*O
  Data for Key1=HBND: rxhmax, hbanglemin, occmin_at, occmin_res
  - rxhmax {3.0} - maximum X...H distance for a hydrogen bond
  - hbanglemin {90.0} - minimum X...H-Y angle for a hydrogen bond
  - occmin_at {0.0} - minimum occupancy to show in the solute atom - site hydrogen bond list
  - occmin_res {0.0} - minimum occupancy to show in the solute residue - site hydrogen bond list
  Data for Key1=JMPA and Key2=RDOR: numrunorig, iversorig
  - numrunorig {numrun}, iversorig {1} - the run number and version number of the original solvent number file

This key activates the selected analysis options after a generic site calculation (with the key GENS)

III.09.17. Generic site analysis options

GSAO Key1 Key2 Key3 [Data]

Prerequisite: IWSL
Can not be followed by: GENS
Related key: GSAN

Key1 (matrix sort option)

NOSR - print the matrix in the original site order
SXYZ - print the matrix sorted by one of the site coordinates
RXYZ - print the matrix sorted by the site-origin distance * sign of the one of the coordinates
LEXI - print the matrix sorted by a quasi-lexicographical order Key2 (output format specification)
- ASCI - formatted output, five numbers per line (text file)
- ASCL - formatted output, one line per site (text file)
- BNRY - unformatted output (binary file)
  Key3 (vector to use for covariance when Key1 of GSAN =DPCV or DPCR)
  - DIPL - Covariances are calculated for the HOH bisector
  - HHVC - Covariances are calculated for the H-H vector
  - DPHH - Covariances are calculated for the vector normal to the HOH plane
    Data for Key1=SXYZ or RXYZ: iaxis
    - iaxis sorting wil be based on the x, y, or z coordinates for iaxis = 1, 2, or,3, resp.
    Data for Key1=LEXI: w, iaxis1 iaxis2 iaxis3
    - w, iaxis1 iaxis2 iaxis3 - Sites will be sorted based on w^2*c(iaxis1) +w*c(iaxis2) +c(iaxis2)

III.09.18. Generic site graph analysis by frame

GABF Key1 [Data]

Key1 (print option)

NODP - No frame-by-frame printout
DTPR - Frame-by-frame printout
DLPR - Frame-by-frame printout with list of hydrogen bonds
Data rxhmax, hbanglemin, hbmatmin
- rxhmax {3.0} - maximum X...H distance for a hydrogen bond
- hbanglemin {90.0} - minimum X...H-Y angle for a hydrogen bond
- hbmatmin {0.0} - the clustering will use an adjacency matrix whose elements are 1 when the frame-averaged connectivity probability is > hbmatmin

Calculate the hydrogen-bond matrix for each frame, find the connected clusters and accumulate (and average) the matrix where all cluster members are connected. The matrix of the connectivity probabilities will be written to a file with extension .hbf.

III.09.19. Configuration with input torsion angles

GENT Data

Prerequisites: CNFG, TORD.

Data ntorch

ntorch - Number of torsion angles to specify

Fdata: ntorch records

ia1, ia2, ang (2i5,f10.0)
- ia1, ia2 - Atoms of torsion bond to change
- ang - New torsion angle

If the atoms ia1 and ia2 are listed in the reverse order on the torsion list, the torsion angle used will be -ang.

III.09.20. Calculate torsion angle distribution

TAND

Prerequisite: TORD
Potential successor: PXAN

When this key is present, analysis of a configuration or trajectory (activated by the SCAN key) will include the calculation of torsion angle distributions for torsions defined by the TORD key. The number of angular grids will be determined by the value of #TD set by the preprocessor. Besides the distribution, the program prints the standard deviation (meaningful for not too wide distribution only), the circular variance (on a scale of 0 to 1) and the number of grids sampled.

III.09.21. Energy analysis along a torsion

TORT Key1 Data

Prerequisites: CNFG, TORD.

Key1 (reference energy)

ABSE - The actual energy terms will be printed
RELE - Each energy term represents the difference between the current and initial conformation Data ntsamp times (ia1, ia1), anginc, PDBfile
- ia1, ia1 - atomindices of the atoms forming the torsion bond to be sampled
- anginc {5.0} - angle increment for the scan (in degrees)
- engmax {99999.9} - maximum energy to print
- PDBfile - if present, each structure generated will be written to PDBfile.pdb

This key calculates the various non-bonded and torsion terms as the selected torsion angle is driven through the full circle. The values printed are relative to the initial torsion angle.

If a conformation was generated previously with the GENT key then the torsion scan will start at those angles (otherwise at 0).

III.09.22. Energy analysis of backbone loop solutions

LPST Key1 Data

Prerequisites: CNFG, LOOP, TORD

Key1 (reference energy)

ABSE - The actual energy terms will be printed
RELE - Each energy term represents the difference between the current and initial conformation
Data ia1, ia1, PDBfile
- ia1, ia1 - atomindices of the atoms forming the torsion bond
- PDBfile -if present, each structure generated will be written to PDBfile.pdb

This key calculates all possible backbone solutions for the six torsions following the selected torsion angle, and calculates the same energy contributions as the TORT key.

HBBR Key1 Data

III.09.23. Aggregate insertion and deletion sites

IDAG Key1, Key2, Data

Prerequisite: IDLG
Related key: GCEN

Key1 (select insert/delete/both)

BOTH - Both insertion and deletion sites will be extracted
INSR - Only insertion sites will be extracted
DELE - Only deletion sites will be extracted
Key2 (filtering option)
- NOFT - All sites will be considered
- FILT - Sites will be subject to a proximity filter:
  Data rmax, nslt1, nslt2, nmcscan
  - rmax, nslt1, nslt2 - Only present when Key2=FILT. Sites selected should be closer than rmax to one of the solute atoms in the range nslt1-nslt2
  - nmcscan {0} - when not zero, the log file will be scanned until the nmcscan-th MC step.

This option will prepare a PDB file containing the solute and the sites of insertions and/or deletions as read from the insertion deletion log file (created by the IDLG key during a grand-canonical ensemble run (see key GCEN)). Insertion sites will be represented as oxygens while deletion sites as nitrogens.

For solute atom records, the occupancy and temperature factor columns will contain the distance of the closest deletion and/or insertion sites (as specified by Key1) and the number of such sites, resp. For the deletion and/or insertion site records, the occupancy and temperature factor columns will contain the index of the proximal solute atom and the distance from it, resp.

III.09.24. Trajectory superimposition

SUPI Key1, Key2, Data

Prerequisite: TRAJ

When filtering a trajectory, the filtered configuration will be superimposed to reference configuration.
Key1 (reference structure)

CONF - Reference structure is the structure read with the CNFG key
TRAJ - Reference structure is the first structure read with the TRAJ key
Key2 (atoms to be used)
- RNGE - Solute atoms to be used will be defined as index range(s)
- PROX - Solute atomd to be used will be defined as the vicinity of a selected atom in the reference structure
  Data for Key2=RNGE: nsltseg, ifsltseg1, ilsltseg1, ifsltseg2, ilsltseg2, ..., nslvseg, ifslvseg1, ilslvseg1, ifslvseg2, ilslvseg2
  Data for Key2=PROX: iaslt_cent, range

III.09.25. Solvent dipole distribution

SVDP Data

Prerequisites: SLVA, SLTA

Data exa, eyb, ezc

exa, eyb, ezc - unit vectors in the direction of coodinate axes (default: Cartesian axes)

The program will calculate the distributions of the angles between the solvent's dipole vector and the three coordinate axes.

III.09.26. Monitoring bulk solvent density

MOND [Data]

Prerequisite: PBCN
Prerequisite of: BTUN
Can not be followed by: PRTG

Data: glimminx, glimmaxx, glimminy, glimmaxy, glimminz, glimmaxz

The solvent density will be monitored inside and outside the rectangle defined by the ranges glimminx to glimmaxx, glimminy to glimmaxy, glimminz to glimmaxz in the x, y, and z directions, respectively. For non-rectangular simulation cells the center of the rectangle has to be the origin of the coordinate system.

III.09.27. Monitoring bulk solvent density

BLKW [Data]

Prerequisite: PBCN
Prerequisite of: BTUN
Can not be followed by: PRTG

Data: wlayer

The solvent density will be monitored inside and outside the rectangle defined by the ranges glimminx to glimmaxx, glimminy to glimmaxy, glimminz to glimmaxz in the x, y, and z directions, respectively, where these limits of the inside box are determined to be wlayer away from the edges of the simulation rectangle. It is only applicable with rectangular or cubic bundary condition and the center of the rectangle has to be the origin of the coordinate system.

III.10. Proximity analysis keywords

PXYZ: Permute coordinates read
PXCR: Proximity criterion choice
FCGD: Functional group definition
PXLM: Omitting atoms from proximity considerations
RFSL: Set first shell radii
RMOD: First shell radius modifications
DBLG: Double proximity radial gridsize
DIPC: Dipole fluctutations
GENV: Proximity volume element generation
GRAV: Proximity RDF averageing
PXTD: Proximity table difference calculation
PXDP: Solute-solvent proximity dipole distr. calculation
PXGR: Solute-solvent proximity g(r) and coordination number calculation
PXBE: Solute-solvent proximity binding energy calculation
PXWW: Solvent-solvent proximity distribution calculation
VOLE: Volume element regeneration
VORO: Voronoi analysis
RTIM: Solvent residence time
DIFC: Solvent diffusion constant
TAUC: Torsion angle autocorrelation

III.10.01. Permute coordinates read from history file

PXYZ Data

Potential successor: CNFG

Data: ix, iy, iz

ix, iy, iz - the X, Y, and Z coordinates will be permuted to the ix-th, iy-th and iz-th place before analysis or before writing out a new trajectory with the key WTRA. For example, to switch to the convention used for CHARMM (Z axis is the prism axis, hexagon vertex on the Y axis) use PXYZ 1 3 2 .

The coordinates will be permuted before analysis or before writing the new trajectory. This functionality allows the use of a trajectory file where the hexagonal prism's orientation is different from the convention MMC is using. Note, that it is only meaningful to use this key with hexagonal prism cell.

Note also that the change is applied only to the configurations analyzed. Thus, depending on whether the last configuration read was analyzed, the configuration current after a SCAN may or may not have the coordinates permuted.

III.10.02. Proximity criterion choice

PXCR Key1 Key2 Key3 [Data]

Prerequisite: SLTA
Prerequisite of: GENV, PXGR, PXWW, PXBE, PXLM, TIMM, FLDG, RTIM, DIFC, PXDP, PXTD, PXWR

Key1 (solvent density source):

TDEN - Solvent density is obtained form the cell volume and solute p.m.v.
PDEN - Solvent density is obtained from the considered primary region volume
WDEN - Solvent density is obtained from the solvent's p.m.v.
MDEN - Solvent density is obtained form the cell volume and the solute volume (including the layer of solvent radius), as calculated from onsets of the primary RDF's and the Monte Carlo estimates of the shell volumes.
Key2 (tesselation strategy):
- BISE - Original proximity criterion based on bisectors
- RPVW - Radical plane-based proximity criterion, using Van der Waals radii
- RPCH - Radical plane-based proximity criterion, using atomic charges
  Key3 (ignoring atoms):
  - ALL - Use all atoms on the solute for proximity index calculation
  - NOHD - Ignore hydrogens on the solute for proximity index calculation
  - URBG - Use only solute atoms in the ranges (specified below) for proximity index calculation
  - URNH - same as Key3=URBG but, in addition, ignore all hydrogens
    Solute atoms ignored by Key3 will not contribute to any rdf either.
    Data:
    - For Key3=RNG* only: nrange; irangestart, irangestop (nrange times) - only atoms in the ranges irangestart - irangestop will be used for proximity index calculation
    - For Key2=RPCH only: rqwmin, rqwmax - the atomic charges will be mapped (linearly onto the [rqwmin,rqwmax] interval to be used as radii in defining the radical planes. Default values: rqwmin {0.0}, rqwmax {1.5}.
    - pmvslt, pmvslv
      - pmvslt {0.0} - solute partial molar volume in ml/mol
      - pmvslv {18.12004} - solvent partial molar volume in ml/mol

III.10.03. Functional group definition

FCGD Key1 Data Fdata

Prerequisite: SLTA
Can not be followed by: FCGA

Key1 (definition type):

STND - Standard (chemically defined) functional groups will be used.
INTO - Functional groups will be defined by the user. The information will only be used for tabulating proximity results.
INTG - Functional groups will be defined by the user. The information will be used for tabulating proximity results and for grouping the calculated distributions, overriding the indxrdf information given after the SLTA key.
Data:
- ninpgrp - Number of functional groups to be defined.
- ninpspecd - Number of descriptors to be read.
  Fdata:
  - RECTYPE 11: (ninpgrp times): short name, long name, scope (a4,1x,a8,i5)
    - short name, long name - The names of the newly defined functional groups.
    - scope {0} - The scope of a definition.
      - scope=0: All atoms satisfying the definition(s) will form a single functional group
      - scope=1: Atoms satisfying the definition(s) and belonging to different residues (solute groups) will form different functional groups
      - scope=2: Atoms satisfying the definition(s) and belonging to different solute molecules will form different functional groups
  - RECTYPE 12: (ninpspecd times): resnam, atnam, labfg8, ifgrp (a4,1x,a4,1x,a10,i5)
    - resnam, atnam, labfg8, ifgrp - Solute atoms with residue name resnam, atom name atnam, (see labslt for key SLTA) long atom label (spgroup, see key SLTA) of labfg8 are defined to belong to the ifgrp-th functional group. Blank field for any of these three variables will match everything. Asterisk in any of these fields will match any character.

Note, that care must be exercised when this option is used after restoring a checkpoint file. If the INTG key was used before resulting in the non-default grouping of rdf's then using it again or using the STND key (to revert to the default) will abort the run since the original atom by atom information is lost.

III.10.04. Omitting atoms from proximity considerations

PXLM Data

Prerequisite: PXCR

Data: nfadel, nladel

nfadel {0} - The first nfadel solute atoms will be omitted from proximity considerations (i.e., when assigning solvents to solute atoms, these omitted atoms will be ignored).
nladel {0} - The last nladel solute atoms will be omitted from proximity considerations.

III.10.05. First shell radius definition

RFSL Key1 [Data]

Prerequisite: SUPT
Potential successor: SLTA

Can not be followed by: SLTA
Key1 (source of radii):

STOR - Use stored first shell radii (shown on the list obtained by PLBP.
Data: rfsfac, rfsadd
- rfsfac {1.0} - All first shell radii will be multiplied by rfsfac.
- rfsadd {0.0} - All first shell radii will be incremented by rfsadd (after multiplication by rfsfac).
SIGM - First shell radii will be obtained as the mean of the L-J sigma parameters of the solvent and solute atoms. The solvent's sigma is the sigma of the islvrep-th atom (see key SLVA) of a general solvent or the sigma of the water oxygen.
Data: rfsfac, rfsadd, sigmah
- rfsfac {1.05} - All first shell radii will be multiplied by rfsfac.
- rfsadd {0.0} - All first shell radii will be incremented by rfsadd (after multiplication by rfsfac).
- sigmah {0} When Key1=SIGM and sigmah is not zero, sigmah will be the sigma value used for the solute hydrogen atoms.
RDFM - Replace the first shell radii with the calculated minima of the primary RDF's.

III.10.06. First shell radius modification

RMOD Data

Data: nmod, nmod times : pfll, pfl, rfsl

nmod - number of potential types to modify
pfll - potential library label (see key SUPT)
pfl - atom type label or atom type number to modify
rfsl - new first shell radius for potential type pfl in the potential library pfll

III.10.07. Radial gridsize doubling

DBLG

This key will double the gridsize for the proximity analysis radial grid, modify all the accumulators involved and print the full result pages with the modified gridsize. If the PXPL key was used earlier, the proximity distributions will also be written on the .pxp file. Note, that the change is permanent for the remainder of the calculations.

III.10.08. Dipole fluctuation calculation

DIPC

The fluctuation of the dipole moments will be calculated in the proximity regions to give estimates of the dielectric constant.

III.10.09. Monte Carlo volume element estimation

GENV Key1 Data [Data]

Prerequisites: SLTA, PXCR, PXGR

Key1 (possible visualization):

NOWR - Don't write .pxv file
WRIT - Write file .pxv to visualize proximity regions
Data: nransn, iwritefs, iwritels
- nransn - The number of (additional) random points to generate this time. This will increase the precision of volume estimates, but only when the solute conformation remains fixed during the simulation.
- iwritefs {1}, iwritels {nslt} - points in the proximity of solute atoms from iwritefs to iwritels will be written to the file <jobname>.<numrun>.pxv
- maxprint {100} - maximum number of points written per proximity region

The calculation of the proximity RDF's require the volume element of shells in the various proximity regions. This volume is calculated by a Monte Carlo method: random points of uniform distributions are generated and the volume of each shell is assumed to be proportional to the number falling into them.

Generating a certain number of random points after each snapshot analyzed (see keys SCAN or PXAN), instead of generating them in one step at the beginning or end, allows for an approximate accounting of the variation of these volume element as a result of conformational changes in the solute.

III.10.10. Calculating RDF extrema averages

GRAV Key1 [Data] Prerequisite: PXGR

Key1 (averaging strategy):

ELEM - Averages calculated over chemical elements
PTYP - Averages calculated over potential types
Data: rkgrmin, r2kgrmin, rtkgrmin
- rkgrmin {0.5} - Minimum first shell coordination number to contribute to averages
- rkgrmin {2.5} - Minimum first+second shell coordination number to contribute to averages
- rkgrmin {10.0} - Minimum total coordination number to contribute to averages

If primary or total RDF's were calculated, the program averages the location and values of the first and second maxima, and the first minima for atoms that are sufficiently solvated as defined by the threshold values inputted above.

III.10.11. Proximity table difference generation

PXTD Data Prerequisite: PXCR

Data: filename1, runnum1, filename2, runnum2

filename1 - File name root of the first checkpoint file
runnum1 - Run number of the first checkpoint file
filename2 - File name root of the second checkpoint file
runnum2 - Run number of the second checkpoint file

The program reads both checkpoint files and calculated the differences of the table entries (shell volumes, coordination numbers, densities, energies, etc.). The two checkpoint files should refer to the same system.

III.10.12. Solute-solvent dipole distribution function calculation

PXDP

Prerequisites: PXCR, PXGR
Potential successor: PXAN

This key will result in the calculation of two functions for the solute atoms, based on the angle THETA between the line drawn from the solute atom to the first solvent atom and the bisector of the angle S2-S1-S3 (i.e., assuming that the solvent is water and the first atom is the oxigen, the dipole direction):

The distribution of THETA for waters in the first shell around the solute atom.
The average of THETA as a function of the waters' distance from their proximal solute atom.

III.10.13. Solute-solvent proximity radial distribution function and coordination number calculation

PXGR Key1 Key2 Key3 Data

Prerequisite: SLTA
Prerequisite of: GENV, PXGR, PXWW, PXBE, PXLM, TIMM, FLDG, RTIM, DIFC, PXDP, PXTD, PXWR

Key1 ('total' solute-solvent g(r) calc.):

NOTO - Don't calculate total solute-solvent g(r)'s
PXTO - Calculate total solute-solvent g(r)'s (centered at each solute atom)
Key2 (proximity solute-solvent g(r) calc.):
- NOPR - Don't calculate solute-solvent proximity g(r)'s or proximity Kc's
- PRKC - Calculate proximity coordination numbers
- PRIM - Calculate solute-solvent proximity g(r)'s, coordination numbers and their distributions
  Key3 (density weighting for proximity regions):
  - NOWT - No density weighting is used for the primary solute-solvent g(r)'s
  - TDWT - Primary solute-solvent g(r)'s are weighted by the total primary area solvent density
  - PDWT - Primary solute-solvent g(r)'s are weighted by the density of the primary area used for the g(r) calculation
    Data: rgpxmax, rpxdiv, rpxnear
    - rgpxmax {10.0} - Range of the solute-solvent g(r)'s in Å
    - rpxdiv {0.1} - Grid size to be used with the solute-solvent g(r)'s in Å
    - rpxnear {1.0} - When a solvent gets closer than rpxnear Å to the solute, a strong warning is issued and the configuration is saved

The total g(r) is calculated for the solute atoms by considering all solvents. The proximity g(r) is calculated for solute atoms by considering only solvents proximal to it. Coordination numbers (and their distributions) will be calculated if either total g(r) or proximity g(r) calculation was requested.

III.10.14. Solute-solvent proximity binding energy calculation

PXBE Data

Prerequisite: PXCR
Potential successor: PXAN

Data: epmink, epdivk

epmink - Minimum of the solute-solvent pair energy distributions (in kcal/mol)
epdivk - Grid size of the solute-solvent pair energy distributions (in kcal/mol), assuming that there are 100 grid points.

This calculation will generate the distribution of the solute solvent energies for solvents in the first shell, as well as the sum of solute solvent energies, summed both for all solvent is the first shell and for all solvents in the whole proximity region.

III.10.15. Solvent-solvent proximity distribution calculation

PXWW Key1 Key2 Data

Prerequisite: PXCR
Potential successor: PXAN

Key1 (coordination number and pair energy distr. calc.):

NOPE - No proximity solvent-solvent coordination number and pair-energy averages will be calculated
PEWW - Calculate proximity solvent-solvent coordination number and pair-energy averages
Key2 (g(r) calculation):
- NOGR - No proximity solvent-solvent g(r) will be calculated
- GRWW - Calculate proximity solvent-solvent g(r)
  Data: rsolmin, rsolmax, [rfsww, rceww,] [rgpxmax, rpxdiv, nijgvv, nijgvv times: islvirep(i), islvjrep(i) ...]
  - rsolmin, rsolmax: solvent-solvent proximity analysis will use only solvents whose distance from their proximal solute atom is in the range [rsolmin,rsolmax]
    If Key1 = PEWW:
    - rfsww: the solvent first shell radius to be used in the coordination number and pair energy calculations.
    - rceww: the solvent binding energy calculations will include all solvents within a sphere of radius rceww.
    If Key2 = GRWW:
    - rgpxmax {10.0} - Range of the solvent-solvent g(r)'s
    - rpxdiv {0.1} - g(r) grid size to be used. If the key PXGR is also present, the larger of the rgpxmax values will be used and the two rpxdiv values have to be the same.
    - nijgvv {0} - The number of different solvent-solvent RDF's to calculate.
    - islvirep(i), islvjrep(i): the i-th solvent-solvent RDF will be calculated between solvent centers islvirep(i) and islvjrep(i).

The solvent-solvent coordination number is the mean number of solvents within a sphere of radius rfsww, averaged over all solvents considered. The solvent-solvent pair energy is the average of water-water energies, calculated for waters within a distance of rfsww and at least one of the pair being considered for the solvent-solvent analysis. The solvent-solvent binding energy is the sum of water-water energies, averaged fover all solvents considered.

III.10.16. Volume element preservation/regeneration

VOLE Key1

Can not be followed by: RCKP Key1:

DROP - Volume element estimates are kept for successive runs using the same solute
KEEP - Volume element estimates are regenareted for each run.

III.10.17. Voronoi polyhedra analysis

VORO Key1, Key2, Key3, Data

Prerequisites: CNFG, PBCN
Potential successor: PXAN

Key1 (use of PBC):

PBC - Analyze with periodic images included
NPBC - Analyze with periodic images ignored
Key2:
- CONF - Analyze single configuration
- TRAJ - Analyze trajectory
  Key3:
  - ALL - Use all atoms in Voronoi construction
  - NOHD - Ignore hydrogens in Voronoi construction

NOT IMPLEMENTED YET

III.10.18. Solvent residence time calculation

RTIM Key1, Data

Prerequisite: PXCR
Potential successor: PXAN
Variable to set to 'T' in the preprocessor: DR

Key1 (shell boundary)

FRST - Residence time within first shell
SCND - Residence time within first two shells
READ - Residence time within user-defined shell radius
Data: timestep,
aucmin,
nzerosignore, rtimlim
- timestep - The simulation timestep used
- aucmin - The exponential fit will use the autocorrelation function until its value drops below aucmin
- nzerosignore {0} - In the list of 0's and 1's representing out and in the solvation shell, resp., set to one zeros when there are at most nzerosignore zeros are between ones.
- rtimlim {10.0} - Residence times will be calculated for staying within rtimlim Å of the solute (only when Key1=READ)

The program will calculate the autocorrelation function of the residence function (0: out; 1: in) and save it on a file <jobname>.rtm to be processed into actual times by a seperate program.

III.10.19. Solvent diffusion constant calculation

DIFC Data

Prerequisite: PXCR
Variable to set to 'T' in the preprocessor: DR

Data: timestep, dcgrid, rdc1, rdc2

timestep - The simulation timestep used
dcgrid - The gridsize (in Å) for the diffusion calculation
rdc1 {0.0}, rdc2 {10.0}, - Diffusion constants will be calculated for solvents whose distance from the solute is in the range of [rdc1,rdc2] Å (in addition to calculations for the first and second solvation shell).

The program will calculate the conformational average of the solvent displacements in each proximity region; within the first shell, the first two shells and the range defined by rdc1 and rdc2.

III.10.20. Torsion angle autocorrelation calculation

TAUC Data

Prerequisite: PART

Data: nmctorauc,tauc_timestep, tauc_min, ntaucprint

nmctorauc - The torsion angles are saved at every nmctoraucth step for autocorrelation calculation.
tauc_timestep {1} - The time elapsed between two samples
tauc_min {0.0} - The minimum of the autocorrelation function to be used for the fit
ntaucprint {20} - the number of autocorrelation function values to print (evenly distributed)

III.11. Numerical precision related keywords

HRDW: Computer architecture
SLFT: Self test
SLFT: Self test tolerances
FIXD: Round-off error elimination
REGE: Regenerate from torsion angles

III.11.01. Computer architecture-dependent algorithm selection

HRDW Key1

Potential successor: NSLV

Key1 (code-determining hardware selection):

SCAL - 'Conventional' scalar machine (32-bit words)
VC32 - Vector machine, 32 bits word, requiring split loops to preserve argument order (e.g., SGI, IBM)
VC64 - Vector machine, 64 bits word (e.g., Cray)

III.11.02. Periodic self test

SLFT Key1 [Key2] [Data]
Related key: PDBT.

At each nrep-th (see key RUNS) step the program performs a consistency check as specified by Key1.

Key1 (self test level):

DEFL - Only default testing, i.e., tests at the start of a run and the compulsory test (to turn off previously invoked self testing or just allow to read in Key2 or Data)
BASC - Basic test (equivalent to one move's worth of computations) on moved molecule: consistency check on energy and forces, center of mass, orientation.
FULL - Full test (equivalent to initial setup calculation): check energy, virial sum and coordination numbers by recomputing them from scratch; check symmetry of near-neighbor bitmap; check random number generator; for vector hardware algorithms (HRDW VC**), check energy with nonvectorized routines.
ALL - FULL test plus BASC test on all molecules.
Key2 (action when test fails):
- STPD, IGND, FIXD, or FXDI - described above for the keyword RCKP.
- STWD - If no inconsistency was found, save a copy of the checkpoint file to <jobname>.99.ckp; if any inconsistency was found stop (like for STPD).
  Data: nmcslftst
  - nmcslftst {5000000} Self tests performed at every nmcslftst-th MC step.

III.11.03. Input self-test tolerances

STOL Data

Prerequisite: CNFG

Data: engtol, virtol, tortol, comtol, zmattol, cslttol, d12tol, d13tol, wsumtol
Self test tolerances for various quantities:

engtol {0.0001} - for relative deviation in energies
virtol {0.001} - for relative deviation in virial sums
tortol {0.1} - for torsion angles
comtol {0.001} - for centers of masses
zmattol {0.0001} - for orientation matrix orthonormality
cslttol {0.001} - for solute atom positions (on distance squares)
d12tol {0.1} - for bond lengths (see LOOP LIMM)
d13tol {0.3} - for 1-3 distances (see LOOP LIMM)
wsumtol {0.0001} - preferential sampling weight sums

The default of engtol is relaxed by a factor of 10 for (TPN) ensemble run (see key IBEN), for runs with more than 1000 solute atoms or 1000 solvent molecules, for systems with active torsions (see key PART), or when the C@G7 lines are active; the default of tortol is relaxed to 20.0 when loop moves don't even touch the rest of the chain (see LOOP LIMM); the default of cslttol is relaxed to 0.001 for systems with more than 200 solute atoms.

III.11.04. Energy contribution recalculation to eliminate round-off errors

FIXD Key 1 Data

Key1: (action on self-test failure):

STPD - Stop run if self-test run before energy recalculation fails
IGND - Ignore if self-test run before energy recalculation fails
FIXD - Attempt to fix if self-test run before energy recalculation fails, stop if fix is unsuccesfull
FXDI - Attempt to fix if self-test run before energy recalculation fails, but run no matter how the fix worked
Data: nmcacord, nmcgcord
- nmcacord - Recalculate from scratch all the current energy contribution values and solute geometries after every nmcacord MC step.
- nmcgcord - Recalculate from scratch only the solute geometries after every nmcacord MC step.

There are several quantities that the program calculates by accumulating the changes to them. Round-off errors may cause some self tests to fail. This option eliminates periodically these round-off errors.

III.11.05. Regenerate solute from torsion angles to eliminate round-off errors

REGE

Prerequisite: TORD, CNFG

When this keyword is invoked, the program takes the torsion angles read from the .crd file and applies them to the solute molecule given as input after the SLTA key. The rigid core of the molecule will be taken from the .crd file. This way the input molecule's bond lengths and bond angles will be made to agree with the input definition.

III.12. Developpers' keywords

TEST: Special code-testing
OPTN: Direct option input
DBUG: Input debug keys
listing
CHKP: Checkpoint file saving frequency

See also the key OUTP PROP.

III.12.01. Special code testing

TEST Key1 [Data]

Key1: (test type):

PBCT - Periodic boundary condition test by system translation
Data: xcn, ycn, zcn
- xcn {10.0} - x coordinates of the new center
- ycn {20.0} - y coordinates of the new center
- zcn {30.0} - z coordinates of the new center
The program shifts the system to (xcn,ycn,zcn) and applies the PBC shifts to the part of the system that is found to be outside the cell. This will result in a system that gives a different representation of the same periodic system as before. Comparison of the total energy (and its components) calculated on the system with and without the shift should not show any difference (apart from round-off related deviations). Any difference should be indicative of incorrect PBC handling.
CPLW - Test coupling parameter weight calculation
FSLT - Test solute force calculation by finite difference quotients
Data: nfrq, dx
- nfrq {1} - every nfrq-th solvent molecule (starting with the first) will be tested
- dx {0.1} - the delta x value for the finite difference calculation
FSLV - Test solvent force calculation by finite difference quotients
Data: nfrq, dx
- nfrq {1} - every nfrq-th solute atom (starting with the first) will be tested
- dx {0.1} - the delta x value for the finite difference calculation

III.12.02. Direct option input

OPTN Data

Data: nopt, nopt times: index, value

index, value - Set the internal option array iop as iop(index)=value.

This key allows to set the internal option arrays without the keyword mechanism normally used.

III.12.03. Input debug keys

DBUG Data

Data: nkeys, nkeys times: index, value

index, value - Set the internal debug option array (idebug) as idebug(index)=value.

numbers and the file unit numbers will be also listed. If

III.12.04. Checkpoint file saving frequency

CHKP Data

Potential successor: PXAN

Data: nrecd

nrecd - After every nrecd steps, information in all of the common blocks are saved on the .ckp file. This option overrides the default checkpoint file saving frequency allowing the localization of a program failure (detected usually by a failed self test). See also the key SCKP for additional options.
III.01.15. History file buffer save
BUFF
When this key is present, the buffers of the open history files are written out whenever a new checkpoint file is saved. This is currently achieved by rewindig the file and skipping to the end.

III.13. Keyword summary

AROM 05.18 Aromatic carbon list
ATFR 09.03 Calculation of mean force on solute atoms
BLKW 09.27 Bulk density monitoring
BNDL 08.05 Listing of bonds found
BRKB 04.05 Break bonds
BTUN 02.07 Tune GCE B parameter
BUFF 01.15 History file buffer save
CPOC 09.13 Cavity/pocket occupancy
CHKP 12.04 Checkpoint file saving frequency
CHRG 05.15 Solute charge manipulations
CLON 04.03 Solute cloning
CMPC 09.09 Configuration comparison
CNFG 02.11 Initial configuration
CSEG 04.11 CHARMM segment id handling
CNST 05.23 Constraint potential
DBLG 10.07 Double proximity radial gridsize
DBUG 12.03 Input debug keys
DENF 09.10 Combined solvent conformation
DIFC 10.19 Solvent diffusion constant
DIPC 10.08 Dipole fluctutations
DSTC 09.01 Distribution calculation
DSTP 08.03 Distribution print
ENGL 09.06 Energy contribution calculation
ENHB 05.20 Hydrogen-bond term
EPLT 08.09 Energy convergence plot form
FCGA 09.02 Functional group analysis
FCGD 10.03 Functional group definition
FILE 01.01 File name root
FILT 09.07 Solvent filtering
FIXD 11.04 Round-off error elimination
FLDG 09.04 Field gradient calculation on solvents
FLXR 05.22 Residues to sample
FREE 03.01 Free energy options
GABF 09.18 Generic site graph analysis by frame
GCEN 02.05 Grand-canonical ensemble
GENS 09.14 Generic solvation site calculation
GENT 09.19 Configuration with input torsion angles
GENV 10.09 Proximity volume element
GRAV 10.10 Proximity RDF averageing
GSAN 09.17 Generic site analyses
HBBR 09.08 Hydrogen-bond bridges
HBMO 06.21 Non-default hydrogen-bond
HRDW 11.01 Computer architecture
IBEN 02.09 (TPN) ensemble
IDAC 01.08 Proximity
IDAG 09.23 Aggregate insertion and deletion sites
IDLG 01.09 I/D log file
IGSV 06.18 Non-interactive solvent
IGJA 06.16 Ignore Jacobian
INCT 05.11 Solvent inner cutoff
IWSL 09.15 Prepare site assignment file, entropy input
KMNP 08.08 PDB file property threshold
LCMP 02.10 Virial biasing factor
LIGA 02.16 Multi-copy ligand
LIMG 02.08 Limit the cavity grid
LOOP 06.15 Loop torsion move
LPST 09.22 Loop solution list
MAKB 04.04 Make bonds
MINE 01.07 Minimum energy structure extr into history file
MIXR 05.03 LJ parameter mixing rule
MODA 05.06 Element modification
MOLD 04.08 Define solute molecules
MOND 09.26 Bulk density monitoring
MOVE 06.01 Move selection
MVRT 06.10 Alternate translation and rotation
MV2S 06.08 Correlated 2-solute molecule move
NFBU 06.03 Solute force bias
NMVP 08.11 Print solvents not moved
NONB 06.13 Non-Boltzmann samplings
NSLV 02.01 Number of solvent moleculess
OPTN 12.02 Direct option input
OUTP 08.01 Print various system characteristics
OVRA 03.06 Overlap ratio evaluation
OVST 02.14 Solute copy overlay
PARD 06.06 Solute molecule displacement
PART 06.09 Solute torsion
PBCT 11.01 Periodic boundary condition test by system translation
PBCN 02.04 Periodic boundary conditions
PBGR 02.16 Solute reset with grids
PDBT 08.14 PDB atomname convention
PFRD 05.07 Potential library input
PLBP 08.07 Potential parameter echo
PLCV 08.15 Postscript convergence plots
PMOD 05.05 Potential library modifications
PRAC 08.04 Acceptance rate print detail
PRCO 08.06 Compilation option listing
PRFI 06.11Insertion/deletion pref. sampl.
PRMF 05.04 Torsion potential specification
PRNT 08.02 Input echo level
PROT 05.12 Minimum repulsion for general solvent
PRPL 08.10 Plotted data print
PRTG 09.11 Print cavity grids
PXAN 07.03 Proximity analysis frequencies
PXBE 10.14 Solute-solvent proximity
PXCR 10.02 Proximity criterion choice
PXDP 10.12 Solute-solvent proximity dipole distr. calculation binding energy calculation
PXTD 10.11 Proximity table difference calculation
PXGR 10.13 Solute-solvent proximity g(r) calculation
PXLM 10.04 Omitting atoms from proximity considerations
PXPA 08.12 Empty proximity region print
PXPL 01.10 Proximity analysis distribution plot file writing
PXPR 08.13 Proximity analysis printing options
PXWR 01.11 Proximity information file
PXWW 10.15 Solvent-solvent proximity distribution calculation
PXYZ 10.01 Permute coordinates read
RAUS 03.02 AUS parameter change
RCKP 07.04 Checkpoint file read
RDBD 07.04 Bond list input
REGE 11.05 Regenerate from torsion angles
RFCR 05.14 Reaction-field correction
RFSL 10.05 Set first shell radii
RMCK 10.05 Checkpoint file removal
RMOD 10.06 First shell radius modifications
RNDG 06.19 Random number generator information
RTIM 10.18 Solvent residence time
RUNS 07.01 Run MC simulation
SAMP 06.02 Sampling technique
SANN 02.19 Simulated annealing
SACP 02.30 Simulated annealing of the chemical potential
SCAL 02.13 Scale simulation cell
SCAN 07.02 Scan trajectory for proximity analysis
SCKP 01.04 Archive checkpoint files
SCRM 02.12 Scramble torsion angles
SEED 06.11 Random number seed input
SENS 09.05 Sensitivity analysis
SETC 05.14 Set various constants
SKWT 06.17 Skewed torsion sampling
SLFT 11.02 Self test
SLTA 04.01 Solute description
SLVA 04.02 Solvent description
SPPS 05.17 Special potential
SPRD 02.15 Spread out solute molecules
SPST 05.16 Field-dependent potentials
STEP 06.04 Stepsizes
STIR 01.13 Contact elimination
STOL 11.03 Self test
STOP 07.05 Stop the run
STPS 06.05 Stepsize scaling
STPX 02.06 Stop GCE run at nx
STSC 06.14 Temporary stepsize scaling
STUN 02.17 Stepsize tuning
STVG 09.12 Estimate solute volume
SUPI 09.24 Trajectory superimposition
SUPT 05.01 Solute potential type
SUVC 05.08 Solute-solvent cutoff
SUUC 05.09 Solute-solute cutoff
SVDP 09.25 Solvent dipole distribution
SVIN 05.19 Solvation parameter input
SVPT 05.02 Solvent potential type
SVVC 05.10 Solvent-solvent cutoff
SWAP 06.07 Solute molecule swap
TAC0 02.18 Stepsize accumulator reset
TAND 09.20 Torsion angle distribution
TAUC 10.20 Torsion angle autocorrelation
TEMP 02.02 Simulation temperature
TITL 02.03 Description
TIQU 03.05 TI quadrature
TORD 04.10 Torsion angle definitions
TORT 09.21 Energy analysis along a torsion angle
TRAJ 01.04 History file specification
VOLE 10.16 Volume element regeneration
VORO 10.17 Voronoi analysis
VVNE 05.24 Solvent-solvent electrostatic
WCKP 01.03 Write checkpoint file
WCNF 01.02 Configuration save
WMAT 03.03 PMF matching
WPLT 03.04 AUS iteration plot
WTRA 01.12 Trajectory conversions

IV. EXAMPLES

In this section a number of commented input files are presented illustrating several different options.

Glycine in water - minimal input; restart from checkpoint
Ethanol in ethanol (neat liquid)
Grand-canonical ensemble
Constant pressure ensemble
Dimethylphosphate-Sodium complex with freely moving sodium
Torsion angle sampling
Potential of mean force between dimethyl phosphate and sodium ions with adaptive umbrella sampling
Creation/annihilation polynomial path thermodynamic integration
Finite difference thermodynamic integration
Widom insertion method
Proximity analysis, trajectory filtering
Conformation filtering
Generic site calculation
Primary hydration shell calculation
Cavity and pocket determination

IV.1. Glycine in water - minimal input; restart from checkpoint file

This is a simple canonical ensemble MC run of a simple solute in water.

FILE glycine ! File names start with glycine.
! Run # = 1
NSLV 215      ! One glycine and 215 waters
TEMP 298.0     ! 298 Kelvin
SVVC SPCC 7.75 ! Solvent-solvent cutoff=7.75 �
SUVC MICC 0.0  ! MI on the solute
PBCN FCC  14.81526   ! Face-centered cubic cell
STEP 0.15 10.0 0.55 40.0 40
! Solute and solvent stepsizes, slt move frequency
SUPT AM94   ! Solute-solvent potential is AMBER
SVPT TIP4 TIP4!Solvent-solvent potential: TIP4P
SLTA SMPL MMC READ  10       ! 10 solute atoms
   35     2.131    -1.202    -0.377    -.278623
   34     2.096     1.058     0.348    -.260774
   33    -0.549    -1.370     0.0      -.385556
    1     1.526     0.0       0.0       .295148
   18     0.0       0.0       0.0      -.040730
   20    -1.602    -1.374     0.044     .147624
   20    -0.249    -1.903    -0.836     .158476
   22    -0.351     0.514    -0.895     .061766
   22    -0.381     0.536     0.868     .086565
   23     3.068    -0.947    -0.337     .216103
CNFG RANC ASCI SIZE !Random initial configuration
! Run 100000 steps
RUNS 100000 10000 25000 0100000  100000  100000
! Run  an other 100000 steps, starting from the end of the first run.
! Run # = 2
RUNS 100000 10000 25000 0100000  100000  100000
STOP

Click HERE to view the output of this run

Files needed for run: none
Files created by the run:

glycine.crd: initial configuration;
glycine.2.crd: final configuration after the first run
glycine.3.crd: final configuration after the last run
glycine.ckp: checkpoint file after the first run ;
glycine.2.ckp: checkpoint file after the last run ;

Suppose we decide to extend the run for an other 100000 steps. It can be done by reading in the checkpoint file created after the run:

! Extend the previus run with an other 100000 steps
FILE glycine 2! Set Run # to 2
RCKP ! Restore the status from the checkpoint file
! glycine.2.ckp
RUNS 100000 10000 25000 0100000  100000  100000
STOP

Click HERE to view the output of this run

Files needed by the run:

glycine.2.crd: starting configuration of the second run
glycine.2.ckp: checkpoint file after the second run

Files modified by the run:

glycine.3.crd: final configuration after the last run
glycine.2.ckp: checkpoint file after the last run

IV.2. Ethanol in ethanol (neat liquid).

This input runs a MC simulation in liquid ethanol. One ethanol molecule is considered the 'solute'

FILE etnl
PRNT ECHO       ! Echo formatted input
TITL Ethanol
HRDW SCLR       ! Scalar machine
SVVC SPCC 10.0  ! Solvent-solvent cutoff
SUVC SPGC 10.0  ! Solute-solvent cutoff
PBCN RECT 26.0
! Unit cell is a cube with 26 � egdes
NSLV 181     ! One solute and 181 solvents
TEMP 298.0   ! Simulation temperature: 298 K
MOVE SHCY
! Molecules are selected by the shuffled cyclic scheme
STEP  0.50 30.0   0.50  30.0 40
! Solute and solvent stepsize params
SUPT AM94        ! AMBER solute
SVPT GENL AM94 ! General solvent, AMBER library
SAMP METC        ! Metroplis sampling
SLVA 9      ! Read 9 solvent atoms
   18    3.108016  0.653187 -8.526237 -0.198000
   22    2.814817 -0.348721 -8.761003  0.066000
   22    2.517394  1.015322 -7.710808  0.066000
   22    2.956134  1.278341 -9.381230  0.066000
   18    4.596542  0.674197 -8.131964  0.131000
   22    4.748424  0.049042 -7.276970  0.066000
   22    5.187164  0.312062 -8.947392  0.066000
   36    4.988388  2.013194 -7.818209 -0.566000
   23    5.982338  2.026292 -7.548412  0.303000
SLTA  SMPL MMC READ 9   !read 9 solute atoms
   18    3.108016  0.653187 -8.526237 -0.198000    1
   22    2.814817 -0.348721 -8.761003  0.066000    1
   22    2.517394  1.015322 -7.710808  0.066000    1
   22    2.956134  1.278341 -9.381230  0.066000    1
   18    4.596542  0.674197 -8.131964  0.131000    1
   22    4.748424  0.049042 -7.276970  0.066000    1
   22    5.187164  0.312062 -8.947392  0.066000    1
   36    4.988388  2.013194 -7.818209 -0.566000    1
   23    5.982338  2.026292 -7.548412  0.303000    1
DSTC NONE ! No distribution function calc.
CNFG READ ASCI NOFX
! Read initial coordinates in ASCII from etnl.crd
RUNS 1000000 10000 500000 100000 10000
!Run 1M steps
RUNS 1000000 10000 500000 100000 10000
!The second run will start at the end of the first.
STOP

Click HERE to view the output of this run

Files needed for run:
etnl.crd: initial configuration.
Files created by the run:

etnl.ckp: checkpoint file after the first run ;
etnl.2.ckp: checkpoint file after the second run;
etnl.2.crd: final configuration after the first run;
etnl.3.crd: final configuration after the second run.

IV.3. Grand-canonical ensemble

This input runs MC simulation in the grand-canonical ensemble. The solute is the protein BPTI (from the PDB).

FILE bpti
PRNT DETL !Echo formatted data, print extra inf
HRDW VC32          ! 32-bit vector
SVVC SPCC 7.75     ! Solvent cutoff
SUVC MIGC 0.0      ! MI on the solute
PBCN RECT 24.87  40.39 66.2 !Rectangular PBC
MOVE RAND          ! Random selections
TEMP 298
NSLV 10   !10 solvent molecules to start with
STEP  0.00   00.0   0.55    40.0  60
! slt, slv stepsize, slt move freq.
SVPT TIP4 TIP4
SUPT AM94
SAMP FBSC 0.5  ! Scaled force-biased sampling
    2
 4.00   0.2   7.0     1.0
PMOD AM94 1
   75   27
    8            1.6     0.2
SLTA  SMPL MMC READ 892
 N3      -2.55400   7.05800  12.80100  -0.37640    1  ARG N
   .....
   .....   891 more lines describing solute atoms ....
   .....
DSTC NONE
GCEN CAVB RSIG ALTI
! Grand-canonical ens., cavity biased,
! LJ sigma-based cavity radii, alternating i/d
    1.0    2.5      1000.0       30   30   30 1000    1    1   0000100     50000
CNFG RANC ASCI SIZE
!Random configuration of 10 (from NSLV) waters
! using the given cell size and writing ASCII file
RUNS 10000  100 25000 100000 100000 100000
STOP

Click HERE to view the output of this run

Files needed for run: none.
Files created by the run:

bpti.ckp: checkpoint file after the run;
bpti.crd: initial configuration.
bpti.2.crd: final configuration after the run.

IV.4. Constant pressure ensemble run

This input runs MC simulation in the isothermal-isobaric ensemble. The solute is the protein a Na+ - DMP- ion pair.

FILE dmpna
TITL run 2 slt molecs in water, only moves
TITL NA+DMP-: KOLLMAN-STRAATSMA(NA+)
HRDW VC32          ! 32-bit vector
SVVC SPCC 7.75     ! Solvent cutoff
SUVC MIGC 0.0      ! MI on the solute
PBCN RECT 32.031  21.354 21.354
TEMP 298
NSLV 484
STEP  0.10   00.0   0.55    40.0 05
SVPT TIP3 TIP3
SUPT AM94
SAMP FBSC 0.5  ! Scaled force-biased sampling
    2
 4.00   0.2   7.0     1.0
PMOD AM94 1   ! Potential library modification
   49   46
   11  1.59955E+00     0.478833E-01
SLTA  SMPL MMC READ 8
   49     2.70000   0.0       0.00000   1.00000    1
   38     0.0       0.0       0.0       0.91200    4
   35     0.74000   0.0       1.28171  -0.65500    4
   35     0.74000   0.0      -1.28171  -0.65500    4
   37    -0.99164  -1.25560   0.00000  -0.41000    4
   37    -0.99164   1.25560   0.00000  -0.41000    4
    4    -1.92075   1.43300   1.07251   0.10900    4
    4    -1.92075  -1.43300  -1.07251   0.10900    4
IBEN UNIF ISOT 1.0 250.0 500 100 !(TPN) ensemble
CNFG READ ASCI
RUNS 2000000 50000 1000000 0100000 100000
STOP SLFT

Click HERE to view the output of this run

Files needed for run:

dmpna.crd: initial configuration.

Files created by the run:

dmpna.ckp: checkpoint file after the run;
dmpna.crd: initial configuration.
dmpna.2.crd: final configuration after the run.

IV.5. Dimethylphosphate-Sodium complex with freely moving sodium

This input show how to allow tranlational/rotational degrees of freedom for molecules considered 'solute' by the program.

FILE dmpna
TITL Na+DMP-: Kollman-Straatsma(Na+)
TITL SPC water. H-R PBC
HRDW VC32          ! 32-bit vector algorithm
SVVC SPCC 7.75     ! Solvent spherical cutoff
SUVC MIGC 0.0
! MI on the solute based on group center distances
SUUC MIMC          ! Molec-based MI for intrasolute
PBCN RECT 32.031  21.354 21.354
! Rectangular PBC, long box
TEMP 298   |  NSLV 484
!Simulation at 298 Kelvin with 485 solvent molecules
STEP 0.1 0.0 0.55 40.0 10
! 0 slt stepsize, 0.55 Å, 40 deg solvent steps
SVPT TIP3 TIP3 ! Solvent-solvent pot: TIP3P
SUPT AM94      ! Solute-solvent pot: AMBER
SAMP FBSC 0.5  ! Scaled force-biased sampling
    2
 4.00   0.2   7.0     1.0
MOVE PRSP NRST
! Preferential sampling based on nearest slt atom dist
    3
 3.00   3.5   4.0    03.0    7.0    1.0
PMOD AM94 1    ! Potential library modification
   49   46
   11  1.59955E+00     0.478833E-01
! Solute's "own" solvent will be used for slt-slv
SLTA  SMPL MMC READ 8
! 8 solute atom - regular solute, read from input stream
   38     0.0       0.0       0.0       0.91200    1
   35     0.74000   0.0       1.28171  -0.65500    1
   35     0.74000   0.0      -1.28171  -0.65500    1
   37    -0.99164  -1.25560   0.00000  -0.41000    1
   37    -0.99164   1.25560   0.00000  -0.41000    1
    4    -1.92075   1.43300   1.07251   0.10900    1
    4    -1.92075  -1.43300  -1.07251   0.10900    1
   49     2.70000   0.0      -0.00000   1.00000    2
PARD UNIF SHCY 1.0 0.3 0.0 1.0 0.3 0.0 ~
1.0 0.3 0.0 1.0 -1 ! Last slt group is independently moved
DSTC NONE   ! No distribution function calc
SLFT BASC   ! Basic self tests performed
CNFG READ ASCI ! Input from dmpna.crd in ASCII
RUNS 2000000 50000 1000000 0100000 100000
! Run 2M steps
STOP

Click HERE to view the output of this run

Files needed for run:
dmpna.crd: initial configuration.
Files created by the run:

dmpna.ckp: checkpoint file after the first run;
dmpna.2.crd: final configuration after the run.

IV.6. Torsion angle sampling

This input show how to sample torsional degrees of freedom for molecules considered 'solute' by the program.

FILE aa3
TITL Three amino acids in vacuum
TITL Solute molecules move, swap and torsions
HRDW VC32          ! 32-bit vector
SVVC SPCC 12.00    ! Solvent cutoff
SUVC MIGC 0.0      ! MI on the solute
SUUC MIMC          ! Molec-based MI for intrasolute
PBCN HEXG 99.00  99.00  99.00    !hexagonal PBC
TEMP 998
NSLV 0   ! Only solute
STEP  0.00   00.0   0.55    40.0 10
SVPT TIP3 TIP3 ! Solvent-solvent pot: CHARMM
SUPT CHRM
SLTA  SMPL MMC READ 34
 NH1      1.24900   2.31500  -0.79400  -0.47000    1  ILE N
 CT1      1.55500   3.73000  -0.91700   0.07000    1  ILE CA
 C        1.31700   4.34000   0.45300   0.51000    1  ILE C
 O        2.09800   5.17100   0.91200  -0.51000    1  ILE O
 CT1      0.73100   4.43400  -2.05000  -0.09000    1  ILE CB
 H        0.48500   2.01200  -1.35900   0.31000    1  ILE HN
 HB       2.61200   3.83400  -1.12700   0.09000    1  ILE HA
 HA      -0.33600   4.20200  -1.80500   0.09000    1  ILE HB
 NH1      1.49900   2.78700   3.28800  -0.47000    2  LEU N
 CT1      2.55800   2.23900   4.11700   0.07000    2  LEU CA
 C        3.78000   3.10300   3.86100   0.51000    2  LEU C
 O        4.51100   3.44400   4.79000  -0.51000    2  LEU O
 CT2      2.81700   0.74400   3.78200  -0.18000    2  LEU CB
 CT1      3.92200   0.01400   4.58300  -0.09000    2  LEU CG
 CT3      4.04200  -1.44600   4.12400  -0.27000    2  LEU CD1
 CT3      3.68900   0.07700   6.10000  -0.27000    2  LEU CD2
 H        1.16000   2.16100   2.58800   0.31000    2  LEU HN
 HB       2.28700   2.35700   5.15800   0.09000    2  LEU HA
 HA       1.86200   0.19700   3.95500   0.09000    2  LEU HB1
 HA       3.03300   0.64900   2.69400   0.09000    2  LEU HB2
 HA       4.90300   0.49900   4.36300   0.09000    2  LEU HG
 HA       4.86700  -1.95800   4.66500   0.09000    2  LEU HD11
 HA       3.09800  -1.99800   4.32300   0.09000    2  LEU HD12
 HA       4.25600  -1.49900   3.03500   0.09000    2  LEU HD13
 HA       4.49200  -0.47200   6.63700   0.09000    2  LEU HD21
 HA       3.69100   1.12200   6.47100   0.09000    2  LEU HD22
 HA       2.71400  -0.38500   6.36400   0.09000    2  LEU HD23
 NH1      3.65600   6.57500   5.52300  -0.47000    3  GLY N
 CT2      3.90500   6.38700   6.93600  -0.02000    3  GLY CA
 C        5.38300   6.32500   7.16600   0.51000    3  GLY C
 O        5.89600   6.89700   8.12500  -0.51000    3  GLY O
 H        3.16200   5.83500   5.07000   0.31000    3  GLY HN
 HB       3.47900   5.44100   7.23900   0.09000    3  GLY HA1
 HB       3.51900   7.24100   7.47600   0.09000    3  GLY HA2
PRMF CHRM par_all22_prot_na.inp
!Torsion pot param file
PARD UNIF RAND 1.0 0.5 30.0 1.0 0.5 30.0 ~
 1.0 0.5 30.0 1.0 34 ! Note the continuation character ~
PART UNIF RAND RAND 1.0
!Torsion strategy and frequency
TORD INPT 7  !Definiton of active torsions
    5    2     180.0      30.0    1
    2    3     180.0      30.0    1
   10    9     180.0      30.0    2
   10   11     180.0      30.0    2
   14   15     180.0      30.0    3
   13   14     180.0      30.0    4
   14   16     180.0      30.0    4
CNFG RANC ASCI
SWAP 0.2 !Specify swapping frequency
RUNS 100000 100  50000 10000 5000
STOP SLFT

Click HERE to view the output of this run

Files needed for run:

aa3.crd: initial configuration.
par_all22_prot_na.inp: torsion potential parameters

Files created by the run:

aa3.ckp: checkpoint file after the run;
aa3.crd: initial configuration.
aa3.2.crd: final configuration after the run.

IV.7. Potential of mean force between dimethyl phosphate and sodium ions with adaptive umbrella sampling

This input performs one type of free energy simulation: it calculates the potential of mean force between two solute molecules using adaptive umbrella sampling.

FILE dmpna_pmf     ! SPC water. H-R PBC
TITL Na+DMP-: Kollman-Straatsma(Na+),
TITL #1   4 Å range  MI/RC G/A  Run A
HRDW VC32          ! 32-bit vector hardware
SVVC SPCC 7.75     ! Solvent-solvent cutoff
SUVC MIGC 0.0      ! MI on the solute
PBCN RECT 32.031 21.354 21.354 !Rectangular PBC
MOVE SHCY          ! Shuffled-cyclic selections
TEMP 298   |  NSLV 485
STEP  0.10 0.0 0.55  40.0  10
! 0.1 Åslt stepsize, no slt rotation, 0.55 Å,
!40 deg solvent steps; solute moves at every 10th MC step
SVPT TIP3 TIP3 ! Solvent-solvent pot: TIP3P
SUPT AM94      ! Solute-solvent pot: AMBER
SAMP FBSC 0.5  ! Scaled force-biased sampling
    2
 4.00   0.2   7.0     1.0
FREE PMF1 WRMM GEOC AUSE 2
!2-part PMF both parts of the slt moves
                      0.00      0.25      0.050     100.0     0.1
    2    0    3    1   0.01     17.      0.0       0.0      0.00002    0.9
    4    5    5   05    2.43      0.75
PMOD AM94 1    ! Potential library modification
   49   46
   11  1.59955E+00     0.478833E-01
SLTA  SMPL MMC READ 24 16 8
!3*8 solute atom - regular solute, last 1 atom is in the 2nd part
   38     0.0       0.0       0.0       0.91200    1
   35     0.74000   0.0       1.28171  -0.65500    1
   35     0.74000   0.0      -1.28171  -0.65500    1
   37    -0.99164  -1.25560   0.00000  -0.41000    1
   37    -0.99164   1.25560   0.00000  -0.41000    1
    4    -1.92075   1.43300   1.07251   0.10900    1
    4    -1.92075  -1.43300  -1.07251   0.10900    1
   49     2.70000   0.0       0.00000   1.00000    2
   38     0.0       0.0       0.0       0.91200    1
   35     0.74000   0.0       1.28171  -0.65500    1
   35     0.74000   0.0      -1.28171  -0.65500    1
   37    -0.99164  -1.25560   0.00000  -0.41000    1
   37    -0.99164   1.25560   0.00000  -0.41000    1
    4    -1.92075   1.43300   1.07251   0.10900    1
    4    -1.92075  -1.43300  -1.07251   0.10900    1
   49     6.70000   0.0      -0.00000   1.00000    2
   38     0.0       0.0       0.0       0.91200    1
   35     0.74000   0.0       1.28171  -0.65500    1
   35     0.74000   0.0      -1.28171  -0.65500    1
   37    -0.99164  -1.25560   0.00000  -0.41000    1
   37    -0.99164   1.25560   0.00000  -0.41000    1
    4    -1.92075   1.43300   1.07251   0.10900    1
    4    -1.92075  -1.43300  -1.07251   0.10900    1
   49     2.70000   0.0       0.00000   1.00000    2
DSTC NONE
CNFG READ ASCI NOFX
! input from dmpna_pmf.crd in ASCII
RUNS 5000000 20000 2500000 0  100000 200000
STOP

Click HERE to view the output of this run

Files needed for run:
dmpna_pmf.crd: initial configuration.
Files created by the run:

dmpna_pmf.ckp: checkpoint file after the first run;
dmpna_pmf.2.crd: final configuration after the run.

IV.8. Creation/annihilation polynomial path thermodynamic integration

This is an example of a different free energy simulation type: calculation the solvation excess Helmholtz free energy of NO in water with polynomial thermodynamic integration over a creation/annihilation path.

FILE NO_ti 10
TITL Excess Helmholtz free energy of solvation
of NO in water
TITL Calculated by three-point Gaussian
quadrature
PRNT ECHO
HRDW VC32          ! 32-bit vector
SVVC MINI         ! Solvent cutoff
SUVC MIGC 0.0      ! MI on the solute
PBCN RECT 14.74    !Rectangular PBC
MOVE RAND          ! Random selections
!Set creation/annihilation TI using the first quadrature point
!Lambda exponents: 4, 3, 2 (for 1/r^12, 1/r^6, 1/r terms, resp.)
FREE TICA  NOMX
 4.0   3.0  2.0     0.112702
TEMP 298
NSLV 108
STEP  0.00   00.0   0.40    35.0 50
SVPT TIP3 TIP3 ! Solv-solv potential is TIP3P
SUPT CHRM
!Add NO parameters
PMOD CHRM 2
  150   70    3.3    N/NO
    7 1.824            0.17
  151   70    3.3    O/NO
    8 1.751            0.159
SAMP METC 0.5
!Make sure the dummy atoms in the first copy are treated as one molecule
MOLD 2 2 4
!Solute has 4 atoms, 4 free energy atoms, and the first copy has 2 atoms
SLTA SMPL MMC READ 4 4 2    !Dummy solute
 DUM      -.570      000.0    0.00000   0.00000    1
 DUM      0.58       000.0    0.00000   0.00000    1
  150     0.57000    000.0    0.00000   0.02800    2
  151     -.58000    000.0    0.00000  -0.02800    2
DSTC NONE
CNFG READ ASCI NOFX
FIXD 2500000
!Equilibration
RUNS 2000000 100000 100000 500000 100000
RMCK !Remove unneded checkpoint file
!Production run, runnumber=11
RUNS 10000000 100000 100000 500000 100000
FILE  NO_ti  20
!Set lambda to the 2nd quadrature points
FREE TICA  NOMX
 4.0   3.0  2.0     0.500000
CNFG READ ASCI NOFX  12
RUNS 2000000 100000 100000 500000 100000
RMCK !Remove unneded checkpoint file
!Production run, runnumber=21
RUNS 10000000 100000 100000 500000 100000
FILE  NO_ti  30
!Set lambda to the 3rd quadrature points
FREE TICA  NOMX
 4.0   3.0  2.0     0.887298
CNFG READ ASCI NOFX  22
RUNS 2000000 100000 100000 500000 100000 00000
RMCK !Remove unneded checkpoint file
!Production run, runnumber=31
RUNS 10000000 100000 100000 500000 100000
!Evaluate the quadrature from the data on the three checkpoint files
TIQU REGL ALL  3 11 10
    0  001    0
STOP SLFT

Click HERE to view the output of this run

Files needed for run:

NO_ti.10.crd: initial configuration.

Files created by the run:

NO_ti.11.crd: configuration after equilibration
NO_ti.11.ckp: checkpoint file with the data for the 1st quadrature point
NO_ti.12.crd: configuration after 1st quadrature run
NO_ti.21.crd: configuration after equilibration
NO_ti.22.crd: configuration after 2nd quadrature run
NO_ti.21.ckp: checkpoint file with the data for the 2nd quadrature point
NO_ti.31.crd: configuration after equilibration
NO_ti.32.crd: configuration after 3rd quadrature run
NO_ti.31.ckp: checkpoint file with the data for the 3rd quadrature point

IV.9. Finite difference thermodynamic integration

This is a example combines an other two free energy simulation types. It calculates the Helmholtz free energy difference between ethanol and acetone in water, using the finite differenece thermdynamic integration. The integrand at each quadrature point is obtained by the popular perturbation method.

FILE pm 1
TITL Perturbation method test ethane ->
acetaldehyde
TITL 3-point Gaussian quadrature (probably
inadequate)
HRDW VC32
SVVC SPCC 10.0
SUVC MICC
PBCN FCC    19.57563
TEMP 298 | NSLV 500
STEP    0.10   10.0  0.55      30.0  30
SVPT TIP3 TIP3
SUPT CHRM
!Lambda=0.112702 run
FREE PMLI CCMX
                    0.112702  0.102702  0.122702         0.0      1.0
SLTA SMPL MMC FILE 16 16 8 16 1
CNFG READ ASCI NOFX 50
!Equilibrate
RUNS 1000000 100000 10000 1000000 100000
RMCK !Remove equilibration checkpoint file to
save disk space
!Production run
RUNS 5000000 100000 10000 1000000 100000
!New coupling parameter - lambda=0.5
FREE PMLI CCMX
                    0.50      0.49      0.51        0.102702 0.122702
!Restore solute to original initial and final state
SLTA SMPL MMC FILE 16 16 8 16 1
!Read starting config from disk to prepare the new
!initial and final states
CNFG READ ASCI NOFX
!Equilibrate
RUNS 1000000 100000 10000 1000000 100000
RMCK
!Production run
RUNS 5000000 100000 10000 1000000 100000
!New coupling parameter - lambda=0.887298
FREE PMLI CCMX
                    0.887298  0.877298  0.897298    0.49      0.51
!Restore solute to original initial and final state
SLTA SMPL MMC FILE 16 16 8 16 1
CNFG READ ASCI NOFX
!Equilibrate
RUNS 1000000 100000 10000 1000000 100000
RMCK
!Production run
RUNS 5000000 100000 10000 1000000 100000
!Evaluate the TI quadrature
TIQU 2 2
    3    0    1    0    0    0
STOP SLFT FULL

Click HERE to view the output of this run

Files needed for run:

pm.crd: initial configuration.
pm.slt: description of the two solutes.

Files created by the run:

pm.crd: initial configuration.
pm.2.ckp, pm.4.ckp, pm.6.ckp: checkpoint files at the quadrature points.
pm.2.crd, pm.3.crd, pm.4.crd, pm.5.crd, pm.6.crd, pm.7.crd: Coordinate files at the end of equilibration and production runs.

IV.10. Widom insertion method

This input shows how to run a Widom insertion calculation based on a pure water trajectory run earlier by CHARMM.

FILE methane
TITL Calculate the chemical potential of Methane in Water
TITL using Widom cavity insertion method
HRDW VC32          ! 32-bit vector
SVVC MINI 10.0
SUVC MIGC 10.0      ! MI on the solute
PBCN RECT 15.5  15.5 15.5  !Rectangular PBC
TEMP 300
NSLV 124 !no of solvent molecs in the CHARMM run
SVPT TIP4 TIP4 ! Solvent-solvent potential is TIP4P
SUPT CHRM      !Solute-solvent potential is CHARMM
FREE WIDO 2.6 64 64 64 2 5
SLTA SMPL MMC READ 5 5
 CT3     -0.13265   0.14073   0.03004  -0.36000    1  CSP3C1
 HA       0.84914   0.15299   0.45527   0.09000    1  CSP3H11
 HA      -0.35123  -0.84076  -0.33574   0.09000    1  CSP3H12
 HA      -0.84914   0.40992   0.77775   0.09000    1  CSP3H13
 HA      -0.18078   0.84077  -0.77776   0.09000    1  CSP3H14
TRAJ CHRM RGFX  !Open trajectory in CHARMM format
SCAN TRAJ 1000 5 00 0 500 2 1  2 3 1
STOP

Click HERE to view the output of this run

File needed for run:

methane.hst: the CHARMM trajectory (DCD) file

Files created by the run:

methane.ckp: checkpoint files
methane.pxc: analysis checkpoint file

IV.11. Proximity analysis, trajectory filtering

This input show how to set up proximity analysis of a trajectory given as a series of PDB files and how to use this analysis to filter this trajectory file.

FILE 5ht1
TITL GCE run with fixed solute
HRDW VC32        ! 32-bit vector
SVVC SPCC 7.75  ! Solvent-solvent cutoff
SUVC MIGC 0.0 ! Group-cent based MI on the slt
PBCN RECT 70.0 75.0 58.0  !Rectangular PBC
TEMP 298
SVPT TIP3 TIP3
SUPT CHRM  ! Solute-solvent potential is CHARMM
MOLD 1 3192 ! Bypass the topology code
SLTA SMPL MMC FILE  3192
!Solute description is read from 5ht1.slt in MMC format
TRAJ ALLP RGFX ALST !Open trajectory in PDB format
FILT SOLV TRAJ ALLP SHL1 RVDW 1.4 0 0 10
!filter out solvents outside the first shell
TRAJ ALLP RGFX ALST 0 1 11
!Open filtered trajectory
PXCR WDEN BISE
!select proximity criterion type
PXWR ASCI
!Create ASCII proximity information file
PXBE -100.0  2.0
!Calculate solute-solvent energy analysis
SCAN TRAJ 250000 1 5000 0 2000 1000 2  3 1
!Run analysis
FILE 5ht1 1    !Reset run number to one
TRAJ ALLP RGFX ALST 0 1 11 !Open filtered traj
FILT ENRG TRAJ ALLP -100.0 -1.0 10
!Filter solvent by the solute-solvent energy
TRAJ ALLP RGFX ALST 0 1 11
!Open filtered trajectory again
FILT ENRG TRAJ ALLP -1.0 100.0 11
!Filter solvent by solute-solvent energy using a different range
TRAJ ALLP RGFX ALST 0 1 21
!Open trajectory after 1st energy filter
PXWR NONE
DENF INSG 0 1 0
!Create single configuration aggregating all solvents
TRAJ ALLP RGFX ALST 0 1 22
!Open trajectory after 2nd energy filter
DENF INSG 0 1 0
!Create single configuration aggregating all solvents
STOP

Click HERE to view the output of this run

Files needed for run:

5ht1.hst: Trajectory file in PDB format

Files created by the run:

5ht1_11.hst: Filtered (1st shell only) trajectory file
5ht1.pxi: proximity information file (see PXWR)
5ht1.ckp: checkpoint file
5ht1.pxc: proximity analysis checkpoint file
5ht1_21.hst: Filtered (1st energy filtering) trajectory file
5ht1_22.hst: Filtered (2nd energy filtering) trajectory file
5ht1_den_21.dat: Aggregated solvents (in Insight free format) file from 1st energy filtering
5ht1_2.dat: Aggregated solvents (in Insight free format) file from 2nd energy filtering

IV.12. Conformation filtering

This example shows the elimination of waters that are not internal to the solute according to their circular variance. The filtering is done on a CHARMM CRD file.

FILE scednas
PRNT ECHO
SVVC SPCC 12.00
SUVC SPGC 20.0
PBCN RECT 99.00  99.00  99.00 ! Large box
NSLV 16426
SVPT TIP3 TIP3 ! Solvent-solvent potential is TIP3
SUPT ATNO ! No potential information
BRKB  32 ~
    92    97    92    99   231   236   231   238   292   297   292   299 ~

   323   328   323   330   404   409   404   411   593  3691   856   861 ~
   856   863  1149  1154  1149  1156  1784  1789  1784  1791  1825  1830 ~
  1825  1832  1882  2138  2030  2035  2030  2037  2090  2095  2090  2097 ~
  2299  2304  2299  2306  2318  2323  2318  2325  2467  2472  2467  2474 ~
  2792  2797  2792  2799
!Break spurious bonds that would confuse the routine establishing
!the molecular connectivities
!Alternative: MOLD 1 3697
SLTA SMPL CRD FILE 3697
CNFG READ CHRM NOFX
FILT SOLV CONF CHRM CRCV RSIG 6.0 0.5 0 11
CNFG READ CHRM NOFX
FILT SOLV CONF CHRM CRCV RSIG 6.0 0.4 0 12
STOP

Click HERE to view the output of this run

Files needed for run:

scednas.slt: Solute portion of the CHARMM CRD file
scednas.CRD: Full configuration (including the solute) in CHARMM CRD format

Files created by the run:

scednas_11.CRD: Filtered conformation (keeping only internal waters)
scednas_12.CRD: Filtered conformation (keeping only internal waters), weakened filtering (keeping more waters)

IV.13. Generic site calculation

FILE GENSfile
SVVC SPCC 10.0  ! Solvent cutoff
SUVC MIGC 0.0   ! MI on the solute
PBCN RECT 56.0011  67.5172  81.0524 !Rectangular PBC
SVPT TIP3 TIP3
SUPT CHRM
SLTA SMPL MMC FILE 5402  0 0 0 1
TRAJ ALLH FLEX ALST
GENS CALC FIXI GETH SPDB FSRT ~
 125 5000000 1 10000 6.5 0.01 0.10 6.0 0.2 100 10
!Number of sites requested (initially): 500
!Number of simulation steps to read: 5000000
!Use every 1-th config (counted in MC stepnumbers)
!Discard the first 10000 MC steps
!For the matching, consider only sites within 6.5 Å
!Convergence threshold for the Hungarian method matching: 0.01
!Convergence threshold for the RMSD between iterations: 0.1
!Add a new site if a match is farther than 6.0 Å
!Previous site estimate will be mixed in with a factor of 0.2
!Maximum number of iterations over the trajectory: 199
!Minimum number of solvents in a configuration to be of use: 10
STOP NOSF

Click HERE to view the output of this run

Files needed for run:

GENSfile.slt: Solute portion of the CHARMM CRD file
GENSfile.CRD: Full configuration (including the solute) in CHARMM CRD format
GENSfile.hst: Simulation trajectory in CHARMM CRD format

Files created by the run:

GENSfile_gs.pdb: Solute plus the generic sites in PDB format
GENSfile_gs_cmp.pdb: Solute plus a composite of solvents that are nearest to sites, in PDB format
GENSfile_gs_cmp.rep: Solute plus a the most representative configuration

from the simulation history, in PDB format

IV.14. Primary hydration shell calculation

FILE alaval
TITL compacted alaval
TITL PHS, 2 shells
HRDW VC32
NSLV 190       !190 waters
TEMP 298.0
SUUC SPGC 21.00
SVVC SPCC 21.00
SUVC SPGC 21.00
PBCN PHS RSIG UPML 0.15 3.0 8.0 1.0 21.0 0.997 100 1000.0 3.
!PHS boundary conditions
STEP    0.00   0.0  0.40      30.0  20
SUPT CHRM
SVPT TIP3 TIP3
MIXR ARGE
SLTA SMPL MMC FILE 56
SAMP METC MOVE SHCY   !Plain Metropolis moves
PART UNIF SHCY CYCI 0.1          !Solute torsions active
PRMF CHRM  par_all22_prot_na.inp
TORD ALL SING 30.0  ! All torsions moved with same stepsize
CNFG READ ASCI
RUNS 200000 5000  100000  90000  9000000  90000000  1
STOP

Click HERE to view the output of this run

Files needed for run:

alaval.slt: Solute description file
alaval.crd: Coordinate file of the initial structure
par_all22_prot_na.inp: Charmm parameter file for torsion potential description

File created by the run:

alaval.ckp: Checkpoint file

IV.15. Cavity and pocket determination

FILE f8
PRNT ECHO
TITL 1tf_8_13
NSLV 0
TEMP 298.0
SVVC SPCC 9.05     ! Solvent cutoff
SUVC SPCC 12.0
PBCN RECT  70.0 70.0 70.0
STEP    0.00   00.0  0.55      40.0  30
SUPT CHRM
SAMP METC 0.5  ! Scaled force-biased sampling
MOVE RAND
SVPT TIP3 TIP3 ! Solvent-solvent potential is TIP4P
SLTA SMPL PDB FILE  3928
GCEN CAVB RSIG ! Grand-canonical ensemble (establishes grid)
    2.0    2.5      1000.0      250  250  250 1000    1    1  010000    20000
CNFG READ PDB
PRTG PDB ALLG AGLS 0 0 0.8 10.0 0.2 1.0 3.3 0.9 0.5 10.0
STOP NOSF

Click HERE to view the output of this run

Files needed for run:

f8.pdb: Solute description file

Files created by the run:

f8_2.pdb: PDB file of solute plus cavity grid coordinates
f8_3.pdb: PDB file of solute plus pocket grid coordinates

V. BRIEF DESCRIPTION OF THE OUTPUT.

Since the output of the program is largely self-explanatory, only a relatively brief description is given.

The output starts with information about the program: self identification, dimensions used - important for compiling other programs that read the interrupt file of the program - the dates the program and the common blocks were modified and the date of the run.

By default, the command lines are echoed verbatim. The command PRNT ECHO allows for direct echoing of the formatted data read as well. At the start of each run the the options selected are explained, numbers read are printed with their interpretation and unit assumed. These explanations are preceded by the keyword that was the source of the information, allowing the easy interpretation of the messages.

The solute description consists of three parts:

List of atoms (before cloning) and their properties (see below)
Summary of the properties of residues (charge, radius)
Summary of the properties of molecules (charge, radius, number of rings)

Each atom record consists of the following items (several of these may be blank):

Sequence number
Chemical symbol
Hydrogen-bond donor/acceptor label:
- D1: >NH donor
- D2: -OH donor
- A1: >C=O acceptor
- A2: R-O-R acceptor
Potenial code (label and number)
Potential library (see key SUPT)
Atom type label (p: polar; a: aromatic) used for desolvation energy calculation (see key SVIN)
Functional group type
Marker for group (residue) center: G
Marker for molecule (segment) center: M
X, Y, Z coordinates in a representative conformation
Partial charge
Lennard-Jones epsilon
Lennard-Jones sigma
Molecule number
Group number
Mobility marker
- FIX: atom is not moved during the simulation
- MOV: atom is moved only by whole solute molecule translation/rotation (see key PARD)
- TOR: atom is moved by torsion angle change (see key PART)
- INV: Atom stays unchanged between the two topologies during thermodynamic integration (see key FREE TICA)
- SIM: Atom stays nearly unchanged between the two topologies during thermodynamic integration (see key FREE TICA)
Residue name
Atom name
First shell radius for proximity analysis (see key PXCR) or solute atom volume (see key SVIN)
Proximity group index or solute atom surface (see key SVIN)
User-defined functional group name (see key FCGD)
User-defined functional group label (see key FCGD)

A very important feature of the program is the automatic self test on continuing a previously interrupted calculation (RCKP) as well as at every 2.5 millionth Monte Carlo step. It can also be called more frequently with the help of SLFT. If the self test failed a message "Discrepancy found" appears and the program stops (unless RCKP IGND or RCKP FIXD was used). Discrepancies may arise when the checkpoint file (.ckp) is corrupted or in case of a program error. If the latter is the case, contact the author of the program. A relatively harmless error can be an accumulation of round-off errors to unacceptable levels. Such errors can be prevented using the FIXD FIXD keys.

The initial total energy and its components, and virial sum header precedes the periodic output (at each nmcrep-th step) during the simulation. This periodic output (one to three lines each) contains the stepnumber (N) the energy of the currently accepted configuration (E), the cumulative total energy average (<E>), the minimum and maximum energy value sampled (Emn and Emx), with the configuration number/1000 or /1000000 where it occurred after them in parenthesis, the solute binding energy (Us) the acceptance rate (a) of the current move type (for solute molecule move or torsion the acceptance rate printed is for the solute molecule or torsion angle sampled in this step) and the index of the molecule selected for move (I) and the move type coded as follows.

MOV: solvent or whole solute displacement
CPL: coupling parameter change
PSM: solute molecule displacement
TOR: solute torsion change
LP*: solute torsion change with loop move (* is replaced by the loop type)
SWC: solute molecule swap
2ST: 2-solute molecule correlated rotation
SPC: special move (if implemented)

For torsion changes the torsion number is also printed. For torsion on cloned solute, the torsion number corresponding to the original torsion list is also given. For molecule swap, the second molecule's index is also printed. The line is ended by the symbol A or R for accepted and rejected moves, resp. The second line gives the solute intramolecular energy terms when the solute is not rigid. An additional line gives grand-canonical ensemble information: current and average number of solvent molecules, smallest and largest number of solvent molecules sampled (Nmn, Nmx), the acceptance rate of the insertion/deletion steps (id acc), the number of uncovered gridpoints (Ngf) and the probability of finding a cavity (pcav).

Periodically (at each nmcplt-th step), and at the end of each run a complete report of the averages and distributions is computed and printed. This report starts with the thermodynamic averages: energy, energy square, standard deviation and range for energy, total virial sum, solute virial sum and the pressure.This is followed by the error estimates using the method of batch means for the total energy or pressure. The error estimates are given as two standard deviations - corresponding to ca 95% confidence intervals. Since he method of batch means assumes that the actual batch means are statistically independent the program performs a test for independence - the so-called run test. At the and of this section there are two tables containing the critical values for the run test: for independence, the number of runs found should be between the two numbers corresponding to the number of "ups" (n1) and "downs" (n2) in the two tables. The program prints CORRELATED or UNCORRELATED when the run test was failed or passed, respectively and print ??? or >>> when the number of "ups" or "downs" were either too low or too high for the table. The test is repeated with double the blocksize until the actual run lengths prevent further doubling.

During the run, various informations are gathered relevant to the convergence of the calculation: Number of times the solute is attempted to move, accepted to move; average translational and rotational displacements; the minimum, maximum and average correlations between the initial and final orientations; the total displacement of the system (square root of the molecular displacement square sums), information on the average molecular displacements, the correlation between two successive moves of a molecule (averaged over all moves and all molecules) and convergence indicators involving average molecular displacement square, acceptance ratio and correlation between successive moves; if preferential sampling was used (MOVE PRS*) the relative frequencies the molecules were attempted to move (perturbed), acceptance rates for individual molecules (useful to detect "stuck" molecules).

For the various free energy options the distributions and free energy values calculated are printed here. Also, calculations involving the perturbation method print the solute binding energy estimate on the two limiting state ensembles to be used for convergence test and the range of energy values that went into the exponential averages - for reliability these ranges should not exceed cca 10 kcal/mol.

For adaptive umbrella sampling, the range covered by each iteration is plotted with letters U and D indicating that the probability distribution is increasing or decreasing. In the plot each line corresponds to an iteration and the position of the U's and D's refer to the coupling-parameter range sampled.

The program calculates first order quantum corrections to the intermolecular free energy with the method of Powles and Rickayzen. This calculation is based on the computed configurational average of the force and torque components and on the mass and moment of inertia of the molecule involved. The solute contribution to the quantum correction is also computed and printed, along with the computed force and torque component squares, molecular mass and moments of inertia.

The various distribution functions are printed only optionally and they are self-explanatory. Each distribution is followed by a summary locating the extrema and other pertinent characteristics.

Two more pieces of information relevant to the convergence of the run are given near the end of the output: the closest solvent-solvent distance and the number of steps elapsed during which all molecules were moved at least once.

Proximity analysis results are printed by the atoms on which the proximity analysis was requested. The output may consist of several parts (depending on key PXPR and, of course, on the keys invoking the calculation of the various propeties).

Listing of the calulated distribution functions, followed by the position and value of their extrema.
Tables of various calculated averages, listed both in the original order of solute atoms and grouped by residues and listed by functional groups (either chemical or user-defined - see key FCGD) The table entries for each solute atom (one line) are as follows:
- Atom numbers in the current order
- Atom numbers in the original order
- Residue number
- Atom name (read as labslt(1) with key SLTA)
- Residue name (read as labslt(2) with key SLTA)
- Chemical symbol
  First shell properties:
  - First shell radius, rfs (specified by key RFSL) in �
  - Volume of the first shell, vfs, calculated as the sum of volume elements for shells within rfs in �³ (see key GENV)
  - The volume of the first two shells, v2fs, defined as the volume of the proximity region extending to a distance rfs+solvdia, where solvdia is the L-J sigma of the islvrep-th (see key SLTA) solvent atom in �³
  - First shell coordination number, <K>, calculated as the average number of solvents in the proximity region within rfs
  - First shell density, <K>/vfs, converted to g/ml
  - First two shell coordination number, <2K>, calculated as the average number of solvents in the proximity region within rfs solvdia
  - First shell binding energy, <sltbe>, the sum of interactions of solvents in the first shell with the (whole) solute (see key PXBE)
  - First shell pair energy, <sltpe>, calculated as <sltbe>/<K>
  Total primary region properties:
  - Total coordination number, <K>, calculated as the total number of solvents in the proximity region
  - Total binding energy, <sltbe>, the sum of interactions of all solvents in the proximity region
  First shell solvent-solvent properties:
  - Coordination number, <Kw> - see key PXWW
  - Pair energy, <nnwwpe> - see key PXWW
  - Binding energy, <bewwe> - see key PXWW
Tables of the calculated extrema of the proximity g(r), total g(r), and mean dipole (see key PXDP) function.

The output concludes with averages of the total energy and solute binding energy over the run, as well as the coupling parameter range sampled in the run, if the variable coupling parameter option was used.

Throughout the output, lines with periods are used to delineate certain segments of the output. Warning messages (indicating potential problem with the input) are preceded by 5 dashes, strong warnings (likely, but not certain problems) are preceded by 5 equal signs, and error messages are preceded by 5 asterisks. Messages announcing some specific action (not taken all the time) are preceded by 5 plus signs. Messages announcing that an input setting is overriden by the program are preceded with five 'less-than' signs. Before stopping the program prints the run time and the number of overrides, warnings, strong warnings and errors encountered. It is always a good practice to find these messages in the output file to make sure the program was running as intended.

Minimum critical values for the run test:

  n2:  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19 20
 n1  ----------------------------------------------------------
  2 |                                2  2  2  2  2  2  2  2  2 | 2
  3 |              2  2  2  2  2  2  2  2  2  3  3  3  3  3  3 | 3
  4 |           2  2  2  3  3  3  3  3  3  3  3  4  4  4  4  4 | 4
  5 |        2  2  3  3  3  3  3  4  4  4  4  4  4  4  5  5  5 | 5
  6 |     2  2  3  3  3  3  4  4  4  4  5  5  5  5  5  5  6  6 | 6
  7 |     2  2  3  3  3  4  4  5  5  5  5  5  6  6  6  6  6  6 | 7
  8 |     2  3  3  3  4  4  5  5  5  6  6  6  6  6  7  7  7  7 | 8
  9 |     2  3  3  4  4  5  5  5  6  6  6  7  7  7  7  8  8  8 | 9
 10 |     2  3  3  4  5  5  5  6  6  7  7  7  7  8  8  8  8  9 | 10
 11 |     2  3  4  4  5  5  6  6  7  7  7  8  8  8  9  9  9  9 | 11
 12 |  2  2  3  4  4  5  6  6  7  7  7  8  8  8  9  9  9 10 10 | 12
 13 |  2  2  3  4  5  5  6  6  7  7  8  8  9  9  9 10 10 10 10 | 13
 14 |  2  2  3  4  5  5  6  7  7  8  8  9  9  9 10 10 10 11 11 | 14
 15 |  2  3  3  4  5  6  6  7  7  8  8  9  9 10 10 11 11 11 12 | 15
 16 |  2  3  4  4  5  6  6  7  8  8  9  9 10 10 11 11 11 12 12 | 16
 17 |  2  3  4  4  5  6  7  7  8  9  9 10 10 11 11 11 12 12 13 | 17
 18 |  2  3  4  5  5  6  7  8  8  9  9 10 10 11 11 12 12 13 13 | 18
 19 |  2  3  4  5  6  6  7  8  8  9 10 10 11 11 12 12 13 13 13 | 19
 20 |  2  3  4  5  6  6  7  8  9  9 10 10 11 12 12 13 13 13 14 | 20
     ----------------------------------------------------------
       2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19 20

Maximum critical values for the run test:

  n2:  2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19 20
 n1  ----------------------------------------------------------
  2 |                                                          | 2
  3 |                                                          | 3
  4 |           9  9                                           | 4
  5 |        9 10 10 11 11                                     | 5
  6 |        9 10 11 12 12 13 13 13 13                         | 6
  7 |          11 12 12 12 14 14 14 14 15 15 15                | 7
  8 |          11 12 13 13 14 15 15 16 16 16 16 17 17 17 17 17 | 8
  9 |             13 14 14 15 16 16 16 17 17 18 18 18 18 18 18 | 9
 10 |             13 14 15 16 16 17 17 18 18 18 19 19 19 20 20 | 10
 11 |             13 14 15 16 17 17 18 19 19 19 20 20 20 21 21 | 11
 12 |             13 14 16 16 17 18 19 19 20 20 21 21 21 22 22 | 12
 13 |                15 16 17 18 19 19 20 20 21 21 22 22 23 23 | 13
 14 |                15 16 17 18 19 20 20 21 22 23 23 23 23 24 | 14
 15 |                15 16 18 18 19 20 21 22 22 23 23 24 24 25 | 15
 16 |                   17 18 19 29 21 21 22 23 23 24 25 25 25 | 16
 17 |                   17 18 19 20 21 22 23 23 24 25 25 26 26 | 17
 18 |                   17 18 19 20 21 22 23 24 25 25 26 26 27 | 18
 19 |                   17 18 29 21 22 23 23 24 25 26 26 27 27 | 19
 20 |                   17 18 20 21 22 23 25 25 26 26 27 27 28 | 20
     ----------------------------------------------------------
       2  3  4  5  6  7  8  9 10 11 12 13 14 15 16 17 18 19 20

VI. COMPILATION OF THE PROGRAM.

VI.1. Preprocessor

The program source file mmc.for contains a number of special symbols allowing the customization of the program. To produce a valid Fortran source code mmc.f a preprocessor (also a Fortran program) pre.f is provided. The preprocessor performs two functions:

It replaces the special symbols in mmc.for related to the size of the problem (dimension of arrays, etc.); quantities that depend on other values (e.g., the maximum number of atoms) are calculated by the preprocessor to ensure consistency.
It selectively activates segments of code depending on the computational environment (e.g., timing routines), and/or features planned to be used (to save computation time by eliminating operations superfluous in certain context).

If the generated mmc.f is inadequate for a particular input, mmc will abort the run and give specific instructions what to change.

The preprocessor has to be compiled and executed (after which you can compile the program):

To compile, type f77 -o pre.bin pre.f (replace f77 with the command invoking the Fortran compiler on your system, if different)
To run, type pre.bin, followed by three <ENTER>'s to accept the default filename mmc, default choices and values and to give only a brief list of the options used.

The prepocessor as provided has default choices and values that will create a valid source file. There are three ways to change the defaults:

Running the preprocessor interactively (by responding with 'y' to the second prompt of pre.f)
Editing the source file to change the defaults. This involves
- Changing the initialization of selected variables named iopdef*** (specifying the activation or inactivation of selected functionalities) from 'T' to 'F' (or vice versa)
- Changing the initialization of selected variables id (specifying the size of arrays corresponding to the selection) to a new value. The notation refers to the the name assigned to each size-related symbol. The names and symbols used are lietse below. The two quantities most likely to be modified are maxmol and maxslt.
If there is a running version of MMC the key PRCO SAVE can print a data statement that - copied to the end of pre.f - will initialize the defaults to be the same as those of the running version.

Instructions for changing the source code of pre.f are at the beginning of that file. The part of the prepocessor that may be changed by the user is between the comment lines 'START Customization' and 'END Customization'. In the first segment, calls to the subroutine setopt select functionalities to enable/disable. In the second (longer) segment, calls to the subroutine setdim set maximum values for the various aspects of the system size. Some array sizes are generated automatically depending on the code segments activated.

The following list gives the special symbols related to array sizes appearing in mmc.for with the names used in the preprocessor and the quantity they limit.

#MO maxmol: Solvent molecules +1 (has to be even) (see the key NSLV)
#MA maxmolslt: Number of atoms per single solute molecule
#SX mxpxslt: Number solute atoms for proximity analysis
#MM maxsltmol: Number of molecules in the solute.
#UW maxwnnu: Bit map for the neighbors of a solute atom
#TN maxnst: Solute nuclear centers
#VN maxnsv: Solvent nuclear centers
#TE maxest: Solute electron centers
#VE maxesv: Solvent electron centers
#VW maxwnnv: Bit map for the neighbors of a molecule (maxwnnv/(Number of integer word bits -1) + 1. The integer word size is 32 on most machines, 46 on the CRAY).
#ST maxslt: Solute centers (see the key SLTA)
#GR maxgslt: Solute groups
#TA maxtslt: Torsion-affected atoms' list
#SV maxslv: Solvent centers
#SV maxslv4: maxslv-4
#VT maxss: Greater of #SV and #ST
#NA maxat: Atoms
#TL maxtrgrgr: Torsion-affected group' list
#GT maxstg: Solute centers for general solvent case
#GV maxsvg: Solvent centers for general solvent case
#DT maxsst: Solute centers for parameter derivative (see key SENS) and energy decomposition (see key ENGL) calc.
#DM maxmst: Molecules for parameter derivative calculation
#RG maxgrid: Radial distribution function gridpoints (< 505 !)
#PG maxpfgr: Preferential weight list length (RMAX/RI)+1 (RI from RECTYPE 32 - see key DSTC)
#WG maxcggr: US weighing function gridpoints
#OR maxorgr: Number of grid points in the energy distribution function for overlap ratio method calculations
#GX maxxgr: Number of x cavity gridpoints (<128, even)
#GY maxygr: Number of y cavity gridpoints (<128, even)
#GZ maxzgr: Number of z cavity gridpoints (>31, <128, even)
#CV maxcav: Number of 'free' gridpoints for cavity-biased (if less then #GX*#GY*#GZ then some test calculations can't be done)
#W2 maxlin: Work space for n-step optimized matchings
#WS maxausp: Storage space size in adaptive US
#WI maxauit: Number of adaptive US iterations + 1
#MI maxavit: Number of block average entries
#TR maxtors: Torsion list
#AT maxatyp: Number of different atomtypes used
#UU maxatypu: Maximum no of atom types in a given solute
#UV maxstmol: Greater of #MO and #ST
#TG maxtgrid: Number of total g(r) grid points
#VG maxwrgrid: Number of grids for solvent-solvent g(r)
#ND maxgvv: Number of solvent-solvent g(r)'s
#DG maxdrgrid: Number of radial grids for dipole correl. QCDF
#LG maxdagrid: Number of angular grids for dipole correl. QCDF
#GE maxpegrid: Number of energy grids for solute-solvent PE QCDF
#GQ mxpxgslt: Number different RDF's and QCDF's
#PP maxcavps: Number of 'free' gridpoints for cavity-biased
#PS maxpfsum: Preferential weight partial sum list length
#WM maxmatch: Number of AUS iterations to match.
#TD maxtgrid: Number of torsion angle distribution grids
#FE mxfeslt: Number of free-energy solute atoms (all copies)
#MH maxhunmat: Number of sites for generic solvation site calculations
#NH maxhmneig: Number of neighbors for establising a complete match
#LT mxtorloop: Number of torsion loops
#LS maxloopslt:When torsion loops moves are active, maxslt otherwise 1
#MD mxdiffmol: Number of solvent molecules+1 for diffusion constant calculations
#MR mxrescmol: Number of solvent molecules+1 for residence time calculations
#DC mxdiffcr: Number of structures to use in diffusion calc
#RC mxrescr: Number of structures to use in residence time calc
#MW mxwidslt: Number of Widom solute molecules
#MS mxphsmol: Number of solvent molecules+1 for
#MG maxmolfg: Number of solvent molecules+1 to calculate field gradients for PHS calculations
#HA maxath: Number of atoms for representing all sites (maxslt+maxslv*maxhunsite)
#GM maxmapgrid: Number of gridpoints in the energy map
#GH maxhbgrid: Number of gridpoints in the hydrogen-bond map
#NE maxmap: Number of atoms to save for minimum energy configuration
#AU maxtorauc: Number of angles to save for torsion angle autocorrelation calculation
#GC maxgrdclst: The larger of the number of grid clusters fo cavity/pocket calculations and of the number of sites used for the various site analysis (key GSAN) calculations
#RN maxrandinp: The maximum number of random numbers that can be read
#NL maxnnlist: The maximum number of 1st + 2nd + 3rd neighbo

Since the increased memory usage taxes the computer resources (both memory and disk-space), it is usually worth recompiling the program separately for significantly different system sizes (see the key PRCO SVMN for preparing a minimum-sized executable). Remember, however, that the checkpoint file (.ckp) is only readable by a program that was compiled with the same dimensioning!

Code segment that can be selectively activated contains lines with the comment C@**. The selection and removal of such lines is controlled by a number of character variables in pre.f that can be set during the customization of the preprocessor. Except for the variable SYSTEM, the value is either 'T' or 'F'. The following variables can be set:

SYSTEM {'IRIX'} - Choices:
- 'IRIX' - C@UG (SGI Irix)
- 'AIX ' - C@UR (IBM RS600)
- 'HP ' - C@HP (Hewlett-Packard)
- 'MPDM' - C@DM (MPI - Distributed Memory),
- 'G77 ' - C@G7 (Gnu Fortran-77),
- 'G95 ' - C@G9 (Gnu Fortran-95),
- 'EFC ' - C@EF (Intel Fortran),
- 'ABSF' - C@EF (Absoft Fortran90),
- 'UNIX' - C@UX (Generic Unix)
CB {'T'} - C@CB: Cavity-biased (T,V,mu) ensemble code is active (inactive when 'F') - see key GCEN
IB {'T'} - C@IB: (T,P,N) ensemble code is active - see key IBEN
FE {'T'} - #FE : Free-energy simulation arrays are active - see key FREE
MS {'T'}: Any intersolute move is active - see key PARD or PART
TR {'T'} - #TR : Torsion move arrays are active - see key PART
LO {'T'} - #LT : Torsion loop move arrays are active - see key LOOP
PX {'T'} - #SX : Proximity analysis code is active - see key PXCR
FG {'F'} - C@FG: Field-gradient calculation code is inactive - see keys PRTG
PG {'F'} - C@PG: Cavity grid analysis code is inactive - see keys PRTG or STVG
HU {'F'} - #MH : Generic solvation site code is inactive - see key GENS
RF {'F'} - C@RF: Reaction field correction code is inactive
DD {'F'} - C@DD: Distance-dependent dielectric code is inactive
DD {'F'} - C@1R: eps(r)=r dielectric code is inactive
PH {'F'} - #MS : Primary hydration shell (PHS ) code is inactive (active when 'T') - see key RFCR
ED {'F'} - #DT : Energy decomposition (ENGL) key arrays are inactive (active when 'T')
DM {'F'} - C@DM : MPI communication library calls are inactive (active when 'T') - for now only a very limited set of functions are available.
DR {'F'} - #MD : Diffusion constant and residence time calculation code is inactive (active when 'T') - see keys DIFC or RTIM
FR {'T'} - C@FR: Solvent force calculation code is active (inactive when 'F') (used for force-biasing and pressure calculation) - see key SAMP. Calculation of forces can be inhibited to save time when only proximity analysis is run or when force-biasing is not used and the pressure is of no interest.
TS {'T'} - C@TS: Solute torque calculation code is active (inactive when 'F') (for whole solute rot) If the solute is not going to be rotated or no force bias is applied on the solute (NFBU) then the solute torque calculation is superfluous and can be left out, resulting in some savings (significant for large solutes) in computer time.
NN {'T'} - C@NN: Solvent-solvent near-neighbor map is active Generally, the near-neighbor code has to be left out is when (1) the solvent-solvent cutoff is close to the inside sphere radius of the central cell, or (2) minimum image convention is used for solvent-solvent solvent interactions or (3) there are no or few solvent molecules.
TN {'T'} - C@TN: Solute-solvent near-neighbor map is active It can be set to 'F' when the solute is completely rigid (i.e., neither PARD nor PART) are active).
NA {'T'} - C@NA: Near-neighbor map arithmetic code (logical code when 'F') is active.
The bitmap operations used both for the near-neighbour table and for keeping track of intramolecular interactions to be used can be performed either by logical or arithmetis operations. The relative efficiency (when there is a choice) varies from machine to machine. Also, the code involving logical operations uses equivalence statements that are planned to be phased out in futore Fortran versions. Furthermore, using logical operations works only if the compiler performs the logical operation on all bits. If it is not the case (e.g., the Silicon Graphics), NA should be set to 'T'.
VC {'T'} - C@VC: Vectorizable (scalar when F) routine call code is active
I2 {'F'} - C@I2: Integer*2 array use for cavity grids is inactive - see key GCEN. For grand-canonical ensemble calculations requiring a relatively small number of free grids (< 2¹⁵-1 ~ 30,000) but a large number of gridpoints setting to 'T' can save a few megabytes of memory.
PS {'F'} - C@PS: Code amenable to autoparallelization is active (so far, if failed to yield speedup).
QP {'F'} - #TN : QPEN/EPEN potential arrays are inactive.

Activating some of the codes requires deactivating a complementary code. Such code pairs are:
C@NL - C@NA
C@VC - C@NC
C@DM - C@ND
C@1R - C@NR
C@RF - C@1R
The preprocessor takes care of it automatically.

Besides the activating and inactivating lines prefixed with C@**, the preprocessor provides the option to change all double precision (real*8) variables to quadruple precision variables (real*16). This option can be useful for calculations using the LOOP key. Note, however, that not all compilers support quadruple precision variables.

The defaults for variables FG, PG, HU, and DR are set to 'F' because the corresponding code requires large arrays and RF is set to 'F' because it adds code to the energy calculations.

The key PRCO produces an annotated list of the values used for the currently executing version as well as of the code segment selection choices made. A quick listing on the terminal can be obtained by typing mmc.bin, followed by the command PRCO at the MMC> prompt (followed by STOP at the next prompt for a graceful exit).

If a compilation is needed to match a given checkpoint file the command PRCO SAVE will read the information of the current run's checkpoint file and write Fortran data statements to be inserted into pre.f

VI.2. Compilation

Once the mmc.f file is prepared it should be compiled with the appropriate Fortran compilation command, to produce an executable, say mmc.bin, e.g.,

f77 -o mmc.bin mmc.f.

Compiler options:

For test runs use the -C (to inhibit optimization and enable index boundary check)
For production runs, high(est?) level optimization should be invoked, e.g., -Ofast (SGI); -O4 (Compaq Alpha)
For compatibility of binary trajectories note that the default byte order on Linux and SGI is "big endian". if your compiler's deafault is "little endian" you can compile an executable with the Linux/SGI byte order using the added directive "-byteswapio" on Linux .
For runs requiring the MPI communication library, the compilation should include linking instructions to the library and running the program generally requires the use of a system-dependent script - consult your system manager for the apropriate syntax.

Possible problems:

For some systems, the source code is too big to compile it in one step. In this case, the program splitmmc.f (part of the distribution) should be compiled and run. This will result in five files: mmc1.f, ..., mmc5.f. Each one should be compiled separately with the -c option, i.e.,
> f77 -c mmc1.f
followed by
> f77 -o mmc.bin mmc*.o

Some compilers fail due to a so-called 'relocation error' when optimizing at levels higher than one is asked. When using the Intel Fortran compiler (ifort), adding the compiler directives
-mcmodel=medium -share_intel
solved the problem. With some of the other compiler (but not the GNU compiler) the compilation key -fpic was found to solve the problem.

VI.3. Parallelization

There are a number of limitations in the parallelized version of MMC. In particular, it is extremely inefficient (due to latency issues) on distributed memory systems so it should be run either on shared-memory systems, or limit the number of CPU's to the nmumber of procesors on a single node and make sure the the queuing system places each job on the same node.

The following has been parallelized: solvent-solvent energy calculations; solute-solvent energy calculations; Widom-insertion free-energy calculations, and grand-canonical ensemble (GCE) simulations. However, force-calculations have to be disabled. Also, use any solute move (keys PARD or PART) would reduce the parallel efficiency.

VIII. TESTING THE PROGRAM

A test suite has been compiled to allow a systematic check on the integrity of the program. The tests are located in the directory testsuite. There is a c-shell script test.csh. It runs all the tests and examines the output. For each test, it prints different messages if there were input errors, inconcistency errors or no errors. For runs with special solute moves, it also prints the acceptance rates. The program is considered passing the test if there are no errors and the acceptance rates printed are all positive. Furthermore, if there is a subdirectory oldout containing reference output files, the new output will be compared to the old one. Differences should only show in CPU times and dates. Clearly, this is only meaningful if the reference output was generated by the same compiler and compilation options. The table below gives a summary of the features tested by each test system.

 Nmolslt Nwat  PARD PART SWAP TICA PRM PRM GCEN IBEN SUUCm CLON PMLI PROX WIDO
                                       Ft  Dr
1   1     215
2   1     215
3   3      0     x   x     x
4   7      0     x         x
5   2      15                   x
6   2     484    x   x
7   1      0         x
8   4     484                            x
9   6     484                            x
10  1      9                                 x
11  6      0     x   x     x                                 x
12  6      0     x                                           x
13  2      0         x
14  2      0         x          x
15  2     484    x   x
16  3     484    x         x
17  1      4                                 x
18  3     484    x
19  3     693                        x
20  1                x
21  2      50    x                           x
22  2      50    x                                x
23  2      50                                     x
24  2      50    x   x                            x
25  3      0     x   x     x                           x
26  3     484    x   x                     x
27  3     500                                                    x
28  2     484                                                         x
29  4     215    x   x         x
30  1      0     x   x
31  1     190        x
32  1     107                                                              x
  Nmlslt Nwat  PARD PART SWAP TICA PRM PRM GCEN IBEN SUUCm CLON PMLI PROX WIDO
                                       Ft  Dr

IX. REFERENCES

Grand-canonical ensemble formalism: D.J. Adams, Mol. Phys, 29, 307 (1975).

QCDF formalism: A. Ben-Naim, Water and aqueous solutions, Plenum, New York (1974).

Overlap ratio method: Jacucci and N. Quirke, Mol. Phys., 40, 1005 (1980)

EPEN/QPEN potential: F. Marchese, P.K. Mehrotra and D.L. Beveridge, J. Phys. Chem., 85, 1 (1981).

Proximity criterion formalism: P.K. Mehrotra and D.L. Beveridge, J. Am. Chem. Soc., 102,4287 (1980).

Original description of the Metropolis method: N. Metropolis et al., J. Chem. Phys., 21, 1087 (1953),

Shuffled cyclic selection of molecules to be moved: M. Mezei, J. Comp. Phys., 39, 128 (1981)

Click HERE to download a copy

Cavity-biased insertion - original formalism: M. Mezei, Mol. Phys., 47, 1307 (1982)

Virial-biased volume change in the (T,P,N) ensemble: M. Mezei, Mol. Phys., 48, 1075 (1983)

Description of self tests: par_all27_prot_na_crb.acly.crbn.prmM. Mezei, CCP5 Newsletter, Daresbury Laboratory, No. 23, (1986)

Click HERE to download a copy

Free energy methodology review: M. Mezei and D.L. Beveridge, Ann. Acad. NY Sci., 482, 1, (1986).

Click HERE to download a copy

Cavity-biased insertion - grid-based formalism: M. Mezei, Mol. Phys., 61, 565-582 (1987); Erratum, 67,1207-1208 (1989).

Click HERE to download a copy

Adaptive Umbrella sampling: M. Mezei, J. Comp. Phys. 68, 237 (1987)

Click HERE to download a copy

Finite-difference thermodynamic integration: M. Mezei, J. Chem. Phys. 86, 7084 (1987)

Click HERE to download a copy

Radical-plane based proximity criterion: M. Mezei, Molecular Simulation, 1, 327 (1988).

Click HERE to download a copy

Using a bitmap for keeping neighbour lists: M. Mezei, Molecular Simulation, 1, 169 (1988).

Click HERE to download a copy

Scaled force bias method: M. Mezei, Molecular Simulation, 5, 405 (1990)

Click HERE to download a copy

Polynomial Thermodynamic Integration for the solvation free energy of liquid water: M. Mezei, J. Comput. Chem., 13, (1992) 651;

Iso-energy cutoff for ion-ion PMF: M. Mezei, Int. J. Quantum Chem., 52, 147 (1994).

Preferential sampling near the solute: J.C. Owicki, Computer Modelling of Matter, P.G. Lykos, ed., ACS, Washington, D.C., (1978), p 159.

Force-bias sampling: C. Pangali, Rao and B.J. Berne, Chem. Phys. Lett., 55, 413 (1978), Mol. Phys., 37, 1773 (1979)

Quantum corrections: Powles and Rickayzen, Mol. Phys., 38, 1875 (1979).

Polynomial TI for the solvation free energy of the alanine dipeptede: H. Resat and M. Mezei, J. Chem. Phys., 99, (1993) 6052.

Click HERE to download a copy

Smart MC (a force-biasing variant): P.J. Rossky, J.D. Doll and H.L. Friedman, J. Chem. Phys., 69, 4628 (1978).

Generic solvation sites: M. Mezei and D.L. Beveridge, J. Comput. Chem., 5, 523 (1984). (1992) 651;

Click HERE to download a copy

Primary Hydration Shell: D. Beglov and B. Roux, Biopolymers., 35, 171 (1995).
A. Kentsis, M. Mezei, and R. Osman, Biophys. J, 84, 805-815 (2003).

Sampling with Tsallis statistics: I. Andricioaei and J.E. Straub, 53, 3055 (1996).

Cavity-biased Widom insertions: P. Jedlovszky, and M. Mezei, J. Am. Chem. Soc., 122, 5125-5131 (2000).

Extension-biased torsion moves: P. Jedlovszky, and M. Mezei, J. Chem. Phys., 111, 10770-10773 (1999).

Click HERE to download a copy

Local torsion moves: M. Mezei, J. Chem. Phys., 118, 3874-3879 (2003).

Automatic tuning of the B-parameter: J.A. Speidel, J.R. Banfelder, and M. Mezei, J. Chem. Theory and Comp., , (2006).

Automatic tuning of the stepsize parameters: J.A. Speidel, J.R. Banfelder, and M. Mezei, Algorithms, 2, 215-226 (2009).

Simulaid: a simulation facilitator and analysis program J. Comput. Chem., 31, 2658-2668 (2010).

Return to contents page