Description of the program CHAM: search for chameleon sequences in the Protein Data Bank (PDB).

Mihaly Mezei

Department of Pharmacological Sciences,
Icahn Sinai School of Medicine at Mount Sinai,
New York, NY 100102

Mihaly.Mezei@mssm.edu

Aug. 03, 2018.

Reference: M. Mezei, Revisiting chameleon sequences in the Protein Data Bank Algorithms 11, (2018). DOI:

 
 
I.  DESCRIPTION OF THE FUNCTIONS OF THE PROGRAM
II.  RUNNING THE PROGRAM - INPUT
III.  THE OUTPUT ON THE TERMINAL
IV.  FILE FORMATS
V.  INSTALLATION
VI.  CHANGING DIMENSION (ARRAY SIZES)

I. DESCRIPTION OF THE FUNCTIONS OF THE PROGRAM

II.RUNNING THE PROGRAM - INPUT

The program can be run either interactively or from the command line.

When the program is run interactively, the user will be prompted for the following items

Running from the command line involves issuing the following command:

cham -tc argument -tc argument ...
where tc one of mn, mx, ro, db, ff, mp. The respective arguments and their default values are listed below.

III. THE OUTPUT ON THE TERMINAL

Besides the results printed on the <root>.res file the program also prints information on the progress of the database scan and the summary of the results (e.g., number of sequences checked, number of chameleons found). The results printed on the <:output file root>.res file include the following items:

IV. FILE FORMATS

The program can read the protein information is three different formats:

V. INSTALLATION

While the distribution includes an executable (compiled with Intel Fortran under Linux) for other architecture it may be necessary to compile it. The program has to be compiled with Fortran, e.g.,:
f77 -o cham cham.f
to obtain the executable cham

Some compilers fail due to a so-called 'relocation error' when optimizing at levels higher than one is asked. When using the Intel Fortran compiler (ifort), adding the compiler directives
-mcmodel=medium -share_intel
solved the problem. With some of the other compilers (but not the GNU compiler) the compilation key -fpic was found to solve the problem.

  • If the program is to be compiled with Fortran95 then there is a preprocessor in the Simulaid distribution f77tof95.f and f77tof95_f95.f (the second one is the Fortran95 version of the first one) that changes the syntax to conform to Fortran95 requirements.

    VI. CHANGING DIMENSIONS

    The sizes of the arrays are established with parameter statements throughout the code. Several symbols user used for this purpose. There are certain relations between these symbols, so changing one of them is likely to require changes in some others. Below is a list of these symbols (the program checks for violations).

    IMPORTANT: Parameter statements for most symbols occur several places in the program. When a change is required, it has to be carried out at ALL occurences!