Description of the program Compligset:
Further analysis of the data provided by the program Dockres

Mihaly Mezei

Department of Pharmacological Sciences,
Icahn School of Medicine at Mount Sinai,
New York, NY 10029

Dec. 16, 2019.

The program Compligset operates on the results of docking runs, in most caes processed by Dockres. The following inputs are implmented:

A PDB file of the docked complezes, generated by Dockres
A list file (extension .lst) of ligand ids and scores, generated by Dockres
A Glide ligand file (either straight out of Glide (*_pv.mae) or extracted from the *_pv.mae file by Dockres
Combined score list generated by Dockres
A PDB file of the top-scoring ligands generated by Dockres

Most operations require the user to specify the number of ligands to be searched from each list (Nsearch).

If ligand names contaning '_' or '-' are seen (that are usually indicating different tautomers of the same ligand), the user has to specify if the '_*' or '-*' part can be ignored in the ligand names.

The user can also specify minimum COM-COM distance and minimum RMSD thresholds, beyond which two ligand poses will be treated as different ligands (and labeled accordingly)

The following operations are implemented:

Look for overlap between ligand sets
Combine (average) scores among ligand sets
Look for selectivity (different targets)
Rank lists by top score (averages)
Merge lists
Remove duplicates from a .lst file
Combine ligand-target contact maps
Extract ligand-target complex

Compilation of the program

The program is written in Fortran 77. Its size is governed by the parameters (the number between the braces is the value set in the source code), established in the first line of the program

MAXLIG {125000} - maximum number of ligands per target to read
MAXTOP {125000} - maximum number of ligands per target to compare
MAXLIGAT {200} - maximum number of atoms per ligand
MAXTARGET {12} - maximum number of targets
MAXPOSE {10000} - maximum number ligands per target to use in combining (averaging) scores/ranks.
MAXDUP {100000} - maximum number of ligand pairs in the 'duplicate list'
MAXCMEM {200} - maximum number of ligand poses to combine (average) during overlap search

It should be compiled at the highest optimization level for maximum speed. For example, using the g77 compiler the compilation can be executed by

g77 -O4 -o compligset.exe compligset.f

Description of the program Compligset: Further analysis of the data provided by the program Dockres

Mihaly Mezei

Description of the program Compligset:
Further analysis of the data provided by the program Dockres