Limitations

Understanding what Pose Rescorer can and cannot do

Methodological Limitations

Single-Frame Approximation

Rescore performs calculations on a single structure. It does not sample conformational space or account for protein-ligand dynamics. Real binding involves ensemble averaging over many conformations, which is not captured here.

Impact: Scores can be sensitive to the input structure. Small changes in ligand pose or protein conformation can produce different energies.
No Entropy Calculation

The binding free energy is ΔG = ΔH − TΔS. Rescore computes ΔH (enthalpy) using molecular mechanics but does not compute ΔS (entropy). The reported ΔG_bind implicitly assumes ΔS = 0.

Impact: True binding free energies include both enthalpic and entropic contributions. Neglecting entropy can lead to large errors, especially for flexible ligands or systems with significant conformational changes upon binding.
Implicit Solvent Approximation

Generalized Born (GB) and Poisson-Boltzmann (PB) models approximate solvent effects using continuum electrostatics. They do not represent explicit water molecules, hydrogen bonding networks, or desolvation effects accurately.

Impact: Systems where water-mediated interactions are critical may be poorly represented. Explicit solvent MD simulations provide more accurate solvation modeling but are computationally expensive.
Force Field Limitations

AMBER force fields (ff14SB, GAFF2) are parametrized for typical drug-like molecules and standard amino acids. They may not accurately represent:
- Unusual ligands (e.g., organometallics, boron-containing compounds)
- Modified amino acids or non-standard residues
- Covalent inhibitors or transition-state analogs
- Highly charged species or zwitterions

Structural Requirements

Protein Preparation
- The receptor must be a complete protein structure (no missing heavy atoms)
- Protonation states are inferred by pdb4amber—manual curation may be needed for histidines, cysteines, or other ionizable residues
- Waters, ions, and cofactors are removed during preparation
- Multi-chain complexes are supported but not validated extensively
Ligand Preparation
- The ligand must be provided as a MOL2 file with 3D coordinates
- Protonation state is taken from the input file—use tools like MolProbity or Epik to assign correct states
- Ligand must be a single molecule (no multi-component ligands)
- AM1-BCC charge calculation assumes neutral or singly charged molecules
Unsupported Systems

Rescore cannot handle:
- Metal cofactors (e.g., zinc fingers, heme groups)
- Covalent ligands
- DNA/RNA complexes
- Membrane proteins (no lipid bilayer support)

Interpretation of Results

Relative Ranking Only

ΔG_bind values are relative scores for ranking purposes. They should not be interpreted as experimental binding affinities (K_d, K_i, IC₅₀).

Example: A ligand with ΔG = −30 kcal/mol is likely to bind more favorably than one with ΔG = −20 kcal/mol, but the absolute values do not correspond to measured affinities.
Lack of Error Bars

Single-frame calculations provide no statistical uncertainty. There are no error bars or confidence intervals. Scores should be treated as point estimates with unknown precision.
Correlation with Experiment

MM/GBSA scores do not reliably correlate with experimental binding affinities across diverse systems. The method works best for:
- Comparing structurally similar ligands
- Ranking poses for the same ligand
- Filtering large virtual screening libraries
It works poorly for:
- Predicting absolute K_d or ΔG_exp
- Comparing chemically diverse compounds
- Systems with large conformational changes or allosteric effects

Computational Considerations

Performance

A typical calculation for a ~300 residue protein and a small molecule takes on the order of minutes on modern hardware. Batch rescoring scales linearly with the number of ligands (no parallelization across ligands, but AmberTools parallelizes internal GB calculations).
Reproducibility

Results are deterministic for the same input files. However, AM1-BCC charge calculations may have small numerical variations (±0.001 e) between runs, leading to ~1 kcal/mol differences in binding energy.
File Management

Each ligand generates ~50-100 MB of output files (topologies, coordinates, logs). For large batches, ensure sufficient disk space.

Appropriate Use Cases

When to use Rescore:

Post-processing docking results to improve pose ranking
Filtering virtual screening hits for synthesis prioritization
Comparing binding modes from different docking programs
Exploratory studies where speed is more important than precision

When NOT to use Rescore:

Reporting ΔG values in publications as experimental predictions
Building QSAR models without validation on external test sets
Predicting IC₅₀ or K_d quantitatively
Studying systems where entropy, explicit water, or dynamics are critical

Alternative Methods

For more rigorous predictions, consider:

Free energy perturbation (FEP): Alchemical method with explicit solvent and sampling (e.g., FEP+, OpenFE)
Thermodynamic integration (TI): Similar to FEP but with different integration approach
Metadynamics: Enhanced sampling for exploring free energy landscapes
QM/MM calculations: Quantum mechanics for ligand binding site, MM for rest of protein

Best Practices

Always validate docking poses visually before rescoring
Use consistent protonation states across all ligands in a series
Compare scores within a congeneric series, not across diverse scaffolds
Report methods clearly if publishing: state that MM/GBSA is for relative ranking only
Do not over-interpret absolute values—focus on rank order

Limitations

Methodological Limitations

Single-Frame Approximation

No Entropy Calculation

Implicit Solvent Approximation

Force Field Limitations

Structural Requirements

Protein Preparation

Ligand Preparation

Unsupported Systems

Interpretation of Results

Relative Ranking Only

Lack of Error Bars

Correlation with Experiment

Computational Considerations

Performance

Reproducibility

File Management

Appropriate Use Cases

Alternative Methods

Best Practices

Further Reading