FreedomIntelligence

alphafold

489
57
# Install this skill:
npx skills add FreedomIntelligence/OpenClaw-Medical-Skills --skill "alphafold"

Install specific skill from multi-skill repository

# Description

>

# SKILL.md


name: alphafold
description: >
Validate protein designs using AlphaFold2 structure prediction. Use this skill when:
(1) Validating designed sequences fold correctly,
(2) Predicting binder-target complex structures,
(3) Calculating confidence metrics (pLDDT, pTM, ipTM),
(4) Self-consistency validation of designs,
(5) Multi-chain complex prediction with AlphaFold-Multimer.

For faster single-chain prediction, use esm.
For QC thresholds, use protein-qc.
license: MIT
category: design-tools
tags: [structure-prediction, validation, reference]
biomodals_script: modal_alphafold.py


AlphaFold2 Structure Validation

Prerequisites

Requirement Minimum Recommended
Python 3.8+ 3.10
CUDA 11.0+ 12.0+
GPU VRAM 32GB 40GB (A100)
RAM 32GB 64GB
Disk 100GB 500GB (for databases)

How to run

First time? See Installation Guide to set up Modal and biomodals.

cd biomodals
modal run modal_colabfold.py \
  --input-faa sequences.fasta \
  --out-dir output/

GPU: A100 (40GB) | Timeout: 3600s default

Option 2: Local installation

git clone https://github.com/deepmind/alphafold.git
cd alphafold

python run_alphafold.py \
  --fasta_paths=query.fasta \
  --output_dir=output/ \
  --model_preset=monomer \
  --max_template_date=2026-01-01

Option 3: ESMFold (fast single-chain)

modal run modal_esmfold.py \
  --sequence "MKTAYIAKQRQISFVK..."

Key parameters

Parameter Default Options Description
--model_preset monomer monomer/multimer Model type
--num_recycle 3 1-20 Recycling iterations
--max_template_date - YYYY-MM-DD Template cutoff
--use_templates True True/False Use template search

Output format

output/
β”œβ”€β”€ ranked_0.pdb           # Best model
β”œβ”€β”€ ranked_1.pdb           # Second best
β”œβ”€β”€ ranking_debug.json     # Confidence scores
β”œβ”€β”€ result_model_1.pkl     # Full results
β”œβ”€β”€ msas/                  # MSA files
└── features.pkl           # Input features

Extracting metrics

import pickle

with open('result_model_1.pkl', 'rb') as f:
    result = pickle.load(f)

plddt = result['plddt']
ptm = result['ptm']
iptm = result.get('iptm', None)  # Multimer only
pae = result['predicted_aligned_error']

Sample output

Successful run

$ python run_alphafold.py --fasta_paths complex.fasta --model_preset multimer
[INFO] Running MSA search...
[INFO] Running model 1/5...
[INFO] Running model 5/5...
[INFO] Relaxing structures...

Results:
  ranked_0.pdb:
    pLDDT: 87.3 (mean)
    pTM: 0.78
    ipTM: 0.62
    PAE (interface): 8.5

Saved to output/

What good output looks like:
- pLDDT: > 85 (mean, on 0-100 scale) or > 0.85 (normalized)
- pTM: > 0.70
- ipTM: > 0.50 for complexes
- PAE_interface: < 10

Decision tree

Should I use AlphaFold?
β”‚
β”œβ”€ What are you predicting?
β”‚  β”œβ”€ Single protein β†’ ESMFold (faster)
β”‚  β”œβ”€ Protein-protein complex β†’ AlphaFold/ColabFold βœ“
β”‚  β”œβ”€ Protein + ligand β†’ Chai or Boltz
β”‚  └─ Batch of sequences β†’ ColabFold βœ“
β”‚
β”œβ”€ What do you need?
β”‚  β”œβ”€ Highest accuracy β†’ AlphaFold/ColabFold βœ“
β”‚  β”œβ”€ Fast screening β†’ ESMFold
β”‚  └─ MSA-free prediction β†’ Chai or ESMFold
β”‚
└─ Which AF2 option?
   β”œβ”€ Local installation β†’ Full control, slow setup
   β”œβ”€ ColabFold β†’ Easier, MSA server
   └─ Modal β†’ Recommended for batch

Typical performance

Campaign Size Time (A100) Cost (Modal) Notes
100 complexes 1-2h ~$8 With MSA server
500 complexes 5-10h ~$40 Standard campaign
1000 complexes 10-20h ~$80 Large campaign

Per-complex: ~30-60s with MSA server.


Verify

find output -name "ranked_0.pdb" | wc -l  # Should match input count

Troubleshooting

Low pLDDT regions: May indicate disorder or poor design
Low ipTM: Interface not confident, check hotspots
High PAE off-diagonal: Chains may not interact
OOM errors: Use ColabFold with MSA server instead

Error interpretation

Error Cause Fix
RuntimeError: CUDA out of memory Sequence too long Use A100 or split prediction
KeyError: 'iptm' Running monomer on complex Use multimer preset
FileNotFoundError: database Missing MSA databases Use ColabFold MSA server
TimeoutError MSA search slow Reduce num_recycles

Next: protein-qc for filtering and ranking.

# Supported AI Coding Agents

This skill is compatible with the SKILL.md standard and works with all major AI coding agents:

Learn more about the SKILL.md standard and how to use these skills with your preferred AI coding agent.