binder-design - SKILL.md Agent Skill

name: binder-design description: > Guidance for choosing the right protein binder design tool. Use this skill when: (1) Deciding between BoltzGen, BindCraft, or RFdiffusion, (2) Planning a binder design campaign, (3) Understanding trade-offs between different approaches, (4) Selecting tools for specific target types.

For specific tool parameters, use the individual tool skills (boltzgen, bindcraft, rfdiffusion, etc.). license: MIT category: orchestration tags: [guidance, tool-selection, workflow]

Binder Design Tool Selection

Which tool wins

No single tool is best for every target. Hit-rate is strongly target-dependent, so choose by target type, what you want to control, and available compute.

The clearest signal comes from head-to-head competitions where many methods design against the same target. On the Adaptyv Nipah de novo target, the public results show:

Method	Tested	Binders	Hit-rate
Mosaic (gradient, multi-model)	9	8	89%
ProteinMPNN hybrid	28	7	25%
RFdiffusion	60	13	22%
BindCraft	98	7	7%
BoltzGen	182	6	3%

Mosaic had the highest hit-rate here, but on a small, expert-tuned sample. The ranking shifts on other targets, and that target-dependence is true of every method (BoltzGen, Boltz, BindCraft, Mosaic). You cannot know a priori which will win on a new target, so this is not a fixed leaderboard.

Because of that, choose a starting point by cost and effort to a binder, not by assuming a method has the best hit-rate. BoltzGen is the suggested default because it is turnkey and all-atom, so it gets you testable designs fastest with the least setup. Mosaic is the high-ceiling option when you can invest time tuning the objective. On a hard or important target, running more than one method in parallel is reasonable.

De novo binder design?
│
├─ Lowest cost/effort to testable designs → BoltzGen (default)
├─ Hard/important target, can invest tuning → Mosaic (gradient, multi-model)
├─ Ligand / small-molecule binding → BoltzGen (all-atom)
├─ Diversity / exploration → RFdiffusion + ProteinMPNN
├─ End-to-end with built-in validation → BindCraft
└─ Antibody / nanobody (VHH) → germinal skill (also mber, iggm in biomodals)

Tool comparison

Tool	Strengths	Weaknesses	Best for
BoltzGen	All-atom, single-step, turnkey	One model in the loop; mid-range cost per design	Lowest-effort default, ligand binding
Mosaic	Composable multi-model objective, won hard head-to-heads	Needs tuning, local JAX only	Hard or important targets, expert use
BindCraft	End-to-end, built-in AF2 validation	Less diverse	Production campaigns
RFdiffusion	High diversity	Requires ProteinMPNN; not in biomodals	Exploration, diversity
Germinal	Antibody and nanobody formats	Finicky	scFv / VHH design

Compute cost per design

Adaptyv's own tests of these models showed the following compute cost per accepted design, averaged across 7 targets (it varies several-fold by target):

Method	Cost per design
RSO	~$0.15
RFdiffusion	~$0.25
Mosaic	~$0.55
ESMFold2 inversion	~$0.85
mBER	~$1.40
Germinal	~$1.60
BoltzGen	~$1.80
BindCraft	~$2.90

Per-design compute cost is not the same as cost to a binder, which also depends on the hit-rate on your target. The gradient methods (RSO, Mosaic) are cheap per design but need setup and tuning; BoltzGen and BindCraft cost more per design but are turnkey, so their advantage is low human effort rather than lowest compute cost.

Compute vs effort tradeoff

Lowest human effort: BoltzGen needs no tuning and runs through biomodals. Good first pass and good for ligand binding.
Highest ceiling on a hard target: Mosaic, given time to design and tune the objective. It runs locally on a JAX GPU rather than through biomodals, and is cheap per design.
Whatever the generator, validate with boltz or chai and rank with ipsae.

Other biomodals-backed options: modal_rso.py (Rejection Sampling Optimization, an AlphaFold-based gradient method) for minibinders, and modal_mber.py for VHH nanobodies.

Example pipeline: BoltzGen → Chai → QC

BoltzGen provides all-atom design with built-in side-chain packing. This is one turnkey path; swap in Mosaic, RFdiffusion, or BindCraft depending on the target.

Target → BoltzGen → Validate → Filter
 (pdb)  (all-atom)   (chai)     (qc)

1. Target preparation

# Fetch structure from PDB
# Use pdb skill for guidance

Trim to binding region + 10A buffer
Remove waters and ligands
Renumber chains if needed

2. Hotspot selection

Choose 3-6 exposed residues
Prefer charged/aromatic residues
Cluster spatially (within 10-15A)

3. Design with BoltzGen

First, create a YAML config file (e.g., binder.yaml):

entities:
  - protein:
      id: B
      sequence: 70..100

  - file:
      path: target.cif
      include:
        - chain:
            id: A
      binding_types:
        - chain:
            id: A
            binding: 45,67,89

Then run:

modal run modal_boltzgen.py \
  --input-yaml binder.yaml \
  --protocol protein-anything \
  --num-designs 50

Why BoltzGen?

All-atom output (no separate ProteinMPNN step needed)
Better for ligand/small molecule binding
Single-step design (backbone + sequence + side chains)

4. Alternative: RFdiffusion Pipeline

For maximum diversity or when backbone-only is preferred:

# Step 1: Backbone generation (RFdiffusion, run from the official repo)
python run_inference.py \
  inference.input_pdb=target.pdb \
  contigmap.contigs=[A1-150/0 70-100] \
  ppi.hotspot_res=[A45,A67,A89] \
  inference.num_designs=500

# Step 2: Sequence design
modal run modal_ligandmpnn.py \
  --input-pdb backbone.pdb \
  --params-str "--number_of_batches 16 --temperature 0.1"

5. Validation

modal run modal_chai1.py \
  --input-faa sequences.fasta \
  --out-dir predictions/

6. Filtering

Apply standard thresholds:

pLDDT > 0.80
ipTM > 0.50
PAE_interface < 10
scRMSD < 2.0 A

See protein-qc skill for details.

Number of designs

Stage	Count	Purpose
Backbone generation	500-1000	Diversity
Sequences per backbone	8-16	Sequence space
AF2 predictions	All	Validation
After filtering	50-200	Candidates
Experimental testing	10-50	Final selection

Common mistakes

Wrong hotspots

Using buried residues
Too many hotspots (over-constrain)
Wrong chain/residue numbers

Insufficient diversity

Too few designs generated
Low temperature in ProteinMPNN
Not exploring multiple backbones

Poor target preparation

Including full protein instead of binding region
Missing important structural features
Wrong protonation states

Timeline guide

Step	Compute Time
RFdiffusion (500 designs)	2-4 hours
ProteinMPNN (8000 sequences)	1-2 hours
AF2 prediction (8000 sequences)	12-24 hours
Filtering and analysis	1-2 hours

Total: 1-2 days of compute