binder-design

star 136

Guidance for choosing the right protein binder design tool. Use this skill when: (1) Deciding between BoltzGen, BindCraft, or RFdiffusion, (2) Planning a binder design campaign, (3) Understanding trade-offs between different approaches, (4) Selecting tools for specific target types. For specific tool parameters, use the individual tool skills (boltzgen, bindcraft, rfdiffusion, etc.).

adaptyvbio By adaptyvbio schedule Updated 1/16/2026

name: binder-design description: > Guidance for choosing the right protein binder design tool. Use this skill when: (1) Deciding between BoltzGen, BindCraft, or RFdiffusion, (2) Planning a binder design campaign, (3) Understanding trade-offs between different approaches, (4) Selecting tools for specific target types.

For specific tool parameters, use the individual tool skills (boltzgen, bindcraft, rfdiffusion, etc.). license: MIT category: orchestration tags: [guidance, tool-selection, workflow]

Binder Design Tool Selection

Which tool wins

No single tool is best for every target. Hit-rate is strongly target-dependent, so choose by target type, what you want to control, and available compute.

The clearest signal comes from head-to-head competitions where many methods design against the same target. On the Adaptyv Nipah de novo target, the public results show:

Method Tested Binders Hit-rate
Mosaic (gradient, multi-model) 9 8 89%
ProteinMPNN hybrid 28 7 25%
RFdiffusion 60 13 22%
BindCraft 98 7 7%
BoltzGen 182 6 3%

Mosaic had the highest hit-rate here, but on a small, expert-tuned sample. The ranking shifts on other targets, and that target-dependence is true of every method (BoltzGen, Boltz, BindCraft, Mosaic). You cannot know a priori which will win on a new target, so this is not a fixed leaderboard.

Because of that, choose a starting point by cost and effort to a binder, not by assuming a method has the best hit-rate. BoltzGen is the suggested default because it is turnkey and all-atom, so it gets you testable designs fastest with the least setup. Mosaic is the high-ceiling option when you can invest time tuning the objective. On a hard or important target, running more than one method in parallel is reasonable.

De novo binder design?
│
├─ Lowest cost/effort to testable designs → BoltzGen (default)
├─ Hard/important target, can invest tuning → Mosaic (gradient, multi-model)
├─ Ligand / small-molecule binding → BoltzGen (all-atom)
├─ Diversity / exploration → RFdiffusion + ProteinMPNN
├─ End-to-end with built-in validation → BindCraft
└─ Antibody / nanobody (VHH) → germinal skill (also mber, iggm in biomodals)

Tool comparison

Tool Strengths Weaknesses Best for
BoltzGen All-atom, single-step, turnkey One model in the loop; mid-range cost per design Lowest-effort default, ligand binding
Mosaic Composable multi-model objective, won hard head-to-heads Needs tuning, local JAX only Hard or important targets, expert use
BindCraft End-to-end, built-in AF2 validation Less diverse Production campaigns
RFdiffusion High diversity Requires ProteinMPNN; not in biomodals Exploration, diversity
Germinal Antibody and nanobody formats Finicky scFv / VHH design

Compute cost per design

Adaptyv's own tests of these models showed the following compute cost per accepted design, averaged across 7 targets (it varies several-fold by target):

Method Cost per design
RSO ~$0.15
RFdiffusion ~$0.25
Mosaic ~$0.55
ESMFold2 inversion ~$0.85
mBER ~$1.40
Germinal ~$1.60
BoltzGen ~$1.80
BindCraft ~$2.90

Per-design compute cost is not the same as cost to a binder, which also depends on the hit-rate on your target. The gradient methods (RSO, Mosaic) are cheap per design but need setup and tuning; BoltzGen and BindCraft cost more per design but are turnkey, so their advantage is low human effort rather than lowest compute cost.

Compute vs effort tradeoff

  • Lowest human effort: BoltzGen needs no tuning and runs through biomodals. Good first pass and good for ligand binding.
  • Highest ceiling on a hard target: Mosaic, given time to design and tune the objective. It runs locally on a JAX GPU rather than through biomodals, and is cheap per design.
  • Whatever the generator, validate with boltz or chai and rank with ipsae.

Other biomodals-backed options: modal_rso.py (Rejection Sampling Optimization, an AlphaFold-based gradient method) for minibinders, and modal_mber.py for VHH nanobodies.

Example pipeline: BoltzGen → Chai → QC

BoltzGen provides all-atom design with built-in side-chain packing. This is one turnkey path; swap in Mosaic, RFdiffusion, or BindCraft depending on the target.

Target → BoltzGen → Validate → Filter
 (pdb)  (all-atom)   (chai)     (qc)

1. Target preparation

# Fetch structure from PDB
# Use pdb skill for guidance
  • Trim to binding region + 10A buffer
  • Remove waters and ligands
  • Renumber chains if needed

2. Hotspot selection

  • Choose 3-6 exposed residues
  • Prefer charged/aromatic residues
  • Cluster spatially (within 10-15A)

3. Design with BoltzGen

First, create a YAML config file (e.g., binder.yaml):

entities:
  - protein:
      id: B
      sequence: 70..100

  - file:
      path: target.cif
      include:
        - chain:
            id: A
      binding_types:
        - chain:
            id: A
            binding: 45,67,89

Then run:

modal run modal_boltzgen.py \
  --input-yaml binder.yaml \
  --protocol protein-anything \
  --num-designs 50

Why BoltzGen?

  • All-atom output (no separate ProteinMPNN step needed)
  • Better for ligand/small molecule binding
  • Single-step design (backbone + sequence + side chains)

4. Alternative: RFdiffusion Pipeline

For maximum diversity or when backbone-only is preferred:

# Step 1: Backbone generation (RFdiffusion, run from the official repo)
python run_inference.py \
  inference.input_pdb=target.pdb \
  contigmap.contigs=[A1-150/0 70-100] \
  ppi.hotspot_res=[A45,A67,A89] \
  inference.num_designs=500

# Step 2: Sequence design
modal run modal_ligandmpnn.py \
  --input-pdb backbone.pdb \
  --params-str "--number_of_batches 16 --temperature 0.1"

5. Validation

modal run modal_chai1.py \
  --input-faa sequences.fasta \
  --out-dir predictions/

6. Filtering

Apply standard thresholds:

  • pLDDT > 0.80
  • ipTM > 0.50
  • PAE_interface < 10
  • scRMSD < 2.0 A

See protein-qc skill for details.

Number of designs

Stage Count Purpose
Backbone generation 500-1000 Diversity
Sequences per backbone 8-16 Sequence space
AF2 predictions All Validation
After filtering 50-200 Candidates
Experimental testing 10-50 Final selection

Common mistakes

Wrong hotspots

  • Using buried residues
  • Too many hotspots (over-constrain)
  • Wrong chain/residue numbers

Insufficient diversity

  • Too few designs generated
  • Low temperature in ProteinMPNN
  • Not exploring multiple backbones

Poor target preparation

  • Including full protein instead of binding region
  • Missing important structural features
  • Wrong protonation states

Timeline guide

Step Compute Time
RFdiffusion (500 designs) 2-4 hours
ProteinMPNN (8000 sequences) 1-2 hours
AF2 prediction (8000 sequences) 12-24 hours
Filtering and analysis 1-2 hours

Total: 1-2 days of compute

Install via CLI
npx skills add https://github.com/adaptyvbio/protein-design-skills --skill binder-design
Repository Details
star Stars 136
call_split Forks 16
navigation Branch main
article Path SKILL.md
More from Creator