Skill

openfold3-nim

Predicts biomolecular structures (proteins, RNA, DNA, ligands, multi-chain complexes) using OpenFold3 NIM via hosted NVIDIA API or local Docker. Covers endpoint choice, auth, payloads, output artifacts, and container setup.

ai-ml

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/bionemo-agent-toolkit:openfold3-nim

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

BashReadWriteAskUserQuestion

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Predict biomolecular structures with OpenFold3. It supports proteins, DNA, RNA,

Supporting Files

references/api.mdreferences/examples.mdreferences/parameters.mdreferences/science.mdreferences/validation.md

SKILL.md

196 lines · ~1.8k tokens

Stats

LanguagePython

Parent stars30

Parent forks4

MaintenanceGood

Last CommitJun 23, 2026

Actions

View Source View Plugin View on GitHub View README

OpenFold3 NIM

Predict biomolecular structures with OpenFold3. It supports proteins, DNA, RNA, small-molecule ligands, and multi-entity assemblies. Use this SKILL.md for basic hosted/local NIM use; load supplemental files only when the task needs deeper context:

references/api.md: exact endpoints, schemas, Docker flags, response fields.
references/science.md: purpose, strengths, limitations, and model handoffs.
references/parameters.md: molecule fields, MSAs, templates, samples, tuning.
references/validation.md: artifact checks and scientific sanity checks.
references/examples.md: compact hosted/local request patterns.

Choose Mode

Ask only when context is unclear:

Hosted NVIDIA API or local Docker NIM?

Hosted URL: https://health.api.nvidia.com/v1/biology/openfold/openfold3/predict
Local URL: http://localhost:8000/biology/openfold/openfold3/predict
Local readiness: http://localhost:8000/v1/health/ready

Mode difference: the local prediction path has no /v1/ prefix. Hosted requests use Authorization: Bearer $NGC_API_KEY. Supported local Docker startup uses NGC_API_KEY (or NVIDIA_API_KEY via the preflight) for registry login, entitlement checks, and first-run model downloads; pass it into the container with -e NGC_API_KEY. Local inference requests use no auth header after readiness. Warm-cache key-free startup varies by image/version and should not be assumed.

Auth And Environment

Do not print API keys. Confirm they exist with shell tests, not echoes.

Hosted needs NGC_API_KEY in the request header. Local startup needs NGC_API_KEY, or NVIDIA_API_KEY as a fallback, plus LOCAL_NIM_CACHE. A repo-root .env file may be sourced as a local override before validation.

Local Docker

Use the official OpenFold3 NIM image and mount LOCAL_NIM_CACHE at /opt/nim/.cache. First startup downloads model artifacts and can take several minutes.

When writing local setup commands, copy the preflight below exactly. Do not replace it with a simple : "${NGC_API_KEY:?Set NGC_API_KEY}" check, do not drop NVIDIA_API_KEY, and do not invent a default LOCAL_NIM_CACHE; those lines are the repo's local NIM env contract. The default single-GPU launch should show the literal --gpus "device=0"; choose a different device only when the user asks.

set -a
[ -f .env ] && . ./.env
set +a

if [ -z "${NGC_API_KEY:-}" ] && [ -n "${NVIDIA_API_KEY:-}" ]; then
  export NGC_API_KEY="$NVIDIA_API_KEY"
fi
: "${NGC_API_KEY:?Set NGC_API_KEY or NVIDIA_API_KEY}"
: "${LOCAL_NIM_CACHE:?Set LOCAL_NIM_CACHE}"

echo "$NGC_API_KEY" | docker login nvcr.io --username '$oauthtoken' --password-stdin

mkdir -p "${LOCAL_NIM_CACHE}"
chmod 777 "${LOCAL_NIM_CACHE}"

docker run --rm --name openfold3 \
  --runtime=nvidia \
  --gpus "device=0" \
  --shm-size=16g \
  -e NGC_API_KEY \
  -v "${LOCAL_NIM_CACHE}:/opt/nim/.cache" \
  -p 8000:8000 \
  nvcr.io/nim/openfold/openfold3:latest

Readiness check:

until curl -sf http://localhost:8000/v1/health/ready; do sleep 5; done

Request Pattern

Use requests.post(..., json=payload, timeout=300). For local Docker tasks, set hosted = False after the readiness check passes.

import os
import requests

hosted = True
url = (
    "https://health.api.nvidia.com/v1/biology/openfold/openfold3/predict"
    if hosted
    else "http://localhost:8000/biology/openfold/openfold3/predict"
)
headers = {"Content-Type": "application/json"}
if hosted:
    headers["Authorization"] = f"Bearer {os.environ['NGC_API_KEY']}"

seq = "MKTVRQERLKSIVR"
payload = {
    "inputs": [{
        "input_id": "prediction_1",
        "output_format": "pdb",
        "molecules": [{
            "type": "protein",
            "id": "A",
            "sequence": seq,
            "diffusion_samples": 1,
            "msa": {
                "main": {
                    "a3m": {
                        "alignment": f">query\n{seq}",
                        "format": "a3m"
                    }
                }
            }
        }]
    }]
}

response = requests.post(url, headers=headers, json=payload, timeout=300)
response.raise_for_status()
result = response.json()

Payload gotchas:

Top level is {"inputs": [...]} and OpenFold3 accepts exactly one input.
molecules can contain 1-32 objects with type: protein, dna, rna, or ligand.
Protein/RNA MSAs are optional but, when supplied, alignment must start with a FASTA header such as >query\nSEQUENCE.
Ligands use either smiles or ccd_codes, for example {"type": "ligand", "id": "L", "ccd_codes": "ATP"}.
DNA/RNA entities use sequence, for example {"type": "dna", "id": "B", "sequence": "ATCGATCG"}.
diffusion_samples is 1-5. output_format is pdb or cif.

Save And Interpret Output

Save every returned structure as a scientific artifact. Main response path: result["outputs"][0]["structures_with_scores"].

output = result["outputs"][0]
for i, sample in enumerate(output["structures_with_scores"], start=1):
    fmt = sample["format"]
    with open(f"openfold3_structure_{i}.{fmt}", "w", encoding="utf-8") as fh:
        fh.write(sample["structure"])
    print("confidence_score", sample.get("confidence_score"))
    print("complex_plddt_score", sample.get("complex_plddt_score"))
    print("ptm_score", sample.get("ptm_score"))
    print("iptm_score", sample.get("iptm_score"))
    print("complex_pde_score", sample.get("complex_pde_score"))

Higher confidence_score, complex_plddt_score, ptm_score, and iptm_score are generally better; lower complex_pde_score is generally better. Treat toy or very short sequences as API smoke tests, not meaningful structural biology. For why and when OpenFold3 is scientifically appropriate, read references/science.md.

Common Limits

Inputs per request: 1.
Molecules per input: 1-32.
Diffusion samples: 1-5.
TensorRT path supports shorter sequences; PyTorch path can support longer sequences, but long inputs need much more GPU memory.
Sequences over roughly 1800 residues require at least 80 GB GPU memory.
Local NIM is single-GPU only; choose the target device in the Docker flag.

Troubleshooting

401: missing, expired, or unauthorized NGC API key.
422: invalid molecule type, invalid sequence characters, bad MSA shape, or diffusion_samples outside 1-5.
MSA errors: ensure the alignment starts with >query\n.
Local 404: remove /v1/ from the prediction URL.
Local startup stalls: first run may be downloading 10-15 GB of model weights into LOCAL_NIM_CACHE.
Memory errors: shorten the sequence, reduce samples, or use a larger GPU.

openfold3-nim

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

openfold3-nim

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

OpenFold3 NIM

Choose Mode

Auth And Environment

Local Docker

Request Pattern

Save And Interpret Output

Common Limits

Troubleshooting

Similar Skills

OpenFold3 NIM

Choose Mode

Auth And Environment

Local Docker

Request Pattern

Save And Interpret Output

Common Limits

Troubleshooting

Similar Skills