Tool: Boltz

Boltz is an artificial-intelligence method for predicting biomolecular structures consisting of proteins, RNA, DNA, and other molecules such as ligands, cofactors, and drugs. Inspired by AlphaFold 3, Boltz is fully open-source and freely available for both academic and commercial use under the MIT license. See:

Boltz-1 Democratizing Biomolecular Interaction Modeling. Wohlwend J, Corso G, Passaro S, Reveiz M, Leidal K, Swiderski W, Portnoi T, Chinn I, Silterra J, Jaakkola T, Barzilay R. bioRxiv [Preprint]. 2024 Dec 27:2024.11.19.624167.

The ChimeraX Boltz tool installs and runs Boltz (currently version 2.1.1) on the local machine. The prediction runs on Mac, Linux, and Windows without requiring an Nvidia GPU, typically taking on the order of minutes, and is run in background so that ChimeraX can be used for other tasks.

The ChimeraX Boltz tool can be opened from the Structure Prediction section of the Tools menu and manipulated like other panels (more...). It is also implemented as the boltz command.

Boltz-predicted structures vary in confidence levels (see coloring) and should be interpreted with caution. Residue-residue alignment errors for the modeled structures are shown in the Error Plot. See the ChimeraX Boltz details and video Boltz structure prediction in ChimeraX. See also: AlphaFold, ESMFold, Modeller Comparative, Model Loops, computational screening for protein-protein interactions

Boltz Installation
Defining and Running a Prediction
Error Plot
Limitations

[back to top: Boltz]

Boltz Installation

Boltz installation only needs to be done once per computer, as long as the ChimeraX installation is not moved or deleted. Clicking the Install Boltz button on the tool dialog creates a Python virtual environment to install Boltz from PyPi. Boltz uses Torch and other packages, and the total installation including Boltz, the trained neural network weights, and the PDB Chemical Component Dictionary for defining residue types is over 4 GB and may take 10 minutes or more to download and install, depending on network speed. Boltz (currently version 2.1.1) is installed in ~/boltz2 in the user's home directory. This directory will be created, or if it already exists must be empty. The Boltz network parameters and Chemical Component Dictionary are downloaded to ~/.boltz. Installation can also be done with the command boltz install.

[back to top: Boltz]

Defining and Running a Prediction

The specified Prediction name will be used in naming the output folder and files, as detailed in the options. The structure to predict is defined by Adding one or more molecular components. For assemblies containing multiple copies of the same chain, that component should be added multiple times. Components can be defined by:

chain identifiers in currently open atomic structures
sequence pasted in as plain text: protein, RNA, or DNA
UniProt name or accession
3- or 5-letter residue name in the PDB Chemical Component Dictionary (CCD) (ligands, solvent, ions)
SMILES string (small organic molecules)
each ligand SMILES string – a list of ligands for batch prediction of each single ligand with the other components; each line in the list should be of the form name,SMILES (or simply SMILES to give names ligand1, ligand2, ... in the results)

Batch prediction results will be shown in a table listing the ligand names, ipTM confidence scores, affinity scores (if predicted, see options), binding probabilities, and SMILES strings. Clicking a column header sorts the table by the values in that column, and selecting one or more rows and clicking Open opens the corresponding structure prediction(s). The results are written as a comma-separated values file (.bzlig), and this file can be accessed later from the File History to reshow the table. Another way to reshow the table is with the command boltz ligandtable, by specifying the directory in which Boltz was run. The table's context menu includes an option to write the results as a .bzlig file.

The current set of components to model are listed in a table, with the polymer residue count tallied underneath to help assess the size of the calculation (see the ChimeraX Boltz details for guidelines on run times based on size and resources). The Clear button can be used to clear the table contents to start over, and Delete selected rows to remove just the row(s) currently highlighted in the table.

The Options button shows/hides additional options:

Results directory (initial default ~/Desktop/boltz_[name] – the pathname (name and location) of a folder or directory in which to store prediction results, where [name] (if included) indicates substitution of the prediction name specified by the user in the main dialog. The folder does not need to exist already, but if it does exist and is not empty, a numeric suffix will be appended automatically as needed to avoid overwriting previous results. Clicking Browse brings up a file browser window for choosing a folder interactively.
Number of predicted structures (initial default 1) – how many predictions to generate; if more than one, they typically have only small variations
Predict ligand binding affinity for
- none (initial default)
- last ligand – the last non-biopolymer listed in the table of components
- ligand-name – ligand CCD code or SMILES string as listed in the table of components
Boltz can predict the binding affinity in µM for a single ligand. It was trained using Kd, Ki, and IC50 affinity values, treating them as equivalent, so the predicted affinity should be interpreted as a qualitative affinity without a precise definition. Only one affinity prediction is made even if the system contains multiple ligands, and the affinity cannot be predicted for ligands that occur in more than one copy.
Use steering potentials. May be more accurate, but slower. (initial default off) – whether to use Boltz diffusion steering potentials
Use multiple sequence alignment cache (initial default on) – whether to cache (and potentially reuse) the deep sequence alignments generated by the Colabfold server for protein chains. The alignment cache location is ~/Downloads/ChimeraX/BoltzMSA/ Reusing the alignment saves time when multiple predictions will be performed for the same protein or set of proteins but different small-molecule ligands. Because the alignments for different proteins in an assembly are paired to match ones from the same organisms, the cached alignments can only be reused for assemblies with the exact same set of proteins. Alignments computed for individual proteins from multiple different runs cannot be used for an assembly of those proteins.
Compute device – whether to use the CPU always, GPU always (requires an Nvidia or Mac M series GPU), or GPU if available (initial default), as it will be faster than the CPU
Boltz install location – the folder containing a virtual Python environment in which Boltz is installed

Clicking Save default options saves the current option settings as user preferences. More options are available as part of the boltz command.

Clicking Predict launches the calculation (see the ChimeraX Boltz details for run times on various systems). The Boltz prediction is run in the background so that ChimeraX can be used for other tasks. Clicking Stop halts a calculation in progress. When the prediction finishes, the resulting structure(s) are opened automatically.

When first opened, the predicted structures are colored by the pLDDT confidence measure (same as for AlphaFold models) in the B-factor field:

100

to 90

– high accuracy expected
90

to 70

– backbone expected to be modeled well
70

to 50

– low confidence, caution
50

to 0

– should not be interpreted, may be disordered

...in other words, using

color bfactor palette alphafold

The Color Key graphical interface or a command can be used to draw a corresponding color key, for example:

key red:low orange: yellow: cornflowerblue: blue:high [other-key-options]

A prediction with at least one component specified by structure chain will be superimposed on the pre-existing chain with matchmaker. If more than one chain in the predicted assembly was specified by an existing chain ID, only the first one is used for superposition.

Error plot shows a plot of the predicted aligned error (PAE), in which color gradations show (for each pairwise combination of residues) the expected error in position of one residue when the true and predicted structures are aligned based on the other residue.

[back to top: Boltz]

Error Plot

Besides the per-residue pLDDT confidence measure, Boltz gives for each pair of residues (X,Y) the expected position error at residue X if the predicted and true structures were aligned on residue Y. These residue-residue “predicted aligned error” (PAE) values can be shown in a plot by clicking the Error plot button on the Boltz dialog.

When the mouse cursor is over the plot, the residue pair and PAE value at its current position are reported in the bottom right corner of the window.

Clicking Color PAE Domains clusters the residues into coherent domains (sets of residues with relatively low PAE values) and uses randomly chosen colors to distinguish these domains in the structure (details...). Clicking Color pLDDT returns the structure to the default confidence coloring.

The plot's context menu includes:

Dragging box colors structure (initial default checked on) – whether dragging a box on the plot highlights the corresponding parts of the 3D structure with bright colors and makes everything else gray; if this option is unchecked, highlighting will be done with selection instead of coloring
Color plot from structure – color the plot to match the 3D structure where the pair of residues represented by an X,Y point have the same ribbon color; show the rest of the plot in shades of gray
Color plot rainbow – use the pae palette (default) to color the plot, with colors assigned to values as follows:
0 5 10 15 20 25 30
Color plot green – use the paegreen palette to color the plot:
0 5 10 15 20 25 30
Show chain divider lines (initial default checked on) – for multimer predictions, draw lines on the plot demarcating the end of one chain and the start of another; the lines may obscure a few chain-terminal residues in the plot, and can be hidden if this is problematic
Save image – save the plot as a PNG file

The Color Key graphical interface or a command can be used to draw (in the main graphics window) a color key for the PAE plot. For example, to make a color key that matches the pae or paegreen scheme, respectively:

key pae :0 : : :15 : : :30 showTool true
key paegreen :0 : : :15 : : :30 showTool true

A title for the color key (e.g., “Predicted Aligned Error (Å)”) would need to be created separately with 2dlabels.

[back to top: Boltz]

Limitations

Structure size. Boltz uses a lot of memory, and the amount of available memory limits the size of structures that can be predicted. For a computer with 32 Gbytes, the size limit is roughly 1000 residues plus ligand atoms (called "tokens"). Consumer Nvidia GPUs with 8 or 12 GB of memory (e.g. RTX 3070) only handle 300-500 residues before using CPU memory on Windows, which slows the prediction 10-20 fold. On Linux, it will not use CPU memory. Consumer Nvidia GPUs with 24 GB (RTX 3090 and RTX 4090) are able to predict 1000 tokens, or about 1400 with 16-bit floating point. Prediction size limits are perhaps the most important shortcoming of Boltz compared to AlphaFold 3, which handles memory more efficiently and is able to predict 5000 tokens with 80GB of GPU memory, about twice the size that Boltz can predict. A drawback of AlphaFold 3 is that it requires Linux and an Nvidia GPU, in addition to various licensing restrictions. We hope that in the future, Boltz will optimize memory use to allow predicting larger structures.

Run time. The computation time increases quadratically with the number of tokens, so a prediction with 3 times the number of residue and ligand atoms will take approximately 9 times longer to run. For a table of the run times to predict assemblies of different sizes on various desktop and laptop computers, see the ChimeraX Boltz details.

Nvidia GPU support on Windows. Installing Boltz will get a CUDA-enabled version of the torch machine learning package if it detects Nvidia graphics. It decides if you have Nvidia graphics by seeing if the file C:/Windows/System32/nvidia-smi.exe exists. Otherwise it gets a cpu-only version of torch. If you install an Nvidia graphics driver after installing Boltz, you will have to reinstall Boltz to get the CUDA version. The installed torch is for CUDA 12.6 or newer. If your computer has a version of CUDA older than 12.6 but newer than 11.8, you can run the following commands in a Windows Command Prompt to install a CUDA 11.8 version of torch. For other CUDA versions, refer to the Torch installation page for the correct pip install command.

  > cd C:\Users\username\boltz\Scripts
  > pip.exe uninstall torch
  > pip.exe install torch --index-url https://download.pytorch.org/whl/cu118

Nvidia GPU support on Linux. On Linux, the installed Boltz will work with CUDA 12.6 or newer if you have Nvidia graphics. If you have an older system CUDA version it may still work, or you can refer to the Torch installation page for the correct pip install command and replace torch with the following shell commands:

  $ cd ~/boltz/bin
  $ ./pip uninstall torch
  $ ./pip install torch --index-url https://download.pytorch.org/whl/cu118

No covalently linked ligands. Although Boltz can predict covalently linked ligands, that capability is not yet available in the ChimeraX interface or command. Similarly, post-translational modifications such as phosphorylation are not yet supported.

No chain identifiers assigned. It can be helpful to assign chain identifiers (A,B,C...) to the different molecular components to match existing structures. Boltz is capable of this, but the ChimeraX user interface does not currently allow it.

Multiple sequence alignments (MSAs). Boltz uses the Colabfold MSA server (https://api.colabfold.com) for computing deep sequence alignments. This requires internet connectivity and is subject to outages if that server (located in Korea currently) is down. By default, the sequence alignments are cached in ~/Downloads/ChimeraX/BoltzMSA so that they can be reused for subsequent predictions with the same set of polymers.

UCSF Resource for Biocomputing, Visualization, and Informatics / November 2025