Chimera Interface to Modeller
Chimera provides a graphical interface to running the program
Modeller,
either locally or via a web service hosted by the
UCSF
RBVI. Two types of calculations are available:
- Comparative (homology) modeling.
Theoretical models of a protein are generated using at least one known related 
structure and a sequence alignment of the known and unknown structures.
The protein to be modeled is the target, and a related 
known structure used for modeling is a template.
The inputs for comparative modeling can be generated in
several ways.
 
 
- Building 
parts of a protein without using a template.
Missing segments can be built de novo, or existing segments
refined by generating additional possible conformations.
Parts that need building or refinement are often loop regions.
Modeller is developed by the 
Sali Lab.
Use of Modeller, whether a previously
downloaded copy or via web service, 
requires a license key.
Academic users can register free of charge to receive a license key. 
(Commercial entities and government research labs, please see
Modeller licensing.) Modeller users should cite:
Comparative protein modelling by satisfaction of spatial restraints.
Šali A, Blundell TL.
J Mol Biol. 1993 Dec 5;234(3):779-815.
See also: 
Build Structure,
Rotamers,
Multalign Viewer,
mda,
ModBase fetch,
modifying and saving data, the
Comparative Modeling tutorial
Comparative modeling requires a structure to serve as a template,
and a target-template sequence alignment.
There are several ways
to obtain these inputs using Chimera.  Sequence alignments are shown in 
Multalign Viewer.
The Chimera interface to comparative modeling with Modeller
can be started by choosing
Structure...
Modeller (homology) from the Multalign Viewer menu,
or called with mda.
With this interface,
*only a single chain or subunit can be modeled at a time.*
Modeling a multimer or complex requires running Modeller outside of Chimera.
- Choose the target (sequence to be modeled)
- the name of the target sequence should be chosen from the pulldown menu
of all sequences in the current alignment
 
 
- Choose at least one template
- at least one structure to use as a template should be chosen from the table,
by clicking or dragging with the left mouse button to highlight the
corresponding rows.  Ctrl-click toggles the status of a single row. 
The choices are actually the sequences in the alignment, and 
choosing a sequence indicates using its associated structure(s).
Some of the columns are blank initially, but clicking 
Fetch Structures/Annotations 
loads the structures into Chimera and fills in the table, where possible.
The table can be sorted by the contents of any column by clicking the
column header:
  
  - Sequence - sequence name; 
  choosing a sequence indicates using its associated structure as a template
  (the table lists all of the sequences in the alignment except the target)
  
- Structure ID - identifier for the structure associated 
  with the sequence, if any; usually a 4-character PDB ID, but could be a 
  SCOP domain ID
  (which includes a PDB ID)
  
- %ID - percent sequence identity of the sequence as compared to
  the target, computed from the alignment
  
- Title - title of structure (from PDB entry)
  
- Organism - source organism of structure (from PDB entry)
  
 Clicking Fetch Structures/Annotations:
  - fetches the structure for any sequence not already associated with a
  structure, if that structure can be deduced from the sequence name (as in the 
  Structure
  preferences in
  Multalign Viewer)
  
- uses the structure IDs to look up additional information
  
 Before comparative modeling with multiple templates fetched by
mda, use the command
reset overlay to position the template structures for best results.
 
 
- Choose where to run Modeller:
 
 
  - Run Modeller via web service
    
    - Modeller license key 
    - a license key is required 
    to run the program; the Modeller home page includes links
    to register for a key and to download the program
    
 
- Run Modeller locally
    
    - Location of Modeller executable
    - the location of the Modeller executable file; 
    the license key will
    have been entered somewhere already during local installation
    
- Modeller script file (optional, overrides dialog)
    - use the specified Modeller script to control the calculation; this
    will override the settings in the dialog.  The script corresponding
    to the current dialog settings can be viewed in
    IDLE by clicking 
    Get Current Modeller Script, saved to a file using the IDLE menu,
    and edited by hand as desired. For more details on scripting Modeller, 
    consult the Modeller manual.
    
 
 
 
- Advanced Options:
 
 
  - Number of output models [N] (max 1000)
  (default 5)
  
- Include non-water HETATM residues from template
  (off by default)
  - whether to include HETATM residues other than water (ligands, ions, 
  detergent molecules, etc.) from the template in the output models.
  This option will propagate all qualifying residues, even from multiple
  templates; those not desired in the output should be
  deleted 
  from the template(s) beforehand.
  
- Include water molecules from template
  (off by default)
  - whether to include water residues from the template in the output models.
  This option will propagate all qualifying residues, even from multiple
  templates; those not desired in the output should be
  deleted 
  from the template(s) beforehand.
  
- Build models with hydrogens (warning: slow)
  (off by default)
  
- Use fast/approximate mode (produces only one model)
  (off by default)
  - use fast/approximate mode (~3 times faster) to get a rough idea of 
  model appearance or to confirm that the alignment is reasonable.  
  This mode does not randomize the starting structure (generates only a
  single model) and performs very little optimization of the target function.
  
- Use thorough optimization (recommended with MDA)
  - optimize more thoroughly than the default, as recommended for modeling
  large targets with MultiDomain Assembler
  (the mda command)
  
- Temporary folder location (optional)
  - use the specified location for temporary files; otherwise, a
  location will be generated automatically
  
- Distance restraints file (optional)
  - Specify an input file containing distance restraints 
  (see example file distres.txt). 
  Each line in the file should be of the format:
  
  res1  res2  dist  stdev
   where res1 and res2 are residue numbers or 
  ranges of residue numbers in the target sequence, dist is the 
  distance in Å, and stdev is the standard deviation. 
  If a single residue is specified, its Cα will be used to 
  anchor the restraint. If a residue range (e.g. 233-275) is
  specified, the range's center of mass will be used to anchor the restraint.
 
OK starts the calculation and dismisses the panel,
while Apply starts the calculation without dismissing the panel.
Close dismisses the panel without performing any calculation.
Help brings up this manual page in a browser window.
Running Modeller is a background task.  
Clicking the information icon  in the Chimera status 
line will bring up the Task Panel,
in which the job can be canceled if desired.
 
in the Chimera status 
line will bring up the Task Panel,
in which the job can be canceled if desired.
After the calculation has finished, the comparative models are opened 
in Chimera and can be saved
in the usual ways.  The models are automatically superimposed onto the template 
(or the lowest-numbered of multiple templates) using 
matchmaker
defaults, and the view is focused on that template. 
The models are associated 
with the target sequence, and the RMSD
header displayed in the
sequence window.
Model scores are shown in a
Model List,
the same dialog used
for comparative models fetched
from ModBase.  Loops or other parts of a model
can be subjected to further refinement.
Running Modeller with identical inputs on different machines may give
different (but equally valid) results, due to small numerical differences
that can lead to finding different local optima of the 
modeling objective function.
The only required input for Modeller building or refinement 
is a protein structure.  
Missing segments can be built de novo, or existing segments
refined by generating additional possible conformations.
Building and refinement can be applied to
protein structures regardless of whether they were
modeled or determined experimentally.
The Chimera interface to Modeller for building or refinement can be accessed
by starting
Model/Refine Loops, a tool in the Structure Editing category, or
by choosing Structure...
Modeller (loops/refinement) from the
Multalign Viewer (Sequence tool) menu.
Using Model/Refine Loops is equivalent to using the
Sequence tool
to show the sequence of a chain, then using its menu to show the interface.
If the structure chain is
associated with a sequence in an
alignment in Multalign Viewer,
however, the alignment sequence can be used instead of the individual sequence.
- 
The output models will include any water molecules and other 
HETATM residues from the input model; those not desired in the output 
should be deleted 
from the input beforehand.
 
 
- Model/remodel - which parts of the input model to build or refine; 
they will be considered collectively even if composed of multiple
noncontiguous sequence segments
 
 
 
- Allow this many residues adjacent to missing regions to move 
(default 1) - if one of the missing structure options is used,
how many existing residues at each missing/existing junction 
to allow to move relative to the input structure
- Number of models to generate (default
5)
- Loop modeling protocol 
  
  - standard (default)
  
- DOPE - Discrete Optimized Protein Energy score (see
Shen
and Sali, Protein Sci 15:2507 (2006)) with Lennard-Jones
  potential and GB/SA implicit solvent interaction; generally higher quality
  but more computationally expensive than the standard method, and more
  prone to calculation failure (resulting in fewer models than requested)
  
- DOPE-HR - same as DOPE, except higher precision
  
 
- Run modeller using
  
  - web service (default)
  
- local installation
  
 If web service:
   - Modeller license key 
    - a license key is required 
    to run the program; the Modeller home page includes links
    to register for a key and to download the program
    
 If local installation:
    - Location of Modeller executable
    - the location of the Modeller executable file; 
    the license key will
    have been entered somewhere already during local installation
    
- Custom Modeller script file (optional, overrides dialog)
    - use the specified Modeller script to control the calculation; this
    will override the settings in the dialog.  The script corresponding
    to the current dialog settings can be viewed in
    IDLE by clicking 
    Get Current Modeller Script, saved to a file using the IDLE menu,
    and edited by hand as desired. For more details on scripting Modeller, 
    consult the Modeller manual.
    
 
- Temporary folder location (optional)
  - use the specified location for temporary files; otherwise, a
  location will be generated automatically
  
OK starts the calculation and dismisses the panel,
while Apply starts the calculation without dismissing the panel.
Close dismisses the panel without performing any calculation.
Help brings up this manual page in a browser window.
Running Modeller is a background task.  
Clicking the information icon  in the Chimera status 
line will bring up the Task Panel,
in which the job can be canceled if desired.
 
in the Chimera status 
line will bring up the Task Panel,
in which the job can be canceled if desired.
After the calculation has finished, the models (each including the
unchanged parts of the protein in addition to what was built or refined) 
are opened in Chimera and can be 
saved in the usual ways.  
The models are associated 
with the sequence, and the RMSD
header displayed in the
sequence window.
Model scores are shown in a
Model List,
the same dialog used
for comparative models fetched
from ModBase.
UCSF Computer Graphics Laboratory / July 2015