MatchMaker MatchMaker icon

MatchMaker superimposes structures by first constructing a sequence alignment and then performing a least-squares fit to superimpose the aligned residue pairs. Fitting uses one point per residue. The sequence alignment is based on some combination of residue identity/similarity and secondary structure correspondence. For an informal introduction, see the Superpositions and Alignments tutorial. See also: Match -> Align, Multalign Viewer, and

Meng, E.C., Pettersen, E.F., Couch, G.S., Huang, C.C., and Ferrin, T.E. "Tools for integrated sequence-structure analysis with UCSF Chimera." BMC Bioinformatics 7(1):339 (2006).
If it is already known which residue numbers in each structure should be used for matching, the match command is a faster alternative (it does not include a sequence alignment step).

There are several ways to start MatchMaker, a tool in the Structure Comparison category. MatchMaker is also implemented as the command mmaker (or matchmaker).

The Chain pairing options are mutually exclusive and control which chain sequences are used to construct each reference-match alignment:

If an amino acid substitution matrix is chosen for scoring, only peptide chains will be aligned; if a nucleic acid matrix is chosen, only nucleic acid chains will be aligned. An error message will appear if there are no reference-match pairs of the appropriate type. Regardless of which chain(s) in a model to be matched are aligned in sequence to the reference, the entire model will be reoriented.

The sequence Alignment algorithm can be Needleman-Wunsch (global; default) or Smith-Waterman (local). Alignment scoring can use secondary structure information along with residue similarity (more details below). The option to Compute secondary structure assignments is available when secondary structure scoring is turned on. It indicates that helices and strands should first be identified with the ksdssp algorithm, overriding any pre-existing secondary structure assignments. A reason to use this option despite existing secondary structure assignments is that the use of consistent criteria tends to improve MatchMaker results. Pre-existing secondary structure information may have been determined with different methods or parameters for different structures. Ksdssp parameter defaults can be adjusted with the compute SS dialog (opened from the Model Panel).

When Show alignment(s) in Multalign Viewer is checked, each pairwise reference-match sequence alignment will be shown in a separate window. (Note that Match -> Align can be used to construct a multiple sequence alignment from a multiple superposition.) When fit iteration is employed, the pairs used in the final fit will be shown in the alignment as a region named matched residues. The header named RMSD is automatically shown and other headers hidden. This line shows the spatial variation among residues associated with a column. In the pairwise case, the value is simply the distance between atoms in the two residues associated with a column.

*Note: The main purpose of MatchMaker is to superimpose related structures; the sequence alignments are just a by-product. Successful superposition only requires a partially correct sequence alignment, as incorrect portions tend to be omitted during fit iteration. If the sequences are easy to align, the MatchMaker sequence alignment is likely to be correct from beginning to end. However, if they are more distantly related, parts of the sequence alignment may be incorrect even when the resulting iterated match looks very good. If the goal is to obtain not just a structural superposition but also an alignment of dissimilar sequences, Match -> Align is recommended for generating a structurally verified sequence alignment after the structures have been superimposed. Furthermore, matching the structures (again) using this structurally verified sequence alignment, while having little effect on the superposition, will provide better statistics for describing structural similarity (RMSD, etc.) because more columns will be aligned correctly.
Iterate by pruning long atom pairs until no pair exceeds [x] angstroms refers to an iterative fitting procedure. The sequence alignment is not changed, but residue pairs in the alignment can be removed from the "match list" used to superimpose the structures. Fitting uses one point per residue: CA atoms in amino acids and C4' atoms in nucleic acids. If a nucleic acid residue lacks a C4' atom (some lower-resolution structures are P traces), its P atom will be paired with the P atom of the aligned residue. In each cycle, atom pairs are removed from the match list and the remaining pairs are fitted, until no matched pair is more than x angstroms apart (x=2.0 by default). The atom pairs removed are either the 10% farthest apart of all pairs or the 50% farthest apart of all pairs exceeding the cutoff, whichever is the lesser number of pairs. The result is that the best-matching "core" regions are maximally superimposed; conformationally dissimilar regions such as flexible loops are not included in the final fit, even though they may be aligned in the sequence alignment.

Sequence alignment scores, parameter values, and final match RMSDs will be reported in the Reply Log.

Save settings writes the current MatchMaker parameters to the preferences file. Reset to defaults resets the dialog to the factory default parameter settings without changing any preferences.

OK performs the aligning/matching and dismisses the dialog, whereas Apply performs the operations without dismissing the dialog. Cancel simply dismisses the dialog. Help opens this manual page in a browser window.

See also: match, rmsd, matrixcopy, Match -> Align

ALIGNMENT SCORING

Sequence alignment scores can include a residue identity/similarity term, a secondary structure term, and gap penalties.

Residue identity/similarity. This term is based on any of several common substitution matrices.
Secondary structure correspondence. This term contributes to the score when Include secondary structure score (N%) is turned on. Clicking Show parameters reveals the secondary structure scoring parameters. N reflects the relative weights of the terms, which can be adjusted by moving the slider. If N is 30, for example,
total score = 0.70(residue similarity score) + 0.30(secondary structure score) – gap penalties
The values in the secondary structure Scoring matrix (for all pairwise combinations of H helix, S strand, and O other) and the secondary-structure-specific Gap opening penalties can be adjusted. Reset secondary structure scoring parameters to defaults can be used to restore the default values of all secondary structure scoring parameters.
Gap penalties. When Include secondary structure score is turned on, the secondary-structure-specific Gap opening penalties (Intra-helix, Intra-Strand, Any other) are used instead of the single Gap opening penalty. The same Gap extension penalty is used, however. The default gap parameters are not necessarily optimal, so it is reasonable to adjust them as needed.
Note that turning off Include secondary structure score is not the same as moving the slider to zero (N = 0%) with the option on. When secondary structure scoring is on:
UCSF Computer Graphics Laboratory / August 2008