Self-consistent mean field (biology)

From Wikipedia, the free encyclopedia

The self-consistent mean field (SCMF) method is an adaptation of mean field theory used in protein structure prediction to determine the optimal amino acid side chain packing given a fixed protein backbone. It is faster but less accurate than dead-end elimination and is generally used in situations where the protein of interest is too large for the problem to be tractable by DEE.

Like dead-end elimination, the SCMF method explores conformational space by discretizing the dihedral angles of each side chain into a set of rotamers for each position in the protein sequence. The method iteratively develops a probabilistic description of the relative population of each possible rotamer at each position, and the probability of a given structure is defined as a function of the probabilities of its rotamer components.

The basic requirements for an effective SCMF implementation are:

  1. A well-defined finite set of discrete independent variables
  2. A precomputed numerical value (considered the "energy") associated with each element in the set of variables, and associated with each binary element pair.
  3. An initial probability distribution describing the starting population of each rotamer
  4. A way of updating rotamer energies and probabilities as a function of the mean-field energy

The process is generally initialized with a uniform probability distribution over the rotamers—that is, if there are rotamers at the position in the protein, then the probability of any individual rotamer is. The conversion between energies and probabilities is generally accomplished via the Boltzmann distribution, which introduces a temperature factor (thus making the method amenable to simulated annealing). Lower temperatures increase the likelihood of converging to a single solution, rather than to a small subpopulation of solutions.

Mean-field energies

The energy of an individual rotamer is dependent on the "mean-field" energy of the other positions—that is, at every other position, each rotamer's energy contribution is proportional to its probability. For a protein of length with rotamers per residue, the energy at the current iteration is described by the following expression. Note that for clarity, the mean-field energy at iteration is denoted by, whereas the precomputed energies are denoted by, and the probability of a given rotamer is denoted by.

These mean-field energies are used to update the probabilities through the Boltzmann law:

Where is the Boltzmann constant, and is the temperature factor.

Energy of the system

Although computing the system energy is not required in carrying out the SCMF method, it is useful to know the overall energies of the converged results. The system energy consists of two sums:

Where the addends are defined as:

Convergence

Accuracy

References

Related Articles

Wikiwand AI