Training HyperBF networks involves estimation of weights
, shape and centers of neurons
and
. Poggio and Girosi (1990) describe the training method with moving centers and adaptable neuron shapes. The outline of the method is provided below.
Consider the quadratic loss of the network
. The following conditions must be satisfied at the optimum:

,

,

where
. Then in the gradient descent method the values of
that minimize
can be found as a stable fixed point of the following dynamic system:

,

,

where
determines the rate of convergence.
Overall, training HyperBF networks can be computationally challenging. Moreover, the high degree of freedom of HyperBF leads to overfitting and poor generalization. However, HyperBF networks have an important advantage that a small number of neurons is enough for learning complex functions.[2]