For degree-d polynomials, the polynomial kernel is defined as[2]

where x and y are vectors of size n in the input space, i.e. vectors of features computed from training or test samples and c ≥ 0 is a free parameter trading off the influence of higher-order versus lower-order terms in the polynomial. When c = 0, the kernel is called homogeneous.[3] (A further generalized polykernel divides xTy by a user-specified scalar parameter a.[4])
As a kernel, K corresponds to an inner product in a feature space based on some mapping φ:

The nature of φ can be seen from an example. Let d = 2, so we get the special case of the quadratic kernel. After using the multinomial theorem (twice—the outermost application is the binomial theorem) and regrouping,

From this it follows that the feature map is given by:

generalizing for
,
where
,
and applying the multinomial theorem:

The last summation has
elements, so that:

where
and
