Draft:Normal Cone (variational analysis)

Constructions in nonsmooth analysis From Wikipedia, the free encyclopedia

In variational analysis, set-valued analysis and optimization, the concept of a normal cone to a subset of a space generalizes that of the orthogonal complement / annihilator of a vector space, (outward) normal vector fields to surfaces — or more generally of the normal bundle of an embedded submanifold — to possibly non-smooth subsets of vector spaces.

Normal cones provide, among other things, the geometrical foundation for generalizing the convex subdifferential to non-convex functions. Of particular note is also their role in generalizing Fermat's rule to give necessary (and sometimes also sufficient) first order optimality conditions for constrained and non-smooth optimization problems. Moreover they play a role in defining coderivatives of set-valued maps. [1]

In the non-convex case there are several inequivalent definitions for a normal cone that all turn out to be useful and interesting for different problems — whereas in the convex case these all coincide which greatly simplifies things. For clarity and approachability this article first discusses the convex case over Hilbert spaces before going into the non-convex case over more general spaces.

Conventions

All vector spaces in this article are assumed to be real. The set denotes the extended real numbers and a function is called proper if it is nowhere equal to and also somewhere finite. We also assume that all cones contain zero. Sums of sets throughout the article are to be interpreted as Minkowski sums.

A set together with the normal cone at one of its points.

Convex Case

Definition

The convex normal cone to a non-empty convex subset of a (pre-)Hilbert space is defined byfor all , and for all . [2][3] Here denotes the inner product of .

Geometric Interpretation

This definition can intuitively be understood as follows: suppose and , then is the vector from to . If then would be orthogonal to , while the condition additionally allows to "point away" from more than 90°. So the (convex) normal cone at is the set of all vectors that point at least 90° away from all the vectors from to ; but translated to the origin.

Examples

  • When is a subspace of , then is the linear-algebraic normal space / orthogonal complement to for . If is instead an affine subspace then is instead the orthogonal complement to the underlying parallel vector space of .
  • Let be a non-empty, closed, convex set and consider its convex indicator function Then
  • For a proper, lower-semicontinuous, convex function we have , where denotes the epigraph of and the convex subdifferential of . This relationship serves to define further subdifferentials based on normal cones in the non-convex case.

Properties

  • is a non-empty, convex cone for all .
  • If , then .[3] The forward direction holds even if is infinite dimensional, while the backward direction may fail unless has nonempty interior.[4]
  • Let be non-empty, closed, convex. Then , where is the metric projection of onto . [3]
  • Intersection Rule: for any two nonempty, convex subsets of a tvs and . Provided that the qualification condition holds, the reverse inclusion is also true such that The qualification condition may be further weakened, especially if is a nicer space. [4] For example if is Banach and the two convex sets are closed, it is sufficient that , while in finite dimensions one can show that for any finite family of convex sets whose relative interiors intersect.
  • For further calculus rules cf. the properties of the non-convex generalizations below.

Another very important property is what is sometimes called Fermat's rule: let be convex, nonempty, closed and convex. Then, assuming some very mild constraint qualifications (c.f. for example Proposition 27.8 in [3]), solves the constrained minimization problem if and only if . If is differentiable this reduces to (here is the Fréchet gradient of ; i.e. the pointwise Riesz representative of the Fréchet derivative of ).

Generalization

More generally, one may define for any topological vector space , with topological dual . In this case is the duality pairing of and In this more general setting the normal cone can be recognized to be the set of all that attain their maximum on at . [1] This connects normal cones to another important class of objects, the so-called support function of a set. One has .

Non-Convex Case

Related Articles

Wikiwand AI